hermes-web-ui

mirror of https://github.com/EKKOLearnAI/hermes-web-ui.git synced 2026-05-28 06:50:14 +00:00

Author	SHA1	Message	Date
ekko	74a3bd4acf	feat(chat-run-socket): record usage from Hermes session in syncFromHermes Call updateUsage() in syncFromHermes to record token usage data from Hermes ephemeral session to local DB. This ensures accurate usage tracking including: - input_tokens - output_tokens - cache_read_tokens - cache_write_tokens - reasoning_tokens - model The usage data comes from the Hermes session detail which contains accurate token counts from the upstream LLM provider. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 10:46:53 +08:00
ekko	dbe6c30d8e	fix(chat-run-socket): exclude last user message instead of comparing timestamps Replace timestamp-based filtering (m.timestamp !== now) with position-based filtering. This is more reliable because: 1. No precision issues with second-level timestamps 2. Handles edge cases where multiple messages have the same timestamp 3. Works correctly even if there's a small time difference between now and DB record New logic: 1. Filter valid messages first 2. Find the last user message from the end 3. Exclude it from history (it's the one we just added in handleRun) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 10:40:27 +08:00
ekko	541d296d58	fix(chat-run-socket): exclude current user message from conversation_history When loading conversation_history from DB, exclude the message that was just added (with timestamp === now) to avoid duplication in the upstream request. Since user messages are now written immediately to DB on run start, we need to filter them out when building history for the upstream call. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 10:39:18 +08:00
ekko	9ddf595111	fix(chat-run-socket): write user message to DB immediately on run start Changes: - Move addMessage() call to handleRun start, before conversation_history loading - Remove delayed addMessage() after history loading (no longer needed) - Remove useLocalSessionStore() check - always write user message immediately - Simplify syncFromHermes to always skip user messages This ensures user messages are persisted immediately when a run starts, improving reliability and user experience. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 10:37:20 +08:00
ekko	7540150bf8	fix(chat-run-socket): sync user messages from Hermes when not using local store When using Hermes state.db (not local store), user messages were never written to local DB because: 1. handleRun only calls addMessage() when useLocalSessionStore() is true 2. syncFromHermes was filtering out all user messages Fix: Conditionally sync user messages based on store mode: - Local store mode: skip user messages (already written in handleRun) - Hermes state.db mode: sync all messages including user messages Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 10:35:23 +08:00
ekko	485a028743	fix(chat-run-socket): use profile-specific upstream from GatewayManager Replace hardcoded UPSTREAM env var with dynamic lookup via gatewayManager.getUpstream(profile). This ensures each profile connects to its own gateway instance with correct port and host. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 10:32:27 +08:00
ekko	edfbfff6e8	feat(usage): enhance usage tracking with cache tokens and model info Backend changes: - Add cache_read_tokens, cache_write_tokens, reasoning_tokens, model fields - Migrate from session_id PRIMARY KEY to separate id column with session_id index - Update updateUsage() to accept data object instead of separate params - Add migration logic to preserve existing data during schema upgrade - Add UsageRecord interface for type safety Frontend changes: - Update UsageView to display new token types (cache, reasoning) - Update usage store to handle new usage structure - Update sessions API to fetch enhanced usage data Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 10:29:36 +08:00
ekko	3cb215a92b	fix(chat-run-socket): defer addMessage call to avoid duplicate in conversation_history - Move `const now` outside session_id block for broader scope - Defer addMessage() call until after conversation_history is loaded - This prevents the user message from appearing twice in history - Remove updateUsage call from calcAndUpdateUsage to avoid double counting Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 10:28:01 +08:00
ekko	5fa1fc60b9	feat(sessions): add local session store support to conversation endpoints Live mode (ConversationMonitorPane) now reads from local session-store when useLocalSessionStore() is enabled, instead of always hitting Hermes state.db. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-29 00:38:18 +08:00
ekko	95fc6692ec	fix(chat): use totalTokens for compression.started token_count Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-29 00:34:37 +08:00
ekko	e3c5352d69	fix(chat): unify token calculation via calcAndUpdateUsage and fix session search - Make calcAndUpdateUsage the single entry point for all inputTokens/outputTokens calculation, always loading from DB with snapshot awareness - Remove overrideInputTokens parameter; compression path calls calcAndUpdateUsage before and after compress, letting DB state be the source of truth - Add inputTokens + outputTokens as totalTokens for compression threshold comparison - Fix session search to match message content (not just title), return snippets and matched_message_id via two-step query - Fall back to preview for session title display when title is null - Remove isStreaming guard from newChat() to allow creating sessions anytime Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-29 00:20:13 +08:00
ekko	4ddd2c12a5	feat(chat): use ephemeral Hermes session per run and sync tool results from state.db - Generate ephemeral session_id for each Hermes run, sync complete data (including tool results) from Hermes state.db after run completion - Resolve tool_name from assistant message's tool_calls JSON (Hermes stores tool_name as NULL in its messages table) - Fall back to preview as title in mapSessionRow when title is empty - Set preview from first user message when creating local sessions - Enqueue ephemeral sessions for deferred deletion via gc_pending_session_deletes - Fix enqueueEphemeralDelete: use top-level import instead of require, set next_attempt_at to now (was 0, preventing drain) - Remove isStreaming guard from newChat() to allow creating sessions anytime Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-28 22:54:45 +08:00
ekko	94d581a112	feat(sessions): add local session store with SessionDeleter and config toggle - Add session-store.ts: self-built SQLite CRUD for sessions/messages - Add session-deleter.ts: timer-based singleton for deferred session deletion - Add SESSION_STORE env var (local\|remote) to toggle between local SQLite and Hermes CLI - Update sessions controller to branch on useLocalSessionStore() - Update chat-run-socket to persist messages to local DB on run completion - Improve SSE event handling: tool_call_id capture, finish_reason tracking - Update group-chat to use SessionDeleter instead of direct CLI delete - Update context-compressor to enqueue compression sessions for deferred deletion Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-28 22:09:47 +08:00
ekko	d4c0596204	fix(chat): remove upstream usage values and pre-send inputTokens overwrite - Remove all evt.usage/parsed.usage references, only use local countTokens - Remove pre-send inputTokens calculation that was overwriting resume value with compressed context, causing incorrect context drop (70k → 40k) - run.completed now recalculates inputTokens with current snapshot + full messages including new ones from this run Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-28 20:32:40 +08:00
ekko	acf1da16f6	feat(chat): add server-side sessionMap with message tracking and resume-based loading - Add sessionMap to ChatRunSocket consolidating activeRuns + sessionStates, tracking messages, isWorking status, events, and token usage per session - Load messages from DB on resume when not in memory, return via resumed event - Track streaming messages (user/assistant/tool/reasoning) into sessionMap so reconnecting clients get full message history without HTTP fetch - Calculate token usage locally with countTokens, snapshot-aware for compressed sessions - Add usage.updated event broadcast on run.completed with recalculated tokens - Replace HTTP fetchSession with Socket.IO resume for message loading - Add serverWorking state to drive streaming indicator from server isWorking status - Clear events immediately on run completion instead of delayed cleanup Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-28 20:12:39 +08:00
ekko	cacf8a2bac	feat(chat): replace HTTP+SSE with Socket.IO for chat runs and add context compression - Replace HTTP POST + SSE streaming with Socket.IO /chat-run namespace for decoupled message handling that survives client disconnect/refresh - Add SQLite-backed context compression with snapshot-based incremental updates - Unify server-side session state tracking (completedSessions + compressingSessions → sessionStates) for reliable state replay on reconnect - Filter compress_ sessions from session list queries - Add compression snapshot store with proper snake_case→camelCase column aliases - Delete temporary compress_ sessions after compression completes - Change compressed summary role from 'system' to 'user' - Add compression.started/completed events to frontend chat store Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-28 16:44:34 +08:00
ww	610f3eb9d0	feat(copilot): integrate GitHub Copilot provider with dynamic model list / 集成 GitHub Copilot provider 与动态模型列表 (#239 ) * feat(copilot): integrate GitHub Copilot provider with dynamic model list 集成 GitHub Copilot provider 与动态模型列表 EN: - New copilot-models service: fetch live model list from GitHub /models API - Filter noise IDs (accounts/, text-embedding, rerank prefixes) - Pass through preview/disabled metadata to frontend - Cache isolated per OAuth token (FNV-1a hash key) to prevent cross-account leak - Multi-source token resolution: env > apps.json > gh CLI - ModelSelector renders PREVIEW (orange) and UNAVAILABLE (gray, non-selectable) badges with tooltips - ProviderFormModal exposes Copilot OAuth login entry - New CopilotLoginModal component: guides gh auth login device flow - ProviderCard hides delete button for OAuth-only builtin providers (copilot/codex/nous) since their credentials live outside auth.json ZH: - 新增 copilot-models 服务：从 GitHub /models live API 拉取模型列表 - 噪音 ID 过滤（accounts/、text-embedding、rerank 前缀） - preview/disabled 元数据透传至前端 - 缓存按 OAuth token 隔离（FNV-1a hash key），避免切换 profile 串账号 - 多源 token 解析优先级：env > apps.json > gh CLI - ModelSelector 渲染 PREVIEW（橙色）/ UNAVAILABLE（灰色、不可选）badge，附 tooltip - ProviderFormModal 提供 Copilot OAuth 登录入口 - 新增 CopilotLoginModal 组件：引导 gh auth login 设备流程 - ProviderCard 对 OAuth-only builtin（copilot/codex/nous）隐藏删除按钮其凭证不在 auth.json，删除按钮原本无效 Tests / 测试: new copilot-models suite (cache isolation, noise filter, preview/disabled passthrough) + copilot-login-modal — 24/24 passed. Pre-existing sessions-db-lineage failure on upstream/main is unrelated. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * refactor(copilot): switch to explicit opt-in per maintainer feedback 回应 PR #239 review：上一版会自动把系统级 GitHub OAuth 凭证（VS Code Copilot 插件、gh CLI 登录态）当作 hermes provider 拉到列表里，对未在 hermes 中注册过 Copilot 的用户造成困扰。本次改为显式 opt-in：用户必须通过 Add Provider 主动添加，删除时按 token 来源决定是否清 ~/.hermes/.env，并避免误清理 VS Code / gh CLI 用户的全局凭证。 Address PR #239 review feedback. Previously Copilot would silently appear in the provider list whenever the host had any GitHub OAuth token (VS Code plugin, gh CLI login). This caused confusion for users who never explicitly registered Copilot in hermes. Now Copilot requires explicit opt-in via Add Provider; on delete we only clear ~/.hermes/.env when the token actually originated there, leaving VS Code / gh CLI credentials untouched. What changed - 新增 ~/.hermes-web-ui/config.json 的 copilotEnabled flag 控制可见性 - 即便能解析到 token，未启用时也不在列表中显示 - resolveCopilotOAuthTokenWithSource 区分 token 来源（env / gh-cli / apps-json） - ProviderFormModal 增加 GitHub Copilot 入口；无 token 时进 device flow modal - CopilotLoginModal 重写为 in-app device flow 状态机（不再要求用户在终端跑 gh） - 删除 Copilot 时仅 source='env' 才清 ~/.hermes/.env，并自动 fallback 默认模型 - 老用户升级兼容：若 default 仍指向已禁用的 copilot，后端清空 default 让前端兜底 API - POST /api/hermes/copilot-auth/check-token - POST /api/hermes/copilot-auth/enable - POST /api/hermes/copilot-auth/disable - POST /api/hermes/copilot-auth/start (device flow) - POST /api/hermes/copilot-auth/poll (device flow) Tests - tests/server/copilot-auth-controller.test.ts (11 cases) - tests/server/copilot-device-flow.test.ts (12 cases) - tests/client/copilot-login-modal.test.ts 重写覆盖状态机 Follow-ups (留作后续 PR) - device flow session 未绑定 profile，登录中切 profile 会写到错的 .env - copilot device-code 接口的 expires_in 字段未使用，硬编码 15 分钟超时 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-26 22:51:35 +08:00
ekko	0446385a37	chore: add v0.4.8 changelog and improve scroll behavior (#234 ) * chore: add v0.4.8 changelog and improve scroll behavior - Add v0.4.8 changelog entries for recent fixes - Fix forced scroll to bottom when returning from other tabs - Smooth session switch with loading transition overlay - Auto-scroll to bottom after mermaid diagram rendering - Bump version to 0.4.8 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: replace blob URLs with persistent download URLs and add image preview - Replace blob URLs with /api/hermes/download URLs after upload so attachments survive page refresh - Add click-to-preview overlay for image attachments - Move upload directory from /tmp to ~/.hermes-web-ui/upload Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: replace findLast with reverse+find for ES2022 compat Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: bump TypeScript lib target from ES2022 to ES2023 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: add changelog entries for blob URL fix, image preview and upload dir Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-26 13:28:08 +08:00
Zhicheng Han	ed12e958d0	fix: report web ui version in dev health checks (#231 )	2026-04-26 10:55:08 +08:00
ekko	8db644496e	fix(sessions): optimize N+1 queries and fix search 500 on non-CJK input (#230 ) Replace per-session SQL queries in listSessionSummaries/searchSessionSummaries with a single bulk load via loadAllSessions() + in-memory map traversal, eliminating N+1 round-trips. Fix search 500 error for pure numbers, English letters, and other FTS5-incompatible input by extending the catch fallback beyond CJK-only to all FTS query failures. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-26 10:44:51 +08:00
Zhicheng Han	b68ba8bcb9	fix chat session lineage visibility (#228 )	2026-04-26 10:29:17 +08:00
Zhicheng Han	f1a6d97c8b	fix(sessions): harden compressed session lineage projection (#226 ) - Project compressed roots to their continuation tip in session lists. - Search title/content candidates through logical compression lineage. - Hydrate detail views along the requested continuation branch while preserving requested ids. - Scope model-context cache lookup by provider to avoid same-name cross-provider matches. - Add regression coverage for lineage and provider lookup behavior.	2026-04-26 10:10:01 +08:00
Zhicheng Han	d2ab2bca08	fix(sessions): 修复压缩续接会话详情为空 (#218 ) Session detail now prefers DB-backed reconstruction for compressed continuation chains, with CLI fallback preserved and pending-deletion guard covered by tests.	2026-04-25 22:23:33 +08:00
ekko	bc9b43f06a	fix: model switch reset, custom provider resolution and base_url_env cleanup (#212 ) * fix: reset entire config.model on model switch Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: resolve custom provider from CLI config and clean base_url_env on delete - When config.model.provider is "custom" (set by hermes CLI), match base_url + model against custom_providers to resolve custom:name - Clear base_url_env from .env when deleting a builtin provider Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-25 19:38:46 +08:00
Zhicheng Han	00c6b9532c	fix: make context length lookup provider-aware (#207 )	2026-04-25 18:57:22 +08:00
ekko	65e712edfc	fix: group chat UX polish and model switch cleanup (#205 ) * fix(chat): clamp context remaining tokens to 0 instead of showing negative Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: group chat mention popup position, timestamp style, and model switch cleanup - Move @ mention popup above input to avoid blocking the textarea - Fix .msg-time scoping (was nested inside .msg-header, now top-level) - Reduce timestamp opacity and set to 12px for subtler display - Clean up stale base_url/api_key from config.yaml on model switch Closes #204 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: remove unused variables in GroupChatInput Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-25 16:16:11 +08:00
ww	4bdcaa6258	feat: add Alibaba Coding Plan provider with .env base_url support (#200 ) * feat(providers): 新增 Alibaba Cloud (Coding Plan) 内置 provider 对齐 hermes-agent 上游 PR #15045（commit 727d1088），新增 alibaba-coding-plan provider，鉴权使用 ALIBABA_CODING_PLAN_API_KEY 环境变量，base_url 可通过 ALIBABA_CODING_PLAN_BASE_URL 覆盖。默认 base_url 使用国际版端点 coding-intl.dashscope.aliyuncs.com/v1，与上游 auth.py:255 保持一致。中国大陆 DashScope 账号（dashscope.aliyun.com 颁发的 sk-sp-* 密钥）需要通过 ALIBABA_CODING_PLAN_BASE_URL=https://coding.dashscope.aliyuncs.com/v1 （不带 -intl）覆盖，因为 -intl 端点对该类密钥返回 HTTP 401。该差异在源码注释中已说明。模型列表覆盖 8 个 Coding Plan 支持的模型：qwen3.5-plus、 qwen3-max-2026-01-23、qwen3-coder-next/plus、glm-5、glm-4.7、 kimi-k2.5、MiniMax-M2.5（基于实测可用列表）。 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * feat(providers): Alibaba Coding Plan 添加国内/国际区域切换在 ProviderFormModal 中针对 alibaba-coding-plan preset 增加一个 "区域"字段，可在国际版（coding-intl）与中国大陆（coding，无 -intl）两个端点之间切换，切换时自动更新 base_url。默认选中国际版以对齐上游 hermes-agent 默认值。中国大陆 DashScope 账号（dashscope.aliyun.com 颁发的 sk-sp-* 密钥）只需在表单里点一下 "中国大陆"即可，无需手动改 base_url 或设环境变量。 8 个 locale（zh/en/de/es/fr/ja/ko/pt）都补全了 region/regionIntl/ regionCn 三个 i18n key。 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix(providers): builtin provider 列表优先读取 base_url env override 之前服务端 getAvailable 在渲染 builtin provider 列表时直接用 PROVIDER_PRESETS 里的默认 base_url，忽略了用户保存到 .env 的 base_url override。这导致用户在 Alibaba Coding Plan 选了"中国大陆"保存后，列表里仍然显示国际版 URL。修复：envMapping.base_url_env 如果存在且 .env 中有值，优先使用该值；否则 fallback 到 preset 默认。 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-25 14:00:07 +08:00
356252190-star	12ae840234	fix: skip remote profiles in startAll() to prevent startup hang (#197 )	2026-04-25 09:22:01 +08:00
ekko	70ed0e0dc2	revert: harden Hermes stream recovery around tool-call boundaries (#189 ) (#192 ) Reverts #189 due to reported bugs. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-24 22:18:32 +08:00
Zhicheng Han	009acc1c28	fix: harden Hermes stream recovery around tool boundaries (#189 )	2026-04-24 21:42:42 +08:00
ekko	edd41e6eb7	fix: group chat mobile UX and UI polish (#188 ) * fix: group chat UI background colors and replace console.log in context-engine - Set message list background to $bg-card to match single chat - Set status-bar background to transparent - Replace all console.log/warn with logger in context-engine compressor Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: group chat mobile UX improvements - Add backdrop overlay for mobile sidebar with tap-to-close - Auto-collapse sidebar on room select in mobile - Move timestamp below message bubble - Widen msg-body max-width to 85% to match single chat - Add left padding to chat-header to avoid hamburger overlap Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-24 21:28:06 +08:00
ekko	ba72264542	feat: group chat session lifecycle, typing recovery, mention highlighting (#186 ) * feat: restore group chat system with Socket.IO and SQLite persistence - GroupChatServer: Socket.IO server with room management, message history, typing indicators - SQLite storage for rooms, messages, and agent configuration - AgentClients: manages AI agent connections via socket.io-client, forwards @mentions to Hermes gateway - REST API: room CRUD, agent management, invite codes - Agent auto-restoration on server restart - Tests for all REST endpoints Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: add context-engine design document for group chat compression Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: handle special-character session search * fix: keep unicode dotted session search on quoted FTS path * feat: add context engine and group chat frontend UI - Context engine: three-zone compression (head/tail/summary) with LLM summarization, incremental updates, TTL cache, and graceful degradation - Frontend: group chat page with Socket.IO client, room sidebar, message list, agent/member display, create/join-by-code modals - Integration: wire context engine into agent-clients before /v1/runs - Refactor ChatStorage to use global DB (getDb/ensureTable) with gc_ prefix - Add i18n keys for group chat to all 8 locales - Add sidebar nav entry and router for group chat page Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: remove leftover main branch code from merge conflict resolution The `isNumericQuery`, `hasUnsafeChars`, and `runLikeContentSearch` functions no longer exist — they were replaced by HEAD's `shouldUseLiteralContentSearch` and `runLiteralContentSearch`. This dead code block caused a TypeScript compile error after the merge. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: install missing socket.io dep and type ack params Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: enable WebSocket proxy and fix socket.io transport for group chat - Add ws: true to Vite proxy config so WebSocket upgrade requests are forwarded to the backend - Allow both polling and websocket transports on server and client (polling as fallback when WebSocket upgrade fails through proxy) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: separate socket.io path from REST routes for group chat socket.io was mounted at /api/hermes/group-chat which intercepted all REST requests to /api/hermes/group-chat/rooms etc, returning "Transport unknown". Changed socket.io path to /api/hermes/group-chat/ws to avoid conflicts. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: improve group chat UI, agent management, and socket.io reliability - Redesign GroupChatPanel with Naive UI, stacked agent avatars, and popover management - Match GroupChatInput style with single chat input, add IME composition handling - Add agent add/remove per room with profile selection and duplicate prevention - Use @multiavatar for SVG avatar generation with caching - Decouple joinRoom from socket.io, use REST API for data loading - Switch socket.io to default path with /group-chat namespace to avoid proxy conflicts - Restore agent connections after server is listening - Add getRoomDetail REST endpoint and duplicate agent prevention (409) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: server-side @mention routing with context compression status and queue - Move @mention detection from agent socket listeners to server-side processMentions() - Add per-room processing lock to block mention dispatch during compression - Queue mentions during processing, drain only the latest when ready - Emit context_status events (compressing/replying/ready) to room via Socket.IO - Frontend displays compression status indicator above input - Token-based compression trigger (100k threshold) with CJK-aware estimation - Fix compressor type errors (countTokens parameter type) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: improve group chat profile handling and session sync Refine group chat room/session behavior with per-room compression controls, sidebar updates, and better stale session cleanup so multi-profile group chat state stays consistent. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: group chat improvements — session lifecycle, typing recovery, mention highlighting - Fix cross-profile session deletion with deferred delete queue - Move saveSessionProfile to after gateway response confirmation - Replace all console.log with logger in group-chat modules - Add server-side typing/context_status state tracking for room rejoin - Fix @ mention popup position to follow cursor - Add @ mention highlighting (blue) in chat message content - Fix mention regex to match all occurrences after HTML tags - Enable esbuild minify and treeShaking - Move @multiavatar/multiavatar to devDependencies - Add i18n keys for group chat features - Update tests for new functionality Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: bump version to 0.4.5 and move @multiavatar to devDependencies Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Zhicheng Han <zhicheng.han@mathematik.uni-goettingen.de>	2026-04-24 20:41:14 +08:00
ekko	82965ae6e2	refactor: rewrite model-context to use js-yaml, add context_length to provider form (#177 ) * fix: context-length API returns 200K instead of actual model context Two bugs cause the /api/hermes/sessions/context-length endpoint to always return DEFAULT_CONTEXT_LENGTH (200K): 1. getModelContextLength ignores config.yaml model.context_length The function only checks models_dev_cache.json (which doesn't exist in default installations) and falls back to the hardcoded 200K default, completely ignoring the user's explicit model.context_length setting in config.yaml. 2. getDefaultModel regex fails when api_key/base_url come before default The regex /^model:\s\n\s+default:\s(.+)$/m assumes 'default' is the first child key under 'model:', but when api_key or base_url appear first in the YAML, the match fails. This causes getModelContextLength to short-circuit to DEFAULT_CONTEXT_LENGTH before even reaching the cache lookup. Fix: - Add getDefaultModelRobust() that extracts the entire model: block first, then searches for default: within it - Add getConfigContextLength() that reads model.context_length from config.yaml as a fallback (matching hermes-agent priority) - Update getModelContextLength() resolution order: 1. models_dev_cache.json (existing) 2. config.yaml model.context_length (new) 3. DEFAULT_CONTEXT_LENGTH (existing fallback) Closes #169 * refactor: rewrite model-context to use js-yaml, add context_length to provider form - Replace fragile regex-based YAML parsing with js-yaml for reliable config.yaml reads - Fix context_length resolution priority: config.yaml override > custom_providers > models_dev_cache > 200K default - Add context_length input field when adding custom providers in ProviderFormModal - Backend: persist context_length to custom_providers models.<model>.context_length in config.yaml - Add i18n keys (contextLength, contextLengthPlaceholder) to all 8 locales Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: use NInputNumber instead of NInput type=number for context_length NInput does not support type="number" in Naive UI. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: devilardis <53129661@qq.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-24 11:18:11 +08:00
Zhicheng Han	30e88797ef	fix: add gpt-5.5 to OpenAI Codex models (#175 )	2026-04-24 10:11:21 +08:00
ekko	f8283729ba	refactor: replace jobs proxy with local controller and optimize model loading (#174 ) * refactor: replace jobs proxy with local controller and optimize model loading - Add local jobs controller that directly fetches upstream gateway with profile support and 30s timeout, replacing unreliable proxy catch-all - Upstream errors (non-200) return 502 instead of leaking to frontend - Switch loadModels() from fetchAvailableModels (slow, fetches all provider APIs) to fetchConfigModels (reads config.yaml only) - Hide logo dance video in sidebar Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: resolve TypeScript errors from previous refactor - Remove unused imports (danceVideo, useTheme) in AppSidebar - Map ConfigModelsResponse.groups to AvailableModelGroup[] format Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-24 09:57:30 +08:00
ekko	88c7e25f78	fix(i18n): add i18n support for custom model feature in ModelSelector (#172 ) * feat(models): add custom model name input with provider selector - Add custom model input field at bottom of model selector modal - Add provider dropdown to specify target provider for custom model - Track custom models in app store and display with CUSTOM badge - Merge custom model into provider group list - Fix custom provider models being overwritten by API response (keep both) * Upload screenshot * fix(i18n): add i18n support for custom model feature in ModelSelector Replace hardcoded English strings (CUSTOM badge, placeholder, hint) with vue-i18n t() calls and add corresponding translation keys to all 8 locales (en, zh, ja, ko, fr, es, de, pt). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: toller892 <892@users.noreply.github.com> Co-authored-by: Tony <125938283+toller892@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-24 08:49:45 +08:00
ekko	30c94b226a	fix: add periodic log rotation to prevent unbounded log growth (#160 ) Log rotation previously only ran at startup, causing logs to grow indefinitely on long-running processes (reported up to 71GB/day). Now checks file size every 60 seconds and truncates when exceeding 3MB. Fixes #155 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-23 19:49:31 +08:00
ekko	1abe308742	feat: add Node.js version warning, fix provider URL detection, and add v0.4.4 changelog (#146 ) - Display persistent warning bar when Node.js version < 23 - Fix provider model fetching to support non-v1 API versions (e.g. /v4) - Add v0.4.4 changelog entries to frontend - Bump version to 0.4.4 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-23 12:57:42 +08:00
ww	0cc31ee999	feat: add file browser and file download with multi-backend support (#142 ) * feat: add file browser and file download with multi-backend support Adds a built-in File Browser page and a File Download system to Hermes Web UI, enabling users to browse, edit, preview, upload, and download files from the workspace directly from the web dashboard. File Browser (/hermes/files): - New view FilesView.vue plus components under components/hermes/files/ (FileTree, FileList, FileBreadcrumb, FileToolbar, FileContextMenu, FileEditor, FilePreview, FileRenameModal, FileUploadModal) - New Pinia store stores/hermes/files.ts for directory tree, selection, and editing state - New API module api/hermes/files.ts - New server routes routes/hermes/files.ts with CRUD, rename, upload, and directory listing - New service services/hermes/file-provider.ts with a pluggable provider architecture (local filesystem + multi-terminal backends) File Download: - New server route routes/hermes/download.ts and client API api/hermes/download.ts - Integration in chat messages (MessageItem.vue, MarkdownRenderer.vue) to surface downloadable file references Packaging: - package.json: add a prepare script so the package can be installed directly from a git URL with dist/ built automatically i18n: add files/download translations to en.ts and zh.ts. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: use clipboard fallback for non-secure HTTP contexts navigator.clipboard is undefined on HTTP intranet deployments (only available in secure contexts). The previous synchronous calls threw silently and the success toast still fired, making 'copy' actions appear broken. - Add packages/client/src/utils/clipboard.ts with execCommand fallback via a hidden textarea - Use the helper in FileContextMenu (copy file path), CodexLoginModal (copy user code), NousLoginModal (copy user code), ChatPanel (copy session id) - Each call now awaits the result and shows success/failure based on the actual outcome Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-23 12:09:39 +08:00
ekko	1f91b902da	fix: handle special char search 500 & polish live badge (#144 ) * fix(search): handle numeric query FTS errors regardless of table existence Remove the `no such table: messages_fts` condition so numeric queries fall back to LIKE search on any FTS failure (malformed MATCH, missing table, etc.). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(search): handle special char queries, polish live badge UI - Add hasUnsafeChars() to catch FTS5-breaking queries (¥, @, #, etc.) and fall back to LIKE search, preventing 500 errors - Polish session live badge: smaller size, remove border/shadow, add pulsing dot indicator for a cleaner look - Remove spinner drop-shadow glow effect Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-23 11:18:56 +08:00
Zhicheng Han	5f40ae6258	feat(chat): add direct Live badge and harden Live monitor backend (#138 ) * feat(chat): add direct live badge to session rows * fix(live): use session DB for conversations monitor * docs: add chat vs live monitor direction plan * fix(search): avoid numeric session search 500 without FTS table	2026-04-23 10:49:00 +08:00
ekko	32dc084b66	fix: support both Codex and Nous auth structures in OAuth provider detection (#141 ) The isOAuthAuthorized check only looked for Codex's nested `providers.{key}.tokens.access_token` structure, missing Nous's flat `providers.nous.access_token`. Now checks both paths so all OAuth providers are correctly detected and displayed in the provider list. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-23 08:48:06 +08:00
ekko	df797d09b2	feat: add StepFun and Nous Portal provider support (#140 ) - Add StepFun provider (API key auth, STEPFUN_API_KEY) - Add Nous Portal provider with full OAuth device code flow (device code request → poll for token → mint agent key → save to auth.json) - Add NousLoginModal component for OAuth UI (user code display + verification link) - Update ProviderFormModal to handle Nous OAuth flow (hide API key fields) - Add nous-auth backend controller and routes - Update PROVIDER_ENV_MAP with stepfun and nous entries - Add i18n translations for Nous OAuth in all 8 locales Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-23 08:39:19 +08:00
ekko	70ddbd0bcd	feat: add username/password login, account settings, and changelog (#133 ) (#134 ) - Add username/password login as additional auth mechanism alongside existing token - First login must use token; password can be configured in Settings > Account - Password login returns the existing static token (no auth middleware changes) - Add account settings: setup, change password, change username, remove password - Add logout button to sidebar footer - Add version changelog popup (click version number in sidebar) - Support all 8 locales (en, zh, de, es, fr, ja, ko, pt) - Bump version to 0.4.3 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-22 20:27:33 +08:00
ekko	6f69c69802	feat: add token usage tracking, context display, and dynamic context length (#132 ) * fix: specify TS_NODE_PROJECT for dev:server script ts-node/register resolves tsconfig from the entry file upward, finding the root solution-style tsconfig.json (no compilerOptions). This causes target to default to ES3, breaking MapIterator spread syntax (TS2802). Set TS_NODE_PROJECT env var to point to the server tsconfig which targets ES2024. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: add token usage tracking, context display, and dynamic context length - Intercept SSE proxy to capture run.completed events and persist token usage (input_tokens, output_tokens) per session to SQLite/JSON store - Display context usage bar in ChatInput showing used/total/remaining tokens - Resolve actual context length from Hermes models_dev_cache.json based on the active profile's default model (fallback 200K), with 5min in-memory cache - Move sessions-db.ts to db/hermes/ for unified database layer - Add usage store with SQLite + JSON fallback (auto-migration via ensureTable) - Fix proxy SSE path regex to match rewritten upstream path - Fix route ordering: /sessions/usage before /sessions/:id to avoid 404 - Fetch per-session usage on session enter instead of batch - Add unit tests for usage-store, db index, and proxy SSE interception Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-22 16:14:50 +08:00
cl1107	f27db3036a	feat: add session search modal (#128 )	2026-04-22 14:00:34 +08:00
Zhicheng Han	ffd825afe2	fix: keep self-update on the active install path (#123 )	2026-04-22 10:33:38 +08:00
Zhicheng Han	3f88553765	feat(web-ui): add pinned sessions and live monitor in Chat (#118 ) * feat: add single-page live session monitor and chat pinning * fix: restore full test green after main merge * fix: use Array.from instead of Set spread for ts-node compatibility [...new Set()] requires downlevelIteration which isn't enabled in ts-node dev mode, causing sonic-boom crash on startup. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: ekko <fqsy1416@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-22 08:09:58 +08:00
ekko	83ad9642e2	fix(models): fix builtin provider detection and model matching (#120 ) - Add glm-coding-plan to PROVIDER_ENV_MAP for proper env mapping - Rename GLMCodingPlan value from 'glm' to 'glm-coding-plan' (kebab-case) - Match custom providers against PROVIDER_PRESETS to reuse builtin models - Fix provider key matching in create/update (use entry.name consistently) - Clear stale base_url/api_key from config on provider create - Clear model config when all providers are removed - Add gateway restart on provider remove Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-22 00:11:39 +08:00
ekko	477af66232	fix: auth bypass, SPA serving, and provider improvements (#97 ) * feat(chat): polish syntax highlighting and tool payload rendering (#94) * [verified] feat(chat): polish syntax highlighting and tool payload rendering * [verified] fix(chat): tighten large tool payload rendering * docs: update data volume path in Docker docs Align documentation with docker-compose.yml change: hermes-web-ui-data -> hermes-web-ui, /app/dist/data -> /root/.hermes-web-ui Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor: bundle server build and restructure service modules - Add build-server.mjs script for standalone server compilation - Add logger service with structured output - Restructure auth, gateway-manager, hermes-cli, hermes services - Update docker-compose volume mount path - Update tsconfig and entry point for bundled server Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor: separate controllers from routes and centralize route registration - Extract business logic from route handlers into controllers/ - Add centralized route registry in routes/index.ts with public/auth/protected layers - Replace global auth whitelist with sequential middleware registration - Extract shared helpers to services/config-helpers.ts - Allow custom provider name to be user-editable in ProviderFormModal - Deduplicate custom providers by poolKey instead of base_url in getAvailable Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: auth bypass via path case, SPA serving, and provider improvements - Fix auth bypass: path case-insensitive check for /api, /v1, /upload - Fix SPA returning 401: skip auth for non-API paths (static files) - Fix profile switch: use local loading state instead of shared store ref - Auto-append /v1 to base_url when fetching models (frontend + backend) - Guard .env writing to built-in providers only - Add builtin field to provider presets, enable base_url input in form - Print auth token to console on startup (pino only writes to file) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Zhicheng Han <43314240+hanzckernel@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-21 12:35:48 +08:00

1 2

76 Commits