Call updateUsage() in syncFromHermes to record token usage data from Hermes
ephemeral session to local DB. This ensures accurate usage tracking including:
- input_tokens
- output_tokens
- cache_read_tokens
- cache_write_tokens
- reasoning_tokens
- model
The usage data comes from the Hermes session detail which contains
accurate token counts from the upstream LLM provider.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Replace timestamp-based filtering (m.timestamp !== now) with position-based filtering.
This is more reliable because:
1. No precision issues with second-level timestamps
2. Handles edge cases where multiple messages have the same timestamp
3. Works correctly even if there's a small time difference between now and DB record
New logic:
1. Filter valid messages first
2. Find the last user message from the end
3. Exclude it from history (it's the one we just added in handleRun)
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
When loading conversation_history from DB, exclude the message that was just
added (with timestamp === now) to avoid duplication in the upstream request.
Since user messages are now written immediately to DB on run start,
we need to filter them out when building history for the upstream call.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Changes:
- Move addMessage() call to handleRun start, before conversation_history loading
- Remove delayed addMessage() after history loading (no longer needed)
- Remove useLocalSessionStore() check - always write user message immediately
- Simplify syncFromHermes to always skip user messages
This ensures user messages are persisted immediately when a run starts,
improving reliability and user experience.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
When using Hermes state.db (not local store), user messages were never written
to local DB because:
1. handleRun only calls addMessage() when useLocalSessionStore() is true
2. syncFromHermes was filtering out all user messages
Fix: Conditionally sync user messages based on store mode:
- Local store mode: skip user messages (already written in handleRun)
- Hermes state.db mode: sync all messages including user messages
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Replace hardcoded UPSTREAM env var with dynamic lookup via gatewayManager.getUpstream(profile).
This ensures each profile connects to its own gateway instance with correct port and host.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Backend changes:
- Add cache_read_tokens, cache_write_tokens, reasoning_tokens, model fields
- Migrate from session_id PRIMARY KEY to separate id column with session_id index
- Update updateUsage() to accept data object instead of separate params
- Add migration logic to preserve existing data during schema upgrade
- Add UsageRecord interface for type safety
Frontend changes:
- Update UsageView to display new token types (cache, reasoning)
- Update usage store to handle new usage structure
- Update sessions API to fetch enhanced usage data
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- Move `const now` outside session_id block for broader scope
- Defer addMessage() call until after conversation_history is loaded
- This prevents the user message from appearing twice in history
- Remove updateUsage call from calcAndUpdateUsage to avoid double counting
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- Show rotating loading icon before session title when actively streaming
- Hide chat/live mode toggle buttons
- Fix isSessionLive to only return true during actual streaming
- Remove unused LIVE_BADGE_WINDOW_MS constant
- Fix resumeSession callback type to include inputTokens/outputTokens
- Remove unused fetchSessionUsageSingle import
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Live mode (ConversationMonitorPane) now reads from local session-store
when useLocalSessionStore() is enabled, instead of always hitting
Hermes state.db.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Make calcAndUpdateUsage the single entry point for all inputTokens/outputTokens
calculation, always loading from DB with snapshot awareness
- Remove overrideInputTokens parameter; compression path calls calcAndUpdateUsage
before and after compress, letting DB state be the source of truth
- Add inputTokens + outputTokens as totalTokens for compression threshold comparison
- Fix session search to match message content (not just title), return snippets
and matched_message_id via two-step query
- Fall back to preview for session title display when title is null
- Remove isStreaming guard from newChat() to allow creating sessions anytime
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Generate ephemeral session_id for each Hermes run, sync complete data
(including tool results) from Hermes state.db after run completion
- Resolve tool_name from assistant message's tool_calls JSON (Hermes
stores tool_name as NULL in its messages table)
- Fall back to preview as title in mapSessionRow when title is empty
- Set preview from first user message when creating local sessions
- Enqueue ephemeral sessions for deferred deletion via gc_pending_session_deletes
- Fix enqueueEphemeralDelete: use top-level import instead of require,
set next_attempt_at to now (was 0, preventing drain)
- Remove isStreaming guard from newChat() to allow creating sessions anytime
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Add session-store.ts: self-built SQLite CRUD for sessions/messages
- Add session-deleter.ts: timer-based singleton for deferred session deletion
- Add SESSION_STORE env var (local|remote) to toggle between local SQLite and Hermes CLI
- Update sessions controller to branch on useLocalSessionStore()
- Update chat-run-socket to persist messages to local DB on run completion
- Improve SSE event handling: tool_call_id capture, finish_reason tracking
- Update group-chat to use SessionDeleter instead of direct CLI delete
- Update context-compressor to enqueue compression sessions for deferred deletion
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Remove all evt.usage/parsed.usage references, only use local countTokens
- Remove pre-send inputTokens calculation that was overwriting resume value
with compressed context, causing incorrect context drop (70k → 40k)
- run.completed now recalculates inputTokens with current snapshot + full
messages including new ones from this run
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Add sessionMap to ChatRunSocket consolidating activeRuns + sessionStates,
tracking messages, isWorking status, events, and token usage per session
- Load messages from DB on resume when not in memory, return via resumed event
- Track streaming messages (user/assistant/tool/reasoning) into sessionMap
so reconnecting clients get full message history without HTTP fetch
- Calculate token usage locally with countTokens, snapshot-aware for compressed sessions
- Add usage.updated event broadcast on run.completed with recalculated tokens
- Replace HTTP fetchSession with Socket.IO resume for message loading
- Add serverWorking state to drive streaming indicator from server isWorking status
- Clear events immediately on run completion instead of delayed cleanup
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Replace HTTP POST + SSE streaming with Socket.IO /chat-run namespace
for decoupled message handling that survives client disconnect/refresh
- Add SQLite-backed context compression with snapshot-based incremental updates
- Unify server-side session state tracking (completedSessions + compressingSessions
→ sessionStates) for reliable state replay on reconnect
- Filter compress_ sessions from session list queries
- Add compression snapshot store with proper snake_case→camelCase column aliases
- Delete temporary compress_ sessions after compression completes
- Change compressed summary role from 'system' to 'user'
- Add compression.started/completed events to frontend chat store
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Build conversation_history before adding the new user message to the
session, so the message is sent only via `input` and not duplicated
in `conversation_history` as well.
Closes#257
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
- Hover over any message to reveal a copy icon button
- Click to copy the full message text to clipboard
- Shows success/error toast notification
- Skips tool messages (no copy button shown)
- i18n support for all 8 languages (EN/ZH/DE/ES/FR/JA/KO/PT)
- Dark mode compatible styling
Co-authored-by: 356252190-star <356252190-star@users.noreply.github.com>
Fixes#237
When web-ui sends a chat message, it includes conversation_history with only
user and assistant messages — tool calls and tool results are filtered out.
Meanwhile, it sends session_id in the request body but does NOT pass the
X-Hermes-Session-Id header.
Without the header, the gateway falls back to using the incomplete
conversation_history from the body, so the AI agent loses context about
previous tool calls, especially after idle periods.
Fix: Pass X-Hermes-Session-Id as a request header so the gateway loads
complete session history from the session DB (including tool messages).
Co-authored-by: 356252190-star <356252190-star@users.noreply.github.com>
- Remove isTransitioning overlay that caused white screen on session switch
- Simplify scroll logic: just scrollToBottom() on session change
- Remove changelog entry for removed transition feature
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
* chore: add v0.4.8 changelog and improve scroll behavior
- Add v0.4.8 changelog entries for recent fixes
- Fix forced scroll to bottom when returning from other tabs
- Smooth session switch with loading transition overlay
- Auto-scroll to bottom after mermaid diagram rendering
- Bump version to 0.4.8
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* fix: replace blob URLs with persistent download URLs and add image preview
- Replace blob URLs with /api/hermes/download URLs after upload so
attachments survive page refresh
- Add click-to-preview overlay for image attachments
- Move upload directory from /tmp to ~/.hermes-web-ui/upload
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* fix: replace findLast with reverse+find for ES2022 compat
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* chore: bump TypeScript lib target from ES2022 to ES2023
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* docs: add changelog entries for blob URL fix, image preview and upload dir
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
Replace per-session SQL queries in listSessionSummaries/searchSessionSummaries
with a single bulk load via loadAllSessions() + in-memory map traversal,
eliminating N+1 round-trips. Fix search 500 error for pure numbers,
English letters, and other FTS5-incompatible input by extending the
catch fallback beyond CJK-only to all FTS query failures.
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
- Project compressed roots to their continuation tip in session lists.
- Search title/content candidates through logical compression lineage.
- Hydrate detail views along the requested continuation branch while preserving requested ids.
- Scope model-context cache lookup by provider to avoid same-name cross-provider matches.
- Add regression coverage for lineage and provider lookup behavior.
Session detail now prefers DB-backed reconstruction for compressed continuation chains, with CLI fallback preserved and pending-deletion guard covered by tests.
* fix: align group chat room sidebar background with session list
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* docs: add v0.4.7 changelog and remove v0.4.3/v0.4.1 entries
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* chore: remove unused video.mp4 asset
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
* fix: reset entire config.model on model switch
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* fix: resolve custom provider from CLI config and clean base_url_env on delete
- When config.model.provider is "custom" (set by hermes CLI), match
base_url + model against custom_providers to resolve custom:name
- Clear base_url_env from .env when deleting a builtin provider
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
* fix(chat): clamp context remaining tokens to 0 instead of showing negative
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* fix: group chat mention popup position, timestamp style, and model switch cleanup
- Move @ mention popup above input to avoid blocking the textarea
- Fix .msg-time scoping (was nested inside .msg-header, now top-level)
- Reduce timestamp opacity and set to 12px for subtler display
- Clean up stale base_url/api_key from config.yaml on model switch
Closes#204
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* fix: remove unused variables in GroupChatInput
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
#142 added a `prepare` script that triggers `npm run build` when
dist/ is missing. During Docker build, `npm install` runs before
source files and tsconfig.json are copied, causing build failure.
Use --ignore-scripts to defer build to the explicit step.
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
* fix: restore fetchAvailableModels to fix provider lost as custom (#174)
#174 replaced fetchAvailableModels with fetchConfigModels, but
config/models response lacks default_provider, label, base_url and
api_key fields. This caused selectedProvider to always be empty,
making all models appear as "custom" in the model selector.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* chore: bump version to 0.4.6
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
* fix: group chat UI background colors and replace console.log in context-engine
- Set message list background to $bg-card to match single chat
- Set status-bar background to transparent
- Replace all console.log/warn with logger in context-engine compressor
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* fix: group chat mobile UX improvements
- Add backdrop overlay for mobile sidebar with tap-to-close
- Auto-collapse sidebar on room select in mobile
- Move timestamp below message bubble
- Widen msg-body max-width to 85% to match single chat
- Add left padding to chat-header to avoid hamburger overlap
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>