hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-27 04:00:37 +00:00

Author	SHA1	Message	Date
Frank Song	c7e52084ba	Harden messaging channel handoff	2026-05-03 16:35:50 +00:00
Frank Song	20ef643bb8	Add messaging session handoff summary	2026-05-03 16:35:22 +00:00
Hermes Bot	1148656370	Merge PR #1525 by @ai-ag2026: clear stale WebUI stream state proactively (refs #1471 ) Merge conflict resolution: kept HEAD's `CACHE_NAME = 'hermes-shell-__WEBUI_VERSION__'` (post-#1517 rename) over PR #1525's `'hermes-shell-__CACHE_VERSION__-stale-stream-cleanup1'` manual suffix. The renamed placeholder still auto-bumps with each release through the `quote(WEBUI_VERSION, safe="")` substitution, so the manual `-stale-stream-cleanup1` suffix is no longer needed to force-update existing service workers — the natural version bump (v0.50.278 → v0.50.279) already invalidates the old cache via `caches.delete(k)` for `k !== CACHE_NAME` in the SW activate handler. No behavioral regression: the SW cache still bumps on this release, just via the canonical version-token path. Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-03 16:06:42 +00:00
Manfred	6bce34c27e	fix: clear stale WebUI stream state Clear persisted active_stream_id and pending runtime fields when the server no longer has the referenced live stream. Also drop browser-side INFLIGHT state when the server reports a session idle and bump the service-worker cache so the frontend fix is delivered. Adds regression coverage for backend stale-stream cleanup, frontend inflight invalidation, and cache busting.	2026-05-03 11:46:42 +02:00
Frank Song	8f3dbe185d	fix: consolidate __CACHE_VERSION__ → __WEBUI_VERSION__ (#1509 ) __CACHE_VERSION__ (sw.js) and __WEBUI_VERSION__ (index.html) are functionally identical — both resolve to quote(WEBUI_VERSION, safe='') at request time. Two names exist for historical reasons (different files added at different times). Rename __CACHE_VERSION__ → __WEBUI_VERSION__ in: - static/sw.js (CACHE_NAME + VQ constant + comment) - api/routes.py (substitution string) - tests/test_pwa_manifest_sw.py (all assertions) Single canonical name. No behavior change — same ?v=vX.Y.Z query strings on the same URLs.	2026-05-03 14:59:37 +08:00
Hermes Bot	8f58688b66	test: lock /session/static MIME-type + auth fix; drop unused import - Add tests/test_session_static_assets.py (5 tests): * /session/static/style.css must return text/css (not text/html) * /session/static/ui.js must return application/javascript * /session/<id> still serves the HTML index (catch-all not weakened) * Path-traversal still sandboxed after prefix strip * /session/static/* matches /static/* auth-exemption policy - Drop unused 'from urllib.parse import urlparse as _up' import from PR #1505's added block (parsed._replace already gives a usable result). Co-authored-by: Rick Chew <rickchew@users.noreply.github.com>	2026-05-03 05:20:19 +00:00
Rick Chew	7cf2150b94	fix: serve static assets correctly under /session/* routes When the browser loads a session page at /session/<id>, it requests static assets relative to that path — e.g. /session/static/style.css. The /session/* catch-all in handle_get() intercepted those requests and returned the HTML index page (text/html), causing browsers to refuse the stylesheet with a MIME-type mismatch error. Two-part fix: - routes.py: add a guard before the /session/ catch-all that strips the /session prefix from /session/static/* paths and delegates to _serve_static(), so the correct Content-Type is returned. - auth.py: whitelist /session/static/* in check_auth() alongside /static/, so static assets on session pages are served without requiring an authenticated session (same policy as /static/). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-03 13:05:15 +08:00
Hermes Bot	8616033605	fix(onboarding,providers): probe LM Studio /models + align env var with agent CLI (#1499 #1500 ) Addresses both #1499 (onboarding wizard never probes the configured base URL) and #1500 (cross-tool env-var name divergence between webui and agent CLI). Surfaced together because they're both LM-Studio onboarding bugs that pile on top of each other — fixing only one leaves the broken UX. #1499 — Onboarding wizard probes <base_url>/models before persisting Pre-fix, `apply_onboarding_setup` accepted whatever `base_url` the user typed without ever fetching `<base_url>/models`. @chwps's log timeline in #1420 showed the wizard finishing in 239ms with zero outbound HTTP — onboarding silently persisted unreachable URLs and left users with empty model dropdowns they had to populate by hand-editing config.yaml. Backend: * New `probe_provider_endpoint(provider, base_url, api_key, timeout=5.0)` in `api/onboarding.py`. Stdlib-only (urllib + socket — no httpx dep). Returns `{ok, models}` on success; `{ok: False, error: <code>, detail}` on failure with stable error codes the frontend can switch on: invalid_url, dns, connect_refused, timeout, http_4xx, http_5xx, parse, unreachable. 256 KB response cap and 5s timeout keep a hostile or mis- pointed endpoint from blocking the wizard. * New `POST /api/onboarding/probe` route — thin JSON wrapper around the function above. Same local-network gate as `/api/onboarding/setup` because the body carries an `api_key` the user typed. * The probe response is NEVER persisted. Only the user's typed selection ends up in config.yaml; the probed model list just populates the wizard's dropdown. * SSRF: deliberately does NOT block private-IP ranges. The wizard is gated behind WebUI auth and the legitimate target IS a local LM Studio / Ollama / vLLM server. A "block private IPs" SSRF defense would make the feature useless for its primary use case. Frontend: * `static/onboarding.js`: - New `ONBOARDING.probe` state ({status, error, detail, models, probedKey}). - `_runOnboardingProbe()` — POSTs to /api/onboarding/probe, idempotent & cached on (provider, baseUrl, apiKey). - Debounced (400ms) on `oninput` of the base URL field. - Explicit "Test connection" button. - `nextOnboardingStep` blocks Continue at the setup step for any provider with `requires_base_url=True` until the probe succeeds. Same localized error renders inline. * `static/i18n.js`: 13 new keys × 9 locales (canonical English in `en`, English fallback with `// TODO: translate` markers in the other 8 — same convention as v0.50.271 #1488 voice-buttons). * `static/style.css`: probe banner + Test button styling (red-tinted error variant, green-tinted success variant, neutral probing state). Verified via manual repro on port 8789: * connect_refused → red banner, helpful "from Docker, try the host IP" hint, blocks Continue. * DNS failure → red banner, "could not resolve host '...'", blocks Continue. * Success against a mock /v1/models server → green banner, model dropdown populates from the probed list, Continue advances normally. #1500 — webui env var aligned with agent CLI (LM_API_KEY) The webui has long used `LMSTUDIO_API_KEY` for LM Studio's API key in both onboarding and Settings detection. The agent CLI runtime (hermes_cli/auth.py:177-183) reads `LM_API_KEY`. So a user who configured auth on their LM Studio instance got Settings → Providers reporting has_key=True (because webui saw its own LMSTUDIO_API_KEY) but the agent runtime ignored the key and fell back to LMSTUDIO_NOAUTH_PLACEHOLDER → 401 against the auth-enabled LM Studio server. Masked in practice for the no-auth majority. Picked Option B from the issue (defer to the agent — single source of truth) but mitigated the migration cliff by reading the legacy name as a fallback: * `api/onboarding.py:_SUPPORTED_PROVIDER_SETUPS["lmstudio"]`: - `env_var: "LM_API_KEY"` (canonical, what onboarding writes going forward). - `env_var_aliases: ["LMSTUDIO_API_KEY"]` (read-only fallback for pre-#1500 users so detection keeps working without forcing an .env rewrite). * `api/onboarding.py:_provider_api_key_present` reads aliases too. * `api/providers.py:_PROVIDER_ENV_VAR["lmstudio"] = "LM_API_KEY"`. * `api/providers.py:_PROVIDER_ENV_VAR_ALIASES["lmstudio"] = ("LMSTUDIO_API_KEY",)` — new dict, used by `_provider_has_key` and `get_providers`'s key_source resolution. Drops in cleanly when other providers later rename their env vars too. Verified: ``` before fix: webui writes LMSTUDIO_API_KEY → agent ignores it → 401 on chat after fix: webui writes LM_API_KEY → agent picks it up → chat works pre-#1500 .env with LMSTUDIO_API_KEY → still has_key=True in Settings → key_source='env_file' ``` Tests * `tests/test_issue1499_onboarding_probe.py` — 17 tests: 3 invalid_url variants, dns, connect_refused, success (OpenAI shape), success (bare-list shape), http_4xx, http_5xx, parse non-JSON, parse wrong-shape, api_key authorization header passthrough, "probe must not write to config.yaml or .env", PROBE_ERROR_CODES contract pin, 3 end-to-end route-level smoke tests against the live server fixture. * `tests/test_issue1500_lmstudio_env_var_alignment.py` — 5 tests: onboarding declares LM_API_KEY canonical with LMSTUDIO_API_KEY alias, onboarding writes ONLY the canonical name, legacy env var still detected post-migration, canonical takes precedence when both are set, _provider_api_key_present reads aliases. * `tests/test_issue1420_lmstudio_provider_env_var.py` — updated: the original 5-test #1420 suite now pins LM_API_KEY as canonical and LMSTUDIO_API_KEY as alias. Full suite: 3879 → 3901 passing (+22), 0 failures. Out of scope (explicitly NOT addressed here) The third LM Studio onboarding sub-bug from #1420's thread — that `apply_onboarding_setup` requires a non-empty api_key for lmstudio even though most LM Studio installs run keyless — remains. The agent's `LMSTUDIO_NOAUTH_PLACEHOLDER` substitution kicks in at runtime, but the onboarding wizard rejects the empty-key case at submit. Fixing this requires a UX decision (auto-write a sentinel? loosen the required-key check for self-hosted providers?) and is left as a separate follow-up. Closes #1499 Closes #1500 Co-authored-by: chwps <106549456+chwps@users.noreply.github.com> Co-authored-by: AdoneyGalvan <25235323+AdoneyGalvan@users.noreply.github.com>	2026-05-03 02:46:24 +00:00
Hermes Bot	bcfd8b2eac	chore(release): stamp v0.50.268 — 4-PR batch + Opus follow-ups (i18n + per-session fields + None title guard) - CHANGELOG.md: v0.50.268 entry detailing #1395 #1450 #1462 #1476 + Opus SHOULD-FIX followups - ROADMAP.md: bump to v0.50.268, 3800 tests collected - TESTING.md: bump header + total to 3800 SF-1 i18n fix: - static/i18n.js: session_meta_children key in all 10 locale blocks (en, ja, ru, es, de, zh, zh-Hant x2, pt, ko) - static/sessions.js: 2 callsites use t(session_meta_children, childCount) SF-2 #1462 per-session field carry-over: - api/routes.py: duplicate now carries personality, enabled_toolsets, context_length, threshold_tokens SF-3 #1462 None-title guard: - api/routes.py: (session.title or "Untitled") + " (copy)" Tests: - tests/test_stage268_opus_followups.py: 6 regression tests pinning SF-1 + SF-2 + SF-3 - tests/test_session_duplicate.py: 2 brittle assertions widened to accept new forms Follow-up issue filed: #1481 (PWA /sw.js whitelist vestige, Opus SF-4)	2026-05-02 17:54:58 +00:00
Hermes Bot	6303a30a87	Address review feedback: deepcopy independence, persist on duplicate, reset pinned/archived, 404 status Five fixes from the May 2 2026 maintainer review: 1. messages and tool_calls now use copy.deepcopy() — prior plain assignment shared list refs between source and duplicate, so appending a turn to one mutated the other. 2. copied_session.save() called explicitly — pre-fix, the duplicate was in-memory only until the user sent a turn. Refreshing mid-flow lost it. 3. pinned and archived reset to False — duplicating an archived conversation should produce a visible (un-archived) copy. 4. Missing-session error is now status=404 (was default 400). 5. Removed redundant `import uuid` / `import time` inside the handler — both are already at the top of routes.py. Test updates: - Two existing static-grep tests widened to accept the new `copy.deepcopy(session.messages)` form alongside the original `messages=session.messages`. - Five new static-grep regression tests pin each of the five fixes so reverting any single one trips a test. All 3775 tests pass. Co-authored-by: Alexey Dsov <AlexeyDsov@users.noreply.github.com>	2026-05-02 17:39:55 +00:00
AlexeyDsov	7c4c0142d5	feat(api): add /api/session/duplicate endpoint for session cloning\nNew endpoint creates independent session copies with all messages, model and workspace intact. Added 10 comprehensive regression tests for error handling and logic verification.	2026-05-02 11:59:45 +03:00
nesquena-hermes	c73f2ff387	v0.50.264 polish followups: i18n parity + assistant-output readability Closes #1442 (server-side _LOGIN_LOCALE missing ja/pt/ko) Closes #1443 (promote _isImeEnter helper to 6 other Safari Enter guards) Closes #1446 (glued-bold-heading lift for LLM thinking-block output) Closes #1447 (markdown heading visual hierarchy in chat messages) All four issues were filed by the Opus pre-release advisor on the v0.50.264 batch or by Cygnus via Discord (relayed by @AvidFuturist, May 1 2026). They share a common shape — narrow, well-scoped, independent of each other, all adding regression tests. == #1442: _LOGIN_LOCALE parity (api/routes.py + static/i18n.js) == Added entries for ja/pt/ko to the server-side _LOGIN_LOCALE dict that renders the localized login page BEFORE the JS i18n bundle loads. With v0.50.264 shipping Japanese as the 8th built-in locale, ja/pt/ko users were seeing the English login page even with their language preference set. While auditing static/i18n.js for English leakage, also fixed: - ko: 10 user-facing login/sign-out/password keys still in English - es: 3 sign-out/auth-disabled keys still in English Tests: tests/test_login_locale_parity.py (20 tests) — pins both invariants: (a) every locale in i18n.js LOCALES has a matching _LOGIN_LOCALE entry (b) every locale's login-flow keys (13 of them) are translated, not English == #1443: window._isImeEnter promotion == PR #1441 fixed the Safari IME-composition Enter race in the chat composer (`#msg`) by widening the guard from `e.isComposing` to a `_isImeEnter(e)` helper that combines three signals (isComposing \|\| keyCode===229 \|\| _imeComposing flag). Six other Enter-input handlers were left on the original narrow guard and would still drop IME composition Enters on Safari for Japanese/Chinese/Korean users. Promoted the helper to `window._isImeEnter` (defined in static/boot.js) and replaced the `e.isComposing` guards at all six sites: - static/sessions.js: session rename, project create, project rename - static/ui.js: app dialog (confirm/prompt), message edit, workspace rename The state-free part of the helper (`isComposing \|\| keyCode===229`) handles Safari's race for any focused input without needing per-input composition listeners — only `#msg` keeps the local `_imeComposing` flag. Tests: - tests/test_issue1443_ime_helper_promotion.py (9 tests) — pins each site + verifies no raw `e.isComposing` Enter-guards remain in sessions.js/ui.js - tests/test_ime_composition.py — alternation regex extended to accept the windowed helper form (loosen-test-on-shape-change pattern from v0.50.264 reflection notes) == #1446: glued-bold-heading lift (static/ui.js renderMd + Python mirror) == LLMs in thinking/reasoning mode emit "section headers" glued to the end of the previous paragraph with no whitespace: Para 1 text.Heading to Para 2 Para 2 text.Heading to Para 3 The renderer correctly produces inline `<strong>` per CommonMark, but it looks like trailing emphasis on the body text rather than a section break. Cygnus reported this as "Markdown feedback 2 of 3." Added a single regex pre-pass in renderMd(): s.replace(/([.!?])\\([^\n]{1,80})\\\n\n/g, '$1\n\n$2\n\n') Constraints chosen to avoid false positives: - Trigger only on `[.!?]` IMMEDIATELY before `` (no space) — almost always an LLM-glued heading, not intentional emphasis - Inner text ≤80 chars, no `` or newline (single-line only) - Trailing `\n\n` required — preserves "this is important to know." mid-paragraph emphasis untouched - Position: after rawPreStash restore, before fence_stash restore — fenced code blocks stay protected (their content is `\x00P` / `\x00F` tokens when the lift runs) Mirrored in tests/test_sprint16.py render_md() so both stay in sync. Tests: tests/test_issue1446_glued_heading_lift.py (17 tests, 5 of which drive the actual ui.js renderMd via node) — covers all 3 trigger forms (.!?), all 4 preserve-emphasis cases the issue spec'd, fenced/inline code protection, chained glued headings, source-level position pin, regex shape pin. == #1447: markdown heading visual hierarchy (static/style.css) == Pre-fix sizes in `.msg-body`: h1 18px, h2 16px, h3 14px (= body), h4 13px, h5 12px, h6 11px So h3 was indistinguishable from body and h4/h5/h6 were SMALLER than body. Cygnus's report: "Markdown feedback 3 of 3 — Headings seem to be missing across the board in Hermes. They're there, but all plaintext." New sizes: h1 24px (border-bottom) h2 20px (border-bottom) h3 17px h4 15px h5 14px (uppercase, tracked) h6 13px (uppercase, tracked, muted) All headings now `font-weight:700` + `color:var(--strong)` for stronger ink. h5/h6 use uppercase + letter-spacing for "label-style" affordance instead of being smaller-than-body. Synced .preview-md (file preview pane) to match exactly so a markdown file preview and a chat message render identically. Added missing h4/h5/h6 rules to .preview-md (it only had h1-h3 before). Updated data-font-size="small"/"large" h1-h6 overrides to scale proportionally with the new defaults. Hierarchy preserved at all three font-size settings. Tests: tests/test_issue1447_heading_hierarchy.py (9 tests) — pins the size hierarchy, the bottom borders on h1/h2, the uppercase affordance on h5/h6, the .preview-md sync, and the small/large override scaling. == Verification == pytest tests/ -q → 3748 passed (+56 new) bash ~/WebUI/scripts/run-browser-tests.sh → 20 + 11 PASS bash ~/WebUI/scripts/webui_qa_agent.sh 8789 → 23/23 PASS Visual confirmation in browser at port 8789: - Heading hierarchy clearly visible at all 6 levels - Glued-bold lift produces separate paragraphs as designed - window._isImeEnter accessible from any module after boot.js - Login page renders ja/pt/ko strings correctly (curl -s /login)	2026-05-02 04:19:28 +00:00
Ryan Jones	9de61a0b9a	feat: add opt-in webui extension hooks	2026-05-02 03:36:54 +00:00
nesquena-hermes	081e600b33	fix: context-window indicator broken on older sessions (#1436 ) Fix two-layer bug where `/api/session` returned `context_length=0` for sessions that pre-date #1318, then the frontend silently fell back to cumulative `input_tokens` and the 128K JS default, producing nonsense indicators like "100" capped from "890% used (context exceeded), 1.2M / 131.1k tokens used". Empirical impact: 23 of 75 sessions on dev server rendered >100% before this fix. #1356 fixed the same symptom on the live SSE path but missed the GET /api/session load path that older sessions go through. Two-layer fix: 1. Backend (api/routes.py:1295-1313) — resolve context_length via agent.model_metadata.get_model_context_length() when the persisted value is 0. Mirrors api/streaming.py:2333-2342. 2. Frontend (static/ui.js:1269) — drop the cumulative `input_tokens` fallback. When last_prompt_tokens is missing, render "·" + "tokens used" (existing !hasPromptTok branch) instead of computing a percentage from the cumulative total. 10 regression tests in tests/test_issue1436_context_indicator_load_path.py covering both layers + the empty-model edge case (avoids the 256K default-for-unknown-model trap that get_model_context_length('') returns). Verified live: claude-opus-4-7 session with input_tokens=5,226,479 now renders "·" + "5.3M tokens used" instead of "100" + "3987% used". Reported by @AvidFuturist. Closes #1436.	2026-05-02 01:43:00 +00:00
nesquena-hermes	f8007d43f3	v0.50.257: 4 Opus pre-release follow-ups + CHANGELOG + test fixes for #1415 stage-257 batch (PRs #1402 + #1415): Opus pre-release advisor caught 4 issues in stage-257: 1. MUST-FIX (security): api/oauth.py::_write_auth_json — tmp.replace() preserves the temp file umask (0644 default), so OAuth access/refresh tokens landed world-readable on shared systems. Fix: tmp.chmod(0o600) BEFORE rename, with try/except OSError that warns but does not abort. 2. SHOULD-FIX: _handle_cron_history and _handle_cron_run_detail accepted job_id as a path component without validation. Mirrors the rollback path-traversal vector caught in v0.50.255 (#1405). Path() / .. does NOT normalize. New regex ^[A-Za-z0-9_-][A-Za-z0-9_.-]{0,63}$ with explicit . / .. rejection. 3. SHOULD-FIX: _handle_cron_history int(offset)/int(limit) raised ValueError on malformed input → confusing 500. Now try/except + clamp to (max(0, offset), max(1, min(500, limit))). 4. NIT: same regex applied to _handle_cron_run_detail (defense-in-depth even though path-resolve check would catch it downstream). PR #1415 follow-up: 8 pre-existing tests in test_issue1106 and test_custom_provider_display_name asserted bare model IDs but #1415 changes named-custom-provider IDs to @custom:NAME:model form when active provider differs. Tests updated to use _strip_at_prefix helper to keep checking the same invariant in the new shape. 4 regression tests in test_v050257_opus_followups.py + 8 fixed pre-existing tests. Full suite: 3602 passed, 0 failed.	2026-05-01 18:30:41 +00:00
nesquena-hermes	bc17229a7d	Merge PR #1402 from bergeouss: P2 improvements — cron history, toolsets per session, Codex OAuth # Conflicts: # static/i18n.js	2026-05-01 18:20:05 +00:00
nesquena-hermes	6ad7a4cc83	Merge PR #1405 from bergeouss: P3 features (insights, rollback, voice mode, subagent tree, redact toggle)	2026-05-01 16:58:49 +00:00
nesquena-hermes	6f55b973e5	Merge PR #1390 from starship-s: preserve session provider context	2026-05-01 16:58:48 +00:00
bergeouss	ae40af03d7	feat: P3 improvements — insights panel, rollback UI, voice mode, subagent tree, api redact toggle - #464 Insights panel: usage analytics dashboard with session/message/token stats, model breakdown, activity by day/hour charts, token breakdown (GET /api/insights) - #466 Rollback UI: checkpoint list, diff viewer, restore confirmation (api/rollback.py, GET /api/rollback/{list,diff}, POST /api/rollback/restore) - #1333 Voice mode: turn-based STT→send→TTS loop using Web Speech API, progressive enhancement with pulsing indicator and auto-resume - #494 Subagent session tree: parent→children grouping in sidebar with expand/collapse chevrons, child count badges, localStorage persistence - #1396 API redact toggle: Settings checkbox to disable forced redaction for self-hosted users (lazy check at call-time, default ON) - #1385 Closed: compact tool activity toggle already exists in Settings - #497 Commented: proposed shared-file bridge for cross-process gateway approvals - i18n: tab_insights added to all 8 locales, voice/checkpoint keys to EN+RU	2026-05-01 13:43:10 +00:00
bergeouss	f4bfd9dca7	fix: address PR #1402 review feedback — cron sort, path traversal, OAuth robustness - Cron history: sort by mtime instead of lexicographic filename (more robust) - Path traversal: use resolve() + is_relative_to() instead of brittle string checks - _cron_output_snippet: document the contract for response heading extraction - _read_auth_json: catch JSONDecodeError specifically, log warning instead of silent swallow - OAuth timestamps: use ISO strings consistently (created_at, updated_at) - Credential id: use uuid4 instead of time-based truncated int (collision-safe)	2026-05-01 13:38:14 +00:00
bergeouss	8ae198e88c	feat: P2 improvements — cron history, toolsets per session, Codex OAuth - #468: Cron run history — GET /api/crons/history (metadata listing) + GET /api/crons/run (full output), lazy-load on click in Tasks panel - #493: Per-session toolset override — Session.enabled_toolsets field, POST /api/session/toolsets endpoint, streaming handler override, composer chip UI with dropdown (matches reasoning chip pattern) - #1362: In-app Codex OAuth — device-code flow (stdlib only, no httpx), SSE polling endpoint, onboarding wizard login button - #1240: Design proposal comment for provider/model source-of-truth	2026-05-01 12:42:21 +00:00
Dennis Soong	0ec4aad949	fix: anchor active sessions per browser tab	2026-05-01 19:52:05 +08:00
Hermes Agent	67193faf38	Apply Opus pre-release follow-ups for v0.50.253 Three small fixes from Opus review of the merged stage diff: 1. Strip 9 orphan wiki_* i18n keys (72 lines) from PR #1342 — leaked from a different branch, zero references outside i18n.js. 2. /branch endpoint: reject non-string session_id with explicit 400 (was raising TypeError → generic 500 from get_session()). 3. /branch endpoint: reject negative keep_count with explicit 400 (Python slice semantics on negative produces 'all but last N', confusing fork behavior). Plus tests/test_v050253_opus_followups.py — 3 regression tests pinning all three fixes. Verified: 3558 pytest passing.	2026-05-01 06:53:32 +00:00
starship-s	1bfc4a992a	Merge branch 'nesquena:master' into fix/provider-qualified-session-models	2026-05-01 00:35:43 -06:00
Hermes Agent	52bfceaa3b	Add /branch command to fork conversations from any message (#1342 , fixes #465 ) Fix: gate parent_session_id emission in compact() on truthiness so sessions without a fork link don't leak parent_session_id: None and break the v0.50.251 lineage end_reason gating in agent_sessions.py. The /branch endpoint sets the field on saved forks; everything else keeps the v0.50.251 sidebar lineage path as the canonical source.	2026-05-01 05:32:45 +00:00
starship-s	bdc328d034	fix: preserve webui model provider context Persist session model_provider separately from model IDs so active/default provider selections like gpt-5.5 remain bare while routing through OpenAI Codex. Keep @provider:model for picker disambiguation and runtime bridging, and preserve explicit OpenRouter plus custom/proxy base_url routing.	2026-04-30 23:23:47 -06:00
Hermes Agent	fc8898161e	Apply Opus pre-release follow-ups (force redaction, log profile fallback)	2026-05-01 05:07:09 +00:00
Hermes Agent	a6d831fc63	Cache /api/models/live with 60s TTL (#1378 )	2026-05-01 04:46:15 +00:00
Hermes Agent	d1e1c4eeec	Fix CLI session import fallback model default (#1386 )	2026-05-01 04:46:10 +00:00
NocGeek	89dcab8327	fix: persist manual cron run results (#1372 ) Manual WebUI cron runs previously called cron.scheduler.run_job(job) and then only cleared the in-memory running flag. That meant output could be dropped and job metadata like last_run_at / last_status was not updated after a manual run. This PR matches the scheduled cron path (cron/scheduler.py:1334-1364) exactly: - Save manual-run output via save_job_output - Mark manual runs complete via mark_job_run - Treat empty final_response as a soft failure with the same error string as the scheduled path - Record manual-run failures in job metadata via mark_job_run(False) - Keep _run_cron_tracked self-contained for worker-thread execution Includes 2 behavioral regression tests using monkeypatch.setitem on sys.modules to mock cron.scheduler.run_job + cron.jobs helpers — the right test pattern (exercises the real _run_cron_tracked code path). Split out from #1352 (the larger profile-aware-cron-panel PR that's on hold) per pre-release-review feedback. Self-contained, doesn't touch the held PR's profile-filtering scope. Co-authored-by: NocGeek <NocGeek@users.noreply.github.com>	2026-04-30 23:15:31 +00:00
Nathan Esquenazi	604b44a254	fix(clarify-sse): inline snapshot under _lock to avoid deadlock in handler The new _handle_clarify_sse_stream handler in #1355 holds clarify._lock and then calls clarify.get_pending(sid) under the lock. get_pending also acquires _lock internally — and clarify._lock is a non-reentrant threading.Lock(), so the second acquisition deadlocks the SSE handler thread the moment any client connects to /api/clarify/stream. Existing tests pass because they only exercise sse_subscribe, sse_unsubscribe, _clarify_sse_notify, and submit_pending directly — none of them invoke the route handler. The deadlock would only manifest when a real EventSource opens the connection. Reproduced with a tiny harness that holds _lock and calls get_pending: the worker thread is still blocked after a 2s timeout. With the fix, both empty and populated queue cases complete in <1ms. Fix: read clarify._gateway_queues / clarify._pending inline under the same _lock acquisition, mirroring the approval SSE handler's pattern at api/routes.py:2785-2793. No recursive lock; head-of-queue snapshot is identical to what get_pending would have returned. Added tests/test_pr1355_sse_handler_no_deadlock.py with three tests: - behavioural: empty queue snapshot completes within 2s - behavioural: populated queue snapshot returns the head entry - source-level invariant: routes.py must not call get_clarify_pending() inside `with _clarify_lock:` block (locks the regression in) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 14:39:37 -07:00
fxd-jason	d2d464aac3	feat(clarify): SSE long-connection for real-time clarify notifications (#1355 ) Replaces the 1.5s HTTP polling loop for clarify with a Server-Sent Events endpoint at /api/clarify/stream that pushes clarify events to the browser instantly. Mirrors the approval SSE pattern from v0.50.248 (#1350) including all the correctness lessons: - Atomic subscribe + initial snapshot under clarify._lock - _clarify_sse_notify called inside _lock for ordering guarantees (no notify-out-of-order race) - Notify passes head=q[0].data (head-fidelity, not the just-appended entry) - resolve_clarify also calls notify after pop so trailing clarifies surface immediately (no stuck-clarify bug) - Empty-state notify with None,0 after pop-empty so frontend hides the card - 30s keepalive comments, _CLIENT_DISCONNECT_ERRORS handling - Bounded queue (maxsize=16) with silent drop on full - Frontend: EventSource with automatic 3s HTTP polling fallback on onerror Co-authored-by: fxd-jason <wujiachen7@gmail.com>	2026-04-30 21:32:51 +00:00
nesquena-hermes	e68f74ac99	fix(approval): close SSE notify-ordering, head-fidelity, and trailing-approval gaps (Opus MUST-FIX A/C/D) Pre-release Opus review caught three correctness bugs in the original PR #1350 SSE wiring beyond the snapshot/subscribe race: A) Notify-ordering race (MUST-FIX A): _approval_sse_notify took _lock only for the subscriber-list snapshot, then released it before put_nowait. With two parallel submit_pending calls, T2's notify could fire before T1's, leaving the UI showing pending_count=1 while the server actually had 2 queued. C) Trailing approval lost (MUST-FIX C): _handle_approval_respond never called _approval_sse_notify after popping. With parallel tool-call approvals (#527), a second approval queued behind the one being responded to was invisible until the next event ever fired — in practice, the agent thread parked on it would appear hung. D) Payload showed tail not head (MUST-FIX D): payload built from the just-appended entry instead of queue[0]. /api/approval/pending returns the head; SSE returned the tail. Diverging contracts. Fix: - Split into _approval_sse_notify_locked (caller holds _lock, no internal locking) and _approval_sse_notify (convenience wrapper). - submit_pending: call _locked variant inside the queue-mutation lock, passing queue_list[0] as head. - _handle_approval_respond: call _locked variant inside the pop lock, passing the new head (or None/0 if queue is empty). - Restore fallback poll to 1500ms (was bumped to 3000ms; degraded-mode parity with v0.50.247 is more important than save 1.5s of polling). New regression tests in tests/test_pr1350_sse_notify_correctness.py: - test_second_submit_pending_sends_head_not_tail (D) - test_respond_to_first_pushes_second_as_new_head (C) - test_respond_to_only_pending_pushes_empty_state (C edge) - test_pending_count_is_monotonic_under_contention (A) Updated test_approval_sse.py to pin the new contract: - _approval_sse_notify_locked(session_key, head, total) - 1500ms fallback interval Total: 3411 tests passing. Co-authored-by: jasonjcwu <jasonjcwu@users.noreply.github.com>	2026-04-30 18:45:15 +00:00
nesquena-hermes	d6b9cfac23	release: v0.50.248 Bundles: - #1349 fix(ui): show context indicator percentage without explicit context_length - #1350 feat(approval): SSE long-connection for real-time approval notifications Pre-release fixes applied: - Inline subscribe + snapshot under a single _lock acquisition in _handle_approval_sse_stream() to close the snapshot/subscribe race flagged in pre-release review. A submit_pending() arriving between the snapshot read and subscribe call would have been lost (appended to _pending after our snapshot AND notified to subscribers before we joined). Now atomic. - Added tests/test_pr1350_sse_atomic_subscribe.py (4 source-level invariants covering the atomic-lock-block guarantee). Co-authored-by: jasonjcwu <jasonjcwu@users.noreply.github.com>	2026-04-30 18:34:37 +00:00
fxd-jason	932694aec6	feat(approval): SSE long-connection for real-time approval notifications (#1350 ) Replaces the 1.5s HTTP polling loop with a Server-Sent Events endpoint at /api/approval/stream that pushes approval events to the browser instantly. The backend uses a thread-safe subscriber registry (_approval_sse_subscribers) with bounded queues to prevent memory leaks from slow clients. Frontend uses EventSource with automatic fallback to 3s HTTP polling on SSE error. - Backend: subscribe/unsubscribe/notify lifecycle in api/routes.py - New route: GET /api/approval/stream?session_id= - submit_pending() now calls _approval_sse_notify() after queue append - Frontend: EventSource with onerror -> _startApprovalFallbackPoll() - 30s keepalive comments, _CLIENT_DISCONNECT_ERRORS handling - 42 new tests (static analysis + unit + concurrency) Co-authored-by: jasonjcwu <jasonjcwu@users.noreply.github.com>	2026-04-30 18:31:42 +00:00
nesquena-hermes	eb678d5b54	feat(cron): auto-assign cron job sessions to dedicated 'Cron Jobs' project (#1079 ) From PR #1345. Co-authored-by: bergeouss <bergeouss@users.noreply.github.com>	2026-04-30 17:13:59 +00:00
nesquena-hermes	fbe84d26e6	fix(ui+pwa): avoid stale Mermaid render errors and bust cached static asset URLs on every release From PR #1337. Co-authored-by: Dennis Soong <dso2ng@gmail.com>	2026-04-30 16:18:01 +00:00
nesquena-hermes	e2d33ffce4	fix(models): persist context_length/threshold_tokens/last_prompt_tokens in Session model (#1318 split) From PR #1341. Co-authored-by: fxd-jason <wujiachen7@gmail.com>	2026-04-30 16:17:59 +00:00
nesquena-hermes	d0f6ee2ef9	fix(cron): import run_job inside _run_cron_tracked to fix NameError (#1310 ) From PR #1317. Co-authored-by: fxd-jason <wujiachen7@gmail.com>	2026-04-30 15:24:30 +00:00
nesquena-hermes	ded9b7e1c4	release: v0.50.243 (#1302 ) release: v0.50.243 Batch release of 2 PRs. - #1301 — fix: remove PRIMARY chip badge + add Claude Opus 4.7 label Drops the chip-projected configured-model badge added in #1287 (chip width 235px → 164px). Adds Claude Opus 4.7 label entries so the picker no longer renders "Claude Opus 4 7" (missing dot). Independently reviewed and approved by nesquena (commit `c0bbd23`). - #1297 (@franksong2702) — fix: preserve cron output response snippets Fixes #1295. /api/crons/output now preserves the ## Response section when a large skill dump appears in the prompt section; falls back to file tail when no marker exists. Tests: 3254 passed, 2 skipped, 3 xpassed. Independently reviewed and approved by nesquena (commit `b262e4d`).	2026-04-29 21:06:30 -07:00
nesquena-hermes	0ad95cb16a	release: v0.50.241 (#1293 ) release: v0.50.241 Batch release of 4 PRs: - #1290 (@nickgiulioni1) — Inline audio/video media editor with playback speed controls and HTTP byte-range streaming. PDF/media previews in workspace file browser. Composer tray inline players for audio/video. (Rebased from #1232.) - #1287 (@renatomott) — Configured model badges (Primary / Fallback N) in the model picker, carried through to the composer chip. Persists through on-disk model cache. - #1289 (@franksong2702) — Appearance autosave for theme/skin/font-size in Settings; inline Saving / Saved / Failed status. Font size now persists to config.yaml. Refs #1003. - #1294 (@franksong2702) — Normalize agent session source metadata (raw_source / session_source / source_label) through /api/sessions and gateway watcher SSE snapshots. Existing source_tag / is_cli_session fields preserved. Refs #1013. Tests: 3254 passed, 2 skipped, 3 xpassed (was 3199 before this release). Independently reviewed and approved by nesquena (commit `d1738f6`).	2026-04-29 19:54:07 -07:00
nesquena-hermes	33a145a669	release: v0.50.240 ## Release v0.50.240 Batch release of 13 PRs that passed full triage + code review + test suite (3199 tests, 0 failures). --- ### Added - Compact tool activity mode (`simplified_tool_calling`, default on) — groups tool calls and thinking traces into a single collapsed "Activity" disclosure card per assistant turn. Also adds a new Calm Console theme with earth/slate palette and serif prose. @Michaelyklam — #1282 - PDF first-page preview — `MEDIA:` `.pdf` files render a canvas thumbnail via PDF.js CDN (4 MB cap). HTML sandbox iframe — `.html`/`.htm` files render inline in a sandboxed `<iframe srcdoc>` (256 KB cap). 10 i18n keys × 7 locales. @bergeouss — #1280, closes #480 #482 - Inline Excalidraw diagram preview — `.excalidraw` files render as pure SVG (no external deps; rectangles, ellipses, diamonds, text, lines, arrows, freehand; 512 KB cap). @bergeouss — #1279, closes #479 - Inline CSV table rendering — fenced `csv` blocks and `MEDIA:` CSV files render as scrollable HTML tables with auto-separator detection. @bergeouss — #1277, closes #485 - Inline SVG, audio, and video rendering — SVG as `<img>`, audio as `<audio controls>`, video as `<video controls>`. @bergeouss — #1276, closes #481 - Batch session select mode — multi-select sessions for bulk Archive/Delete/Move. 11 i18n keys × 7 locales. @bergeouss — #1275, closes #568 - Collapsible skill category headers — click to collapse/expand without re-render; state persists across filter cycles. @bergeouss — #1281 - `providers.only_configured` setting — opt-in flag to restrict the model picker to explicitly configured providers. @KingBoyAndGirl — #1268 - OpenCode Go model catalog — adds Kimi K2.6, DeepSeek V4 Pro/Flash, MiMo V2.5/Pro, Qwen3.6/3.5 Plus. @nesquena-hermes — #1284, closes #1269 ### Fixed - Profile `TERMINAL_CWD` TypeError — `_build_agent_thread_env()` helper merges env before `_set_thread_env()` call. @hi-friday — #1266 - Service worker subpath cache bypass — regex now matches `/api/` under any mount prefix. @Michaelyklam — #1278 - SSE client disconnect leaks* — `TimeoutError`/`OSError` treated as clean disconnects; server backlog 64, threads daemonized; session list renders before saved-session restore. @KayZz69 — #1267 - i18n locale corrections — Korean MCP strings (23), Chinese MCP strings (23), zh-Hant missing keys (41), de missing keys (229). @bergeouss — #1274, closes #1273 --- ### Test results ``` 3199 passed, 2 skipped, 3 xpassed in 72.79s ``` ### PRs on hold (not included) #1265 (draft), #1271 (superseded by #1266), #1272 (skipped XSS tests), #1232 (partial test run), #1222 (review questions open), #1134 (live-server tests), #1132 (superseded by #1134), #1108 (negative UX review), #1084 (empty description)	2026-04-29 17:42:32 -07:00
Hermes Agent	eeef360a74	Merge remote-tracking branch pr/1261 into stage/batch-v0.50.238	2026-04-29 15:51:54 +00:00
Hermes Agent	bd8fc6a2e2	fix(models): preserve @provider:model hint when hint matches active provider When the user explicitly selects @provider:model from the picker, _resolve_compatible_session_model() was stripping the prefix because the hint matched the active provider (hint_matches_active=True → return bare_model, True). This caused: - The picker to snap back to the first duplicate entry on next render - resolve_model_provider() to use the default provider instead of the explicitly selected one, running the agent on the wrong backend The hint_matches_active branch was intended for normalizing stale cross- provider session models. But an @provider:model where the hint IS the active provider is not stale — it is the user's deliberate selection. Fix: return (model, False) so the full @provider:model survives to resolve_model_provider() in config.py, which already handles it correctly. Updates test_active_at_provider_session_model_preserved_with_hint and adds test_issue1253_duplicate_model_id_active_provider_hint_preserved. Closes #1253	2026-04-29 15:18:43 +00:00
Hermes Agent	4ee80425f2	Merge remote-tracking branch 'refs/remotes/pr/1229' into stage/batch-v0.50.238	2026-04-29 15:17:57 +00:00
Hermes Agent	e2ff00f819	Merge remote-tracking branch pr/1247 into stage/batch-v0.50.238	2026-04-29 15:11:21 +00:00
Hermes Agent	8b9ad761f9	Merge remote-tracking branch pr/1251 into stage/batch-v0.50.238	2026-04-29 15:10:49 +00:00
KingBoyAndGirl	4e0d8da060	fix: restore GET /api/mcp/servers route inside handle_get() Problem: - GET /api/mcp/servers returned 404 error - MCP servers management UI could not load server list - Root cause: route was placed outside handle_get(), in unreachable code Root Cause: - The MCP servers GET route was incorrectly placed after handle_get() returned False (404) - handle_get() function returns False at line ~1224, so any code after it won't execute - The route was also in handle_post() area but without proper method checking Solution: - Moved GET /api/mcp/servers route inside handle_get() before the return False statement - Removed the misplaced route from the old location (originally around line 1636) - Also updated /api/profiles response format to include full profiles list Testing: - After restart: curl http://localhost:8787/api/mcp/servers returns {"servers": []} - No more 404 errors - WebUI can now properly load MCP servers list	2026-04-29 17:39:56 +08:00
Frank Song	1ed1ce219d	Preserve transcript across context compaction	2026-04-29 16:37:08 +08:00
KingBoyAndGirl	d184613752	fix: fetch live models for custom provider from model.base_url	2026-04-29 16:24:19 +08:00

1 2 3 4

171 Commits