hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-27 04:00:37 +00:00

Author	SHA1	Message	Date
Dennis Soong	cbb251b823	fix: add sidebar cancel for running sessions	2026-05-03 08:46:36 +08:00
Hermes Bot	341b1ee6b6	fix(composer): distinct voice-mode icon, descriptive labels, opt-in pref (#1488 ) Composer footer rendered two near-identical mic icons whose tooltips both said "Voice input" — push-to-talk dictation and hands-free voice mode were visually indistinguishable. Researched how ChatGPT/Claude/Gemini solve the same problem and adopt the industry convention. Changes: - btnVoiceMode now uses Lucide audio-lines (6 vertical bars), the universal voice-conversation glyph. Also registered in LI_PATHS. - Distinct localized tooltips: voice_dictate ("Dictate") and voice_mode_toggle ("Voice mode"), with active-state flips (voice_dictate_active "Stop dictation", voice_mode_toggle_active "Exit voice mode"). Legacy voice_toggle key removed (it resolved to "Voice input" in every locale and caused the duplicate-tooltip bug). - Voice mode is opt-in via Settings -> Preferences -> "Hands-free voice mode button" (default off). Dictation mic stays visible by default, unchanged. localStorage-backed; panels.js onchange calls window._applyVoiceModePref() so the button appears/disappears immediately without reload. - 17 regression tests pin: distinct titles, audio-lines glyph, all 4 new keys in all 9 locales, removal of stale voice_toggle, English labels match convention, pref gating (no unconditional display='' left in boot.js), Settings checkbox + i18n, panels.js wiring, active-state tooltip flips. Browser-verified on port 8789: default state shows 1 mic; enabling the pref makes the audio-waveform button appear live; tooltips read "Dictate" and "Voice mode" distinctly. Closes #1488	2026-05-02 22:16:23 +00:00
Hermes Bot	9049d4d6b3	test(bootstrap): skip venv.EnvBuilder.create() in fail-loud test The test_ensure_python_fails_loudly_when_no_interpreter_can_import_agent test was passing locally but failing on CI runners because: 1. CI runners don't have REPO_ROOT/.venv/bin/python on the filesystem 2. The function path on missing venv calls venv.EnvBuilder(with_pip=True).create() 3. That internally calls subprocess.check_output() — a different code path than the monkey-patched bootstrap.subprocess.run, which only stubs run(). 4. CI fails with: AttributeError: NoneType has no attribute stdout The behavior under test is "what happens when no interpreter can import both WebUI deps and the agent" — NOT the venv-creation path. So we sidestep EnvBuilder by setting REPO_ROOT to tmp_path with a pre-existing .venv/bin/python file. The venv-existence check passes, EnvBuilder is skipped, the stubbed _python_can_run_webui_and_agent returns False on the final check, and the expected RuntimeError fires. Co-authored-by: ccqqlo <ccqqlo@users.noreply.github.com>	2026-05-02 19:45:54 +00:00
Hermes Bot	0076f3d9ab	test(bootstrap): widen ensure_python_has_webui_deps stub for rebase onto v0.50.269 The PR added an `agent_dir` parameter to ensure_python_has_webui_deps. The test_bootstrap_foreground.py tests (added in #1478) had `lambda p: p` stubs that were 1-arg only. Widened to `lambda a, *kw: a[0]` so the stubs accept the new signature on the rebased base. Co-authored-by: ccqqlo <ccqqlo@users.noreply.github.com>	2026-05-02 19:35:42 +00:00
milo	634f90a807	fix: validate WebUI launcher can import agent	2026-05-02 19:32:21 +00:00
Hermes Bot	715a80569d	fix(bootstrap): --foreground mode for process supervisors (#1478 )	2026-05-02 18:04:44 +00:00
Hermes Bot	6aa2190cc6	fix(boot): restore inflight session on bfcache pageshow (#1480 )	2026-05-02 18:04:44 +00:00
Hermes Bot	26b332612d	fix(api): add pending_user_message to Session.compact() (#1479 )	2026-05-02 18:04:44 +00:00
Hermes Bot	bcfd8b2eac	chore(release): stamp v0.50.268 — 4-PR batch + Opus follow-ups (i18n + per-session fields + None title guard) - CHANGELOG.md: v0.50.268 entry detailing #1395 #1450 #1462 #1476 + Opus SHOULD-FIX followups - ROADMAP.md: bump to v0.50.268, 3800 tests collected - TESTING.md: bump header + total to 3800 SF-1 i18n fix: - static/i18n.js: session_meta_children key in all 10 locale blocks (en, ja, ru, es, de, zh, zh-Hant x2, pt, ko) - static/sessions.js: 2 callsites use t(session_meta_children, childCount) SF-2 #1462 per-session field carry-over: - api/routes.py: duplicate now carries personality, enabled_toolsets, context_length, threshold_tokens SF-3 #1462 None-title guard: - api/routes.py: (session.title or "Untitled") + " (copy)" Tests: - tests/test_stage268_opus_followups.py: 6 regression tests pinning SF-1 + SF-2 + SF-3 - tests/test_session_duplicate.py: 2 brittle assertions widened to accept new forms Follow-up issue filed: #1481 (PWA /sw.js whitelist vestige, Opus SF-4)	2026-05-02 17:54:58 +00:00
Dennis Soong	5e806f6fd8	fix: restore inflight session on bfcache pageshow	2026-05-03 01:53:01 +08:00
Hermes Bot	6a26e82c22	fix(bootstrap): address Opus pre-merge review feedback (#1478 ) Three changes from the pre-merge Opus review: MUST-FIX — XPC_SERVICE_NAME false-positive on macOS Terminal macOS launchd sets `XPC_SERVICE_NAME` in EVERY Terminal-spawned shell, not just real services. Typical noise values: `"0"` (truthy in Python!) and `"application.com.apple.Terminal.<UUID>"`. A bare `os.environ.get(name)` existence check would auto-promote interactive `./start.sh` runs to foreground mode on every Mac dev machine — silently breaking the most common installation path (no /health probe, no browser open, no log file, hanging shell). Fix: new `_is_real_supervisor_value()` helper that filters noise. For `XPC_SERVICE_NAME` specifically, reject `"0"` and any `"application."` prefix. Real launchd plists use reverse-DNS Label form (`com.<rdns>.<svc>`) which still triggers correctly. 7 new tests in `TestXPCServiceNameNoiseFilter`: - 4 noise values (`0`, Terminal.app, iTerm2, VSCode) → no detection - 3 real Label forms → correct detection - Mixed env with XPC noise + real INVOCATION_ID → falls through to systemd SHOULD-FIX 1* — Test env leakage The original `clean_env` fixture stripped supervisor-detection env vars but not the resolved bootstrap vars (HERMES_WEBUI_HOST/PORT/AGENT_DIR) that `main()` mutates onto `os.environ`. After `test_foreground_exports_resolved_env_vars` ran, later tests would import bootstrap with polluted defaults (DEFAULT_HOST="0.0.0.0" instead of "127.0.0.1"). Existing assertions still passed (tautological vs DEFAULT_), but it was a footgun for future tests. Fix: extend `clean_env` to also `delenv` the three resolved vars before each test. SHOULD-FIX 2* — Pre-execv executability guard If `discover_launcher_python` returns a path that doesn't exist or isn't executable, `os.execv` raises OSError → wrapper catches → SystemExit(1) → supervisor restarts → loop forever. That's exactly the failure mode this PR is supposed to eliminate. Fix: `os.access(python_exe, os.X_OK)` check before execv. Converts infinite supervisor loop into a single visible RuntimeError. 1 new test in `TestForegroundExecutabilityGuard` pinning that the guard fires before execv when the python path is non-executable. Docs — supervisor.md updates - New section explaining the XPC_SERVICE_NAME noise filter and what values trigger / don't trigger detection - New section listing supervisors that are NOT auto-detected (runit, daemontools, PM2, Foreman/Honcho, custom shell-script supervisors) with explicit recommendation to set HERMES_WEBUI_FOREGROUND=1 Verification - 3820 tests pass (+9 from this commit's new tests vs the original PR push of 3811) - Filter manually verified end-to-end with the live os.environ: XPC=0 → None, XPC=application.* → None, XPC=com.example.foo → triggers - run-browser-tests.sh ALL CHECKS PASSED on the worktree Items deferred from the Opus review - #4 chdir target may not exist: REPO_ROOT comes from __file__.resolve() so it's stable; not a real concern in practice - #6 two startup messages in foreground mode: cosmetic, useful for diagnostics - #7 stricter explicit-only mode: leaves user the override of just not passing --foreground (current behavior) - #8 test stub return value: trivial, can fix later if regression surface - #9 argparse positional-after-option ordering: test reads fine These can be follow-up issues if anyone hits them.	2026-05-02 17:52:13 +00:00
youzhi	b804b66238	Fix session list pending message payload	2026-05-03 01:44:38 +08:00
Hermes Bot	273888df48	fix(sidebar): nest child sessions under lineage roots (#1450 )	2026-05-02 17:41:05 +00:00
Hermes Bot	7c1b53258a	feat(api): /api/session/duplicate endpoint for session cloning (#1462 )	2026-05-02 17:41:05 +00:00
Hermes Bot	f0ed4aaa59	fix(sessions): sync URL after session id rotation (#1395 )	2026-05-02 17:41:05 +00:00
Hermes Bot	6303a30a87	Address review feedback: deepcopy independence, persist on duplicate, reset pinned/archived, 404 status Five fixes from the May 2 2026 maintainer review: 1. messages and tool_calls now use copy.deepcopy() — prior plain assignment shared list refs between source and duplicate, so appending a turn to one mutated the other. 2. copied_session.save() called explicitly — pre-fix, the duplicate was in-memory only until the user sent a turn. Refreshing mid-flow lost it. 3. pinned and archived reset to False — duplicating an archived conversation should produce a visible (un-archived) copy. 4. Missing-session error is now status=404 (was default 400). 5. Removed redundant `import uuid` / `import time` inside the handler — both are already at the top of routes.py. Test updates: - Two existing static-grep tests widened to accept the new `copy.deepcopy(session.messages)` form alongside the original `messages=session.messages`. - Five new static-grep regression tests pin each of the five fixes so reverting any single one trips a test. All 3775 tests pass. Co-authored-by: Alexey Dsov <AlexeyDsov@users.noreply.github.com>	2026-05-02 17:39:55 +00:00
Hermes Bot	f84b6a4e2f	fix(bootstrap): add --foreground mode for process supervisors (#1458 Bug #1 ) Issue #1458 reports persistent-host crashes (≥1/day) when running the WebUI under launchd KeepAlive on macOS. Root cause: `bootstrap.py` calls `subprocess.Popen([python, "server.py"], start_new_session=True)`, probes /health, then exits 0. Under any process supervisor (launchd, systemd, supervisord, runit, s6), the supervisor sees its tracked PID exit, marks the program as "completed," and respawns it. The new bootstrap fails to bind port 8787 (orphaned server still has it), exits non-zero, supervisor respawns again — loop until the orphan crashes for some other reason and the next respawn finds the port free. This PR addresses Bug #1 of the three failure modes tracked in #1458: the `bootstrap.py` double-fork breaking process supervisors. Bug #2 (state.db FD leak) and Bug #3 (HTTP-unhealthy wedge) remain open under the same issue — they need diagnosis data before a fix can land. Changes ------- 1. `bootstrap.py`: - New `--foreground` argparse flag with help text mentioning launchd / systemd / supervisord. - New `_detect_supervisor()` that returns the env var name for any supervisor it detects: `INVOCATION_ID` / `JOURNAL_STREAM` / `NOTIFY_SOCKET` (systemd, s6), `XPC_SERVICE_NAME` (launchd), `SUPERVISOR_ENABLED` (supervisord), or `HERMES_WEBUI_FOREGROUND` for the explicit user opt-in. Truthy values for the explicit opt-in: `1` / `true` / `yes` / `on` (case-insensitive). - `main()` branches on `args.foreground or _detect_supervisor()`: - Foreground path: chdir to `agent_dir or REPO_ROOT`, then `os.execv(python, [python, server_path])` to replace the bootstrap process image with the server. The supervisor sees the long-lived server as the original child. No `wait_for_health` probe — the supervisor's KeepAlive / Restart=on-failure handles liveness. - Default path: unchanged. Spawn server as detached child via `Popen + start_new_session=True`, probe /health, return 0. This still works for interactive `bash start.sh` invocations. - Resolved env vars (HOST/PORT/STATE_DIR/AGENT_DIR) are now mutated on `os.environ` directly instead of into a local `env` copy so they are inherited across `os.execv`. 2. `docs/supervisor.md` (new): runnable launchd plist, systemd .service, and supervisord conf examples + a diagnostic recipe (`lsof` + ppid chain) for catching the orphan-loop in production. 3. `.gitignore`: allowlist `docs/supervisor.md` (the directory uses an opt-in pattern; matches the existing `!docs/docker.md` precedent). 4. `tests/test_bootstrap_foreground.py` (new): 35 regression tests covering the argparse flag, `_detect_supervisor()` behavior across all five supervisor env vars, the explicit opt-in's truthy/falsy values, and `main()`'s execv-vs-Popen routing decision under each input combination. `os.execv` is monkeypatched in the routing tests — we pin the structural choice (which call is made, with which args, in which cwd, with which env) not the post-exec behavior. Why this scope and no more -------------------------- Bug #2 (state.db FD leak) lists 5 candidate paths and asks the reporter for `lsof -p <pid> \| sort \| uniq -c \| sort -rn \| head -20` output to disambiguate. Until that data lands, any "fix" would be speculative — explicitly out of scope per the contributor-pickup comment on the issue. Bug #3 (launchd-running, port-listening, HTTP-unhealthy) was added in @stefanpieter's reply comment. Diagnosis is in flight; no concrete fix shape yet. Also out of scope. Running locally end-to-end verifies the behavior: ``` [bootstrap] Starting Hermes Web UI on http://127.0.0.1:8789 (foreground mode: --foreground) $ pgrep -af 'server.py' 2997632 /home/.../python /tmp/wt-fix-1458/server.py $ ps -o ppid -p 2997632 2997581 ← bash that ran bootstrap.py — same PID as the original bootstrap $ ps -p 2997581 -o cmd ... bootstrap.py ... ← but exec'd into server.py ``` The same PID that bash forked for `bootstrap.py` is now `server.py`. A supervisor watching that PID would correctly observe the long-lived server. No double-fork. Verification ------------ - 3811 tests pass (`pytest tests/` — full suite, +51 from this PR plus master-merge-in) - All 35 new bootstrap-foreground tests pass - `bash scripts/run-browser-tests.sh` PASS (HTTP API checks against worktree) - `bash scripts/webui_qa_agent.sh 8789` PASS (23/23 visual QA) - Live verified: server starts cleanly under both `--foreground` and `HERMES_WEBUI_FOREGROUND=1`; PID lineage confirms no double-fork Closes #1458 (Bug #1 only). Bugs #2 and #3 remain tracked under the issue.	2026-05-02 17:37:54 +00:00
Hermes Bot	3abae9aca7	chore(release): stamp v0.50.267 — 7 contributor PR batch + Opus follow-up - CHANGELOG.md: v0.50.267 entry detailing #1454/#1474/#1461/#1465/#1467/#1460/#1473 + Opus advisor SHOULD-FIX trailing-empty guard for _norm_model_id - ROADMAP.md: bump to v0.50.267, 3776 tests collected - TESTING.md: bump header + total to 3776 - api/config.py: trailing-empty fallback in _norm_model_id (parts[-1] or s) - static/ui.js: mirror trailing-empty fallback in _normalizeConfiguredModelKey - tests/test_norm_model_id_trailing_empty_guard.py: 5 regression tests	2026-05-02 17:03:25 +00:00
Hermes Bot	c517339bce	fix(sessions): batch session actions + in-flight reload recovery (#1473 )	2026-05-02 16:49:55 +00:00
youzhi	a90e38f033	Fix string i18n placeholder interpolation	2026-05-02 23:05:55 +08:00
youzhi	40d2563d51	Fix batch session actions and inflight reload	2026-05-02 22:45:49 +08:00
Dennis Soong	3aafe52985	test: tighten inflight stream reuse invariants	2026-05-02 22:29:14 +08:00
Dennis Soong	6f0c5d6e1a	fix: reuse inflight session stream	2026-05-02 19:12:26 +08:00
AlexeyDsov	7c4c0142d5	feat(api): add /api/session/duplicate endpoint for session cloning\nNew endpoint creates independent session copies with all messages, model and workspace intact. Added 10 comprehensive regression tests for error handling and logic verification.	2026-05-02 11:59:45 +03:00
nesquena-hermes	c73f2ff387	v0.50.264 polish followups: i18n parity + assistant-output readability Closes #1442 (server-side _LOGIN_LOCALE missing ja/pt/ko) Closes #1443 (promote _isImeEnter helper to 6 other Safari Enter guards) Closes #1446 (glued-bold-heading lift for LLM thinking-block output) Closes #1447 (markdown heading visual hierarchy in chat messages) All four issues were filed by the Opus pre-release advisor on the v0.50.264 batch or by Cygnus via Discord (relayed by @AvidFuturist, May 1 2026). They share a common shape — narrow, well-scoped, independent of each other, all adding regression tests. == #1442: _LOGIN_LOCALE parity (api/routes.py + static/i18n.js) == Added entries for ja/pt/ko to the server-side _LOGIN_LOCALE dict that renders the localized login page BEFORE the JS i18n bundle loads. With v0.50.264 shipping Japanese as the 8th built-in locale, ja/pt/ko users were seeing the English login page even with their language preference set. While auditing static/i18n.js for English leakage, also fixed: - ko: 10 user-facing login/sign-out/password keys still in English - es: 3 sign-out/auth-disabled keys still in English Tests: tests/test_login_locale_parity.py (20 tests) — pins both invariants: (a) every locale in i18n.js LOCALES has a matching _LOGIN_LOCALE entry (b) every locale's login-flow keys (13 of them) are translated, not English == #1443: window._isImeEnter promotion == PR #1441 fixed the Safari IME-composition Enter race in the chat composer (`#msg`) by widening the guard from `e.isComposing` to a `_isImeEnter(e)` helper that combines three signals (isComposing \|\| keyCode===229 \|\| _imeComposing flag). Six other Enter-input handlers were left on the original narrow guard and would still drop IME composition Enters on Safari for Japanese/Chinese/Korean users. Promoted the helper to `window._isImeEnter` (defined in static/boot.js) and replaced the `e.isComposing` guards at all six sites: - static/sessions.js: session rename, project create, project rename - static/ui.js: app dialog (confirm/prompt), message edit, workspace rename The state-free part of the helper (`isComposing \|\| keyCode===229`) handles Safari's race for any focused input without needing per-input composition listeners — only `#msg` keeps the local `_imeComposing` flag. Tests: - tests/test_issue1443_ime_helper_promotion.py (9 tests) — pins each site + verifies no raw `e.isComposing` Enter-guards remain in sessions.js/ui.js - tests/test_ime_composition.py — alternation regex extended to accept the windowed helper form (loosen-test-on-shape-change pattern from v0.50.264 reflection notes) == #1446: glued-bold-heading lift (static/ui.js renderMd + Python mirror) == LLMs in thinking/reasoning mode emit "section headers" glued to the end of the previous paragraph with no whitespace: Para 1 text.Heading to Para 2 Para 2 text.Heading to Para 3 The renderer correctly produces inline `<strong>` per CommonMark, but it looks like trailing emphasis on the body text rather than a section break. Cygnus reported this as "Markdown feedback 2 of 3." Added a single regex pre-pass in renderMd(): s.replace(/([.!?])\\([^\n]{1,80})\\\n\n/g, '$1\n\n$2\n\n') Constraints chosen to avoid false positives: - Trigger only on `[.!?]` IMMEDIATELY before `` (no space) — almost always an LLM-glued heading, not intentional emphasis - Inner text ≤80 chars, no `` or newline (single-line only) - Trailing `\n\n` required — preserves "this is important to know." mid-paragraph emphasis untouched - Position: after rawPreStash restore, before fence_stash restore — fenced code blocks stay protected (their content is `\x00P` / `\x00F` tokens when the lift runs) Mirrored in tests/test_sprint16.py render_md() so both stay in sync. Tests: tests/test_issue1446_glued_heading_lift.py (17 tests, 5 of which drive the actual ui.js renderMd via node) — covers all 3 trigger forms (.!?), all 4 preserve-emphasis cases the issue spec'd, fenced/inline code protection, chained glued headings, source-level position pin, regex shape pin. == #1447: markdown heading visual hierarchy (static/style.css) == Pre-fix sizes in `.msg-body`: h1 18px, h2 16px, h3 14px (= body), h4 13px, h5 12px, h6 11px So h3 was indistinguishable from body and h4/h5/h6 were SMALLER than body. Cygnus's report: "Markdown feedback 3 of 3 — Headings seem to be missing across the board in Hermes. They're there, but all plaintext." New sizes: h1 24px (border-bottom) h2 20px (border-bottom) h3 17px h4 15px h5 14px (uppercase, tracked) h6 13px (uppercase, tracked, muted) All headings now `font-weight:700` + `color:var(--strong)` for stronger ink. h5/h6 use uppercase + letter-spacing for "label-style" affordance instead of being smaller-than-body. Synced .preview-md (file preview pane) to match exactly so a markdown file preview and a chat message render identically. Added missing h4/h5/h6 rules to .preview-md (it only had h1-h3 before). Updated data-font-size="small"/"large" h1-h6 overrides to scale proportionally with the new defaults. Hierarchy preserved at all three font-size settings. Tests: tests/test_issue1447_heading_hierarchy.py (9 tests) — pins the size hierarchy, the bottom borders on h1/h2, the uppercase affordance on h5/h6, the .preview-md sync, and the small/large override scaling. == Verification == pytest tests/ -q → 3748 passed (+56 new) bash ~/WebUI/scripts/run-browser-tests.sh → 20 + 11 PASS bash ~/WebUI/scripts/webui_qa_agent.sh 8789 → 23/23 PASS Visual confirmation in browser at port 8789: - Heading hierarchy clearly visible at all 6 levels - Glued-bold lift produces separate paragraphs as designed - window._isImeEnter accessible from any module after boot.js - Login page renders ja/pt/ko strings correctly (curl -s /login)	2026-05-02 04:19:28 +00:00
Dennis Soong	082f3d45b7	fix: nest child sessions under lineage roots	2026-05-02 12:09:36 +08:00
nesquena-hermes	4ee9368464	Opus pre-release follow-ups for PR #1445 REQUIRED: - _fully_unquote_path range(3) -> range(10) — defense-in-depth so quadruple- encoded .. is rejected by validator instead of slipping through (not exploitable but contract violation) - docs/EXTENSIONS.md trust-model callout moved to top of file with explicit 'don't enable in untrusted env / don't point at user-writable dir' guidance NICE-TO-HAVE (taken since Nathan asked for all fixes big and small): - URL list cap at _MAX_URL_LIST=32 to avoid pathological rendering - One-shot WARNING log for rejected URLs (silent drop now visible to admin) - One-shot WARNING log for URL list truncation - MIME map: ttf (font/ttf), otf (font/otf), wasm (application/wasm) 5 regression tests in tests/test_pr1445_opus_followups.py pin all invariants.	2026-05-02 03:49:40 +00:00
nesquena-hermes	73cb3c1948	stage-265: test fix + CHANGELOG for v0.50.265	2026-05-02 03:42:58 +00:00
Ryan Jones	9de61a0b9a	feat: add opt-in webui extension hooks	2026-05-02 03:36:54 +00:00
nesquena-hermes	e6e9868625	Opus pre-release follow-up: blur resets _imeComposing flag Opus advisor caught a recoverable footgun in PR #1441's manual flag: if focus is lost mid-composition (window blur or older Safari WebKit IME quirk), compositionend may never fire and _imeComposing stays true until the next full composition cycle. Result: Enter-to-send is silently broken until page reload — an unrecoverable stuck state for something that's supposed to be transient. Add a blur listener that also resets the flag. Cheap belt-and-suspenders against the stuck state. Adds 1 regression test pinning the listener. (other Opus findings logged in /tmp/stage-264-brief.md as follow-up issues: _LOGIN_LOCALE parity for ja/pt/ko, promote _isImeEnter to the 6 other Safari-affected Enter guards in sessions.js + ui.js)	2026-05-02 02:56:48 +00:00
nesquena-hermes	241bdafd28	test: bump locale-count assertions for new ja locale (8 -> >=8/9)	2026-05-02 02:50:40 +00:00
nesquena-hermes	71cf06cd1c	test: pr1441 IME helper guards + pr1439 ja locale parity - Loosen test_ime_composition._ime_guarded_enter_pattern to accept the new _isImeEnter(e) helper (PR #1441 widened guard for Safari + 229 keyCode + manual _imeComposing flag). Original e.isComposing-only pattern still matches via alternation. - Add test_pr1441_ime_safari_guard.py (6 tests): pin the 3-guard helper, compositionstart sets manual flag, compositionend defers reset to next tick (Safari race), null-guard $('msg') for non-chat pages, send-Enter uses helper, dropdown-Enter uses helper. - Add test_japanese_locale.py (8 tests): mirror Chinese/Korean templates, block exists, representative translations, full key parity with English, no extra keys, duplicates mirror en exactly, placeholders preserved, arrow-function values mirrored, _label uses Japanese script.	2026-05-02 02:44:59 +00:00
Dennis Soong	9e894a2555	fix: sync URL after session id rotation	2026-05-02 10:35:40 +08:00
nesquena-hermes	584974c9d2	fix(renderer): line-anchor fence regex to prevent mid-line ``` corruption (#1438 ) The markdown fence regex /```([\s\S]?)```/g had no line anchoring. A literal triple backtick inside code block content (e.g. a regex with ``` in a lookbehind, or a script that documents fences) terminated the outer fence at the wrong place. The leaked tail then went through bold/italic/inline-code passes, eating `` characters as italic markers and emitting literal </strong> tags into the rendered output. CommonMark §4.5 requires that an opening code fence be the first non-whitespace content of a line (up to 3 spaces of indent allowed) and that the closing fence also start a line. This patch updates 3 sites + the Python mirror to use that invariant: static/ui.js:1559 renderMd() fenced-block stash (assistant messages) static/ui.js:66 _renderUserFencedBlocks() (user messages) static/ui.js:2599 _stripForTTS() (TTS speech pre-strip) tests/test_sprint16.py Python mirror Pattern: (^\|\n)[ ]{0,3}```(?:([\s\S]?)\n)?[ ]{0,3}```(?=\n\|$) The non-capturing (?:...\n)? group keeps empty fences (```\n```) working; without it, a body+\n is required and the closing fence on the very next line no longer matches. The lead group (^\|\n) is prefixed back to the stash token so paragraphs above don't bleed into the <pre> block. 20 regression tests in tests/test_issue1438_fence_anchoring.py cover: - Cygnus's exact repro from Discord (May 1 2026) - Inline ``` mid-paragraph (must not open fence) - Partial/streaming fence with no close (must not eat content) - Empty fences with and without language tag - 3-space indented fences (allowed) vs 4-space (not a fence) - Multiple adjacent blocks - Bold/italic/inline-code surviving after a fence - Source-level guards on all 3 patched sites + lead-prefix invariant Empirical browser verification (live JS, on bug repro): Before fix: </code></pre>[^\n]<em>\|%%[ \t]</em>... ← truncated, italic leak After fix: <pre><code>...```[^\n]\|%%...</code></pre> ← intact, regex preserved Tests: 3678 passed (+20 from new test file, was 3658), 0 failures. Reported-By: Cygnus (Discord) Relayed-By: @AvidFuturist Closes #1438	2026-05-02 02:30:20 +00:00
nesquena-hermes	081e600b33	fix: context-window indicator broken on older sessions (#1436 ) Fix two-layer bug where `/api/session` returned `context_length=0` for sessions that pre-date #1318, then the frontend silently fell back to cumulative `input_tokens` and the 128K JS default, producing nonsense indicators like "100" capped from "890% used (context exceeded), 1.2M / 131.1k tokens used". Empirical impact: 23 of 75 sessions on dev server rendered >100% before this fix. #1356 fixed the same symptom on the live SSE path but missed the GET /api/session load path that older sessions go through. Two-layer fix: 1. Backend (api/routes.py:1295-1313) — resolve context_length via agent.model_metadata.get_model_context_length() when the persisted value is 0. Mirrors api/streaming.py:2333-2342. 2. Frontend (static/ui.js:1269) — drop the cumulative `input_tokens` fallback. When last_prompt_tokens is missing, render "·" + "tokens used" (existing !hasPromptTok branch) instead of computing a percentage from the cumulative total. 10 regression tests in tests/test_issue1436_context_indicator_load_path.py covering both layers + the empty-model edge case (avoids the 256K default-for-unknown-model trap that get_model_context_length('') returns). Verified live: claude-opus-4-7 session with input_tokens=5,226,479 now renders "·" + "5.3M tokens used" instead of "100" + "3987% used". Reported by @AvidFuturist. Closes #1436.	2026-05-02 01:43:00 +00:00
nesquena-hermes	26d0f45791	fix: new-chat guard ignores in-flight streams (#1432 ) + profile form auto-capitalizes typed values (#1423 ) Two unrelated UX bugs, both small surgical fixes with regression tests. Issue #1432 — "+" button doesn't open new chat during streaming ================================================================ Reported by @Olyno: clicking "+" after sending a first message keeps redirecting to the same chat instead of opening a new blank conversation, making parallel chats impossible until the first response finishes. Root cause: static/boot.js:691 (and the Cmd/Ctrl+K branch at :844) had an empty-session guard from #1171 that skipped newSession() when message_count===0: if(S.session && (S.session.message_count\|\|0)===0){ $('msg').focus(); closeMobileSidebar(); return; } But during the first user turn of a brand-new session, message_count is still 0 server-side because the user message hasn't been merged into s.messages yet. The guard treated that as "empty" and silently dropped the click, blocking parallel chats for the entire stream duration. Fix: Tighten the predicate to also exclude in-flight state: if(S.session && (S.session.message_count\|\|0)===0 && !S.busy && !S.session.active_stream_id && !S.session.pending_user_message){ $('msg').focus(); closeMobileSidebar(); return; } Same predicate applied to the Cmd/Ctrl+K handler at :844. The in-flight signal (active_stream_id \|\| pending_user_message) is the same one _restoreSettledSession() in messages.js:1081 already uses to decide whether a session is "settled" — keeping both call sites aligned. Verified end-to-end: with S.busy=true and pending_user_message set, the old guard returned `block=true` (= the bug), the new guard returns `block=false` (= fixed). With a truly empty session (no busy, no pending), both old and new guards still block — preserving #1171 behavior. Issue #1423 — Profile name field auto-capitalizes typed values ============================================================== Self-reported (Mac app, May 1 2026): typing `hello` into the New Profile "Name" field shows `Hello` after blur/autofill, contradicting the "Lowercase letters, numbers, hyphens, underscores only" hint right next to it. The form lowercases on submit so stored data is correct, but the displayed value during typing is misleading. Root cause: static/panels.js:2532 had only autocomplete="off": <input type="text" id="profileFormName" placeholder="..." autocomplete="off" required> Missing three attributes that actually prevent the misbehavior: - autocapitalize="none" — mobile keyboards (iOS Safari, Android Chrome, WKWebView in the Mac app) auto-capitalize the first letter without it - autocorrect="off" — Safari runs autocorrect on blur, can rewrite hello→Hello - spellcheck="false" — desktop browsers may run spellcheck on blur Fix: Add the three attributes to profileFormName. Also added to profileFormBaseUrl since URLs are similarly bad targets for autocapitalize/autocorrect. profileFormApiKey is type="password" and already has correct browser behavior. Verified end-to-end against the live DOM: openProfileCreate() → getElementById('profileFormName').getAttribute(...) returns the new attributes correctly, with required preserved. Tests ----- 3648 passed, 2 skipped, 3 xpassed (was 3640 — added 8 new regression tests in test_1432_newchat_and_1423_profile_input.py). One pre-existing test had to be widened: tests/test_mobile_layout.py test_new_conversation_closes_mobile_sidebar grabbed only the first 500 chars of the btnNewChat handler block to scan for closeMobileSidebar. The new comment block pushed closeMobileSidebar past that window even though both calls are still present. Bumped the window to 1500 chars and the shortcut-block lines from 12 to 24 to match the multi-line guard. Closes #1432 Closes #1423 Reported by @Olyno (#1432, GitHub)	2026-05-02 00:52:41 +00:00
nesquena-hermes	8ceeef3716	Apply Opus pre-release fixes: dropdown resize guard + display:block Three fixes from Opus advisor review of stage-261: 1. CRITICAL: dropdown-survives-resize bug. The composerToolsetsDropdown is a DOM sibling of composerToolsetsWrap, not a child, so CSS hiding the wrap does not cascade-hide an open dropdown. If a user opens the dropdown at composer-footer >= 1100px and then opens the workspace panel (or resizes the window), the dropdown would stay open without a visible anchor. Fixed in three places (defense-in-depth): - resize listener: closes dropdown when chip.offsetParent === null - _positionToolsetsDropdown: closes if chip hidden (defense-in-depth) - toggleToolsetsDropdown: early-returns if chip hidden (defense against future #1431 redesign code that might invoke from elsewhere) 2. MEDIUM: display:flex changed to display:block to match sibling wraps (.composer-profile-wrap, .composer-model-wrap, .composer-reasoning-wrap all use the natural block display). 3. Added 3 new regression tests to pin all three guards. Refs #1431, #1433.	2026-05-02 00:21:15 +00:00
nesquena-hermes	a6884ca40f	Make composer-footer toolsets chip responsive instead of always-hidden Replaces PR #1433 unconditional JS display:none with a CSS @container query that shows the chip only at composer-footer widths >= 1100px. JS now clears inline style instead of setting display:none, so the CSS responsive cascade is the single source of truth. Also removed inline style=\"display:none\" from index.html so the CSS base rule provides the default-hidden state. 10 regression tests pin the base hide, wide-container show, narrow-container hide (520px container query), mobile viewport hide (640px @media), JS does not force display:none, JS clears inline style, /api/session/toolsets and the dropdown machinery (toggleToolsetsDropdown, _populateToolsetsDropdown) are preserved. Refs #1431, #1433.	2026-05-02 00:04:12 +00:00
nesquena-hermes	b57525241b	v0.50.260: Docker reliability batch - PR #1428 + broader UX/docs improvements + Opus advisor fixes Combines PR #1428 (UID/GID alignment) with a broader Docker reliability pass that addresses recurring user reports about compose files not working. Constituent PR: - #1428 sunnysktsang - Align agent UID/GID with webui (fixes #1399). Two- and three-container compose files had agent at UID 10000 (image default) and webui at UID 1000 (WANTED_UID default), causing permission denied on shared hermes-home volume. All services now use ${UID:-1000}. Plus broader Docker UX overhaul: - All 3 compose files document HERMES_SKIP_CHMOD/HERMES_HOME_MODE escape hatches inline (the v0.50.254 fix wasn't surfaced for Docker users). - New .env.docker.example template covering UID/GID, paths, password, permission handling. UID/GID are uncommented with placeholder values per Opus advisor (so macOS users don't skim past). - New docs/docker.md - comprehensive guide: 5-min quickstart, failure mode table with one-line fixes, bind-mount migration, multi-container architecture diagram, macOS Docker Desktop VirtioFS note, link to community sunnysktsang/hermes-suite all-in-one image. - README Docker section rewritten - clearer quickstart, failure-mode table, link to docs/docker.md. Stale /root/.hermes references removed. Plus Opus pre-release advisor MUST-FIX: - HERMES_HOME_MODE has DIFFERENT semantics in the WebUI vs the agent image. WebUI: credential-file mode threshold (0640 allows group bits). Agent: HERMES_HOME directory mode (default 0700). 0640 on a directory has no owner-execute bit, so the agent can't traverse its own home and bricks. My initial draft recommended HERMES_HOME_MODE=0640 in agent service blocks - corrected to 0750 across all 4 surfaces (compose files, .env.docker.example, docs/docker.md). 3 regression tests pin the asymmetry. 12 regression tests total in test_v050260_docker_invariants.py. Full suite: 3627 passed, 0 failed. Nathan explicitly authorized merge with my own review + Opus only, no independent review needed.	2026-05-01 23:10:52 +00:00
nesquena-hermes	69ab856d37	test fix: skip test_session_db_close_is_idempotent when hermes_state not on import path CI-only failure: test_session_db_close_is_idempotent imported hermes_state from /home/hermes/.hermes/hermes-agent which exists locally but NOT on the GH Actions runner that only has the WebUI repo. Use importlib.util.find_spec to detect availability and pytest.skip when the agent repo isn't present. The source-level pin in test_cached_agent_reuse_closes_old_session_db catches revert of the close() call; the runtime idempotency test is added confirmation when both repos are co-located. Local: 5 passed. CI: 4 passed + 1 skipped (idempotency).	2026-05-01 22:45:18 +00:00
nesquena-hermes	c75ce33280	v0.50.259: Opus pre-release follow-up — close _session_db on LRU eviction + CHANGELOG + 5 regression tests PR #1421 (SessionDB WAL handle leak fix on cached-agent reuse path) had a sibling leak at the LRU eviction site that I caught during pre-review: api/streaming.py SESSION_AGENT_CACHE.popitem(last=False) was discarding the evicted entry with `evicted_sid, _ = ...`. The agent's _session_db was dropped on the floor and only released when GC eventually finalized the agent — which on a long-running server may be never (cyclic refs, extension types holding C handles, etc.). Same fix shape as #1421: capture the evicted entry, call _evicted_agent._session_db.close() explicitly. SessionDB.close() is idempotent + thread-safe (with self._lock: if self._conn:), so the double-close-is-benign property still holds. 5 regression tests in test_v050259_sessiondb_fd_leak.py: - Source-level: cached-agent reuse path closes before replace - Source-level: LRU eviction path captures + closes evicted agent - Behavioral: SessionDB.close() is idempotent (3 calls safe) - Behavioral: cached-agent reuse with mock — close called exactly once - Behavioral: LRU eviction with mock — only evicted agent's DB closes Full suite: 3615 passed, 0 failed. Nathan explicitly authorized 'just go ahead and merge it as a small release' since the PR is 9 LOC, focused, has Opus pre-release follow-up + tests, and matches the empirically-confirmed leak shape (73-handle leak at EMFILE).	2026-05-01 22:42:53 +00:00
nesquena-hermes	399f12ac96	v0.50.258: Opus follow-up — fix multi-param redirect-encoding bug + CHANGELOG PR #1419 (login session TTL + redirect-back + connectivity probe) had a real bug in the server-side ?next= construction: quote(path, safe='/:@!$&'()*+,;=') keeps ? and & literal, so: (a) /api/sessions?limit=50&offset=0 round-trips as /api/sessions?limit=50 — the inner & terminates the outer next= value and offset=0 leaks as a top-level outer query the login page ignores. (b) An attacker-controlled path with embedded &next=https://evil.com injects a second top-level next parameter. Browsers parse first-match (benign), Python parse_qs parses last-match (the evil URL) — the parser-divergence is a footgun even though _safeNextPath() in login.js rejects the actual exploit. Fix: encode the entire path?query blob with safe='/' so ?, &, = all percent-encode. The outer next then holds exactly one path-with-query string the browser auto-decodes once. 6 regression tests in test_v050258_opus_followups.py pin round-trip behavior across simple paths, single-query, multi-param queries, attacker-injection neutralization, and the SESSION_TTL=30d constant. Full suite: 3610 passed, 0 failed.	2026-05-01 21:30:10 +00:00
nesquena-hermes	c78bcddda6	v0.50.257: CRITICAL Opus finding — fix non-functional per-session toolset override Opus pre-release advisor caught a 5th issue not covered by my initial follow-up sweep, this one CRITICAL: PR #1402 #493 per-session toolset override silently no-op'd every time. Bug: api/streaming.py:1755 called _session_meta.get('enabled_toolsets') on the result of Session.load_metadata_only(). It returns a Session INSTANCE, not a dict. .get() raised AttributeError, which the surrounding bare except swallowed silently. The toolset chip in the UI saved correctly to disk, but the streaming agent always ran with global toolsets. Fix: use getattr(_session_meta, 'enabled_toolsets', None). Two new regression tests: - Source-level: forbid the .get() / [] dict-access shape. - Runtime: Session.load_metadata_only must return a Session instance. Full suite: 3604 passed, 0 failed.	2026-05-01 18:36:24 +00:00
nesquena-hermes	f8007d43f3	v0.50.257: 4 Opus pre-release follow-ups + CHANGELOG + test fixes for #1415 stage-257 batch (PRs #1402 + #1415): Opus pre-release advisor caught 4 issues in stage-257: 1. MUST-FIX (security): api/oauth.py::_write_auth_json — tmp.replace() preserves the temp file umask (0644 default), so OAuth access/refresh tokens landed world-readable on shared systems. Fix: tmp.chmod(0o600) BEFORE rename, with try/except OSError that warns but does not abort. 2. SHOULD-FIX: _handle_cron_history and _handle_cron_run_detail accepted job_id as a path component without validation. Mirrors the rollback path-traversal vector caught in v0.50.255 (#1405). Path() / .. does NOT normalize. New regex ^[A-Za-z0-9_-][A-Za-z0-9_.-]{0,63}$ with explicit . / .. rejection. 3. SHOULD-FIX: _handle_cron_history int(offset)/int(limit) raised ValueError on malformed input → confusing 500. Now try/except + clamp to (max(0, offset), max(1, min(500, limit))). 4. NIT: same regex applied to _handle_cron_run_detail (defense-in-depth even though path-resolve check would catch it downstream). PR #1415 follow-up: 8 pre-existing tests in test_issue1106 and test_custom_provider_display_name asserted bare model IDs but #1415 changes named-custom-provider IDs to @custom:NAME:model form when active provider differs. Tests updated to use _strip_at_prefix helper to keep checking the same invariant in the new shape. 4 regression tests in test_v050257_opus_followups.py + 8 fixed pre-existing tests. Full suite: 3602 passed, 0 failed.	2026-05-01 18:30:41 +00:00
youzhi	59e07f3fff	Fix WebUI custom provider routing	2026-05-02 02:11:41 +08:00
nesquena-hermes	0f594ec714	fix: register 5 missing Lucide icons (TTS speaker + queue chevron + insights cards) (#1413 ) The li() helper in static/icons.js logs console.warn and returns '' when an icon name is not in LI_PATHS. Five icon names referenced by static/.js were never registered, so their host elements rendered as empty 0-size buttons / containers despite display:flex. Five missing icons added: - 'volume-2' — TTS speaker on every assistant message (ui.js:3376; regression from #499; surfaced after #1411 fixed CSS specificity in v0.50.255) - 'chevron-up' — queue pill chevron (ui.js:2178; the '▲' fallback only fired when li was undefined, not when it returned '') - 'hash' — Insights 'Messages' stat card (panels.js:883) - 'cpu' — Insights 'Tokens' stat card (panels.js:884) - 'dollar-sign' — Insights 'Cost' stat card (panels.js:885) The Insights icons are a fresh regression from #1405 (v0.50.255). Adds tests/test_issue1413_li_path_coverage.py — three tests: 1. Walk every li('NAME', ...) call across static/.js, assert NAME is registered in LI_PATHS. Prevents the entire class of bug. 2. Pin the five icons added by this fix so removal gets a clear error message. 3. Pin the warn+empty-string contract of li() so the diagnostic story in the test docstring stays accurate. Reported by @AvidFuturist via Telegram, 2026-05-01. Fixes #1413	2026-05-01 17:57:34 +00:00
nesquena-hermes	fcba6fda1c	Merge PR #1411 from nesquena-hermes: TTS toggle CSS specificity collision (#1409 ) + Ollama env var bleed (#1410 ) # Conflicts: # CHANGELOG.md	2026-05-01 17:34:28 +00:00
nesquena-hermes	5ce516ed38	v0.50.255: Opus follow-ups (4 fixes) + CHANGELOG Opus pre-release advisor caught 4 issues in stage-255 (#1390 + #1405): 1. MUST-FIX: api/rollback.py path-traversal — _checkpoint_root() / ws_hash / checkpoint did NOT normalize Path() / "../escape", so an authenticated caller could read or restore from another allowlisted workspace via ../<other-ws-hash>/<sha>. New _validate_checkpoint_id() regex-guards with ^[A-Za-z0-9_-][A-Za-z0-9_.-]{0,63}$ and rejects . and .. literals. Both get_checkpoint_diff and restore_checkpoint validate. 2. SHOULD-FIX: redact_session_data perf cliff — the new api_redact_enabled toggle in #1405 called uncached load_settings() per string, recursed across messages[] and tool_calls[]. For a 50-message session: hundreds of disk reads per /api/session response. Now read once at the top and thread _enabled through via private kwarg. 3. SHOULD-FIX: voice-mode wrong-session TTS — the patched autoReadLastAssistant fires globally; if the user navigated to a different session between sending and stream completion, TTS would speak the wrong session\\s reply. New _voiceModeThinkingSid closure captures S.session.session_id at thinking-time; _speakResponse bails to _startListening() on mismatch. 4. NIT: rollback._inspect_checkpoint had bare Exception in the except tuple alongside specific catches, swallowing everything. Now (TimeoutExpired, OSError) only. 6 regression tests in test_v050255_opus_followups.py. Full suite: 3587 passed, 2 skipped, 3 xpassed.	2026-05-01 17:19:53 +00:00
nesquena-hermes	0e9bd651a4	fix: TTS toggle CSS specificity collision (#1409 ) + Ollama env var bleed (#1410 ) Two unrelated UX/Settings bugs, both small surgical fixes with regression tests. Issue #1409 — TTS toggle has no effect ======================================= Reported via Discord: ticking Settings → Voice → "Text-to-Speech for responses" did nothing. The speaker icon never appeared on assistant messages despite the checkbox saving to localStorage correctly. Root cause (CSS specificity collision): static/panels.js _applyTtsEnabled() set btn.style.display = enabled ? '' : 'none' on every .msg-tts-btn. The '' branch removes the inline override, after which the .msg-tts-btn { display:none; } rule from style.css re-hides the button. Both branches left the icon hidden, so the toggle has been silently broken since #499 first shipped the TTS feature. Fix (body-class toggle, Option B from the issue): - panels.js: _applyTtsEnabled now toggles body.classList('tts-enabled') - style.css: new compound selector body.tts-enabled .msg-tts-btn { display:inline-flex; align-items:center; } - default-hidden rule (.msg-tts-btn{display:none;}) preserved so the icon stays hidden by default (CSS-only state) - boot.js paths that already call _applyTtsEnabled(localStorage…) work unchanged — the new function applies state at the body level instead of inline-styling individual buttons, so the rule survives renderMd() re-renders without re-querying every button Verified end-to-end against live server: getComputedStyle on a probe .msg-tts-btn returns display:flex when body has tts-enabled, display:none when it doesn't. Two regression tests in TestIssue1409TtsToggleBodyClass explicitly check for the body-class shape and forbid the broken inline-style pattern. Issue #1410 — Ollama (local) shows "API key configured" when only Ollama Cloud key is set ================================================================= Reported via Discord: configuring Ollama Cloud lit up the local Ollama card too. Both providers were mapped to OLLAMA_API_KEY in api/providers.py _PROVIDER_ENV_VAR. Root cause: api/providers.py:47-48 "ollama": "OLLAMA_API_KEY", "ollama-cloud": "OLLAMA_API_KEY", _provider_has_key("ollama") found the value the user set for Ollama Cloud and returned True. But the runtime code path in hermes_cli/runtime_provider.py only consumes OLLAMA_API_KEY when the base URL hostname is ollama.com (Ollama Cloud) — local Ollama is keyless by default and reaches a custom base URL with no auth. The WebUI was reporting "configured" for a key local Ollama doesn't even read. Fix (Option A from the issue body, preferred): - Drop bare "ollama" from _PROVIDER_ENV_VAR with an inline comment explaining why - _provider_has_key("ollama") falls through to the config.yaml branch, which already supports providers.ollama.api_key for local users who genuinely need to set a token - ollama-cloud retains its OLLAMA_API_KEY mapping unchanged Verified end-to-end against live server with OLLAMA_API_KEY=sk-cloud-key-test in env: GET /api/providers reports has_key=True only for ollama-cloud, and has_key=False for bare ollama. Two regression tests in TestIssue1410OllamaEnvVarBleed cover the bleed-prevention case AND the "local user with config.yaml api_key still reports configured" case to guard against over-correction. Tests ----- 3572 passed, 2 skipped, 3 xpassed (was 3567 — added 5 new regression tests). Closes #1409 Closes #1410 Reported by @AvidFuturist (Discord, May 1 2026)	2026-05-01 17:14:51 +00:00
nesquena-hermes	6ad7a4cc83	Merge PR #1405 from bergeouss: P3 features (insights, rollback, voice mode, subagent tree, redact toggle)	2026-05-01 16:58:49 +00:00

1 2 3 4 5 ...

427 Commits