hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-25 11:10:18 +00:00

Author	SHA1	Message	Date
Hermes Agent	9aad249e5a	chore(release): stamp v0.50.295 — 3-PR batch + Opus pass Constituent PRs: #1637 by @Michaelyklam — protect raw pre from glued-bold lift (closes #1451) #1639 by @bergeouss — macOS auto-scroll race + custom:* provider list (closes #1360, #1619) #1642 by @nesquena-hermes — YAML/JSON/diff code block newlines (closes #1618, #1463) Opus advisor SHIP verdict on stage-295. One observation absorbed: - api/config.py:2533 dead-code comment per Opus (defensive belt-and-braces for #1619 fallback; load-bearing fix is in routes.py /api/models/live) PR #1641 (Michaelyklam parallel-discovery duplicate of #1642) closed as superseded; UI media adopted with co-author trailer. 4245 → 4255 tests passing (+10).	2026-05-04 18:37:52 +00:00
bergeouss	324aeaaded	fix: macOS auto-scroll momentum race (#1360 ) + custom:* provider model list (#1619 ) #1360 — On macOS WKWebView, trackpad momentum scrolling fires scroll events that interleave with the _programmaticScroll setTimeout(0) guard. A mid-momentum scroll event either gets swallowed (_programmaticScroll still true) or falsely reports nearBottom (momentum hasn't settled), keeping _scrollPinned=true and snapping the viewport back down. Fix: rAF-debounce the scroll listener so the nearBottom check runs at the next paint frame when the browser's scroll position has settled. Added a hysteresis counter requiring 2 consecutive near-bottom samples before re-pinning, preventing accidental re-pin during deceleration. #1619 — When a custom:* provider (e.g. custom:relay via custom_providers) has models that overlap with auto-detected models from base_url /v1/models, the dedup logic at config.py:2263 skipped them all. The named custom group ended up empty, and the continue at line 2334 silently discarded the auto-detected models. Result: only the default model appeared. Fix 1 (config.py): When custom:* named group has 0 models after dedup, fall back to auto_detected_models_by_provider instead of dropping them. Fix 2 (routes.py): Extended /api/models/live fallback to handle custom:* slugs (not just bare "custom") for both custom_providers config lookup and base_url live fetch.	2026-05-04 18:23:04 +00:00
test	6bbf913e22	Stage 294: PR #1631 — streaming stability trio (closes #1623 , #1624 , #1625 ) by @nesquena-hermes — APPROVED	2026-05-04 17:13:08 +00:00
nesquena-hermes	66b925f59d	fix(cache): stamp /api/models disk cache with WebUI version + schema version (#1633 ) Closes #1633. STATE_DIR/models_cache.json was persisted across server restarts without any version stamp, so a Docker container update from version A to B read the cache file written by version A — users saw stale picker contents (missing models, phantom provider groups) for up to 24 hours until either the TTL expired, an unrelated provider edit triggered invalidate_models_cache(), or they manually deleted the file. Reporter Deor (Discord) updated to v0.50.292 — which contained fixes for #1538, #1539, and #1568 — did a hard refresh and cleared site data, and still saw byte-for-byte identical picker contents because the server kept reading the v0.50.281 cache file off the host-mounted state volume. Fix: * _save_models_cache_to_disk() stamps payloads with _webui_version (resolved lazily from api.updates.WEBUI_VERSION via sys.modules lookup to avoid the api.config <-> api.updates circular import) and _schema_version = 2. * New _is_loadable_disk_cache() validator checks both stamps in addition to shape. Mismatch on either field rejects the load. * _load_models_cache_from_disk() calls the new validator and strips the disk-only metadata before returning, so the rest of the code sees the same shape it always did. * _is_valid_models_cache() kept loose (shape-only) so in-memory cache writes that never touch disk don't fail validation. Schema version is independent of the WebUI version stamp so future cache-shape changes can invalidate older releases without relying on a tag bump alone. Early-init edge case (api.updates not yet loaded) skips the version check rather than wedging the boot — at worst an unstamped file is written once and rejected on the next call. Updated existing tests/test_model_cache_metadata.py to use subset/ round-trip semantics rather than byte-for-byte equality, since the disk payload now has additional stamps. The four response-shape fields still round-trip verbatim; the load result is unchanged (stamps stripped). 19 new regression tests. 4180 -> 4199 tests pass.	2026-05-04 17:03:02 +00:00
nesquena-hermes	040cb8af70	Apply Opus pre-release SHOULD-FIX + NITs (in-PR per release policy) SHOULD-FIX: rate-limit _repair_stale_pending repair-firing telemetry. Switch from unconditional logger.warning to age-keyed: WARNING when pending_age < 5min (the diagnostically valuable race window — actual leak-path candidates that slipped past the grace guard) and DEBUG for the long-tail (orphaned sidecars from prior process lifetimes). Prevents reconnect loops on stuck sessions from flooding the log while preserving the diagnostic signal we want for tuning _REPAIR_STALE_PENDING_GRACE_SECONDS empirically. NIT: _LOCAL_SERVER_PROVIDERS expanded with lm-studio (hyphenated alias used in some custom_providers configs and already recognized at api/config.py:2189 for SSRF host trust) and localai (LocalAI project). Test parametrize expanded from 7 to 11 names, also covering pre-existing koboldcpp and textgen for symmetry. +4 regression tests. NIT (docs): CHANGELOG callout for the RFC1918 behavior change. Internal- network OpenAI-compatible proxies now preserve the model prefix on private-IP base_urls. Documented the migration path: configure as a custom_providers entry to bypass the local-server detection. NIT (deferred, optional): narrowing the heuristic to is_loopback only is left as future work; the broader scope was an explicit goal in the bug body and Opus flagged it as SHOULD-DISCUSS-but-not-block. 4184 -> 4188 passing. 0 regressions. ~10 LOC absorbed total.	2026-05-04 16:50:22 +00:00
nesquena-hermes	bea57beba9	fix(streaming): SSE heartbeat alignment, repair grace period, local-server model id preservation (#1623 , #1624 , #1625 ) Closes #1623 — Lower SSE app heartbeat from 30s to 5s at every long-lived handler (main agent, terminal, gateway-watcher, approval-poller, clarify-poller). Kernel TCP keepalive declares peer dead at 25s worst-case (10s KEEPIDLE + 5s KEEPINTVL * 3 KEEPCNT, added v0.50.289 #1581). 30s app heartbeat let the kernel tear sockets down on flaky networks before the app sent its first keepalive byte — drops at ~10s during long thinking phases. New named constant _SSE_HEARTBEAT_INTERVAL_SECONDS=5; regression test pins the inequality (app_heartbeat * 2 <= kernel_window) so future tuning can't re-introduce the misalignment. Closes #1624 — Add 30s grace period to _repair_stale_pending() trigger. Without it, any narrow race between the streaming thread clearing pending_user_message and STREAMS.pop(stream_id) produces a false-positive 'Previous turn did not complete.' marker on a turn that finished correctly (reproducible after every command-approval turn). Defense-in-depth, not the root-cause fix — the actual streaming-thread leak path is tracked separately. Falsy pending_started_at (legacy sidecars) treated as 'old enough' so legitimate legacy-data recovery still works. Plus logger.warning telemetry on every legitimate repair so the next batch of user reports tells us whether the underlying race still fires. Closes #1625 — Local model servers (LM Studio, Ollama, llama.cpp, vLLM, TabbyAPI, koboldcpp, textgen-webui) now keep the full HuggingFace-style model id (e.g. 'qwen/qwen3.6-27b' instead of stripped 'qwen3.6-27b'). New _LOCAL_SERVER_PROVIDERS set + _base_url_points_at_local_server() loopback/ RFC1918 heuristic — either signal triggers no-strip. Backward compat preserved for OpenAI-compatible proxies on public hosts (LiteLLM at litellm.example.com still strips openai/gpt-5.4 -> gpt-5.4). Updated the existing #230/#433 test to reflect that #1625 supersedes the strip-on-custom rule for loopback hosts (see api/config.py and test_model_resolver.py docstring update). Reported by @akarichan8231 in Discord on 2026-05-04. 42 regression tests across: tests/test_issue1623_sse_heartbeat_alignment.py (3) tests/test_issue1624_repair_stale_pending_grace.py (9) tests/test_issue1625_local_server_model_id_preservation.py (30) 4142 -> 4184 passing. 0 regressions.	2026-05-04 16:49:43 +00:00
Hermes Agent	f3e066b53c	chore(release): stamp v0.50.293 — 3-PR batch + 2 Opus follow-ups absorbed Constituent PRs: #1627 by @franksong2702 — show Hermes Agent version (closes #1606) #1629 by @nesquena-hermes — profile isolation trio (closes #1611, #1612, #1614) #1630 by @Michaelyklam — provider config cleanup regression test (#1597 follow-up) Opus advisor SHIP verdict + 2 SHOULD-FIX absorbed in-release: - load_projects() re-reads from disk inside lock to close migration startup race - _detect_agent_version() uses --dirty for symmetry with _detect_webui_version() 4142 → 4180 tests passing.	2026-05-04 16:33:57 +00:00
test	838645fd50	Stage 293: PR #1629 — profile isolation trio (closes #1611 , #1612 , #1614 ) by @nesquena-hermes — APPROVED	2026-05-04 16:21:29 +00:00
test	341b4c7abd	Stage 293: PR #1627 — show Hermes Agent version in Settings (closes #1606 ) by @franksong2702	2026-05-04 16:20:39 +00:00
nesquena-hermes	6bc0f9c4d5	Apply Opus pre-release SHOULD-FIX + NITs (in-PR per release policy) SHOULD-FIX #1 (renamed-root client cross-alias): drop strict-equality client filter at static/sessions.js:1853. Server-side _profiles_match cross-aliases 'default'-tagged rows to a renamed root 'kinni'; the strict-equality client would reject them, dropping every legacy session for renamed-root users. The server is now solely authoritative for profile scoping. SHOULD-FIX #2 (messaging-source dedupe ordering): _keep_latest_messaging_session_per_source now runs AFTER the profile filter at api/routes.py:2078. Before, it ran on the merged-cross-profile list with profile-blind keys, discarding the older profile's row across profiles before the scope filter — leaving zero rows for any messaging identity the active profile shared with another profile. NIT #3: _projects_migrated flag now set only AFTER successful save_projects. NIT #4: cleaned dead test code in test_is_root_profile_invalidation_drops_stale. NIT #5: _create_profile_fallback's clone_from=='default' literal now routes through _is_root_profile() for parity with the 5 other callsites. +2 regression tests pin the SHOULD-FIX shapes: - test_keep_latest_messaging_runs_after_profile_filter (source-string ordering) - test_static_sessions_js_trusts_server_profile_scoping (no client re-filter) 4173 -> 4175 tests pass. 0 regressions.	2026-05-04 16:17:26 +00:00
Michael Lam	b6c695e1ab	test: cover provider config cleanup path	2026-05-04 09:04:07 -07:00
nesquena-hermes	e8862632ed	fix(profiles): scope sessions, projects, and root-profile resolution to active profile (#1611 , #1612 , #1614 ) Closes #1611 — /api/sessions filters by active profile by default; ?all_profiles=1 opt-in for aggregate views; new _profiles_match() helper honours renamed-root cross-aliasing; static/sessions.js drops the s.is_cli_session bypass; toggle-on re-fetches with all_profiles=1 instead of slicing client-cached rows. Closes #1612 — new _is_root_profile() central helper consults list_profiles_api() for is_default=True matches alongside the legacy 'default' alias. Replaces five literal-default callsites in api/profiles.py. Memoized with explicit invalidation hooks at create + delete. Sticky active_profile file write now stores '' for renamed root, consistent with the legacy empty==root contract. Closes #1614 — projects carry a profile field stamped at create-time; /api/projects filters by active profile; /api/projects/{create,rename,delete} and /api/session/move reject ops on cross-profile projects with 404; new _PROJECTS_MIGRATION migration in load_projects() back-tags untagged projects from any session that uses them, fall back to 'default'; ensure_cron_project keys lookup by (name, profile) so each profile gets its own Cron Jobs project. 31 regression tests (9+11+11) pin the renamed-root resolution, server-side profile scoping shape, helper invariants, cross-alias matching, migration behavior, and active-profile guards on every project mutation endpoint. 4148 tests pass. Reporter: @stefanpieter Co-authored-by: stefanpieter <noreply@github.com>	2026-05-04 16:03:05 +00:00
Frank Song	59efb42dcd	Show Hermes Agent version in settings	2026-05-04 23:57:56 +08:00
Hermes Agent	1549a10510	chore(release): stamp v0.50.292 — 12-PR batch + Opus follow-ups absorbed Constituent PRs: #1597 by @Michaelyklam — pytest config-path isolation #1598 by @Michaelyklam — multi-tab SSE broadcast (closes #1584) #1599 by @Sanjays2402 — _pending_started_at truthy-check (closes #1595) #1600 by @Michaelyklam — streaming markdown subpath/fallback #1601 by @Michaelyklam — subpath frontend routes #1602 by @ai-ag2026 — cross-source continuation #1603 by @ai-ag2026 — git remote name preservation #1605 by @ai-ag2026 — update banner branch labels #1608 by @franksong2702 — cron broad-except removal (closes #1578) #1609 by @franksong2702 — server.py socket cleanup (closes #1583) #1621 by @franksong2702 — fork indicator polish (fixes #1613) #1622 by @s905060 — paste text-with-image (closes #1620) Opus advisor SHIP verdict + 2 SHOULD-FIX absorbed in-release: • #1598 ordering race fixed (offline-buffer replay moved inside lock) • #1601 sessions.js:1440 gateway SSE probe baseURI parity fix 4117 → 4142 tests passing.	2026-05-04 15:45:41 +00:00
test	21eb8a89bf	Stage 292: PR #1598 — broadcast SSE stream events to multiple tabs (closes #1584 ) by @Michaelyklam	2026-05-04 15:34:17 +00:00
test	8a10532d29	Stage 292: PR #1601 — keep frontend routes under subpath mounts by @Michaelyklam	2026-05-04 15:34:08 +00:00
test	b6702fbeae	Stage 292: PR #1602 — keep cross-source continuations separate in sidebar by @ai-ag2026	2026-05-04 15:33:32 +00:00
test	51848fb67d	Stage 292: PR #1603 — preserve git remote names in update links by @ai-ag2026	2026-05-04 15:33:32 +00:00
test	165356e744	Stage 292: PR #1608 — tighten worker-side broad-except in _run_cron_tracked (closes #1578 ) by @franksong2702	2026-05-04 15:33:32 +00:00
test	5b4ab72452	Stage 292: PR #1597 — isolate pytest Hermes config path by @Michaelyklam	2026-05-04 15:33:32 +00:00
Frank Song	cdcd6021cc	fix(cron): tighten worker-side broad-except in _run_cron_tracked (closes #1578 ) Remove the try/except Exception wrapper around cron_profile_context_for_home(...).__enter__() in _run_cron_tracked. A silent fallback to ctx=None would leave the worker thread unpinned against process-global HERMES_HOME, silently corrupting cross-profile state — the same class of bug as #1573. Add regression test to catch any future re-introduction.	2026-05-04 16:28:33 +08:00
Manfred	3c93d5a702	fix: keep cross-source continuations separate in sidebar	2026-05-04 09:30:47 +02:00
Manfred	93251e5bcb	fix: preserve git remote names in update links	2026-05-04 09:30:47 +02:00
Michael Lam	e9d7d5e427	fix: keep frontend routes under subpath mounts	2026-05-04 00:06:58 -07:00
Sanjay Santhanam	14fac05dc9	fix(streaming): use truthy-check for _pending_started_at fallback Switch the per-turn duration fallback from `is not None` to a truthy check so None, missing-attr, and an explicit 0 all uniformly fall back to time.time(). Without this, a 0 timestamp (e.g. via a buggy migration or manual file edit) would yield `time.time() - 0` ≈ wall-clock-since-epoch, displaying nonsense like 'Done in 56 years 4 months ...'. In practice pending_started_at is always set via int(time.time()) so this is a hardening fix, not a live-bug fix. Also drop the brittle source-string assertion in the regression test that pinned the literal expression. The behavioural test test_done_handler_persists_duration_on_last_assistant_message already proves the duration field is set; pinning the source line broke twice during the v0.50.290 release pipeline alone (Opus tightening + maintainer revert). Fixes #1595 Signed-off-by: Sanjay Santhanam <51058514+Sanjays2402@users.noreply.github.com>	2026-05-03 23:21:19 -07:00
Michael Lam	22187d2b4c	fix: resolve provider config cleanup path	2026-05-03 23:13:10 -07:00
Michael Lam	6c5bc95b3b	fix: broadcast SSE events to all tabs	2026-05-03 22:43:11 -07:00
nesquena-hermes	3369a08f37	fix(updates): use merge-base for compare URL so 'What's new?' link resolves Closes #1579. api/updates.py was building the GitHub compare URL from local HEAD short SHA: repoUrl + '/compare/' + curSha + '...' + newSha where curSha = `git rev-parse --short HEAD` Whenever local HEAD diverges from upstream — unpushed work, dirty stage branches, forks, in-flight rebases, release-time merge commits whose SHA only lives in the maintainer's local history — the compare URL points at a SHA github.com has never seen and returns the standard 404 page. Reporter (@ai-ag2026) observed: https://github.com/nesquena/hermes-webui/compare/c660c7f...86cb22e → 404 because c660c7f was an unpushed local commit. The right base is `git merge-base HEAD <compare_ref>` — the most recent commit local and upstream share. Since `git fetch` succeeded just before, the merge-base is guaranteed to exist on the upstream GitHub repo. Behavior matrix: Pure-behind clone (no local commits): merge-base == HEAD; URL unchanged. Behind + local-only commits (#1579): merge-base != HEAD; URL points at public ancestor instead of local HEAD. merge-base failure (shallow clone): current_sha=None; JS link guard suppresses link rather than emitting a known-broken URL. Also hardens static/ui.js: reset the link's href and display:none on every banner render, so a stale link from a prior render can't survive a re-render where the new payload has current_sha=null. Tests: - test_current_sha_is_merge_base_not_local_HEAD — reporter's scenario - test_current_sha_equals_HEAD_when_no_local_commits — backward compat - test_current_sha_falls_back_to_None_when_merge_base_fails — defensive - test_whats_new_link_resets_display_and_href_on_every_render - test_whats_new_link_suppressed_when_curSha_falsy - test_reporter_url_shape_no_longer_produces_invalid_compare_url 4094 → 4100 passing. 0 regressions.	2026-05-04 05:26:19 +00:00
Hermes Bot	d15b0a2929	Stage 290: PR #1592 — turn duration display 'Done in 1m 12s' by @Michaelyklam	2026-05-04 04:51:43 +00:00
Hermes Bot	38a9878821	Stage 290: PR #1591 — first-turn sidebar visibility (optimistic upsert) by @Michaelyklam	2026-05-04 04:51:43 +00:00
Michael Lam	0eddb0580e	fix: document turn duration fallback	2026-05-03 21:12:07 -07:00
Michael Lam	f3fa106cd7	feat: show agent turn duration	2026-05-03 20:20:17 -07:00
Michael Lam	9ed0639319	fix: show first-turn chats in sidebar immediately	2026-05-03 20:10:05 -07:00
Michael Lam	c93c7efd20	docs: explain relative login script path	2026-05-03 19:44:02 -07:00
Michael Lam	f0e6a9b788	fix: keep login assets out of service worker cache	2026-05-03 18:18:27 -07:00
Hermes Bot	c07999f0ce	Stage 288: PR #1572 — collapse duplicate provider groups (closes #1568 ) by @nesquena-hermes — APPROVED	2026-05-03 22:37:43 +00:00
Hermes Bot	421f40c2cf	Stage 288: PR #1571 — cron profile isolation (closes #1573 ) by @kowenhaoai — APPROVED + reviewer fix + post-review tightening	2026-05-03 22:37:43 +00:00
nesquena-hermes	df03055def	Address review feedback: tighten profile-resolution error handling Three small follow-ups from the review: 1. Remove the over-broad except Exception around get_active_hermes_home() in _handle_cron_run. The function is in-memory dict reads + one Path.is_dir() stat — if it raises from inside a request handler, api.profiles is in a state we shouldn't be making cron decisions in. A silent fallback to _profile_home=None re-introduces the exact bug #1573 fixes (worker thread runs unpinned against process-global HERMES_HOME). Better to 500 the request than risk silent cross- profile state corruption. 2. Add a thread-safety note on os.environ mutation in api/profiles.py explaining why _cron_env_lock is sufficient — CPython env-var assignment is GIL-protected at the bytecode level but the multi-step read-modify-write pattern (snapshot prev → assign new → restore on exit) is not atomic without explicit serialization. The lock makes the entire context-manager body run-to-completion serially, including any subprocess.Popen() calls inside run_job() that inherit the env. 3. New regression test (test_cron_run_does_not_silently_swallow_profile_resolution_errors) pinning the no-silent-fallback contract via source-level assertion. Catches future re-introduction of the over-broad except clause. Co-authored-by: kowenhaoai <kowenhaoai@users.noreply.github.com>	2026-05-03 22:29:57 +00:00
nesquena-hermes	458cf38ac9	fix(picker): collapse duplicate provider groups + guard provider-id-as-model.default (closes #1568 ) Reporter (Deor, Discord #report-bugs, May 03 2026 14:19 PT, relayed by @AvidFuturist) saw the Settings → Default Model dropdown rendering the OpenCode Go provider as TWO separate optgroups: "OpenCode Go" (the canonical one with all 14 catalog models) and "Opencode_Go" (a phantom group containing one self-referential entry). Three structural causes, all in api/config.py:_build_available_models_uncached: 1. Detection-path id leakage. The detection block at line ~1980 reads cfg["providers"] keys verbatim. If the user's config has ``providers.opencode_go.api_key`` (underscore variant) AND another path adds the canonical ``opencode-go`` (e.g. via active_provider), both end up in detected_providers and the build loop creates two distinct provider groups with the second labelled via the ``pid.title()`` fallback as ``"Opencode_Go"``. 2. Injection-block rogue model. The default-model injection block at line ~2598 puts ANY ``model.default`` string into the picker as a fake option. A stray ``model.default: opencode_go`` (provider id mistakenly used as a model id) surfaces as a phantom model labelled ``"Opencode GO"``. 3. Empty-group bleed. When a non-canonical provider id makes it into detected_providers but has no entry in _PROVIDER_MODELS, the build loop creates an optgroup with zero models — pure UI noise. This PR addresses all three: - New `_canonicalise_provider_id()` helper that folds underscores to hyphens, lowercases, and applies alias resolution only when the alias target is itself a canonical id in `_PROVIDER_DISPLAY`. The last constraint avoids round-tripping ``x-ai`` (canonical) through the alias table to ``xai`` (which the WebUI doesn't index by). - Detection-path canonicalisation. The cfg["providers"] scan applies the helper before adding to detected_providers. Same treatment in the only_show_configured intersection so that mode doesn't accidentally exclude the canonical id when configured_providers only contains the underscore-variant key. - Post-collection dedup pass that re-canonicalises every entry in detected_providers — belt-and-braces against future regressions in any of the ~25 ``detected_providers.add(...)`` callsites without auditing each one. Idempotent for already-canonical ids. - Provider-id guard on the model.default injection block. When the injected value matches a known provider display name or alias (after underscore/case normalisation), skip the injection and emit a `logger.warning` instead. Real unknown model ids (newly released models, custom endpoints) still get injected — only provider-shaped values are rejected. - Empty-group filter at end of build. Drop optgroups with zero models. Custom: groups (`provider_id` starts with `custom:`) are exempt — users may want an empty card visible as a reminder. Tests ----- `tests/test_issue1568_duplicate_provider_groups.py` (17 tests): - TestCanonicaliseProviderId (8): unit tests pinning helper behaviour — canonical preserved, underscore folded, case folded, aliases resolved, x-ai not round-tripped, empty input, unknown ids normalised, idempotence - TestProviderGroupDedup (4): end-to-end picker behaviour — underscored providers-key produces ONE group not two (Deor's case), uppercase providers-key collapsed, aliased keys (z-ai → zai) collapsed, happy path unchanged - TestDefaultModelProviderIdGuard (3): provider id as model.default doesn't inject phantom + WARNING logged; alias as model.default also caught; legitimate unknown model IDs (forward-compat) still injected - TestEmptyGroupFilter (2): empty optgroups dropped from picker; custom: providers exempted from filter Plus one structural test fix in `tests/test_issue604_all_providers_model_picker.py:test_cfg_providers_only_adds_known` — widened the regex window from 500 to 1500 chars so the new documentation comment block doesn't push `_PROVIDER_MODELS` past the substring slice. Pre-existing brittle window pattern, not a new issue. Verification ------------ Live on port 8789 with Deor's exact reproduction config (`providers.opencode_go.api_key` + `model.provider: opencode-go`): /api/models groups: 1 (was 2) Browser <select> optgroups: 1 (was 2) Total options under "OpenCode Go": 14 (was 14 in real group + 0 in phantom group) Five-scenario sweep all collapse to ONE provider group: \| Config shape \| Pre-fix \| Post-fix \| \|---\|---\|---\| \| Hyphenated provider + underscored providers-key (Deor's case) \| 2 groups \| 1 group ✅ \| \| Hyphenated provider + UPPERCASE providers-key \| 2 groups \| 1 group ✅ \| \| Aliased providers-key (z-ai resolved to zai) \| 2 groups \| 1 group ✅ \| \| model.default = provider-id (orig #1568 scenario) \| 15 models with phantom \| 14 models, no phantom ✅ \| \| Happy path (canonical-only) \| 1 group \| 1 group ✅ \| 4070 pytest passed (was 4053 → 4070, +17 from this PR). 3 CI runs to follow on push. QA harness 11/11 passed. JS unaffected — pure backend fix. Reporter: Deor (Discord #report-bugs, May 03 2026 14:19 PT) Relayed by: @AvidFuturist	2026-05-03 22:04:58 +00:00
貓鷹閣 Hermes	2a8311a788	fix(cron): scheduled jobs panel respects active profile Wrap all /api/crons* endpoints in cron_profile_context so the TLS-active profile's jobs.json is read/written, not the process-default one. Before: cron.jobs._get_jobs_file() reads HERMES_HOME from os.environ (process-global) at call time, bypassing WebUI's per-request thread-local profile. Result: the Scheduled jobs panel always showed the default profile's jobs regardless of which profile the user selected via cookie, and CRUD operations silently wrote to the wrong jobs.json. Fix: - api/profiles.py: new cron_profile_context (HTTP/TLS) and cron_profile_context_for_home (worker threads) context managers. Both hold a module-level lock, swap os.environ['HERMES_HOME'], and re-patch cron.jobs module-level constants (HERMES_DIR/CRON_DIR/JOBS_FILE/ OUTPUT_DIR are import-time snapshots that don't participate in the module's lazy __getattr__ path). - api/routes.py: wrap all 12 cron endpoints (GET + POST). For /api/crons/run, capture the TLS-active home at dispatch time and pass it into the background thread so cron output lands in the right profile directory. Tests: 3 new regression tests in test_scheduled_jobs_profile_isolation.py cover TLS-based pinning, explicit-home pinning, and serialization of concurrent contexts. Full cron + profile test suite (24 tests) passes. Refs: ~/.hermes/patches/hermes-webui_scheduled-jobs-profile-isolation.patch Obsidian: Hermes_Patches/20260504_Hermes_WebUI_Scheduled_Jobs_Profile_Isolation.md	2026-05-04 06:00:17 +08:00
nesquena-hermes	a2b793be4f	fix(picker): Nous Portal featured-set cap + endpoint symmetry (closes #1567 ) Two related dropdown bugs in one PR — same root shape (model-picker endpoints disagreeing about which Nous Portal models exist) plus the preemptive UX guard against the picker becoming unusable on large-tier Nous accounts. #1567 — Endpoint disagreement ============================= Reporter (Deor, Discord, May 03 2026) saw Settings → Providers card showing "Nous Portal — 396 models · OAuth" while the in-conversation picker dropdown listed only the four hardcoded curated entries. Two structural causes: 1. ``api/providers.py:get_providers`` iterates ALL OAuth providers regardless of authentication state and unconditionally live-fetches the catalog. 2. ``api/config.py:_build_available_models_uncached`` only iterates providers in ``detected_providers``, gated on ``hermes_cli.models.list_available_providers().authenticated``. That flag can disagree with ``get_auth_status(<id>).logged_in`` on some hermes_cli versions. When the disagreement happens for Nous, the picker silently falls through to the curated 4-entry static list while the providers card keeps showing the live catalog — exactly the asymmetry users report. Plus: the Nous live-fetch branch in `_build_available_models_uncached` fell back to the same curated 4-entry list when `provider_model_ids` returned an empty list (transient failure / OAuth refresh in flight), which doubles down on the disagreement instead of healing it. UX cap (the design concern Nathan flagged on triage) ==================================================== Even with the disagreement fixed, dumping a 397-model catalog into a flat dropdown is unusable. We trim the visible picker to a curated ~15-entry featured set when the catalog exceeds 25 models, and surface the rest under a new ``extra_models`` field so: - ``/model`` slash autocomplete (commands.js) covers the full catalog - ``_dynamicModelLabels`` (ui.js) hydrates from both lists, so a model selected from outside the featured slice still gets a proper label - The optgroup label gets ``" (15 of 397)"`` appended so the user understands the dropdown is intentionally trimmed, not broken - The providers card surfaces ``models_total`` separately so the header still reads "397 models · OAuth" - A small "+N more" disclosure pill appears at the end of the rendered pill list (only fires for non-OAuth providers — OAuth cards never render pills) with a tooltip pointing at the slash command Featured selection rules ------------------------ Deterministic; same algorithm runs in both `/api/models` and `/api/models/live` so background enrichment doesn't undo the trim: 1. Always include the user's currently-selected model (sticky — no orphan IDs in the dropdown after a refresh) 2. Always include every entry from the curated static ``_PROVIDER_MODELS["nous"]`` list whose id maps onto a live id 3. Top up to 15 by walking ``_NOUS_VENDOR_PRIORITY`` round-robin (one model per vendor each pass) so no vendor monopolises the slots Changes by file =============== api/config.py - New `_format_nous_label` neighbour: `_NOUS_FEATURED_THRESHOLD = 25`, `_NOUS_FEATURED_TARGET = 15`, `_NOUS_VENDOR_PRIORITY` tuple, `_build_nous_featured_set()` helper (~80 LOC) - `_build_available_models_uncached` Nous branch: - Apply featured-set cap with sticky-selection signal - Return `extra_models` alongside `models` for the catalog tail - Decorate optgroup label with truncation count - Drop stale-4 fallback when authenticated but live-fetch empty (omit the group entirely; truth lives in the providers card and the next cache rebuild will heal it) - Keep stale-4 fallback when hermes_cli is unavailable (test envs, package mismatches) — that's a different failure mode - Detection symmetry: explicit `get_auth_status("nous").logged_in` check after the existing `list_available_providers()` loop, so the picker matches the providers card on hermes_cli versions where the two signals disagree api/providers.py:get_providers - Apply same featured-set cap so card body doesn't render 397 pills - Add `models_total` field reporting full catalog size (used by frontend for the "N models · OAuth" header text) api/routes.py:_handle_live_models - Apply same featured-set cap for `/api/models/live` so background enrichment via `_fetchLiveModels()` doesn't undo the dropdown trim - Use sticky-selection from `cfg["model"]["model"]` matching the main endpoint's logic static/ui.js:populateModelDropdown - Hydrate `_dynamicModelLabels` from `g.extra_models` so a selection outside the visible dropdown still renders with its proper label static/commands.js:_loadSlashModelSubArgs - Iterate `group.extra_models` so `/model` autocomplete covers the full catalog (not just the trimmed featured slice) static/panels.js:_buildProviderCard - Header count uses `p.models_total` (full catalog size) instead of `p.models.length` (trimmed slice) - Render trailing "+N more" disclosure pill when `models.length < models_total` with a tooltip pointing at the slash command static/style.css - New `.provider-card-model-tag-more` rule (italic, dashed border, cursor:help, no select) — visually distinct from real model pills Tests ===== `tests/test_issue1567_nous_picker_capacity_and_symmetry.py` (20 tests): - TestBuildNousFeaturedSet (8): unit tests on the helper — small-catalog no-op, large-catalog cap to target, disjoint+complete invariants, priority-vendor round-robin guarantee, sticky selection with and without `@nous:` prefix, curated-flagship preservation, empty-catalog handling, determinism - TestApiModelsLargeCatalog (2): /api/models cap behavior end-to-end on a synthetic 397-model catalog vs a 20-model catalog - TestNousDetectionSymmetry (2): picker includes Nous when `get_auth_status` agrees but `list_available_providers` disagrees; picker omits Nous when both disagree - TestNousLiveFetchEmpty (2): authenticated + empty-fetch omits group; hermes_cli unavailable still falls back to static-4 - TestProvidersCardPickerSymmetry (1): both endpoints agree on exactly the same featured-set IDs + total catalog count - TestFrontendExtrasContract (4): static-source assertions pinning the JS contract for `extra_models`, `models_total`, and the "+N more" disclosure Verified live on port 8789 (30-model catalog): - /api/models Nous group: provider="Nous Portal (15 of 30)", 15 models, 15 extra_models - /api/models/live?provider=nous: 15 entries (matches main path) - /api/providers Nous card: models_total=30, models=15 - Browser dropdown after backfill: 15 options, 30 entries in _dynamicModelLabels - Sticky selection: Claude Opus 4.7 (the active model) in the featured slice as expected 4073 pytest passed (was 4053 → 4073, +20 from this PR). 3 CI test runs (3.11/3.12/3.13) green. QA harness 11/11 passed. Reporter: Deor (Discord #report-bugs, May 03 2026 14:15 PT) Relayed by: AvidFuturist	2026-05-03 21:44:22 +00:00
Manfred	064b2734d1	fix: block self-update restart during active streams	2026-05-03 21:13:43 +00:00
Dutch AI Agency	732c995d91	fix(#1560 ): refuse password change when HERMES_WEBUI_PASSWORD env var is set Settings password silently no-opped when HERMES_WEBUI_PASSWORD was set: the env var takes precedence in api.auth.get_password_hash(), but the UI happily POSTed _set_password and returned a green "Saved" toast while every subsequent login still required the env-var password. Same for Disable Auth (_clear_password=true). Backend (api/routes.py): - GET /api/settings now exposes password_env_var: bool so the UI knows the field is shadowed. - POST /api/settings refuses _set_password and _clear_password with HTTP 409 + a clear message naming HERMES_WEBUI_PASSWORD when the env var is set. Short-circuits BEFORE save_settings() so settings.json is not touched. Frontend (static/index.html, static/panels.js, static/i18n.js): - Added settingsPasswordEnvLock banner div in the System pane. - panels.js reads settings.password_env_var, disables the password field, swaps in a localized "locked" placeholder, reveals the banner, and hides the Disable Auth button (its POST would 409 anyway). - New i18n keys password_env_var_locked and password_env_var_locked_placeholder added to all 9 locales (en, ja, ru, es, de, zh, zh-Hant, pt, ko). Tests: - tests/test_issue1560_password_env_var_lock.py: requirement-pinning (handler exposes flag, 409 on set/clear, banner div, panels.js wiring, i18n in all 9 locales, env var name in messages, live HTTP smoke when env unset). - tests/test_1560_password_env_var_no_op.py: behavioral via FakeHandler (real status codes for env-set/unset/blank, settings.json hash unchanged after 409, panels.js disable+banner+placeholder+disable-auth-hidden). Both files run clean: 23 passed in 2.04s. test_issue1139_password_remote.py unaffected (4/4 still pass).	2026-05-03 20:59:32 +00:00
Hermes Bot	0c6c6b3bb1	fix: absorb Opus advisor doc-only SHOULD-FIX nits (1) api/session_recovery.py: removed misleading dated-format comment claim. YYYYMMDD_HHMMSS_*.json files don't start with '_' so the underscore- skip wouldn't apply to them anyway. Replaced with the truthful general statement: any future non-session JSON marked with the '_' convention is skipped automatically. (2) CHANGELOG.md: fixed self-referential typo. v0.50.284 obviously couldn't have said 'v0.50.285' inside its release notes — the quoted text was 'after deploying v0.50.284'. Pure documentation. No behavior change. Tests still pass (8/8 in tests/test_metadata_save_wipe_1558.py).	2026-05-03 20:54:02 +00:00
Hermes Bot	1a7eaf518f	fix(session-recovery): skip _index.json + harden _msg_count against non-dict JSON (v0.50.284 follow-up) v0.50.284 shipped startup self-heal in api/session_recovery.py that crashed on the very first JSON file it scanned in the production session directory. Verified live on the prod server immediately after the v0.50.284 deploy: [recovery] startup recovery failed: 'list' object has no attribute 'get' Root cause: the production session dir contains _index.json — a top-level LIST of session metadata dicts (not a dict). _msg_count() did data.get('messages') which raises AttributeError on a list. The broad except Exception in server.py's startup hook swallowed the error and the recovery silently no-op'd for every user — defeating the entire purpose of the v0.50.284 release. Fix is three small defensive changes: 1. _msg_count() — added isinstance(data, dict) guard. Non-dict-shaped JSON files now return -1 (the harmless 'unknown count' sentinel) instead of raising AttributeError. 2. recover_all_sessions_on_startup() — skips any file whose name starts with '_' (the existing project convention for non-session metadata files like _index.json). These are convention-marked as system files, not session payloads. 3. recover_all_sessions_on_startup() — wraps recover_session(path) in try/except Exception so a single malformed file can't break recovery for the rest. Logs and continues. 2 new regression tests: - test_recover_all_sessions_on_startup_skips_non_session_index_json - test_msg_count_returns_neg1_for_non_dict_top_level 4026 → 4028 tests passing (+2). Net effect: any user wiped between v0.50.279 and v0.50.284 deploys whose session has a .bak shadow will now get auto-recovered on first launch of v0.50.285, as v0.50.284's release notes promised. Closes #1558 (follow-up — the original P0 was closed by v0.50.284 but the recovery half didn't actually run in production).	2026-05-03 20:50:06 +00:00
Hermes Bot	da3932a7ef	fix(stage-284): absorb Opus advisor SHOULD-FIX items (5+6 LOC) Both flagged by pre-release Opus advisor; both clearly defensive and small enough to absorb in-release per the reviewer-flagged-fix-in-release-not-followup policy. SHOULD-FIX #1 (api/routes.py:_clear_stale_stream_state, ~25 LOC): After the metadata-only reload (#1559 Layer 2), the local 'session' variable is reassigned to the full-load object but the caller still holds the original metadata-only stub. /api/session then returns the stale active_stream_id at routes.py:1791, causing the frontend to attempt one ghost SSE reconnect before recovering. Fix: capture original_stub at function entry, then patch its in-memory active_stream_id and pending_* fields to None after both the early-return (full-load already cleared) path AND the successful-mutation path. Now the caller's read returns fresh state, no ghost reconnect. SHOULD-FIX #2 (api/models.py:Session.save, ~20 LOC): The .bak write at api/models.py:436 used write_text() which truncates- then-writes — a crash mid-write or concurrent backup-producing save could leave a torn .bak. Recovery defends correctly (JSONDecodeError → returns -1 → 'no_action'), so the failure mode was 'backup lost' not 'spurious restore'. Fix: tmp + os.replace pattern matching the main file write at line 446-453. Now backup either lands cleanly or doesn't land at all. 4026/4026 tests pass post-absorb.	2026-05-03 20:41:00 +00:00
Hermes Bot	c97c634197	Stage 284: PR #1559 — P0 hotfix metadata-only save wipe (#1558 )	2026-05-03 19:56:32 +00:00
Dutch AI Agency	45f25235a8	fix: guard stale stream cleanup with session lock	2026-05-03 21:37:38 +01:00
Hermes Bot	166f439eeb	fix: correct issue references #1557 → #1558 (nesquena review feedback) The PR title and body correctly say 'Closes #1558' but every code comment, the test file name, error-message strings, docstrings, and the original commit body referenced #1557 instead. Independent reviewer flagged this: > The 17 wrong references won't auto-close issue #1558 from the commit > message — and the test file name will be misleading for future archeology. > Worth a one-pass s/#1557/#1558/g (and rename test file → > test_metadata_save_wipe_1558.py) before merge so the artifacts agree > with reality. This commit: - Renames tests/test_metadata_save_wipe_1557.py → test_metadata_save_wipe_1558.py - Replaces 17 #1557 references with #1558 across: - tests/test_metadata_save_wipe_1558.py (7 refs) - api/models.py (5 refs in Session.save guard + backup safeguard comments) - api/routes.py (2 refs in _clear_stale_stream_state docstring + log) - api/session_recovery.py (3 refs) - server.py (3 refs in startup self-heal block) Verified: 6/6 tests in tests/test_metadata_save_wipe_1558.py pass with the renamed file + updated references.	2026-05-03 19:55:14 +00:00
nesquena-hermes	1d9a0cbba1	fix(P0 #1557 ): metadata-only Session.save() was wiping conversation history v0.50.279 introduced api.routes._clear_stale_stream_state() (#1525) which calls session.save() to clear stale active_stream_id/pending_* fields. The helper is called from /api/session and /api/session/status — both of which load the session with metadata_only=True. Session.load_metadata_only() synthesizes a stub with messages=[] (its whole purpose: fast metadata read without parsing the 400KB+ messages array). Session.save() unconditionally writes self.messages to disk via os.replace(), so saving a metadata-only stub atomically overwrites the on-disk JSON with messages=[], wiping the entire conversation. Production trigger: every SSE reconnect cycle after a server restart polls /api/session/status, which fans out to _clear_stale_stream_state, which saves the metadata-only stub. The user reported losing 1000+ message conversations and seeing 'Reconnecting…' loops on every prompt — the reconnect loop kept the cycle running until the conversation was empty. Fix: three layers, defense in depth. (1) api/models.py: load_metadata_only() now sets _loaded_metadata_only=True on the returned stub. Session.save() raises RuntimeError if that flag is set — a hard guard so any future caller making the same mistake cannot wipe data, only crash visibly. (2) api/routes.py: _clear_stale_stream_state() now detects the metadata-only flag and re-loads the full session with metadata_only=False before mutating persisted state. The full-load path also runs _repair_stale_pending() which independently clears the stream flags, so the explicit clear becomes a no-op in most cases — but messages stay intact. (3) api/models.py + api/session_recovery.py: every save() that would SHRINK the messages array (the precise failure shape of #1557) first snapshots the previous file to <sid>.json.bak. Server.py runs recover_all_sessions_on_startup() at boot — any session whose live JSON has fewer messages than its .bak is restored automatically. Idempotent on clean state. Backup overhead is zero on the normal grow-the-conversation path. Reproducer (master): test_metadata_only_save_does_not_wipe_messages goes from 1000 messages to 0 in a single save() call. After the fix, 1000 messages survive. Tests: 6 new regression tests in tests/test_metadata_save_wipe_1557.py covering all three layers. Full pytest: 4019 → 4025 (+6, all green). Live verified on port 8789: write 1000-msg session with stale active_stream_id, hit /api/session/status, /api/session — file ends with 1002 messages (_repair_stale_pending injects an error-marker pair on full reload, harmless existing behavior), active_stream_id cleared, pending cleared, no Reconnecting loop. Closes #1557. Reported by AvidFuturist via user feedback on v0.50.282.	2026-05-03 19:45:10 +00:00

1 2 3 4 5 ...

483 Commits