hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-25 19:20:16 +00:00

Author	SHA1	Message	Date
Michael Lam	0eddb0580e	fix: document turn duration fallback	2026-05-03 21:12:07 -07:00
Michael Lam	f3fa106cd7	feat: show agent turn duration	2026-05-03 20:20:17 -07:00
Hermes Bot	c07999f0ce	Stage 288: PR #1572 — collapse duplicate provider groups (closes #1568 ) by @nesquena-hermes — APPROVED	2026-05-03 22:37:43 +00:00
Hermes Bot	421f40c2cf	Stage 288: PR #1571 — cron profile isolation (closes #1573 ) by @kowenhaoai — APPROVED + reviewer fix + post-review tightening	2026-05-03 22:37:43 +00:00
nesquena-hermes	df03055def	Address review feedback: tighten profile-resolution error handling Three small follow-ups from the review: 1. Remove the over-broad except Exception around get_active_hermes_home() in _handle_cron_run. The function is in-memory dict reads + one Path.is_dir() stat — if it raises from inside a request handler, api.profiles is in a state we shouldn't be making cron decisions in. A silent fallback to _profile_home=None re-introduces the exact bug #1573 fixes (worker thread runs unpinned against process-global HERMES_HOME). Better to 500 the request than risk silent cross- profile state corruption. 2. Add a thread-safety note on os.environ mutation in api/profiles.py explaining why _cron_env_lock is sufficient — CPython env-var assignment is GIL-protected at the bytecode level but the multi-step read-modify-write pattern (snapshot prev → assign new → restore on exit) is not atomic without explicit serialization. The lock makes the entire context-manager body run-to-completion serially, including any subprocess.Popen() calls inside run_job() that inherit the env. 3. New regression test (test_cron_run_does_not_silently_swallow_profile_resolution_errors) pinning the no-silent-fallback contract via source-level assertion. Catches future re-introduction of the over-broad except clause. Co-authored-by: kowenhaoai <kowenhaoai@users.noreply.github.com>	2026-05-03 22:29:57 +00:00
nesquena-hermes	458cf38ac9	fix(picker): collapse duplicate provider groups + guard provider-id-as-model.default (closes #1568 ) Reporter (Deor, Discord #report-bugs, May 03 2026 14:19 PT, relayed by @AvidFuturist) saw the Settings → Default Model dropdown rendering the OpenCode Go provider as TWO separate optgroups: "OpenCode Go" (the canonical one with all 14 catalog models) and "Opencode_Go" (a phantom group containing one self-referential entry). Three structural causes, all in api/config.py:_build_available_models_uncached: 1. Detection-path id leakage. The detection block at line ~1980 reads cfg["providers"] keys verbatim. If the user's config has ``providers.opencode_go.api_key`` (underscore variant) AND another path adds the canonical ``opencode-go`` (e.g. via active_provider), both end up in detected_providers and the build loop creates two distinct provider groups with the second labelled via the ``pid.title()`` fallback as ``"Opencode_Go"``. 2. Injection-block rogue model. The default-model injection block at line ~2598 puts ANY ``model.default`` string into the picker as a fake option. A stray ``model.default: opencode_go`` (provider id mistakenly used as a model id) surfaces as a phantom model labelled ``"Opencode GO"``. 3. Empty-group bleed. When a non-canonical provider id makes it into detected_providers but has no entry in _PROVIDER_MODELS, the build loop creates an optgroup with zero models — pure UI noise. This PR addresses all three: - New `_canonicalise_provider_id()` helper that folds underscores to hyphens, lowercases, and applies alias resolution only when the alias target is itself a canonical id in `_PROVIDER_DISPLAY`. The last constraint avoids round-tripping ``x-ai`` (canonical) through the alias table to ``xai`` (which the WebUI doesn't index by). - Detection-path canonicalisation. The cfg["providers"] scan applies the helper before adding to detected_providers. Same treatment in the only_show_configured intersection so that mode doesn't accidentally exclude the canonical id when configured_providers only contains the underscore-variant key. - Post-collection dedup pass that re-canonicalises every entry in detected_providers — belt-and-braces against future regressions in any of the ~25 ``detected_providers.add(...)`` callsites without auditing each one. Idempotent for already-canonical ids. - Provider-id guard on the model.default injection block. When the injected value matches a known provider display name or alias (after underscore/case normalisation), skip the injection and emit a `logger.warning` instead. Real unknown model ids (newly released models, custom endpoints) still get injected — only provider-shaped values are rejected. - Empty-group filter at end of build. Drop optgroups with zero models. Custom: groups (`provider_id` starts with `custom:`) are exempt — users may want an empty card visible as a reminder. Tests ----- `tests/test_issue1568_duplicate_provider_groups.py` (17 tests): - TestCanonicaliseProviderId (8): unit tests pinning helper behaviour — canonical preserved, underscore folded, case folded, aliases resolved, x-ai not round-tripped, empty input, unknown ids normalised, idempotence - TestProviderGroupDedup (4): end-to-end picker behaviour — underscored providers-key produces ONE group not two (Deor's case), uppercase providers-key collapsed, aliased keys (z-ai → zai) collapsed, happy path unchanged - TestDefaultModelProviderIdGuard (3): provider id as model.default doesn't inject phantom + WARNING logged; alias as model.default also caught; legitimate unknown model IDs (forward-compat) still injected - TestEmptyGroupFilter (2): empty optgroups dropped from picker; custom: providers exempted from filter Plus one structural test fix in `tests/test_issue604_all_providers_model_picker.py:test_cfg_providers_only_adds_known` — widened the regex window from 500 to 1500 chars so the new documentation comment block doesn't push `_PROVIDER_MODELS` past the substring slice. Pre-existing brittle window pattern, not a new issue. Verification ------------ Live on port 8789 with Deor's exact reproduction config (`providers.opencode_go.api_key` + `model.provider: opencode-go`): /api/models groups: 1 (was 2) Browser <select> optgroups: 1 (was 2) Total options under "OpenCode Go": 14 (was 14 in real group + 0 in phantom group) Five-scenario sweep all collapse to ONE provider group: \| Config shape \| Pre-fix \| Post-fix \| \|---\|---\|---\| \| Hyphenated provider + underscored providers-key (Deor's case) \| 2 groups \| 1 group ✅ \| \| Hyphenated provider + UPPERCASE providers-key \| 2 groups \| 1 group ✅ \| \| Aliased providers-key (z-ai resolved to zai) \| 2 groups \| 1 group ✅ \| \| model.default = provider-id (orig #1568 scenario) \| 15 models with phantom \| 14 models, no phantom ✅ \| \| Happy path (canonical-only) \| 1 group \| 1 group ✅ \| 4070 pytest passed (was 4053 → 4070, +17 from this PR). 3 CI runs to follow on push. QA harness 11/11 passed. JS unaffected — pure backend fix. Reporter: Deor (Discord #report-bugs, May 03 2026 14:19 PT) Relayed by: @AvidFuturist	2026-05-03 22:04:58 +00:00
貓鷹閣 Hermes	2a8311a788	fix(cron): scheduled jobs panel respects active profile Wrap all /api/crons* endpoints in cron_profile_context so the TLS-active profile's jobs.json is read/written, not the process-default one. Before: cron.jobs._get_jobs_file() reads HERMES_HOME from os.environ (process-global) at call time, bypassing WebUI's per-request thread-local profile. Result: the Scheduled jobs panel always showed the default profile's jobs regardless of which profile the user selected via cookie, and CRUD operations silently wrote to the wrong jobs.json. Fix: - api/profiles.py: new cron_profile_context (HTTP/TLS) and cron_profile_context_for_home (worker threads) context managers. Both hold a module-level lock, swap os.environ['HERMES_HOME'], and re-patch cron.jobs module-level constants (HERMES_DIR/CRON_DIR/JOBS_FILE/ OUTPUT_DIR are import-time snapshots that don't participate in the module's lazy __getattr__ path). - api/routes.py: wrap all 12 cron endpoints (GET + POST). For /api/crons/run, capture the TLS-active home at dispatch time and pass it into the background thread so cron output lands in the right profile directory. Tests: 3 new regression tests in test_scheduled_jobs_profile_isolation.py cover TLS-based pinning, explicit-home pinning, and serialization of concurrent contexts. Full cron + profile test suite (24 tests) passes. Refs: ~/.hermes/patches/hermes-webui_scheduled-jobs-profile-isolation.patch Obsidian: Hermes_Patches/20260504_Hermes_WebUI_Scheduled_Jobs_Profile_Isolation.md	2026-05-04 06:00:17 +08:00
nesquena-hermes	a2b793be4f	fix(picker): Nous Portal featured-set cap + endpoint symmetry (closes #1567 ) Two related dropdown bugs in one PR — same root shape (model-picker endpoints disagreeing about which Nous Portal models exist) plus the preemptive UX guard against the picker becoming unusable on large-tier Nous accounts. #1567 — Endpoint disagreement ============================= Reporter (Deor, Discord, May 03 2026) saw Settings → Providers card showing "Nous Portal — 396 models · OAuth" while the in-conversation picker dropdown listed only the four hardcoded curated entries. Two structural causes: 1. ``api/providers.py:get_providers`` iterates ALL OAuth providers regardless of authentication state and unconditionally live-fetches the catalog. 2. ``api/config.py:_build_available_models_uncached`` only iterates providers in ``detected_providers``, gated on ``hermes_cli.models.list_available_providers().authenticated``. That flag can disagree with ``get_auth_status(<id>).logged_in`` on some hermes_cli versions. When the disagreement happens for Nous, the picker silently falls through to the curated 4-entry static list while the providers card keeps showing the live catalog — exactly the asymmetry users report. Plus: the Nous live-fetch branch in `_build_available_models_uncached` fell back to the same curated 4-entry list when `provider_model_ids` returned an empty list (transient failure / OAuth refresh in flight), which doubles down on the disagreement instead of healing it. UX cap (the design concern Nathan flagged on triage) ==================================================== Even with the disagreement fixed, dumping a 397-model catalog into a flat dropdown is unusable. We trim the visible picker to a curated ~15-entry featured set when the catalog exceeds 25 models, and surface the rest under a new ``extra_models`` field so: - ``/model`` slash autocomplete (commands.js) covers the full catalog - ``_dynamicModelLabels`` (ui.js) hydrates from both lists, so a model selected from outside the featured slice still gets a proper label - The optgroup label gets ``" (15 of 397)"`` appended so the user understands the dropdown is intentionally trimmed, not broken - The providers card surfaces ``models_total`` separately so the header still reads "397 models · OAuth" - A small "+N more" disclosure pill appears at the end of the rendered pill list (only fires for non-OAuth providers — OAuth cards never render pills) with a tooltip pointing at the slash command Featured selection rules ------------------------ Deterministic; same algorithm runs in both `/api/models` and `/api/models/live` so background enrichment doesn't undo the trim: 1. Always include the user's currently-selected model (sticky — no orphan IDs in the dropdown after a refresh) 2. Always include every entry from the curated static ``_PROVIDER_MODELS["nous"]`` list whose id maps onto a live id 3. Top up to 15 by walking ``_NOUS_VENDOR_PRIORITY`` round-robin (one model per vendor each pass) so no vendor monopolises the slots Changes by file =============== api/config.py - New `_format_nous_label` neighbour: `_NOUS_FEATURED_THRESHOLD = 25`, `_NOUS_FEATURED_TARGET = 15`, `_NOUS_VENDOR_PRIORITY` tuple, `_build_nous_featured_set()` helper (~80 LOC) - `_build_available_models_uncached` Nous branch: - Apply featured-set cap with sticky-selection signal - Return `extra_models` alongside `models` for the catalog tail - Decorate optgroup label with truncation count - Drop stale-4 fallback when authenticated but live-fetch empty (omit the group entirely; truth lives in the providers card and the next cache rebuild will heal it) - Keep stale-4 fallback when hermes_cli is unavailable (test envs, package mismatches) — that's a different failure mode - Detection symmetry: explicit `get_auth_status("nous").logged_in` check after the existing `list_available_providers()` loop, so the picker matches the providers card on hermes_cli versions where the two signals disagree api/providers.py:get_providers - Apply same featured-set cap so card body doesn't render 397 pills - Add `models_total` field reporting full catalog size (used by frontend for the "N models · OAuth" header text) api/routes.py:_handle_live_models - Apply same featured-set cap for `/api/models/live` so background enrichment via `_fetchLiveModels()` doesn't undo the dropdown trim - Use sticky-selection from `cfg["model"]["model"]` matching the main endpoint's logic static/ui.js:populateModelDropdown - Hydrate `_dynamicModelLabels` from `g.extra_models` so a selection outside the visible dropdown still renders with its proper label static/commands.js:_loadSlashModelSubArgs - Iterate `group.extra_models` so `/model` autocomplete covers the full catalog (not just the trimmed featured slice) static/panels.js:_buildProviderCard - Header count uses `p.models_total` (full catalog size) instead of `p.models.length` (trimmed slice) - Render trailing "+N more" disclosure pill when `models.length < models_total` with a tooltip pointing at the slash command static/style.css - New `.provider-card-model-tag-more` rule (italic, dashed border, cursor:help, no select) — visually distinct from real model pills Tests ===== `tests/test_issue1567_nous_picker_capacity_and_symmetry.py` (20 tests): - TestBuildNousFeaturedSet (8): unit tests on the helper — small-catalog no-op, large-catalog cap to target, disjoint+complete invariants, priority-vendor round-robin guarantee, sticky selection with and without `@nous:` prefix, curated-flagship preservation, empty-catalog handling, determinism - TestApiModelsLargeCatalog (2): /api/models cap behavior end-to-end on a synthetic 397-model catalog vs a 20-model catalog - TestNousDetectionSymmetry (2): picker includes Nous when `get_auth_status` agrees but `list_available_providers` disagrees; picker omits Nous when both disagree - TestNousLiveFetchEmpty (2): authenticated + empty-fetch omits group; hermes_cli unavailable still falls back to static-4 - TestProvidersCardPickerSymmetry (1): both endpoints agree on exactly the same featured-set IDs + total catalog count - TestFrontendExtrasContract (4): static-source assertions pinning the JS contract for `extra_models`, `models_total`, and the "+N more" disclosure Verified live on port 8789 (30-model catalog): - /api/models Nous group: provider="Nous Portal (15 of 30)", 15 models, 15 extra_models - /api/models/live?provider=nous: 15 entries (matches main path) - /api/providers Nous card: models_total=30, models=15 - Browser dropdown after backfill: 15 options, 30 entries in _dynamicModelLabels - Sticky selection: Claude Opus 4.7 (the active model) in the featured slice as expected 4073 pytest passed (was 4053 → 4073, +20 from this PR). 3 CI test runs (3.11/3.12/3.13) green. QA harness 11/11 passed. Reporter: Deor (Discord #report-bugs, May 03 2026 14:15 PT) Relayed by: AvidFuturist	2026-05-03 21:44:22 +00:00
Manfred	064b2734d1	fix: block self-update restart during active streams	2026-05-03 21:13:43 +00:00
Dutch AI Agency	732c995d91	fix(#1560 ): refuse password change when HERMES_WEBUI_PASSWORD env var is set Settings password silently no-opped when HERMES_WEBUI_PASSWORD was set: the env var takes precedence in api.auth.get_password_hash(), but the UI happily POSTed _set_password and returned a green "Saved" toast while every subsequent login still required the env-var password. Same for Disable Auth (_clear_password=true). Backend (api/routes.py): - GET /api/settings now exposes password_env_var: bool so the UI knows the field is shadowed. - POST /api/settings refuses _set_password and _clear_password with HTTP 409 + a clear message naming HERMES_WEBUI_PASSWORD when the env var is set. Short-circuits BEFORE save_settings() so settings.json is not touched. Frontend (static/index.html, static/panels.js, static/i18n.js): - Added settingsPasswordEnvLock banner div in the System pane. - panels.js reads settings.password_env_var, disables the password field, swaps in a localized "locked" placeholder, reveals the banner, and hides the Disable Auth button (its POST would 409 anyway). - New i18n keys password_env_var_locked and password_env_var_locked_placeholder added to all 9 locales (en, ja, ru, es, de, zh, zh-Hant, pt, ko). Tests: - tests/test_issue1560_password_env_var_lock.py: requirement-pinning (handler exposes flag, 409 on set/clear, banner div, panels.js wiring, i18n in all 9 locales, env var name in messages, live HTTP smoke when env unset). - tests/test_1560_password_env_var_no_op.py: behavioral via FakeHandler (real status codes for env-set/unset/blank, settings.json hash unchanged after 409, panels.js disable+banner+placeholder+disable-auth-hidden). Both files run clean: 23 passed in 2.04s. test_issue1139_password_remote.py unaffected (4/4 still pass).	2026-05-03 20:59:32 +00:00
Hermes Bot	0c6c6b3bb1	fix: absorb Opus advisor doc-only SHOULD-FIX nits (1) api/session_recovery.py: removed misleading dated-format comment claim. YYYYMMDD_HHMMSS_*.json files don't start with '_' so the underscore- skip wouldn't apply to them anyway. Replaced with the truthful general statement: any future non-session JSON marked with the '_' convention is skipped automatically. (2) CHANGELOG.md: fixed self-referential typo. v0.50.284 obviously couldn't have said 'v0.50.285' inside its release notes — the quoted text was 'after deploying v0.50.284'. Pure documentation. No behavior change. Tests still pass (8/8 in tests/test_metadata_save_wipe_1558.py).	2026-05-03 20:54:02 +00:00
Hermes Bot	1a7eaf518f	fix(session-recovery): skip _index.json + harden _msg_count against non-dict JSON (v0.50.284 follow-up) v0.50.284 shipped startup self-heal in api/session_recovery.py that crashed on the very first JSON file it scanned in the production session directory. Verified live on the prod server immediately after the v0.50.284 deploy: [recovery] startup recovery failed: 'list' object has no attribute 'get' Root cause: the production session dir contains _index.json — a top-level LIST of session metadata dicts (not a dict). _msg_count() did data.get('messages') which raises AttributeError on a list. The broad except Exception in server.py's startup hook swallowed the error and the recovery silently no-op'd for every user — defeating the entire purpose of the v0.50.284 release. Fix is three small defensive changes: 1. _msg_count() — added isinstance(data, dict) guard. Non-dict-shaped JSON files now return -1 (the harmless 'unknown count' sentinel) instead of raising AttributeError. 2. recover_all_sessions_on_startup() — skips any file whose name starts with '_' (the existing project convention for non-session metadata files like _index.json). These are convention-marked as system files, not session payloads. 3. recover_all_sessions_on_startup() — wraps recover_session(path) in try/except Exception so a single malformed file can't break recovery for the rest. Logs and continues. 2 new regression tests: - test_recover_all_sessions_on_startup_skips_non_session_index_json - test_msg_count_returns_neg1_for_non_dict_top_level 4026 → 4028 tests passing (+2). Net effect: any user wiped between v0.50.279 and v0.50.284 deploys whose session has a .bak shadow will now get auto-recovered on first launch of v0.50.285, as v0.50.284's release notes promised. Closes #1558 (follow-up — the original P0 was closed by v0.50.284 but the recovery half didn't actually run in production).	2026-05-03 20:50:06 +00:00
Hermes Bot	da3932a7ef	fix(stage-284): absorb Opus advisor SHOULD-FIX items (5+6 LOC) Both flagged by pre-release Opus advisor; both clearly defensive and small enough to absorb in-release per the reviewer-flagged-fix-in-release-not-followup policy. SHOULD-FIX #1 (api/routes.py:_clear_stale_stream_state, ~25 LOC): After the metadata-only reload (#1559 Layer 2), the local 'session' variable is reassigned to the full-load object but the caller still holds the original metadata-only stub. /api/session then returns the stale active_stream_id at routes.py:1791, causing the frontend to attempt one ghost SSE reconnect before recovering. Fix: capture original_stub at function entry, then patch its in-memory active_stream_id and pending_* fields to None after both the early-return (full-load already cleared) path AND the successful-mutation path. Now the caller's read returns fresh state, no ghost reconnect. SHOULD-FIX #2 (api/models.py:Session.save, ~20 LOC): The .bak write at api/models.py:436 used write_text() which truncates- then-writes — a crash mid-write or concurrent backup-producing save could leave a torn .bak. Recovery defends correctly (JSONDecodeError → returns -1 → 'no_action'), so the failure mode was 'backup lost' not 'spurious restore'. Fix: tmp + os.replace pattern matching the main file write at line 446-453. Now backup either lands cleanly or doesn't land at all. 4026/4026 tests pass post-absorb.	2026-05-03 20:41:00 +00:00
Hermes Bot	c97c634197	Stage 284: PR #1559 — P0 hotfix metadata-only save wipe (#1558 )	2026-05-03 19:56:32 +00:00
Dutch AI Agency	45f25235a8	fix: guard stale stream cleanup with session lock	2026-05-03 21:37:38 +01:00
Hermes Bot	166f439eeb	fix: correct issue references #1557 → #1558 (nesquena review feedback) The PR title and body correctly say 'Closes #1558' but every code comment, the test file name, error-message strings, docstrings, and the original commit body referenced #1557 instead. Independent reviewer flagged this: > The 17 wrong references won't auto-close issue #1558 from the commit > message — and the test file name will be misleading for future archeology. > Worth a one-pass s/#1557/#1558/g (and rename test file → > test_metadata_save_wipe_1558.py) before merge so the artifacts agree > with reality. This commit: - Renames tests/test_metadata_save_wipe_1557.py → test_metadata_save_wipe_1558.py - Replaces 17 #1557 references with #1558 across: - tests/test_metadata_save_wipe_1558.py (7 refs) - api/models.py (5 refs in Session.save guard + backup safeguard comments) - api/routes.py (2 refs in _clear_stale_stream_state docstring + log) - api/session_recovery.py (3 refs) - server.py (3 refs in startup self-heal block) Verified: 6/6 tests in tests/test_metadata_save_wipe_1558.py pass with the renamed file + updated references.	2026-05-03 19:55:14 +00:00
nesquena-hermes	1d9a0cbba1	fix(P0 #1557 ): metadata-only Session.save() was wiping conversation history v0.50.279 introduced api.routes._clear_stale_stream_state() (#1525) which calls session.save() to clear stale active_stream_id/pending_* fields. The helper is called from /api/session and /api/session/status — both of which load the session with metadata_only=True. Session.load_metadata_only() synthesizes a stub with messages=[] (its whole purpose: fast metadata read without parsing the 400KB+ messages array). Session.save() unconditionally writes self.messages to disk via os.replace(), so saving a metadata-only stub atomically overwrites the on-disk JSON with messages=[], wiping the entire conversation. Production trigger: every SSE reconnect cycle after a server restart polls /api/session/status, which fans out to _clear_stale_stream_state, which saves the metadata-only stub. The user reported losing 1000+ message conversations and seeing 'Reconnecting…' loops on every prompt — the reconnect loop kept the cycle running until the conversation was empty. Fix: three layers, defense in depth. (1) api/models.py: load_metadata_only() now sets _loaded_metadata_only=True on the returned stub. Session.save() raises RuntimeError if that flag is set — a hard guard so any future caller making the same mistake cannot wipe data, only crash visibly. (2) api/routes.py: _clear_stale_stream_state() now detects the metadata-only flag and re-loads the full session with metadata_only=False before mutating persisted state. The full-load path also runs _repair_stale_pending() which independently clears the stream flags, so the explicit clear becomes a no-op in most cases — but messages stay intact. (3) api/models.py + api/session_recovery.py: every save() that would SHRINK the messages array (the precise failure shape of #1557) first snapshots the previous file to <sid>.json.bak. Server.py runs recover_all_sessions_on_startup() at boot — any session whose live JSON has fewer messages than its .bak is restored automatically. Idempotent on clean state. Backup overhead is zero on the normal grow-the-conversation path. Reproducer (master): test_metadata_only_save_does_not_wipe_messages goes from 1000 messages to 0 in a single save() call. After the fix, 1000 messages survive. Tests: 6 new regression tests in tests/test_metadata_save_wipe_1557.py covering all three layers. Full pytest: 4019 → 4025 (+6, all green). Live verified on port 8789: write 1000-msg session with stale active_stream_id, hit /api/session/status, /api/session — file ends with 1002 messages (_repair_stale_pending injects an error-marker pair on full reload, harmless existing behavior), active_stream_id cleared, pending cleared, no Reconnecting loop. Closes #1557. Reported by AvidFuturist via user feedback on v0.50.282.	2026-05-03 19:45:10 +00:00
Hermes Bot	c73a5eb384	Stage 283: PR #1553 — silent credential self-heal on 401 (#1401 ) by @bergeouss	2026-05-03 19:19:02 +00:00
Hermes Bot	e4e53f9ef4	Stage 283: PR #1552 — Gateway status card in Settings (#1457 ) by @bergeouss	2026-05-03 19:19:02 +00:00
Hermes Bot	fd6e409021	Stage 283: PR #1551 — Reveal in File Manager workspace context menu (#1424 ) by @bergeouss	2026-05-03 19:19:02 +00:00
Hermes Bot	cee61fb1d9	Stage 283: PR #1550 — auto-assign session to filtered project (#1468 ) by @bergeouss	2026-05-03 19:19:02 +00:00
Hermes Bot	4daa09da7f	Stage 283: PR #1549 — What's new? link in update banner (#1512 ) by @bergeouss	2026-05-03 19:19:02 +00:00
Hermes Bot	16c53e5bcf	Stage 283: PR #1548 (augmented) — OpenRouter free-tier live fetch (#1426 ) by @bergeouss	2026-05-03 19:19:02 +00:00
Hermes Bot	9a7728f06b	Stage 283: PR #1543 — recover pending turn after stale stream restart by @ai-ag2026 (follow-up to #1471 )	2026-05-03 19:19:01 +00:00
Hermes Bot	babca37ea6	Stage 283: PR #1545 — remove phantom /sw.js from PUBLIC_PATHS (#1481 ) by @bergeouss	2026-05-03 19:19:01 +00:00
Hermes Bot	0750da5b37	fix(models): structural OpenRouter free-tier visibility — live fetch + augment fallback (#1426 ) Augments @bergeouss's PR #1548 v2 with the structural fix the issue actually requested. The original PR added 5 hardcoded entries to _FALLBACK_MODELS which would rot fast as OpenRouter's free-tier roster turns over monthly. Adds proper live-fetch logic to the OpenRouter group population so the free-tier list stays fresh without requiring a code release every time a new free model lands. api/config.py:2120 — replaces the static _FALLBACK_MODELS slice with: 1. Live curated catalog via hermes_cli.models.fetch_openrouter_models() — applies the tool-support filter (Kilo-Org/kilocode#9068). 2. Free-tier live fetch — direct call to https://openrouter.ai/api/v1/models, filtered to free-tier-only (pricing.prompt == 0 AND pricing.completion == 0, OR :free suffix), bypasses the tool-support filter so newly-added free variants appear even before OpenRouter annotates them with tools. Capped at 30 entries to keep the picker usable. 3. Defense-in-depth fallback to _FALLBACK_MODELS (which retains @bergeouss's hardcoded list for offline / test envs). 4. Deduplication via seen_ids — model in both surfaces appears once. 5 new tests + 1 fixed test in tests/test_minimax_provider.py (scoped the provider='MiniMax' assertion to direct-MiniMax routes by filtering for 'minimax/' prefix and excluding ':free' since the OpenRouter free-tier variant minimax/minimax-m2.5:free correctly carries provider='OpenRouter'). Co-authored-by: bergeouss <[email protected]>	2026-05-03 19:18:44 +00:00
bergeouss	1c5bce92cb	feat: add gateway status card to Settings → System (#1457 )	2026-05-03 19:02:17 +00:00
bergeouss	a085b71511	feat: add Reveal in File Manager to workspace file context menu (#1424 )	2026-05-03 19:02:16 +00:00
bergeouss	0fbaafa110	feat: auto-assign project when filtering by project on new session (#1468 )	2026-05-03 19:02:15 +00:00
bergeouss	c94f9c70ce	feat: add 'What's new?' link to update banner (#1512 )	2026-05-03 19:02:14 +00:00
bergeouss	f60db40133	fix: include OpenRouter free-tier models in fallback list (#1426 )	2026-05-03 19:02:13 +00:00
bergeouss	8fe593fa38	feat: silent credential self-heal on 401 errors (#1401 )	2026-05-03 18:32:53 +00:00
bergeouss	237010f8bd	fix: remove phantom /sw.js from PUBLIC_PATHS whitelist (#1481 )	2026-05-03 18:18:14 +00:00
nesquena-hermes	c21e3086a2	docs: align _format_nous_label docstring examples with actual output Per review observation on PR #1544: the docstring claimed 'Gemini 3.1 Pro Preview' and 'Nemotron 3 Super 120B A12B' but the helper reuses _format_ollama_label's 3-letter-token rule, which uppercases 'PRO' (and the existing rule for tokens like 'a12b' renders 'A12b' not 'A12B'). Update the examples to match actual behavior — labels are unchanged, only the docstring. Pure-comment change, no behavioral effect. Test counts unchanged (4013 passed).	2026-05-03 18:12:01 +00:00
nesquena-hermes	bff8cb2b58	fix: Nous Portal full live catalog + dropdown cache invalidation on provider remove Closes #1538, #1539. Two related dropdown-staleness bugs reported by Deor (Discord, May 03 2026). #1538 — Nous Portal picker showed only 4 hardcoded models ========================================================= The Settings → Default Model picker, the composer model dropdown, the /model slash command, and the Settings → Providers card all showed only four Nous models (Claude Opus 4.6, Claude Sonnet 4.6, GPT-5.4 Mini, Gemini 3.1 Pro Preview) because `_PROVIDER_MODELS["nous"]` had four hardcoded entries and `_build_available_models_uncached()` fell through to the generic `pid in _PROVIDER_MODELS` branch. The actual Nous Portal catalog has 30 models live — Claude Opus 4.7, GPT-5.5, Kimi K2.6, MiniMax M2.7, Gemini 3.1 Pro/Flash, several Xiaomi/Tencent/StepFun entries, and more. Fix: - New `_format_nous_label()` helper in `api/config.py` — reuses the `_format_ollama_label()` token rules, drops the vendor namespace, and appends ` (via Nous)` so labels disambiguate from same-named direct- provider entries (e.g. "Claude Opus 4.7" via direct Anthropic). - New `elif pid == "nous":` branch in `_build_available_models_uncached()` mirroring the Ollama Cloud pattern: live-fetch through `hermes_cli.models.provider_model_ids("nous")`, prefix every id with `@nous:` (matches the existing routing convention from PR-era #854 and pinned in tests/test_nous_portal_routing.py), fall back to the curated 4-entry static list when hermes_cli is unavailable. - Same fix applied to `api/providers.py:get_providers()` — that's the separate code path that builds Settings → Providers card models, and it had the identical bug shape. #1539 — Removed provider lingered in dropdowns until restart ============================================================ After Settings → Providers → Remove, the provider's models still appeared in every model dropdown until the page was reloaded. The server-side TTL cache was correctly flushed (`set_provider_key()` calls `invalidate_models_cache()` on both add and remove) but JS-side caches were never dropped: - `_slashModelCache` / `_slashModelCachePromise` (commands.js) — feeds the `/model` slash-command suggestions. - `_dynamicModelLabels` / `window._configuredModelBadges` (ui.js) — populated by `populateModelDropdown()` on app boot and profile switch. Pre-fix, `_removeProviderKey()` only called `loadProvidersPanel()` which refreshed the providers card list but never asked any consumer to re-fetch /api/models. Fix: - `static/commands.js`: new `_invalidateSlashModelCache()` helper that nulls both cache slots, exposed on `window` (typeof-guarded so the module remains importable in headless vm contexts — needed by the existing tests/test_cli_only_slash_commands.py harness). - `static/panels.js`: new `_refreshModelDropdownsAfterProviderChange()` helper that calls the invalidator + `populateModelDropdown()`, wrapped in try/catch so the providers panel update never breaks if a downstream module hasn't loaded yet. Both `_saveProviderKey` and `_removeProviderKey` invoke it (defense-in-depth: same staleness shape applies to the add path too). Tests ----- - `tests/test_issue1538_nous_live_catalog.py` (12 tests): live-fetch surfaces ≥20 entries, every id starts with `@nous:`, every label ends with ` (via Nous)`, recent flagships (Opus 4.7, GPT-5.5, Kimi K2.6, Gemini 3.1 Pro, MiniMax M2.7) reach the dropdown, static fallback works when hermes_cli raises, label formatter unit tests (vendor namespace stripping, variant rendering, MiniMax mixed-case), the curated static list and its routing invariants are preserved. - `tests/test_issue1539_provider_removal_dropdown_invalidation.py` (11 tests): invalidator helper exists and clears both cache slots, exposed on window with typeof guard, both save and remove paths invoke the dropdown flush, helper calls both invalidator and populateModelDropdown, helper is resilient to missing modules, helper does not block panel refresh, server-side `set_provider_key → invalidate_models_cache` invariant pinned. Verified live on port 8789: `/api/models` Nous group returns 30 models (was 4); browser `document.getElementById('modelSelect')` exposes 30 options under the "Nous Portal" group; the dropdown-flush helper is callable from the browser and round-trip rebuild keeps the dropdown at 30 options. Test counts: - Full pytest: 4013 passed, 2 skipped, 3 xpassed, 0 failures (was 3990 → 4013, +23 from this PR). - QA harness pytest: 20 passed. - Browser API sanity: 11/11 passed. - Agent Browser CDP: 21/23 passed (the 2 SSE liveness failures reproduce on master and are unrelated to this PR).	2026-05-03 18:12:01 +00:00
Manfred	afaeb03532	fix: recover pending turn after stale stream restart	2026-05-03 20:00:56 +02:00
Dutch AI Agency	e4d2704ce8	fix: resolve local models from configured base url	2026-05-03 17:04:46 +00:00
Hermes Bot	0cbada7228	Stage 280: PR #1404 — cross-channel messaging handoff (Frank Song, rebased onto master)	2026-05-03 16:51:34 +00:00
Frank Song	c7e52084ba	Harden messaging channel handoff	2026-05-03 16:35:50 +00:00
Frank Song	20ef643bb8	Add messaging session handoff summary	2026-05-03 16:35:22 +00:00
nesquena	df0d904d87	fix(streaming): pass agent.reasoning_effort into WebUI agents (salvages #1531 ) Spliced from #1531 by @Asunfly: take Change-1 only (the actual bug fix + cache signature inclusion) and skip Change-2 (auxiliary title-route extra_body change) which is a separate scope concern. ## What Two surgical fixes in api/streaming.py: 1. Line 1820 — `_cfg.cfg.get(...)` → `_cfg.get(...)`. `get_config()` returns a plain dict (not a wrapper exposing `.cfg`). The buggy line raised AttributeError that the surrounding try/except swallowed, so `_reasoning_config` was always None regardless of what `/reasoning <level>` had been set to. Verified locally — `api/streaming.py:1959` already correctly used `_cfg.get(...)` in the same function, so the same `_cfg` was being read two different ways in one file. 2. Line 1888 — added `_reasoning_config or {}` to `_sig_blob`. Without this, switching effort mid-session would fail to take effect because the per-session agent cache key would still match the old entry. Mirrors how `resolved_provider` / `resolved_base_url` already participate in the signature. ## Why splice instead of merge #1531 directly @Asunfly force-pushed a Change-2 onto #1531 after the original review that removes `extra_body={"reasoning": {"enabled": False}}` from `generate_title_raw_via_aux` (the auxiliary title-generation route). That intent is reasonable (let operator-configured `extra_body.reasoning` flow through to the title route) but it touches a different surface and deserves its own PR. The narrow concern is operators who selected a reasoning-capable auxiliary title model without explicitly setting `reasoning.enabled=False` in the task config — pre-Change-2 the WebUI defended against accidental reasoning on the title hot path; post-Change-2 those configs would reason on every new conversation`s title, with cost and latency implications. ## What is NOT in this PR - The `generate_title_raw_via_aux` extra_body refactor (Change-2 from #1531). - The `test_does_not_override_configured_reasoning_extra_body` test (guards Change-2). Asunfly can re-open that as its own focused PR. ## Tests Two new R17b/R17c regression assertions in tests/test_regressions.py: - `test_streaming_reads_reasoning_effort_from_config_dict` — static-source guard: `_cfg.cfg` must not return to streaming.py - `test_streaming_agent_cache_signature_includes_reasoning_config` — catches removal of `_reasoning_config` from `_sig_blob` ## Closes - Closes #1531 (the Change-1 portion ships here; Asunfly can re-open Change-2 as a separate PR if desired) Co-authored-by: Asunfly <[email protected]>	2026-05-03 16:34:25 +00:00
Hermes Bot	a5e6b9dc8b	Merge PR #1526 by @ai-ag2026: pass WebUI max_tokens into agent + classify OpenRouter quota phrases (refs #1524 )	2026-05-03 16:06:55 +00:00
Hermes Bot	1148656370	Merge PR #1525 by @ai-ag2026: clear stale WebUI stream state proactively (refs #1471 ) Merge conflict resolution: kept HEAD's `CACHE_NAME = 'hermes-shell-__WEBUI_VERSION__'` (post-#1517 rename) over PR #1525's `'hermes-shell-__CACHE_VERSION__-stale-stream-cleanup1'` manual suffix. The renamed placeholder still auto-bumps with each release through the `quote(WEBUI_VERSION, safe="")` substitution, so the manual `-stale-stream-cleanup1` suffix is no longer needed to force-update existing service workers — the natural version bump (v0.50.278 → v0.50.279) already invalidates the old cache via `caches.delete(k)` for `k !== CACHE_NAME` in the SW activate handler. No behavioral regression: the SW cache still bumps on this release, just via the canonical version-token path. Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-03 16:06:42 +00:00
Hermes Bot	437eae00be	Merge PR #1532 by @ai-ag2026: recover WebUI-origin state.db sessions when JSON sidecar missing (refs #1471 )	2026-05-03 16:06:04 +00:00
Manfred	9c0a16fdd6	fix: recover WebUI-origin state.db sessions	2026-05-03 15:41:56 +02:00
Manfred	dbb0879956	fix: pass WebUI max_tokens to agents Read configured max_tokens from config.yaml, pass it into WebUI-created AIAgent instances when supported, and include it in the agent cache signature. Also classify OpenRouter quota phrasing such as more credits, can only afford, and fewer max_tokens. Adds regression coverage for max_tokens propagation, cache signature isolation, and quota error classification.	2026-05-03 11:46:42 +02:00
Manfred	6bce34c27e	fix: clear stale WebUI stream state Clear persisted active_stream_id and pending runtime fields when the server no longer has the referenced live stream. Also drop browser-side INFLIGHT state when the server reports a session idle and bump the service-worker cache so the frontend fix is delivered. Adds regression coverage for backend stale-stream cleanup, frontend inflight invalidation, and cache busting.	2026-05-03 11:46:42 +02:00
Frank Song	8f3dbe185d	fix: consolidate __CACHE_VERSION__ → __WEBUI_VERSION__ (#1509 ) __CACHE_VERSION__ (sw.js) and __WEBUI_VERSION__ (index.html) are functionally identical — both resolve to quote(WEBUI_VERSION, safe='') at request time. Two names exist for historical reasons (different files added at different times). Rename __CACHE_VERSION__ → __WEBUI_VERSION__ in: - static/sw.js (CACHE_NAME + VQ constant + comment) - api/routes.py (substitution string) - tests/test_pwa_manifest_sw.py (all assertions) Single canonical name. No behavior change — same ?v=vX.Y.Z query strings on the same URLs.	2026-05-03 14:59:37 +08:00
Hermes Bot	6381ab1b8a	fix(model-picker): deepcopy auto_detected_models per group to stop dedup bleed-across (#1511 root cause) Supersedes contributor PR #1511 (lost9999), which removed the label-suffix logic in _deduplicate_model_ids() but left the underlying shared-reference bug intact — IDs would still be silently corrupted across provider groups, just with cleaner-looking labels. ## Bug shape When multiple unconfigured providers (Ollama / HuggingFace / custom endpoints / Google Gemini CLI / Xiaomi / etc.) all fell through to the 'else' branch in api/config.py:get_models_grouped() that ends with: groups.append({..., "models": auto_detected_models}) every group ended up sharing the SAME list reference AND the SAME dicts inside. When _deduplicate_model_ids() then mutated those dicts to add @provider_id: prefixes and provider-name parentheticals, the changes were applied to every group that referenced the same dict. Visible symptom: user 'vishnu' reported the dropdown showing 'Deepseek V4 Flash (Xiaomi) (Ollama) (HuggingFace) (Google-Gemini-Cli)' on every group. Hidden symptom (worse): the 'id' field collapsed to '@xiaomi:deepseek-v4-flash' on every group too, so clicking the entry under any group routed the request to Xiaomi. ## Fix api/config.py:2078 — wrap auto_detected_models in copy.deepcopy() at the groups.append site so each group gets its own independent dicts. The existing _deduplicate_model_ids() logic is correct and unchanged; the bug was in the assignment site, not the dedup function. The single-parenthetical disambiguation in labels is retained because the composer chip (composer-model-label) shows the model label without the optgroup header context — 'Deepseek V4 Flash (Ollama)' is more useful than ambiguous 'Deepseek V4 Flash' there. ## Tests tests/test_issue1511_dedup_shared_reference.py — 3 new tests: - test_groups_have_independent_model_lists: structural invariant pin - test_unconfigured_providers_no_shared_dedup_bleed: end-to-end against the corrected code path; verifies each group gets its own @provider_id: prefix and exactly ONE provider parenthetical per disambiguated label - test_shared_reference_pre_fix_demonstrates_corruption: documents the broken state that motivated the fix Full suite: 3925 → 3928 passing (+3 new, 0 regressions). Co-authored-by: lost9999 <56498264+lost9999@users.noreply.github.com>	2026-05-03 06:41:11 +00:00
Hermes Bot	8f58688b66	test: lock /session/static MIME-type + auth fix; drop unused import - Add tests/test_session_static_assets.py (5 tests): * /session/static/style.css must return text/css (not text/html) * /session/static/ui.js must return application/javascript * /session/<id> still serves the HTML index (catch-all not weakened) * Path-traversal still sandboxed after prefix strip * /session/static/* matches /static/* auth-exemption policy - Drop unused 'from urllib.parse import urlparse as _up' import from PR #1505's added block (parsed._replace already gives a usable result). Co-authored-by: Rick Chew <rickchew@users.noreply.github.com>	2026-05-03 05:20:19 +00:00

1 2 3 4 5 ...

450 Commits