hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-26 03:30:36 +00:00

Author	SHA1	Message	Date
nesquena-hermes	72b077ecce	Stage 320: PR #1889 — deduplicate workspace-prefixed user turns by @ai-ag2026	2026-05-08 15:48:28 +00:00
ai-ag2026	f6d09e06ca	fix: deduplicate workspace-prefixed user turns	2026-05-08 15:37:10 +00:00
nesquena-hermes	518453545c	Stage 320: PR #1865 — interim_assistant streaming in runtime + live UI by @franksong2702	2026-05-08 15:37:09 +00:00
nesquena-hermes	035c537281	Stage 320: PR #1861 — overwrite session usage per turn by @franksong2702	2026-05-08 15:37:09 +00:00
Frank Song	c1a9d7ce79	fix: overwrite session usage per turn	2026-05-08 15:37:09 +00:00
Frank Song	82c7367cef	Add interim_assistant streaming path to WebUI	2026-05-08 15:37:09 +00:00
Michael Lam	01b9c82dc9	fix: honor configured max_turns in WebUI agents Read agent.max_turns when constructing streaming WebUI AIAgent instances, pass it as max_iterations when supported, and include it in the per-session agent cache signature so budget changes take effect. Add regression coverage for the config read, constructor kwarg, and cache key.	2026-05-08 15:37:08 +00:00
Michael Lam	e31b7e72d6	fix: show auto-compression running state	2026-05-07 18:41:13 +00:00
Michael Lam	048f1fa24e	fix: keep assistant-only stream deltas on current turn	2026-05-07 06:25:16 +00:00
Frank Song	91f99d8194	fix(oauth): serialize Anthropic env fallback reads	2026-05-07 02:47:19 +00:00
Michael Lam	2d20842450	fix: surface Codex usage exhaustion errors	2026-05-07 01:39:52 +00:00
ai-ag2026	a7b04bbc1e	fix: preserve pending user turn on stream errors	2026-05-06 22:47:58 +02:00
nesquena-hermes	29878259ca	docs(troubleshooting): bake the #1695 diagnostic flow into the error message + a new troubleshooting doc Closes #1695. @Patrick-81 reported the bare "AIAgent not available -- check that hermes-agent is on sys.path" error on a symlinked install (~/Programmes/hermes-agent linked to ~/hermes-agent). The maintainer's response — three diagnostic commands plus `pip install -e .` in the agent dir — fixed it for them. This PR captures both halves of that learning so the next user with the same shape doesn't have to file a new issue: 1. Error message diagnostic block. New helper `_aiagent_import_error_detail()` in api/streaming.py builds a multi-line diagnostic when the import fails, including: - the running Python interpreter - HERMES_WEBUI_AGENT_DIR (set value, or "(not set)") - sys.path entries that mention hermes/agent (or "no entries mention..." — itself a strong diagnostic signal) - the most-common fix (`pip install -e .` in the agent dir) - a pointer to docs/troubleshooting.md The original error message string is preserved as the FIRST line so existing log scrapers and docs-search keep matching. Helper is kept as a separate function so it stays out of the hot path until we actually need to raise — building it on every successful import would be wasted work. 2. New docs/troubleshooting.md. Symptom → Why → Diagnostic commands → Fix → When-to-file-a-bug template, with one entry to start: the "AIAgent not available" flow Patrick-81 walked through. Future recurring failure modes follow the same template. Required a one-line addition to .gitignore — docs/* is gitignored with an allowlist, and the new file needed `!docs/troubleshooting.md` to be tracked. 3. README link. docs/troubleshooting.md added to the `## Docs` section so users know where to look first. 13 regression tests in tests/test_1695_aiagent_import_error_detail.py: 9 for the helper output shape (preserves original message line, includes running python, shows HERMES_WEBUI_AGENT_DIR set/unset both ways, includes pip-install-e hint, points at troubleshooting doc, lists relevant sys.path entries when present, says "no entries..." when absent, output is multi-line) plus 4 for the docs-presence regression (file exists, has the AIAgent section, includes pip install -e ., describes the diagnostic chain with readlink + agent/__init__.py verification). 190 streaming/aiagent tests pass after the change. ast.parse on api/streaming.py clean. CI failure on prior push was due to the docs/* gitignore swallowing the new troubleshooting.md file silently — this commit adds the allowlist entry so the file is tracked.	2026-05-05 22:14:07 +00:00
Michael Lam	f97b040985	fix: raise persisted tool snippet cap	2026-05-05 13:46:54 -07:00
Manfred	52e7916cb8	fix: avoid adaptive title refresh session lock deadlock	2026-05-05 12:51:13 +02:00
Michael Lam	c94ec31dec	feat: show LLM Gateway routing metadata	2026-05-05 02:26:55 +00:00
test	34b060d993	Stage 296: PR #1648 — session save mode config (closes #1406 ) by @Michaelyklam	2026-05-04 21:26:52 +00:00
Michael Lam	3ad8846a27	fix: show TPS in assistant message headers	2026-05-04 21:26:43 +00:00
Michael Lam	876a670387	feat: add session save mode config	2026-05-04 14:05:49 -07:00
Sanjay Santhanam	14fac05dc9	fix(streaming): use truthy-check for _pending_started_at fallback Switch the per-turn duration fallback from `is not None` to a truthy check so None, missing-attr, and an explicit 0 all uniformly fall back to time.time(). Without this, a 0 timestamp (e.g. via a buggy migration or manual file edit) would yield `time.time() - 0` ≈ wall-clock-since-epoch, displaying nonsense like 'Done in 56 years 4 months ...'. In practice pending_started_at is always set via int(time.time()) so this is a hardening fix, not a live-bug fix. Also drop the brittle source-string assertion in the regression test that pinned the literal expression. The behavioural test test_done_handler_persists_duration_on_last_assistant_message already proves the duration field is set; pinning the source line broke twice during the v0.50.290 release pipeline alone (Opus tightening + maintainer revert). Fixes #1595 Signed-off-by: Sanjay Santhanam <51058514+Sanjays2402@users.noreply.github.com>	2026-05-03 23:21:19 -07:00
Michael Lam	0eddb0580e	fix: document turn duration fallback	2026-05-03 21:12:07 -07:00
Michael Lam	f3fa106cd7	feat: show agent turn duration	2026-05-03 20:20:17 -07:00
bergeouss	8fe593fa38	feat: silent credential self-heal on 401 errors (#1401 )	2026-05-03 18:32:53 +00:00
nesquena	df0d904d87	fix(streaming): pass agent.reasoning_effort into WebUI agents (salvages #1531 ) Spliced from #1531 by @Asunfly: take Change-1 only (the actual bug fix + cache signature inclusion) and skip Change-2 (auxiliary title-route extra_body change) which is a separate scope concern. ## What Two surgical fixes in api/streaming.py: 1. Line 1820 — `_cfg.cfg.get(...)` → `_cfg.get(...)`. `get_config()` returns a plain dict (not a wrapper exposing `.cfg`). The buggy line raised AttributeError that the surrounding try/except swallowed, so `_reasoning_config` was always None regardless of what `/reasoning <level>` had been set to. Verified locally — `api/streaming.py:1959` already correctly used `_cfg.get(...)` in the same function, so the same `_cfg` was being read two different ways in one file. 2. Line 1888 — added `_reasoning_config or {}` to `_sig_blob`. Without this, switching effort mid-session would fail to take effect because the per-session agent cache key would still match the old entry. Mirrors how `resolved_provider` / `resolved_base_url` already participate in the signature. ## Why splice instead of merge #1531 directly @Asunfly force-pushed a Change-2 onto #1531 after the original review that removes `extra_body={"reasoning": {"enabled": False}}` from `generate_title_raw_via_aux` (the auxiliary title-generation route). That intent is reasonable (let operator-configured `extra_body.reasoning` flow through to the title route) but it touches a different surface and deserves its own PR. The narrow concern is operators who selected a reasoning-capable auxiliary title model without explicitly setting `reasoning.enabled=False` in the task config — pre-Change-2 the WebUI defended against accidental reasoning on the title hot path; post-Change-2 those configs would reason on every new conversation`s title, with cost and latency implications. ## What is NOT in this PR - The `generate_title_raw_via_aux` extra_body refactor (Change-2 from #1531). - The `test_does_not_override_configured_reasoning_extra_body` test (guards Change-2). Asunfly can re-open that as its own focused PR. ## Tests Two new R17b/R17c regression assertions in tests/test_regressions.py: - `test_streaming_reads_reasoning_effort_from_config_dict` — static-source guard: `_cfg.cfg` must not return to streaming.py - `test_streaming_agent_cache_signature_includes_reasoning_config` — catches removal of `_reasoning_config` from `_sig_blob` ## Closes - Closes #1531 (the Change-1 portion ships here; Asunfly can re-open Change-2 as a separate PR if desired) Co-authored-by: Asunfly <[email protected]>	2026-05-03 16:34:25 +00:00
Manfred	dbb0879956	fix: pass WebUI max_tokens to agents Read configured max_tokens from config.yaml, pass it into WebUI-created AIAgent instances when supported, and include it in the agent cache signature. Also classify OpenRouter quota phrasing such as more credits, can only afford, and fewer max_tokens. Adds regression coverage for max_tokens propagation, cache signature isolation, and quota error classification.	2026-05-03 11:46:42 +02:00
nesquena-hermes	c75ce33280	v0.50.259: Opus pre-release follow-up — close _session_db on LRU eviction + CHANGELOG + 5 regression tests PR #1421 (SessionDB WAL handle leak fix on cached-agent reuse path) had a sibling leak at the LRU eviction site that I caught during pre-review: api/streaming.py SESSION_AGENT_CACHE.popitem(last=False) was discarding the evicted entry with `evicted_sid, _ = ...`. The agent's _session_db was dropped on the floor and only released when GC eventually finalized the agent — which on a long-running server may be never (cyclic refs, extension types holding C handles, etc.). Same fix shape as #1421: capture the evicted entry, call _evicted_agent._session_db.close() explicitly. SessionDB.close() is idempotent + thread-safe (with self._lock: if self._conn:), so the double-close-is-benign property still holds. 5 regression tests in test_v050259_sessiondb_fd_leak.py: - Source-level: cached-agent reuse path closes before replace - Source-level: LRU eviction path captures + closes evicted agent - Behavioral: SessionDB.close() is idempotent (3 calls safe) - Behavioral: cached-agent reuse with mock — close called exactly once - Behavioral: LRU eviction with mock — only evicted agent's DB closes Full suite: 3615 passed, 0 failed. Nathan explicitly authorized 'just go ahead and merge it as a small release' since the PR is 9 LOC, focused, has Opus pre-release follow-up + tests, and matches the empirically-confirmed leak shape (73-handle leak at EMFILE).	2026-05-01 22:42:53 +00:00
Wali Reheman	9b987eefb0	fix: close previous SessionDB before replacing on cached agent SessionDB WAL handles leak when streaming.py creates a new SessionDB instance per request and replaces the cached agent's _session_db without closing the old one. Each orphaned connection holds 2 FDs (.db + .db-wal), causing FD exhaustion and EMFILE crashes after ~73 messages. Fix: close the previous _session_db before replacing it on cached agents, mirroring the close-before-replace pattern used elsewhere in the codebase.	2026-05-01 13:51:21 -07:00
nesquena-hermes	c78bcddda6	v0.50.257: CRITICAL Opus finding — fix non-functional per-session toolset override Opus pre-release advisor caught a 5th issue not covered by my initial follow-up sweep, this one CRITICAL: PR #1402 #493 per-session toolset override silently no-op'd every time. Bug: api/streaming.py:1755 called _session_meta.get('enabled_toolsets') on the result of Session.load_metadata_only(). It returns a Session INSTANCE, not a dict. .get() raised AttributeError, which the surrounding bare except swallowed silently. The toolset chip in the UI saved correctly to disk, but the streaming agent always ran with global toolsets. Fix: use getattr(_session_meta, 'enabled_toolsets', None). Two new regression tests: - Source-level: forbid the .get() / [] dict-access shape. - Runtime: Session.load_metadata_only must return a Session instance. Full suite: 3604 passed, 0 failed.	2026-05-01 18:36:24 +00:00
nesquena-hermes	bc17229a7d	Merge PR #1402 from bergeouss: P2 improvements — cron history, toolsets per session, Codex OAuth # Conflicts: # static/i18n.js	2026-05-01 18:20:05 +00:00
bergeouss	8ae198e88c	feat: P2 improvements — cron history, toolsets per session, Codex OAuth - #468: Cron run history — GET /api/crons/history (metadata listing) + GET /api/crons/run (full output), lazy-load on click in Tasks panel - #493: Per-session toolset override — Session.enabled_toolsets field, POST /api/session/toolsets endpoint, streaming handler override, composer chip UI with dropdown (matches reasoning chip pattern) - #1362: In-app Codex OAuth — device-code flow (stdlib only, no httpx), SSE polling endpoint, onboarding wizard login button - #1240: Design proposal comment for provider/model source-of-truth	2026-05-01 12:42:21 +00:00
starship-s	bdc328d034	fix: preserve webui model provider context Persist session model_provider separately from model IDs so active/default provider selections like gpt-5.5 remain bare while routing through OpenAI Codex. Keep @provider:model for picker disambiguation and runtime bridging, and preserve explicit OpenRouter plus custom/proxy base_url routing.	2026-04-30 23:23:47 -06:00
nesquena-hermes	f53556b3ff	fix(cancel-stream): rename tool_calls to _partial_tool_calls (Opus MUST-FIX) Opus pass-2 review of v0.50.251 caught a critical regression in PR #1375: The cancel-partial message stored captured tool calls under the 'tool_calls' key. That key is whitelisted by _API_SAFE_MSG_KEYS so _sanitize_messages_for_api forwarded the entries to the next-turn LLM call. But the captured entries use the WebUI internal shape ({name, args, done, duration, is_error}) — they don't have the OpenAI/Anthropic id + function: {name, arguments} envelope. Strict providers (OpenAI, Anthropic, Z.AI/GLM) would 400 on the malformed entries. Net effect: the very cancel-then-continue scenario PR #1375 aimed to improve becomes a hard fail. Fix: - Rename the persisted key to '_partial_tool_calls' (underscore- prefixed private key NOT in _API_SAFE_MSG_KEYS, so sanitize correctly strips it). - Update static/messages.js hasMessageToolMetadata check to also recognize _partial_tool_calls for UI rendering. - Update test_issue1361_cancel_data_loss.py assertion to check _partial_tool_calls (and tool_calls as legacy fallback). Plus 2 NIT fixes from the same Opus review: NIT 1 (api/profiles.py:153): re.match → re.fullmatch for consistency with other _PROFILE_ID_RE callers in the codebase. The trailing- newline footgun ($ matches before final \n in re.match) is now closed. Without #1373's is_dir() guard, a name like 'valid\n' would have created a directory named 'valid\n' on Linux. Doesn't escape <HERMES_HOME>/profiles/ via Path joining, but unintended. NIT 2 (test_issue798.py): R19j coverage gaps — added trailing- newline tests, length-boundary tests (64-char valid, 65-char rejected), single-char minimum, and non-ASCII / Unicode-trick tests. New regression test (tests/test_pr1375_partial_tool_calls_sanitize.py): - test_partial_tool_calls_field_not_forwarded_to_llm: pins that sanitize-for-API strips _partial_tool_calls + reasoning + does NOT have tool_calls on a partial message - test_legitimate_tool_calls_are_preserved_for_completed_turns: pins that real OpenAI-shape tool_calls on completed turns survive sanitize unchanged Tests: 3486 passing (3484 → 3486, +2 sanitize tests).	2026-04-30 23:43:23 +00:00
bergeouss	c5f4f569d6	fix(#1361 ): preserve reasoning, tool calls, and partial output on Stop/Cancel (#1375 ) Three distinct data-loss paths fixed: §A — Reasoning text was accumulated in a thread-local _reasoning_text inside _run_agent_streaming. cancel_stream() never saw it because it went out of scope when the thread was interrupted. Now mirrored to a new shared dict STREAM_REASONING_TEXT keyed by stream_id, populated in on_reasoning() and the reasoning branch of on_tool(), read in cancel_stream(). §B — Live tool calls in thread-local _live_tool_calls were similarly invisible to cancel_stream(). Now mirrored to STREAM_LIVE_TOOL_CALLS on tool.started + tool.completed. §C — Reasoning-only streams produced no partial message because the thinking-block regex strip returned empty string and the `if _stripped:` guard skipped the append. Now appends the partial message when EITHER content text, reasoning trace, OR tool calls exist. Mirrors the existing STREAM_PARTIAL_TEXT pattern from #893 exactly: same dict creation in _run_agent_streaming, same _live_config fallback in cancel_stream, same cleanup in _periodic_checkpoint. 8 regression tests in tests/test_issue1361_cancel_data_loss.py covering all three sections plus tools+text combinations. Co-authored-by: bergeouss <bergeouss@users.noreply.github.com>	2026-04-30 23:24:29 +00:00
nesquena-hermes	bbdacdca5c	fix: context window indicator overflow (#1356 ) - api/streaming.py SSE payload now falls back to agent.model_metadata.get_model_context_length when compressor doesn't supply context_length (mirrors the session-save fallback shipped in v0.50.247). - api/streaming.py also falls back to s.last_prompt_tokens to avoid using the cumulative input_tokens counter. - static/ui.js tracks rawPct separately from pct and shows '(context exceeded)' tooltip when rawPct > 100 instead of misleading '100% used (0% left)'. - static/messages.js clears 'Uploading...' composer status after upload completes. Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-30 21:32:45 +00:00
nesquena-hermes	880350312a	fix(streaming): fallback to model_metadata for context_length when compressor missing (#1318 follow-up) (#1348 ) * fix(streaming): fallback to model_metadata for context_length when compressor missing (#1318 follow-up) PR #1318 (shipped in v0.50.246 via PR #1341 + commit `a5c10d5`) persisted context_length on the session so the context-ring indicator survives page reloads. But the writer only fired when agent.context_compressor was present and reported a non-zero value. Fresh agents, interrupted streams, or compressors without the attribute would still leave s.context_length=0 — and the indicator would still show 0% on reload. This follow-up adds a fallback that calls agent.model_metadata.get_model_context_length(model, base_url) when the compressor didn't populate the value. The function returns a sensible static context window for any known model (with a 256K default for unknown models). Wrapped in a broad try/except because older hermes-agent builds may not expose the helper. Sourced from PR #1344 (@jasonjcwu) — extracted into this focused follow-up after #1344 was closed as superseded by #1341. Adds 6 structural tests covering: import + call presence, falsy-gate, agent.model/base_url passing, exception swallowing, save() ordering, result assignment. Closes the data-flow gap in #1318 for the compressor-missing case. * test: relax pr1341 block-size assertion to accommodate the new fallback --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-30 10:27:56 -07:00
nesquena-hermes	a5c10d594d	fix(streaming): persist context_length on session — completes #1318 fix Pre-release Opus + nesquena review on v0.50.246 caught that PR #1341 added the data-structure scaffolding (Session.__init__ accepts the 3 fields, save() persists them, compact() exposes them, GET /api/session returns them) but did NOT add the writer that actually populates them. Without a writer, the user-visible bug (context-ring shows 0% after page reload) was NOT fixed by #1341 alone — the fields stayed None forever because nothing wrote to s.context_length anywhere. Adds the writer at api/streaming.py:2188 (post-merge per-turn save block, before s.save()) so the values from agent.context_compressor land on disk and survive page reloads. Also moves the SSE usage payload comment to clarify that the live SSE payload and the session-level persistence are now distinct paths (payload below, persistence above). Adds tests/test_pr1341_context_window_persistence.py — 6 structural + round-trip tests covering Session __init__/save/compact, the routes response, and the streaming.py writer placement. Closes #1318 (the actual user-visible bug, not just the scaffolding).	2026-04-30 16:42:32 +00:00
nesquena-hermes	f328f3b843	fix(cancel): gate substring guard on pending_started_at timestamp (Opus review) Pre-release Opus review on v0.50.246 caught a SHOULD-FIX in PR #1338's cancel_stream synthesis: the symmetric substring guard (_pending_user in _last_content OR _last_content in _pending_user) was too loose. Common confirmation replies ("ok", "yes", "go") in the prior turn would match longer follow-up prompts ("ok please continue"), the synthesis would be skipped, and the user's typed text would be lost — exactly the data-loss bug #1298 was supposed to fix. The fix: gate the substring check on a timestamp comparison. Only treat the latest user turn as 'already merged by the streaming thread' if its timestamp is at or after pending_started_at. Earlier turns whose content happens to be a substring of the pending must not short-circuit synthesis. Also drops the symmetric (_last_content in _pending_user) branch — that direction was the false-positive vector. Keeps the equality and prefix match (workspace-prefix tolerance from the streaming thread). Adds tests/test_issue1298_cancel_and_activity.py:: test_cancel_synthesizes_when_prior_turn_content_is_substring_of_pending — regression for the exact 'ok' → 'ok please continue' scenario.	2026-04-30 16:28:20 +00:00
nesquena-hermes	d4b055c30b	fix(streaming+ui): preserve user message on cancel + persist activity-panel expand state (#1298 ) From PR #1338. Already independently APPROVED by nesquena before being absorbed into v0.50.246. CHANGELOG entries from this PR were dropped during squash (the v0.50.245 section is already shipped); they will be re-added under [v0.50.246] in the release commit. Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-30 16:18:41 +00:00
nesquena-hermes	09e12e3c60	fix(streaming): handle list fallback_providers config in addition to single fallback_model dict From PR #1339. Co-authored-by: Jim Dawdy <jimdawdy@Jims-MacBook-Pro.local>	2026-04-30 16:18:00 +00:00
nesquena-hermes	5bde48bb6e	fix(streaming): compare compression_count against per-turn snapshot to stop repeated banner From PR #1316. Co-authored-by: qxxaa <mrhanoi@outlook.com>	2026-04-30 15:24:31 +00:00
nesquena-hermes	33a145a669	release: v0.50.240 ## Release v0.50.240 Batch release of 13 PRs that passed full triage + code review + test suite (3199 tests, 0 failures). --- ### Added - Compact tool activity mode (`simplified_tool_calling`, default on) — groups tool calls and thinking traces into a single collapsed "Activity" disclosure card per assistant turn. Also adds a new Calm Console theme with earth/slate palette and serif prose. @Michaelyklam — #1282 - PDF first-page preview — `MEDIA:` `.pdf` files render a canvas thumbnail via PDF.js CDN (4 MB cap). HTML sandbox iframe — `.html`/`.htm` files render inline in a sandboxed `<iframe srcdoc>` (256 KB cap). 10 i18n keys × 7 locales. @bergeouss — #1280, closes #480 #482 - Inline Excalidraw diagram preview — `.excalidraw` files render as pure SVG (no external deps; rectangles, ellipses, diamonds, text, lines, arrows, freehand; 512 KB cap). @bergeouss — #1279, closes #479 - Inline CSV table rendering — fenced `csv` blocks and `MEDIA:` CSV files render as scrollable HTML tables with auto-separator detection. @bergeouss — #1277, closes #485 - Inline SVG, audio, and video rendering — SVG as `<img>`, audio as `<audio controls>`, video as `<video controls>`. @bergeouss — #1276, closes #481 - Batch session select mode — multi-select sessions for bulk Archive/Delete/Move. 11 i18n keys × 7 locales. @bergeouss — #1275, closes #568 - Collapsible skill category headers — click to collapse/expand without re-render; state persists across filter cycles. @bergeouss — #1281 - `providers.only_configured` setting — opt-in flag to restrict the model picker to explicitly configured providers. @KingBoyAndGirl — #1268 - OpenCode Go model catalog — adds Kimi K2.6, DeepSeek V4 Pro/Flash, MiMo V2.5/Pro, Qwen3.6/3.5 Plus. @nesquena-hermes — #1284, closes #1269 ### Fixed - Profile `TERMINAL_CWD` TypeError — `_build_agent_thread_env()` helper merges env before `_set_thread_env()` call. @hi-friday — #1266 - Service worker subpath cache bypass — regex now matches `/api/` under any mount prefix. @Michaelyklam — #1278 - SSE client disconnect leaks* — `TimeoutError`/`OSError` treated as clean disconnects; server backlog 64, threads daemonized; session list renders before saved-session restore. @KayZz69 — #1267 - i18n locale corrections — Korean MCP strings (23), Chinese MCP strings (23), zh-Hant missing keys (41), de missing keys (229). @bergeouss — #1274, closes #1273 --- ### Test results ``` 3199 passed, 2 skipped, 3 xpassed in 72.79s ``` ### PRs on hold (not included) #1265 (draft), #1271 (superseded by #1266), #1272 (skipped XSS tests), #1232 (partial test run), #1222 (review questions open), #1134 (live-server tests), #1132 (superseded by #1134), #1108 (negative UX review), #1084 (empty description)	2026-04-29 17:42:32 -07:00
Hermes Agent	4ee80425f2	Merge remote-tracking branch 'refs/remotes/pr/1229' into stage/batch-v0.50.238	2026-04-29 15:17:57 +00:00
Hermes Agent	ea4d381e43	Merge remote-tracking branch 'pr/1248' into stage/batch-v0.50.238	2026-04-29 14:29:05 +00:00
Hermes Agent	2bdf5c77d4	Merge remote-tracking branch 'pr/1245' into stage/batch-v0.50.238	2026-04-29 14:29:05 +00:00
happy5318	65e5690772	fix: add LRU limit to SESSION_AGENT_CACHE to prevent memory bloat The agent cache stores full AIAgent instances (each holding complete conversation history) without size limit. Long-running servers with many sessions can accumulate unbounded memory usage. Changes: - Replace dict with OrderedDict for LRU tracking - Add SESSION_AGENT_CACHE_MAX = 50 limit - Evict least-recently-used entries when cache exceeds limit - Call move_to_end() on cache hits to maintain LRU order This prevents memory exhaustion on servers with many active sessions.	2026-04-29 17:35:12 +08:00
yzp12138	0fe59831fe	tests: add regression tests + magic-byte image validation for native image attachments	2026-04-29 17:01:01 +08:00
Frank Song	1ed1ce219d	Preserve transcript across context compaction	2026-04-29 16:37:08 +08:00
Dennis Soong	8a74ea89e7	fix: apply profile terminal env in webui sessions	2026-04-29 14:12:59 +08:00
starship-s	014f16c359	fix: harden session sidecar repair	2026-04-29 04:31:36 +00:00
starship-s	8bfd8b28d5	fix: stuck sidecar recovery	2026-04-29 04:31:12 +00:00

1 2 3

121 Commits