hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-29 13:10:17 +00:00

Author	SHA1	Message	Date
Isla Liu	37c3e84ad2	test(session): cover lazy journal retry give-up paths	2026-05-20 20:55:08 +08:00
Isla Liu	2a303de2a3	fix(session): preserve retry budget while journal is still arriving	2026-05-20 20:55:07 +08:00
Isla Liu	d5a185d9c6	fix(session): serialize lazy journal retry per session	2026-05-20 20:48:38 +08:00
Isla Liu	1957785332	fix(session): address Copilot round-2 review — correct stale comment and drop unused fixture arg Two non-functional cleanups from the second Copilot pass: 1. The inline comment in `test_error_marker_no_preserved_as_draft` said the legacy "user message above was preserved" wording was used for the post-retry-give-up case. The actual implementation demotes give-up markers to a different neutral wording ("Partial output may have been lost."). Comment rewritten to match the contract. 2. The regression test `test_lost_response_recovered_on_second_read` declared a `monkeypatch` parameter it never used. Dropped.	2026-05-20 13:08:08 +08:00
Isla Liu	9870e8f111	fix(session): address Copilot review — scope tool-card dedupe by stream id + tighten docs Four code-review comments from the automated Copilot reviewer on this PR: 1. `_journal_tool_already_present` dedupe was session-wide, so a legitimately-repeated tool (e.g. a second `terminal: ls` in an earlier turn) could cause the retry path to falsely skip materializing the recovered tool card. The helper now takes a keyword `stream_id` argument; when supplied, a tool card whose `_recovered_stream_id` is set AND differs from the candidate is no longer treated as a duplicate. Untagged tool cards (live tools, or tool cards carried over from a pre-tagging core transcript) still match, preserving the existing 'core transcript already has this tool, don't duplicate' invariant. Two new tests in `TestJournalToolDedupeScoping` cover both legs of the rule. 2./3. The troubleshooting FAQ pointed at `~/.hermes/webui/sessions/session_<sid>.json` and `~/.hermes/_run_journal/...`. The actual sidecar filename has no `session_` prefix and the run-journal lives under the WebUI sessions dir (`~/.hermes/webui/sessions/_run_journal/<sid>/<stream>.jsonl`, default). Both paths fixed and an explicit note added about `HERMES_WEBUI_STATE_DIR` overriding the state root. 4. Drop unused `json` / `queue` / `Path` imports from `tests/test_session_lost_response_regression.py` so the file stops carrying noise that future linting would flag.	2026-05-20 12:18:03 +08:00
Isla Liu	e8cd0bcc66	test(session): end-to-end regression for lost-response self-heal Reproduces the production failure mode: 1. Stage 1 — sidecar repair runs while the run-journal for the dead stream is empty on disk. Assert the marker arms the lazy-retry hook (`_pending_journal_recovery=True`, `_journal_retry_stream_id`, `_journal_retry_attempts=0`, `_journal_retry_first_seen_ts`) and does NOT carry the legacy "no agent output was recovered" wording. Pending sidecar fields are cleared regardless. 2. Stage 2 — journaled token / tool / tool_complete / token events appear on disk. Call `get_session(sid)` and assert the marker self-heals: wording promotes to "recovered from the run journal", journaled assistant rows + tool card land above the marker in chronological order, all retry meta is stripped. Without the lazy-retry path this test fails at the very first assertion (marker still carries the legacy no-output wording).	2026-05-20 11:58:37 +08:00

6 Commits