Skip to content

chore(classifier): reconstruct wall-clock for v0.2.0 collection run#2

Merged
toonight merged 1 commit into
mainfrom
chore/reconstruct-collection-timestamps
Apr 30, 2026
Merged

chore(classifier): reconstruct wall-clock for v0.2.0 collection run#2
toonight merged 1 commit into
mainfrom
chore/reconstruct-collection-timestamps

Conversation

@toonight

Copy link
Copy Markdown
Owner

Summary

The author still had /tmp/mnemoscope-full.log from the original v0.2.0 collection — its created/modified timestamps give a tight bound on the wall-clock end-to-end. Both fields are flagged as reconstructed from the log mtime, not measured by the collector, in measurements-meta.json#_note and model.json#dataset_collection.wall_clock_human.

value
started_at 2026-04-29T04:11:20-04:00
ended_at 2026-04-30T03:13:50-04:00
wall_clock_s 82950 (23h02m30s)
per-cell average ~207 s over 400 cells

wall_clock_s_grading_only and the token totals stay null — they would be guesses, not audit. b33b5dc already wired the collector to record both for every future run.

Test plan

  • train.py re-ran with seed=42 on the unchanged measurements.csv, re-selected rf at R²=0.5827 / MAE=0.1386
  • model.onnx is byte-identical to the published one (sklearn is deterministic with the same seed)
  • model.json#dataset_collection now reads the updated meta sidecar (embedded dataset_collection metadata from classifier/measurements-meta.json printed)

🤖 Generated with Claude Code

…lection run

The author's machine still had /tmp/mnemoscope-full.log from the original
collection — its created/modified timestamps bound the wall-clock end-to-end:

    started_at: 2026-04-29T04:11:20-04:00
    ended_at:   2026-04-30T03:13:50-04:00
    wall_clock_s: 82950   (23h 02m 30s, ~207 s/cell over 400 cells)

Both fields are flagged as *reconstructed* from the log mtime in
measurements-meta.json#_note and model.json#dataset_collection.wall_clock_human;
they bound the orchestrator's elapsed time but include any idle gaps between
cells, so they are not equivalent to a sum of per-cell durations.

Token totals stay null. The OpenAI-compat usage block was not persisted
during this run, and inferring tokens from the haystack target sizes alone
would be a guess, not an audit. b33b5dc already wired
research/replication/run.py to capture tokens_in / tokens_out for every
future cell.

train.py re-ran on the unchanged measurements.csv with the same seed (42),
re-selected rf at R²=0.5827 / MAE=0.1386, and re-emitted an ONNX file
identical (byte-for-byte) to the published one. Only model.json changed,
to embed the updated measurements-meta.json under dataset_collection.
@toonight toonight merged commit fae3103 into main Apr 30, 2026
2 checks passed
@toonight toonight deleted the chore/reconstruct-collection-timestamps branch April 30, 2026 10:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant