chore(classifier): reconstruct wall-clock for v0.2.0 collection run by toonight · Pull Request #2 · toonight/Mnemoscope

toonight · 2026-04-30T10:44:03Z

Summary

The author still had /tmp/mnemoscope-full.log from the original v0.2.0 collection — its created/modified timestamps give a tight bound on the wall-clock end-to-end. Both fields are flagged as reconstructed from the log mtime, not measured by the collector, in measurements-meta.json#_note and model.json#dataset_collection.wall_clock_human.

	value
`started_at`	`2026-04-29T04:11:20-04:00`
`ended_at`	`2026-04-30T03:13:50-04:00`
`wall_clock_s`	`82950` (23h02m30s)
per-cell average	~207 s over 400 cells

wall_clock_s_grading_only and the token totals stay null — they would be guesses, not audit. b33b5dc already wired the collector to record both for every future run.

Test plan

train.py re-ran with seed=42 on the unchanged measurements.csv, re-selected rf at R²=0.5827 / MAE=0.1386
model.onnx is byte-identical to the published one (sklearn is deterministic with the same seed)
model.json#dataset_collection now reads the updated meta sidecar (embedded dataset_collection metadata from classifier/measurements-meta.json printed)

🤖 Generated with Claude Code

…lection run The author's machine still had /tmp/mnemoscope-full.log from the original collection — its created/modified timestamps bound the wall-clock end-to-end: started_at: 2026-04-29T04:11:20-04:00 ended_at: 2026-04-30T03:13:50-04:00 wall_clock_s: 82950 (23h 02m 30s, ~207 s/cell over 400 cells) Both fields are flagged as *reconstructed* from the log mtime in measurements-meta.json#_note and model.json#dataset_collection.wall_clock_human; they bound the orchestrator's elapsed time but include any idle gaps between cells, so they are not equivalent to a sum of per-cell durations. Token totals stay null. The OpenAI-compat usage block was not persisted during this run, and inferring tokens from the haystack target sizes alone would be a guess, not an audit. b33b5dc already wired research/replication/run.py to capture tokens_in / tokens_out for every future cell. train.py re-ran on the unchanged measurements.csv with the same seed (42), re-selected rf at R²=0.5827 / MAE=0.1386, and re-emitted an ONNX file identical (byte-for-byte) to the published one. Only model.json changed, to embed the updated measurements-meta.json under dataset_collection.

toonight merged commit fae3103 into main Apr 30, 2026
2 checks passed

toonight deleted the chore/reconstruct-collection-timestamps branch April 30, 2026 10:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(classifier): reconstruct wall-clock for v0.2.0 collection run#2

chore(classifier): reconstruct wall-clock for v0.2.0 collection run#2
toonight merged 1 commit into
mainfrom
chore/reconstruct-collection-timestamps

toonight commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

toonight commented Apr 30, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant