Codemap in-repo test bench

Single committed corpus for regression-testing Codemap — no CODEMAP_ROOT pointing at external apps required for maintainers or CI.

Piece	Path	Role
Corpus	`minimal/`	Source tree indexed by goldens, benchmark, agent-eval
Golden scenarios	`golden/scenarios.json`	Tier A scenario inventory (SQL + recipe ids)
Golden snapshots	`golden/minimal/`	Expected query JSON (committed)
Capability map	`CAPABILITIES.json`	Capability groups → fixture files → `goldenScenarios` ids
Agent eval	`scripts/agent-eval/` + probes in `scenarios.json`	MCP-on vs traditional arms
Benchmark scenarios	`benchmark/scenarios.example.json`	Speed comparisons (optional override)

Commands (from repo root)

bun run test:golden          # index minimal + compare all scenarios
bun run test:agent-eval      # probe/live harness on same corpus
bun run check                # unit + golden + agent-eval (CI)
bun scripts/query-golden.ts --update   # refresh snapshots after intentional changes

Tier B / external trees

bun run test:golden:external remains for consumers validating Codemap against a private checkout (gitignored goldens). It is not part of the Codemap maintainer test bench.

Optional scale (second corpus or rename): docs/roadmap.md · harness map: docs/testing-coverage.md.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Codemap in-repo test bench

Commands (from repo root)

Tier B / external trees

Uh oh!

Uh oh!

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Codemap in-repo test bench

Commands (from repo root)

Tier B / external trees