CLI Behavior

This page describes the current public command surface.

Implemented Now

The prooflog binary exists and exposes these top-level commands:

prooflog init
prooflog doctor
prooflog doctor --parser
prooflog ingest --codex
prooflog proof --since main

prooflog init creates a local TOML config file and initializes the local SQLite database schema.

On Unix-like systems, prooflog init sets the config and DB files to owner-readable/writable only.

prooflog doctor reads the config file and prints resolved paths, storage status, Codex root status, Codex JSONL count, and git repo status. On Unix-like systems, it warns when config or DB file permissions are broader than owner-only.

prooflog doctor --parser reads local storage and prints count-only parser diagnostics for raw events, malformed lines, unknown event shapes, extracted rows, and proof facts. It does not print raw JSONL, parse errors, command output, message text, local source paths, or transcript excerpts.

The current config stores:

database path
Codex root
redaction defaults

prooflog ingest --codex discovers local Codex .jsonl files, records file metadata, stores non-empty raw JSONL lines in SQLite, rebuilds raw/message/command-output FTS indexes, derives session/message/command/approval/file-change rows, and classifies supported verification, failure, and failure-resolution evidence into local proof facts.

prooflog proof --since main emits a proof report with scope, changed files, Codex evidence, parser warning counts, verification, failures, risks, redacted report excerpts for obvious secrets, a conservative READY/NOT READY/UNKNOWN decision, why, and next steps. The default output is plain text. Use --format md for a PR-pasteable Markdown report, --format json for experimental machine-readable output, or --format text to request plain text explicitly.

Local Paths

ProofLog uses one app-owned local home directory by default:

$HOME/.prooflog/config.toml
$HOME/.prooflog/prooflog.db

If HOME is not set, ProofLog falls back to USERPROFILE for Windows-style environments.

The default Codex root is:

$HOME/.codex

Planned Behavior

prooflog init will later extend storage hardening around directories and SQLite sidecar files if needed.

prooflog doctor will later add deeper parser diagnostics and richer git edge-case handling.

prooflog ingest --codex will later add additional proof facts as report needs harden.

prooflog proof --since main will later harden the experimental JSON schema as report needs settle.

Current Argument Contract

prooflog proof requires --since <REF> and supports --repo <PATH> and --format text|md|json.

prooflog ingest requires --codex.

prooflog init and prooflog doctor support --db <PATH> and --codex-root <PATH> overrides.

These contracts are covered by integration tests so future implementations keep the initial UX stable.

Proof Git Context

prooflog proof --since <REF> currently resolves git context and changed-file stats before report generation.

It prints:

repository root
current branch, or a detached HEAD label
current HEAD
merge base for --since <REF>
dirty working tree status
changed file count
total additions and deletions
docs-only status
per-file status, path, additions, and deletions
parser warning counts when malformed or unknown raw events exist
risky path level, count, categories, and reasons
risky command counts, families, severity, and reasons
conservative proof decision status and reasons

Use --repo <PATH> to inspect a repository other than the current working directory. Running outside a git repository or passing an invalid base ref fails with an actionable error.

Risk Path Classification

prooflog proof --since <REF> prints a Risk: section based on changed file paths.

Current path categories are:

auth
identity
security
secrets
config
infra
migration
CI/CD
production
Kubernetes
Terraform
database
release

Docs-only changes are reported as risk level: low with zero risky files. Mixed non-doc changes are risk level: elevated when any changed path matches a category. The classifier is deterministic and path-based; it does not inspect file contents or make a final readiness decision.

Risky Command Classification

prooflog proof --since <REF> prints a Risky commands: section from commands in relevant and ambiguous Codex sessions.

Current command families are:

aws
kubectl
terraform
helm
docker
gh
rm
chmod
chown
curl
scp
ssh

Risky commands are reported with family, session, command subject, severity, and reason. Production-like or destructive arguments such as prod, production, --force, delete, destroy, apply, rm -rf, chmod 777, terraform apply, kubectl delete, and similar patterns raise severity to high. ProofLog reports these commands; it does not block them, execute them, or print command output.

Codex Session Correlation

prooflog proof --since <REF> reads the local ProofLog database when available and prints relevant and ambiguous Codex session counts.

Strong relevant signals include:

session workspace path matching the repo root
command cwd inside the repo
file-change paths overlapping changed files

Weak file-name-only overlap is reported as ambiguous rather than hidden. Missing or empty local storage reports zero relevant and ambiguous sessions without failing the proof flow.

Parser Warnings

prooflog proof --since <REF> reports grouped parser warning counts when local raw events contain malformed JSONL lines or unknown valid event shapes.

Parser warnings include:

malformed lines
unknown event shapes

These warnings are counts only. Proof reports do not print raw JSONL content, parse error text, raw transcript text, local source file paths, or command output by default. Missing or empty local storage reports zero parser warnings.

Report Redaction

Proof reports redact obvious secret-like values from user-facing text, Markdown, and JSON output. Current redaction covers bearer tokens, OpenAI-style sk-/sk-proj- keys, AWS access key ids, private-key markers, and long prefixed secret values in command-derived report excerpts.

Redaction applies to report strings such as verification subjects, failure subjects, risky command text, session titles, and decision reasons. Raw local SQLite rows still preserve the original source events and command strings for local auditability.

Proof Decision

prooflog proof --since <REF> prints a Decision: section with status: READY, status: NOT READY, or status: UNKNOWN, plus one or more deterministic reason: lines.

Current decision rules are intentionally conservative:

READY requires changed files, at least one relevant Codex session, at least one relevant passed verification fact, no unresolved relevant verification failures, and no ambiguous relevant failure-resolution facts.
NOT READY means relevant evidence proves a verification failure remains unresolved.
UNKNOWN covers missing local storage, no relevant sessions, no changed files, no relevant verification evidence, unknown-only verification evidence, ambiguous-only evidence, or ambiguous failure resolution.

Decision reasons may include session ids, verification command subjects, and status summaries. They do not print command output or raw transcript text.

No-Evidence Next Actions

UNKNOWN and NOT READY reports include deterministic next actions based on the strongest decision reason:

missing local proof database: run prooflog init and prooflog ingest --codex, then rerun proof
no changed files: choose a base ref with changed files, then rerun proof
no relevant Codex sessions: ingest Codex history for this repository, then rerun proof
no relevant verification evidence: run verification commands for this change, ingest Codex history, then rerun proof
ambiguous-only evidence: rerun verification from this repository so evidence can be linked directly
unresolved verification failures: resolve the listed verification failures and rerun proof

Invalid git refs and non-git directories are runtime errors rather than proof reports. They exit with code 3 and keep actionable stderr.

JSON Report

prooflog proof --format json emits an experimental machine-readable report with schema_version: 1.

The JSON report includes:

scope: repo, branch, since ref, HEAD, merge base, and dirty state
changed: changed-file counts, diff stats, docs-only state, and per-file summaries
codex: relevant and ambiguous session summaries and correlation signals
parser_warnings: malformed line and unknown event-shape counts
verification: safe verification fact counts and summaries
failures: safe failure-resolution counts and summaries
risks: risky changed-path and risky-command summaries
decision: READY/NOT READY/UNKNOWN status and reasons
next_actions: suggested next steps

The JSON report does not include raw transcript text, raw command output, raw parse error text, raw JSONL content, or raw diff text by default. It uses the same decision and exit-code behavior as plain text and Markdown.

Proof Exit Codes

prooflog proof maps proof decisions to process exit codes:

0: READY
1: NOT READY
2: UNKNOWN
3: runtime ProofLog errors such as invalid git refs, non-git directories, config errors, parser errors, or storage errors

Clap usage errors, such as unsupported --format values or missing required arguments, keep clap's standard error behavior.

SQLite Schema

The initialized DB records migration version 2 and creates these MVP tables:

schema_migrations
codex_files
sessions
raw_events
messages
commands
approvals
file_changes
proof_facts

It also creates these FTS5 tables:

raw_events_fts
messages_fts
command_output_fts

The schema is raw-first. Current ingest populates codex_files, raw_events, sessions, messages, commands, approvals, file_changes, and verification/failure/resolution rows in proof_facts; later parser work will add risk facts.

Codex Discovery

prooflog ingest --codex --codex-root <path> recursively discovers lowercase .jsonl files under the configured root.

For each discovered file, it records:

path
size
modified time
SHA-256 hash

Repeated ingest uses size and modified time as a fast unchanged-file gate before hashing or reading file content. New or metadata-changed files are SHA-256 hashed, and files with changed hashes are streamed into raw storage. If metadata changed but the hash is unchanged, file metadata is refreshed without re-parsing raw lines. Symlinked directories are skipped to avoid loops.

This fast path is designed for local Codex history, where files are append-oriented and ordinary filesystem metadata is reliable. It does not try to detect adversarial same-size rewrites with a preserved modified time.

Raw Event Storage

For each discovered file, prooflog ingest --codex reads JSONL content line-by-line and stores one raw_events row for each non-empty physical line.

Each raw event row records:

source file id
line number
raw line text without the line ending
line SHA-256 hash
event type when a known top-level string field is present
event time when a known top-level string field is present
parse error when the line is malformed JSON

Malformed JSON lines do not abort ingest. Unknown valid JSON shapes are preserved with NULL derived metadata. Empty lines are skipped and counted in ingest output.

prooflog proof reports malformed-line and unknown-event-shape counts from stored raw events when those counts are non-zero. This makes parser uncertainty visible without exposing raw local session content.

Current ingest output includes:

files discovered
files ingested
files skipped
raw events stored
raw events skipped
raw events removed
malformed lines
unknown event shapes
warning count

Warning details are grouped under Warnings: only when present. After ingest, raw_events_fts is rebuilt from stored raw events, messages_fts is rebuilt from derived messages, and command_output_fts is rebuilt from derived commands for internal diagnostics. These are not user-facing search commands, and richer derived parser extraction remains planned follow-up work.

Session Derivation

After storing raw events, ingest derives sessions rows from parseable events with a top-level session_id.

When available, each derived session records:

Codex session id
workspace path
model
title or summary
first event timestamp
latest event timestamp
linked raw event count
parse status

Missing optional metadata is stored as NULL. Raw events with a known session id are linked back to the derived sessions.id.

Message Derivation

Ingest derives messages rows from parseable message events with known roles and non-empty text.

When available, each derived message records:

raw event link
session link
role
message text
event timestamp

Unknown message shapes and empty message text are skipped instead of guessed. Message text is indexed in messages_fts for internal diagnostics, but ingest does not print raw message text by default.

Command Derivation

Ingest derives commands rows from parseable command events with a known command string.

When available, each derived command records:

raw event link
session link
command string
cwd
status
exit code
output text
start and end timestamps

Unknown command shapes and missing command strings are skipped instead of guessed. Command/output text is indexed in command_output_fts for internal diagnostics, but ingest does not print command output by default.

Verification Proof Facts

Ingest classifies supported verification commands from derived commands rows and stores them in proof_facts with kind = 'verification'.

Currently recognized command families include:

cargo test, cargo build, cargo clippy
go test, go build, golangci-lint
pytest, ruff
npm test, npm run build, npm run lint, npm run typecheck
pnpm test, pnpm build, pnpm lint, pnpm typecheck
make test, make build
tsc, eslint

Each verification fact records the linked session, linked command, command subject, conservative status, and detector reason. Exit code 0 is passed; non-zero exit codes are failed; missing or ambiguous command outcomes are unknown. Unknown command families are not classified.

Failure Proof Facts

Ingest also classifies explicit command failure evidence from derived commands rows and stores it in proof_facts with kind = 'failure'.

Failure signals include:

non-zero exit code
failure-like status such as failure, failed, fail, or error
output tokens such as error, failed, permission denied, timed out, no such file, command not found, sandbox, or network

Each failure fact records the linked session, linked command, command subject, failed status, and a deterministic reason that names the strongest signal. Exit code 0 and success-like statuses are treated as non-failure for this detector.

Failure Resolution Proof Facts

Ingest derives failure_resolution proof facts for failed verification commands.

Resolution status is:

resolved when a later passing verification command is an exact rerun or a compatible broader rerun of the same detector
unresolved when no later passing verification command of the same detector exists
unknown when later passing evidence exists but compatibility is ambiguous

Resolution facts are linked to the failed command and preserve a deterministic reason. Passing commands from unrelated verification detectors do not resolve failures. The proof decision engine consumes these facts conservatively and prefers UNKNOWN over false READY when evidence is incomplete.

Approval Derivation

Ingest derives approvals rows from parseable approval events.

When available, each derived approval records:

raw event link
session link
requested action
decision
sandbox mode
command
event timestamp

Missing approval fields are stored as NULL. Unknown approval shapes are skipped instead of guessed. Ingest does not print approval commands or raw transcript content by default.

File-Change Derivation

Ingest derives file_changes rows from parseable file-change events with a known path.

When available, each derived file change records:

raw event link
session link
path
change type
diff text
lines added
lines deleted

Missing optional file-change fields are stored as NULL. Missing paths and unknown file-change shapes are skipped instead of guessed. Ingest does not print raw diff text by default. Final decisions and proof report behavior are planned follow-up work.

Permission Warnings

On Unix-like systems, ProofLog expects config and DB files to use mode 0600.

If prooflog doctor finds broader permissions, it prints a warning with a chmod 600 <path> fix. Permission warnings do not currently make doctor fail.

Doctor Readiness

prooflog doctor currently reports:

config path and resolved config values
SQLite open status, migration version, FTS5 availability, and journal mode
required FTS table availability
Codex root state and recursive .jsonl file count
current git repo root and branch when available
warnings for missing Codex root, no JSONL files, missing git repo, and unsafe file permissions

Warnings are non-fatal. Critical config and database errors still return a non-zero exit code.

Parser Diagnostics

prooflog doctor --parser reports local parser health from the existing SQLite database:

raw events
malformed lines
unknown event shapes
sessions
messages
commands
approvals
file changes
proof facts
fixture reminder

The output is count-only and deterministic. It is intended for parser debugging and fixture maintenance, not browsing local session history. Missing local storage returns an actionable error asking the user to initialize and ingest before running parser diagnostics.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLI Behavior

Implemented Now

Local Paths

Planned Behavior

Current Argument Contract

Proof Git Context

Risk Path Classification

Risky Command Classification

Codex Session Correlation

Parser Warnings

Report Redaction

Proof Decision

No-Evidence Next Actions

JSON Report

Proof Exit Codes

SQLite Schema

Codex Discovery

Raw Event Storage

Session Derivation

Message Derivation

Command Derivation

Verification Proof Facts

Failure Proof Facts

Failure Resolution Proof Facts

Approval Derivation

File-Change Derivation

Permission Warnings

Doctor Readiness

Parser Diagnostics

FilesExpand file tree

cli.md

Latest commit

History

cli.md

File metadata and controls

CLI Behavior

Implemented Now

Local Paths

Planned Behavior

Current Argument Contract

Proof Git Context

Risk Path Classification

Risky Command Classification

Codex Session Correlation

Parser Warnings

Report Redaction

Proof Decision

No-Evidence Next Actions

JSON Report

Proof Exit Codes

SQLite Schema

Codex Discovery

Raw Event Storage

Session Derivation

Message Derivation

Command Derivation

Verification Proof Facts

Failure Proof Facts

Failure Resolution Proof Facts

Approval Derivation

File-Change Derivation

Permission Warnings

Doctor Readiness

Parser Diagnostics