crosscheck

🌏 中文

Building crosscheck with crosscheck.

crosscheck

A lightweight orchestration layer that makes your AI coding agents review each other's work — then fix it.

When Claude Code opens a PR, Codex reviews it. When Codex opens a PR, Claude reviews it. If issues are found on a Claude-authored PR, Claude commits fixes and pushes them back. All of this runs on your laptop, against your existing subscriptions, with a single command.

GitHub PR  →  crosscheck watch  →  AI review posted  →  fixes committed

Inspired by Symphony — OpenAI's spec-driven multi-agent framework. Where Symphony coordinates agents at the product level, crosscheck stays at the level engineers already live in: pull requests, diffs, and code review. No new abstractions — just the CR + fix loop, automated.

Why

Coding agents ship fast. They also make confident, plausible mistakes. The fix isn't a human reviewer on every AI-authored PR — it's the other AI reviewing it. Claude and Codex have complementary blind spots; cross-vendor review catches more issues than either model alone, without adding human latency to every commit.

crosscheck wires that loop:


Local execution, always listening	Runs on your machine. `crosscheck watch` opens a tunnel and keeps running. No cloud, no infra, no SaaS.
Subscription-funded	Uses `claude` and `codex` CLIs against your existing Claude Pro/Max and ChatGPT Plus/Pro plans. No per-token API billing. Reviews are free once you're subscribed.
Cross-vendor by default	Claude reviews Codex PRs; Codex reviews Claude PRs. Each model brings different training and different failure modes. The overlap is where bugs hide.
Self-improving	`crosscheck diagnose` surfaces failure patterns from logs. `crosscheck optimize` feeds them to your AI and updates reviewer instructions automatically.

Who this is for

The AI-native developer — You use Claude Code and Codex to ship features across multiple repos. crosscheck routes every PR to the rival AI, closes the loop with auto-fix PRs, and requires zero new infrastructure beyond your existing subscriptions.

The solo founder or indie hacker — You're the only engineer and your agents run around the clock. Without crosscheck, no one reviews what the agent ships. With it, every PR gets a second opinion before it lands — automatically, while you sleep.

The team scaling AI — Your org has adopted AI coding tools across multiple repos. crosscheck gives you consistent coverage out of the box, a self-improving instruction set via crosscheck optimize, and a quantified ROI story via crosscheck impact --money when leadership asks whether the AI investment is paying off.

Quick start

# 1. Install crosscheck and the agent CLIs
npm install -g @motivation-labs/crosscheck
npm install -g @anthropic-ai/claude-code && claude        # Claude Pro/Max subscription
npm install -g @openai/codex && codex login --device-auth # ChatGPT Plus/Pro subscription
brew install gh && gh auth login                          # GitHub CLI

# 2. Guided setup — auth, repos, review mode, workflow pipeline
crosscheck onboard

# 3. Run continuously
crosscheck watch

crosscheck onboard walks you through seven steps: environment check, deployment mode (personal vs team), repo selection, review mode (cross-vendor or single-vendor), workflow pipeline (review-only / review+fix / review+fix+recheck), connection type (localhost.run vs smee.io), and config write. crosscheck watch then opens a tunnel (localhost.run by default, or smee.io if you selected it), auto-registers the webhook, and starts listening.

How it works

┌────────────────────────────────────────────────────────────────┐
│  Your laptop                                                    │
│                                                                 │
│  crosscheck watch                                               │
│    ├── SSH tunnel (localhost.run)  ◄──── GitHub webhook         │
│    ├── Webhook server (:7891)                                   │
│    └── PR handler                                               │
│         ├── detect origin   (Claude Code? Codex? other?)        │
│         ├── clone PR branch                                     │
│         ├── run reviewer    (cross-vendor assignment)           │
│         ├── post review comment                                 │
│         └── post-review auto-fix                               │
│              ├── authoring vendor reads review comment          │
│              └── opens fix PR targeting original branch         │
└────────────────────────────────────────────────────────────────┘

Routing uses a four-signal chain: PR body patterns → commit Co-Authored-By: trailers → branch prefix (claude/ or codex/) → author_routes config fallback. Generated with [Claude Code] → Codex reviews. Generated with [OpenAI Codex] → Claude reviews. allowed_authors restricts reviews to your agent accounts.

Post-review auto-fix runs after the review when issues are found. The vendor that authored the PR (fixer: same-as-author) reads its own review comment and opens a new fix PR targeting the original branch. You review and merge the fix PR — the original PR then updates automatically. Controlled by post_review.auto_fix in your config.

The feedback loop closes via crosscheck diagnose → crosscheck optimize. Failure patterns and quality signals from ~/.crosscheck/logs/ feed back into improved reviewer instructions — no manual config editing required.

watch output

$ crosscheck watch

  "Move fast and review things."

crosscheck watch

  profile   personal · cross-vendor · balanced
  users     your-github-login (5 repos)
  auto-fix  on_issues · same-as-author · pull_request
  config    ./crosscheck.config.yml  ← edit to change above

  9:18:14 AM  ✓ tunnel ready: https://abc123.lhr.life
  9:18:15 AM  ✓ webhook registered for your-org/your-repo
Waiting for PR events — Ctrl+C to stop.

PR #47 opened: add retry logic for flaky network calls
  origin=claude  reviewer=codex
  codex reviewing... (12s)
  review complete (12s)
  posted → github.com/your-org/your-repo/pull/47
  NEEDS WORK
  auto-fix  claude fixing...
  fix PR #48 opened → github.com/your-org/your-repo/pull/48

PR #49 opened: implement caching layer
  origin=codex  reviewer=claude
  claude reviewing... (18s)
  review complete (18s)
  posted → github.com/your-org/your-repo/pull/49
  APPROVE

Commands

crosscheck init                     # check prerequisites, write starter config
crosscheck onboard                  # guided setup — pick repos and set deployment mode
crosscheck review <pr-url>          # one-shot review of a specific PR
crosscheck run <pr-url>             # full workflow: review → auto-fix → recheck
crosscheck watch                    # local dev — tunnel + auto-webhook + listening
crosscheck serve                    # always-on — fixed port, register webhook once
crosscheck status                   # auth state, config, log summary, CLI versions
crosscheck diagnose                 # surface failure patterns from review logs
crosscheck optimize [--apply]       # update reviewer instructions from diagnose output
crosscheck impact [--money]         # time saved, issues caught, code quality trends
crosscheck issue                    # draft and file a bug report from recent error logs

Configuration

Config lives at ~/.crosscheck/config.yml by default — persistent across all projects, no per-repo file needed. Run crosscheck init to generate it. Project-local crosscheck.config.yml overrides the global config when present.

# Which repos/orgs to watch (at least one required)
orgs:
  - your-org                      # covers every repo in the org

# Only review PRs from these GitHub accounts
# Auto-filled with your login by `crosscheck init` or first `crosscheck watch`
routing:
  allowed_authors:
    - your-github-login  # auto-detected from gh auth

# Review depth
quality:
  tier: balanced                  # fast | balanced | thorough

# Optional spend cap
budget:
  per_review_usd: 2.0
  codex_monthly_usd: 50

# Tunnel backend (watch mode only)
# localhost.run — zero install, reconnects automatically (default)
# smee         — stable channel URL, queues events while offline
tunnel:
  backend: localhost.run

# Post-review auto-fix — opens a fix PR when issues are found
post_review:
  auto_fix:
    enabled: true
    trigger: on_issues            # on_issues | always | never
    min_severity: warning         # error | warning | info
    fixer: same-as-author         # the vendor that wrote the PR also fixes it
    delivery:
      mode: pull_request          # pull_request | commit | comment

Full configuration reference: get-started.md

Customization home

Everything crosscheck learns and every choice you make persists in ~/.crosscheck/. Back this directory up and a reinstall is instant — just run crosscheck onboard and press Enter through each step to confirm your previous settings.

File	Written by	Purpose
`config.yml`	`onboard`, `init`	Deployment, repos, mode, vendors, quality tier, tunnel, routing, budget
`workflow.yml`	`onboard`	Global pipeline steps with per-step inline instructions. Written on first onboard, never overwritten — edit freely
`webhook-secret`	auto-generated	HMAC secret for GitHub webhook verification. Reused across restarts
`logs/YYYY-MM-DD.ndjson`	`watch`, `serve`	Structured review event log — feeds `diagnose`, `optimize`, `impact`

Per-project overrides (checked before the global files):

File	Purpose
`.crosscheck/workflow.yml` (in repo)	Per-project pipeline — takes priority over `~/.crosscheck/workflow.yml`
`.crosscheck/AGENT.md` (in repo)	Per-project harness for `crosscheck optimize` — takes priority over bundled `AGENT.md`

Self-improving reviews

Every review outcome is logged to ~/.crosscheck/logs/YYYY-MM-DD.ndjson. Over time, patterns emerge — which commands the reviewer tries to run (and fails), verdict distributions, review duration trends.

# See what's going wrong
$ crosscheck diagnose

crosscheck diagnose  (2026-01-01 → 2026-05-08 · 3 log files)

  Reviews       47 total  —  28 APPROVE  14 NEEDS WORK  5 BLOCK
  Failure rate  codex 12%  /  claude 4%

  Suggestions
  ─────────────────────────────────────────────────────────────
  ✦ codex runs `npm test` during review (7 occurrences)
    → add to instructions: "Do not run npm, tsc, or test commands."
  ✦ 3 reviews timed out on large PRs (>400 lines changed)
    → consider quality.tier: fast for PRs above a size threshold

# Apply the suggested fixes automatically
$ crosscheck optimize --apply
  agent  claude (lower failure rate: 4% vs codex 12%)
  writing ~/.crosscheck/workflow.yml (review step)
  + Do not run npm, tsc, jest, or any build/test commands.
  + Flag PRs over 400 lines changed as too large to review thoroughly.
  done

# Measure the compounding value
$ crosscheck impact --money

crosscheck impact  (all time · 47 reviews)

  Time saved
  ──────────────────────────────────────────────
  Reviews run              47
  Avg AI review time       ~14 min
  Assumed human time       60 min  ⓘ
  Total time saved         ~43 h

  Issues caught
  ──────────────────────────────────────────────
  APPROVE              28   (60%)
  NEEDS WORK           14   (30%)  ← actionable feedback
  BLOCK                 5   (11%)  ← potential bugs / breaking changes
  Total issues caught  19

  Estimated value: ~$8,450
  (43h × $150/hr + 19 issues × $150/issue)

Deployment

Laptop — `crosscheck watch`

Zero configuration. SSH tunnel through localhost.run handles NAT — no port-forwarding, no cloud account. If the tunnel goes silent without exiting, the health check detects it within ~2 minutes and forces a reconnect + webhook re-registration.

crosscheck watch
# → opens tunnel, registers webhook, starts listening

Server — `crosscheck serve`

Bind to a fixed port on a machine with a public IP. Register the webhook once.

crosscheck serve
# → listens on :7891, you register https://your-server/webhook manually

smee.io — stable relay (optional)

localhost.run drops events if your laptop is offline when a PR opens. smee.io queues them and replays on reconnect — useful when the reviewer machine isn't always on.

npm install -g smee-client
# Visit https://smee.io/new — copy the channel URL

# ~/.crosscheck/config.yml
tunnel:
  backend: smee
  smee_channel: https://smee.io/your-channel-id

crosscheck registers the webhook automatically on first watch start — no manual webhook setup needed.

Requirements

	Minimum
Node.js	18+
Claude Code CLI	latest — `npm install -g @anthropic-ai/claude-code`
Codex CLI	latest — `npm install -g @openai/codex`
GitHub CLI	2.65+ — `brew install gh`

GITHUB_TOKEN is derived automatically when gh auth login has been run. No manual export required.

Documentation


get-started.md	Full setup guide — prerequisites, all commands and flags, complete config reference, how it works, FAQ
crosscheck.config.example.yml	Annotated config file with every option
AGENT.md	Harness document used by `crosscheck optimize` — how the AI improves reviewer instructions
CHANGELOG.md	Release notes — what's new in each version

Contributing

Issues and PRs welcome at github.com/Motivation-Labs/crosscheck.

Name		Name	Last commit message	Last commit date
Latest commit History 259 Commits
.codex		.codex
.github/workflows		.github/workflows
assets		assets
src		src
.gitignore		.gitignore
.npmignore		.npmignore
AGENT.md		AGENT.md
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
ISSUE.md		ISSUE.md
LICENSE		LICENSE
README.md		README.md
README.zh.md		README.zh.md
continuous-improvement.md		continuous-improvement.md
crosscheck.config.example.yml		crosscheck.config.example.yml
crosscheck.config.yml		crosscheck.config.yml
get-started.md		get-started.md
get-started.zh.md		get-started.zh.md
package-lock.json		package-lock.json
package.json		package.json
prd.md		prd.md
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌏 中文

crosscheck

Why

Who this is for

Quick start

How it works

watch output

Commands

Configuration

Customization home

Self-improving reviews

Deployment

Laptop — `crosscheck watch`

Server — `crosscheck serve`

smee.io — stable relay (optional)

Requirements

Documentation

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🌏 中文

crosscheck

Why

Who this is for

Quick start

How it works

watch output

Commands

Configuration

Customization home

Self-improving reviews

Deployment

Laptop — crosscheck watch

Server — crosscheck serve

smee.io — stable relay (optional)

Requirements

Documentation

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Laptop — `crosscheck watch`

Server — `crosscheck serve`

Packages