Professor of economics at Charles University, Prague. Meta-analyst and applied macroeconomist working on publication bias, p-hacking, and Bayesian model averaging (often with Zuzana Iršová). Affiliated with CEPR and the Stanford METRICS center.
I use this page as a working notebook for a few AI-assisted research-auditing experiments: a small family of tools that bring adversarial review to empirical papers, grant proposals, and referee reports. The newest and most ambitious is a Claude-only workshop (CRUCIBLE) that argues a paper to the sentence and then, optionally, rebuilds it. It grew out of a manual, human-in-the-loop audit protocol (multi-model: ChatGPT, Gemini, Grok, Claude) and its automation for the Claude Code + Codex command line. That protocol is the methodology to cite. The same approach covers a domain application for ERC grant drafts, alongside the meta-analysis work in the labs below.
The tools surface criticism for a human to weigh. They are not a substitute for judgment.
The underlying Duel/MAD protocol has been independently endorsed by Prof. Bob Reed (University of Canterbury / MAER-Net).
| If you want to… | Use | What it is |
|---|---|---|
| Workshop and (optionally) rebuild a whole paper | paper-workshop | CRUCIBLE, the flagship. A Claude-only fleet of rival-tradition referees argues the paper to the sentence (every comment grounded in an exact quote, no acceptance-probability scores), then rebuilds it on request: a tracked redline, your own code re-run so the numbers are real, and a replication package. The newest of these tools and the least battle-tested. Effectiveness isn't measured yet. |
| Run a manual, human-in-the-loop adversarial audit of a high-stakes paper | research-audit-duel-protocol | The protocol CRUCIBLE grew out of: the Duel + MAD workflows across ChatGPT, Gemini, Grok, and Claude, with a worked example. Independently endorsed, and the methodology to cite. |
| Automate that audit across Claude + Codex, without writing code | mad-research | Three Claude Code skills: one-shot Codex calls, collaborative build with cross-review, and a three-stream adversarial audit. The cross-model (Claude + Codex) automation; it produces an audit memo. |
| Triage an ERC Starting/Consolidator draft before peer review | erc-ai-feedback | A prompt + rubric for a structured pre-review against ERC criteria. Doesn't replace human reviewers; it frees workshop time for what AI can't address. |
| Curious what kids can build? | race | A small two-player browser racing game our kids (ages 8–12) made with Claude Code, set in our town of Litomyšl. |
Iršová & Havránek (2026), doi:10.5281/zenodo.19105954. The core protocol repos ship a CITATION.cff, so GitHub's "Cite this repository" works there too.
- Website: tomashavranek.cz
- ORCID: 0000-0002-3158-2539
- Labs: meta-analysis.cz · easymeta.org
Getting started: CRUCIBLE and mad-research are Claude Code skills, so they need a Claude subscription (mad-research also calls Codex). research-audit-duel-protocol and erc-ai-feedback are just prompts and documents, with nothing to install. Each repo's README has the steps.
A note on confidentiality: these tools send your text to third-party AI providers. Each repo's README states what goes where, so read it before sending anything embargoed, under double-blind review, or otherwise non-public.


