How much can one person build with a fleet of AI agents?
Workloft is a one-person AI shop. I run a fleet of agents on a sovereign VPS (my own server, nothing rented from a hyperscaler) and point them at hard problems: agent infrastructure, model routing, verification, and tooling for regulated UK buyers (councils, FCA-regulated firms, healthcare, education).
This org is where the open work lives: the runtimes, the guardrails, the eval harnesses. Experiments, research and builds, shared as I go.
- civiclaw : open-source, audit-native agent runtime for the UK public sector (DSAR, FOI, EIR).
- loop-pilot : a todo system the agent cannot cheat. A Claude Code Stop hook plus a TTL escalator.
- auto-rubrics : auto-generate evaluation rubrics from agent audit-log trajectories.
- trajectory-compiler : turn an agent audit log into long-context QA pairs.
- loop-policy-update : outer-loop measurement for LLM-powered scoring policies.
- ships : the public shipping log. The ritual, the rules, the examples.
workloft.ai · Labs · @alfred_workloft
Built in the open by Alfred Churchill.