CrashWise

Automated vulnerability discovery for C/C++ targets via LLM-driven harness synthesis, coverage-guided fuzzing (AFL++/libFuzzer), and crash triage.


Runtime	Python 3.11+
License	MIT
Fuzzers	AFL++, libFuzzer
Sanitizers	ASan, UBSan
Orchestration	Temporal
Status	Pre-alpha

crashwise run https://github.com/madler/zlib

Pipeline

Clone → Build (ASan+UBSan+source-cov) → Harness Synthesis → Fuzz → Triage → Report
                                              ↑                          │
                                              └── Self-Correction Loop ──┘

Scans public headers, resolves typedefs, scores entry points by attack surface
LLM generates a LLVMFuzzerTestOneInput harness; validates compilation + 5s sanity gate
On failure: GDB backtrace → LLM diagnosis → fix → retry (max 3 attempts)
Executes AFL++/libFuzzer in sandboxed Docker containers with coverage feedback
On plateau: identifies blocker (magic value, length check, checksum, state machine), rewrites harness
Classifies crashes by type, deduplicates by stack hash, scores exploitability

Quick Start

git clone https://github.com/yahyatoubali/Crashwise.git && cd Crashwise
pip install -e .
docker compose up -d

# .env (minimum)
CRASHWISE_LLM_MODEL=claude-sonnet-4-5
ANTHROPIC_API_KEY=sk-ant-...

crashwise run --timeout 600 https://github.com/madler/zlib

See docs/INSTALL.md for full setup.

CLI Reference

Command	Description
`crashwise init`	Detect build system, generate `crashwise.yaml` manifest
`crashwise run <repo-url>`	Submit fuzzing campaign (blocking)
`crashwise run --detach <url>`	Submit and return immediately
`crashwise doctor`	Validate system prerequisites
`crashwise setup`	Install missing build dependencies
`crashwise dashboard`	Launch Streamlit UI on `localhost:8501`
`crashwise signal <id> pause_hunt`	Pause a running campaign
`crashwise signal <id> force_pivot`	Force MAB strategy switch

Configuration

Required

Variable	Description
`CRASHWISE_LLM_MODEL`	Model identifier for harness synthesis
`ANTHROPIC_API_KEY`	API key (or equivalent for chosen provider)

Optional

Variable	Default	Description
`AI_PROVIDER`	(disabled)	Crash triage backend: `ollama`, `venice`, `openai_compatible`
`AI_MODEL`	—	Model for triage (e.g., `llama3.1:8b`)
`OLLAMA_URL`	`http://localhost:11434`	Ollama endpoint
`DATABASE_URL`	`sqlite+aiosqlite:///./crashwise.db`	Async SQLAlchemy URL
`REDIS_URL`	`redis://localhost:6379/0`	Redis for distributed state
`TEMPORAL_HOST`	`localhost:7233`	Temporal server address

Supported LLM providers: Anthropic, OpenAI, NVIDIA NIM, Together AI, Groq, Ollama, vLLM, any OpenAI-compatible endpoint.

Architecture

┌──────────────────────────────────────────────────────────────┐
│  CLI / API / Dashboard                                        │
├──────────────────────────────────────────────────────────────┤
│  Temporal Workflows (durable, retryable)                      │
│  └─ MainFuzzingWorkflow → 23 Activities                      │
├──────────────────────────────────────────────────────────────┤
│  AI Agents (LangGraph)                                        │
│  ├─ Harness Synthesis (analyze → generate → validate → retry)│
│  ├─ Coverage Analysis (blocker identification + dict gen)     │
│  ├─ Crash Triage (ASAN/GDB → severity → dedup)              │
│  └─ Exploit Generation (PoC synthesis)                        │
├──────────────────────────────────────────────────────────────┤
│  Execution (Docker sandbox)                                   │
│  ├─ AFL++ (multi-strategy, Thompson Sampling MAB)            │
│  ├─ libFuzzer (in-process, coverage-guided)                  │
│  └─ QEMU/KVM (kernel targets)                               │
├──────────────────────────────────────────────────────────────┤
│  Storage: PostgreSQL │ Redis │ R2/S3 │ SQLite (dev)          │
└──────────────────────────────────────────────────────────────┘

Harness Synthesis

Regex-based static analysis of .h files identifies function declarations
Typedef resolution maps library-specific types to canonical C types (Bytef → unsigned char)
Entry points scored by argument shape: (const uint8_t*, size_t) = 1.0, (const char*) = 0.7
LangGraph state machine: analyze_code → generate_harness → validate_harness → [retry|end]
Struct definitions extracted from headers and injected into LLM context
Usage examples mined from test/ and examples/ directories

Coverage Feedback Loop

Source-based coverage via -fprofile-instr-generate -fcoverage-mapping
llvm-cov export produces line-level hit/miss data (lcov format)
Coverage analyzer identifies blockers: magic values, length checks, checksums, state machines, null guards
Dictionary generator extracts comparison literals into custom.dict for token-aware mutation
MAB strategist (Thompson Sampling) pivots between 5 fuzzer configurations on plateau

Execution Sandbox

Every fuzzer container runs with:

Constraint	Value
Network	`--network none`
Filesystem	`--read-only`
Capabilities	`--cap-drop ALL`
PIDs	`--pids-limit 1024`
Scratch	Size-capped tmpfs on `/tmp` and `/dev/shm`
Disk quota	`--storage-opt size=5G` (overlay2+xfs+pquota)

AFL++ containers additionally receive --cap-add SYS_PTRACE for forkserver operation.

Workflow Durability

Built on Temporal:

Campaigns survive worker crashes; activities resume from last heartbeat
Exponential backoff retry with non-retryable error classification
Horizontal scaling via additional worker processes
God-Mode signals: pause_hunt, force_pivot, inject_seed for live operator control

See docs/architecture.md for the full technical reference.

Target Compatibility

Compatible	Requires Manual Tuning	Unsupported
C/C++ libraries with CMake/Make/Meson	Bazel builds, complex monorepos	Closed-source binaries
Parser libraries (image, archive, crypto, font)	Custom toolchains, autotools edge cases	Windows-only (MSVC)
Projects with existing fuzz harnesses	Deeply nested struct-init APIs	Managed languages
Standard `(buf, size)` entry points	Callback-driven APIs (SAX parsers)	Network daemons (stateful protocols)

Validated targets: zlib, libpng, libjpeg-turbo, freetype, openssl, libxml2, harfbuzz, libarchive, pcre2.

Scope & Limitations

Detects

Class	Sanitizer
Heap/stack buffer overflow	ASan
Use-after-free, double-free	ASan
Null pointer dereference	ASan (SIGSEGV)
Integer overflow	UBSan
Uninitialized reads	ASan (partial)

Does Not Detect

Class	Reason
Race conditions / TOCTOU	Single-threaded harnesses; no TSan
Deadlocks, livelock	No thread scheduling manipulation
Logic bugs in async code	Incompatible with byte-mutation model
Timing side-channels	Requires statistical analysis, not fuzzing

CrashWise generates single-threaded LLVMFuzzerTestOneInput harnesses instrumented with ASan+UBSan only. Concurrency bugs require ThreadSanitizer or rr.

Development

git clone https://github.com/yahyatoubali/Crashwise.git && cd Crashwise
uv sync  # or: pip install -e ".[dev]"

uv run pytest tests/ -v        # test
uv run ruff check crashwise/   # lint
uv run mypy crashwise/         # type check

License

MIT — see LICENSE.

Built by Yahya Toubali.

Name		Name	Last commit message	Last commit date
Latest commit History 185 Commits
.github		.github
corpus		corpus
crashwise		crashwise
docs		docs
harnesses		harnesses
infra/temporal/dynamicconfig		infra/temporal/dynamicconfig
scripts		scripts
tests		tests
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.frontend		Dockerfile.frontend
Dockerfile.worker		Dockerfile.worker
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yaml		docker-compose.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CrashWise

Pipeline

Quick Start

CLI Reference

Configuration

Required

Optional

Architecture

Harness Synthesis

Coverage Feedback Loop

Execution Sandbox

Workflow Durability

Target Compatibility

Scope & Limitations

Detects

Does Not Detect

Development

License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

CrashWise

Pipeline

Quick Start

CLI Reference

Configuration

Required

Optional

Architecture

Harness Synthesis

Coverage Feedback Loop

Execution Sandbox

Workflow Durability

Target Compatibility

Scope & Limitations

Detects

Does Not Detect

Development

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Contributors

Uh oh!

Languages