[EPIC] IronCache: vision, tenets, scope, and the research-to-design plan

# [EPIC]: IronCache vision and master index

> The most efficient Redis-wire-compatible cache in the world, shipped as one Rust static binary.
> Tenets, ranked and non-negotiable in this order: **Compatible > Efficient > Simple > Scalable > AI-Driven.**

This issue is the index of the whole project. Every other issue hangs off the map at the bottom. If an issue is not in the map, it is either an orphan to be re-parented or a duplicate to be closed.

## Why this matters

Every incumbent in this space is wrong in a specific, fixable way. Each wrong is mapped to the tenet it violates.

- **Redis 8, single-threaded data core (violates Efficient).** The keyspace is owned by one thread; the rest of the box is spent on I/O threads and housekeeping. Throughput-per-core is capped by design, and the standard answer is "run more shards / more processes," which is operational tax, not efficiency. IronCache treats per-core throughput as the headline number.
- **Valkey, inherited the same ceiling (violates Efficient).** Valkey is the credible community fork and our compatibility oracle, but it carries Redis's single-threaded keyspace architecture. It improves around the edges; it does not remove the core constraint. We respect it as the bar to beat on compatibility while beating it on efficiency.
- **KeyDB, multi-threaded but a locked shared keyspace, and dormant (violates Efficient and Simple).** KeyDB multithreaded by sharing one keyspace behind locks, trading contention for cores, and the project is effectively dormant. Shared-nothing thread-per-core removes the lock instead of optimizing it.
- **DragonflyDB, vertical-only and C++ (violates Scalable and Simple).** Dragonfly proved shared-nothing throughput is real, but it scales vertically (scale-up the box) rather than out, and it is a C++ codebase. IronCache is shared-nothing AND has a single-node-first, slot-ready path to horizontal distribution, in memory-safe Rust.
- **Memcached, no compatibility contract (violates Compatible).** Fast and simple, but it offers no rich data types and no Redis wire contract, so it is not a drop-in. We commit to a published RESP compatibility contract instead of "mostly works."
- **Garnet, managed runtime (violates Simple and Efficient).** Garnet shows the design space (RESP + tiered storage + great numbers) but runs on a managed .NET runtime with a GC and a runtime dependency. IronCache ships as one static native binary with no runtime, no GC pauses, and predictable tail latency.

The gap nobody fills: a memory-safe, single-static-binary, RESP-compatible cache whose primary axis of competition is **efficiency per core and memory per item**, that scales out, and that is honest about what it does not do.

## The five tenets, with measurable intent

1. **Compatible (highest).** Redis-wire compatibility is a published contract, not a vibe. We define compatibility tiers (Tier 0-4), pin a Valkey/Redis differential oracle, and refuse to ship a behavior claim without a differential or conformance test. Intent: an unmodified mainstream client and the common command set work against IronCache unchanged.
2. **Efficient.** The headline metrics are throughput-per-core and memory-at-a-fixed-hit-ratio, not aggregate ops/sec on a big box. Intent: beat Valkey on per-core throughput and beat Redis on bytes-per-stored-item at equal hit ratio, both proven on a reproducible harness.
3. **Simple.** One static binary, one config file, eviction and a memory ceiling ON by default, no sidecars or mandatory proxy, no managed runtime. Intent: install-to-first-GET measured in seconds, operable by reading one INFO output.
4. **Scalable.** Single-node-first, but the storage layout is slot-ready from day one so horizontal distribution is an unlock, not a rewrite. Intent: a Redis-Cluster-compatible client contract and online slot migration without a write freeze.
5. **AI-Driven (lowest, and strictly off the data path).** A background advisor that selects experts and autotunes bounded knobs against an efficiency objective. Intent: measurable headroom over a tuned W-TinyLFU + SIEVE baseline, with hard guardrails, hysteresis, rollback, and a kill-switch. Never per-request inference on the hot path.

The ranking is a tie-breaker rule: when two designs conflict, the higher tenet wins. Compatibility beats a clever efficiency trick; efficiency beats a scaling convenience; and AI never wins against any of the other four.

## Prior art

We do not get to assert "fastest" or "most memory-efficient" without receipts. Prior-art foundations, the pinned competitor landscape, and every quantitative claim we make about an incumbent live in `docs/PRIOR_ART.md`, with each claim recorded, sourced, and falsifiable in `docs/prior-art/claims.yaml`. Claims without a verifying test or a written correction are non-goals by policy. See #6 for the verification process and #9 for the measured single-core bar.

## What IronCache IS and is NOT

**IS:** a Rust, single static binary, RESP/Redis-wire-compatible, shared-nothing thread-per-core in-memory cache, with eviction and a memory ceiling on by default, transparent value compression, opt-in forkless snapshotting, a single-node-first but slot-ready distribution path, and an off-path AI advisor.

**IS NOT (committed non-goals):**
- No embedded scripting VM / Lua on the hot path; native atomic ops cover the common cases instead. See #10, #23.
- No Memcached protocol, no RDMA transport, no managed runtime. See #10.
- No `fork()`+copy-on-write snapshotting, no mandatory proxy, and no reliance on host THP/overcommit tuning. See #11.
- No strong consistency / zero write loss in the default async replication mode; strong consistency is an opt-in tier. See #12.
- No per-request neural/ML inference on the data path. See #13.
- No consistency or efficiency claim without its test or a written correction. See #14.

## Open decisions (cross-cutting)

These cut across multiple pillars and gate the architecture:
- Headline benchmark definition and methodology: throughput-per-core and memory-at-fixed-hit-ratio. See #7.
- Core concurrency model: shared-nothing thread-per-core. See #24.
- Default global allocator and memory-accounting strategy. See #41.
- Default eviction policy (SIEVE vs S3-FIFO vs W-TinyLFU-fronted FIFO). See #46.
- Ship a memory ceiling and eviction ON by default. See #45.
- Durability stance (ephemeral default, opt-in snapshot, later tiers). See #59.
- Default replication and consistency model (async primary/replica + WAIT). See #76.
- Compatibility tiering definition (Tier 0-4). See #16.
- Single-node-first roadmap with slot-ready storage layout. See #69.

## Acceptance criteria

The project has earned its name when, on the reproducible harness (#8) against the pinned oracle (#96), all of the following hold and each is backed by a committed test:

- **Throughput-per-core:** sustained single-core GET/SET throughput strictly exceeds Valkey 9.x single-core on identical hardware and payload mix, with the multiple reported (target: >= 1.5x per core on the standard mixed workload).
- **Memory-per-item:** resident bytes-per-stored-item at a fixed 95 percent hit ratio is below Redis 8 on the value-size survey corpus (target: <= 0.7x Redis bytes/item at equal hit ratio), measured with compression in its default posture.
- **Tail latency:** p99.9 GET latency at the target per-core throughput stays under a fixed bound with no GC-class pauses (target: p99.9 <= 1 ms at the documented load), demonstrating the no-managed-runtime claim.
- **Install-to-first-GET:** from downloading the single binary to a successful GET against a running default instance in under 60 seconds, with zero required config edits (eviction and memory ceiling already on).
- **Redis-conformance bar:** 100 percent pass on the declared Tier 0/1 command surface in the differential suite against pinned redis-server/valkey-server, with documented and tested behavior for every Tier 2+ deviation. See #95, #97, #16.

No headline claim ships without the corresponding row in `docs/prior-art/claims.yaml` and a green test, per #14.

## Issue map

Every planned issue is listed here, grouped by milestone. Nothing is orphaned.

### M0, Charter, claims, and decisions to make before building
- #2: scope, ranked tenets, five-pillar charter (the governing META).
- #3: glossary and load-bearing system invariants.
- #4: ADR index, decision register, and record format.
- #5: issue-tree coherence and deduplication audit.
- #6: prior-art foundations, claim verification, pinned competitor landscape.
- #7: decide headline metrics = throughput-per-core and memory-at-fixed-hit-ratio.
- #9: measure the single-core bar vs Redis 8 / Valkey / Dragonfly / Garnet.
- #10: committed non-goals register (scripting, Memcached protocol, RDMA, managed runtime).
- #11: non-goal, fork()+COW snapshotting, mandatory proxy, host THP/overcommit tuning.
- #12: non-goal, strong consistency / zero write loss in default async mode.
- #13: non-goal, per-request neural/ML inference on the data path.
- #14: non-goal, no consistency or efficiency claim without its test/correction.
- #16: decide and publish the compatibility tiering (Tier 0-4).
- #26: runtime bake-off, monoio vs glommio vs tokio+epoll on GET/SET.
- #32: hot-shard mitigation and memory-reclamation strategy under shard-per-core.
- #37: adaptive vs fixed encoding-conversion thresholds.
- #42: benchmark jemalloc/mimalloc/snmalloc under a cache workload.
- #45: decide to ship a memory ceiling and eviction ON by default.
- #47: benchmark SIEVE/S3-FIFO/W-TinyLFU/ARC/LIRS on cachemon corpus plus KV traces.
- #53: decide default codec = zstd low-level; LZ4 and none as policy options.
- #57: cache value-size and compressibility distribution survey.
- #59: decide durability stance (ephemeral default, opt-in snapshot, warm-restart, later tiers).
- #61: bound and enforce snapshot memory overhead; fast parallel restart.
- #69: decide single-node-first roadmap with slot-ready storage layout.
- #78: research per-shard Raft for an opt-in strongly-consistent tier.
- #80: post-ketama consistent hashing for internal placement.
- #89: define the advisor objective metric (efficiency, not raw hit ratio).
- #90: quantify advisor headroom over a tuned W-TinyLFU + SIEVE baseline.
- #96: adopt Valkey 9.x as RESP differential oracle and head-to-head baseline.

### M1, Core architecture, decisions locked, foundational designs
- #8: reproducible benchmark and memory-model harness.
- #15: RESP protocol surface, parser, and compatibility tiers (protocol EPIC).
- #17: RESP3 reply-shaping policy and error-string fidelity.
- #18: Redis-compatible error-string catalog.
- #22: security surface (AUTH, requirepass, ACL, embedded TLS).
- #24: decide shared-nothing thread-per-core as the core concurrency model.
- #25: shared-nothing core runtime and the async/io stack (runtime EPIC).
- #30: decide transaction and scripting surface scope.
- #31: decide to design the runtime for determinism to enable DST.
- #33: decide epoch-based reclamation (crossbeam-epoch) vs custom drain-list.
- #34: narrow-waist storage API (Read/Upsert/Delete/RMW) under the RESP layer.
- #35: hash table, data-structure encodings, per-key object layout (datastructures EPIC).
- #36: decide per-shard single-thread HashMap vs shared concurrent map fallback.
- #41: decide default global allocator and memory-accounting strategy.
- #43: online defragmentation strategy.
- #44: decide THP and snapshot stance (MADV_NOHUGEPAGE heap, non-fork serialization).
- #46: decide the default eviction policy.
- #48: pluggable EvictionPolicy trait and ghost queue.
- #49: W-TinyLFU frequency admission filter (CM-sketch + doorkeeper + aging).
- #50: map Redis maxmemory-policy names onto IronCache's internal engine.
- #52: transparent value compression strategy (compression EPIC).
- #54: decide C-bound zstd vs pure-Rust zstd for the static binary.
- #55: ZDICT per-prefix dictionary training, versioning, and tagging.
- #58: persistence, forkless snapshot, storage-engine architecture (persistence EPIC).
- #65: decide to reject RocksDB/LSM as the core cold engine; hybrid-log vs F2.
- #68: single-node to multi-node distribution (clustering EPIC).
- #70: Redis-Cluster-compatible client contract (16384 slots, CRC16, hash tags, MOVED/ASK).
- #71: decide internal shard map decoupled from the 16384 compatibility slots.
- #72: decide keyspace partition count as dual-purpose shard/migration unit.
- #73: Raft-managed authoritative slot map and in-binary HA control plane.
- #74: SWIM + Lifeguard data-plane membership and failure detection.
- #76: decide default replication and consistency model (async + WAIT).
- #81: single static binary, CLI, and single-binary operations (binary EPIC).
- #82: decide clap subcommands vs argv[0] symlink mode-switching and artifact signing.
- #86: observability (Prometheus /metrics, INFO/SLOWLOG/LATENCY parity).
- #88: AI-driven background advisor (expert selection + bounded knob autotuning) (AI EPIC).
- #91: advisor safety guardrails (bounded knobs, hysteresis, rollback, kill-switch).
- #94: AI-assisted development pipeline with adversarial claim verification.
- #95: conformance, differential, fuzz, property, and DST testing stack (testing EPIC).

### M2, Advanced engine, distribution, and the harder tests
- #19: MULTI/EXEC/DISCARD/WATCH with optimistic locking and no rollback.
- #20: unified server-push channel (Pub/Sub, sharded Pub/Sub, keyspace notifications, CSC).
- #21: CLIENT TRACKING (BCAST + RESP3 push default, per-client table, RESP2 REDIRECT).
- #23: native atomic-op set covering common scripting use cases without a Lua VM.
- #27: runtime/IO abstraction layer keeping monoio/glommio/tokio swappable.
- #28: io_uring fast path with registered buffers and multishot ops.
- #29: cross-shard coordinator and transaction/scripting surface.
- #38: segmented extendible-hash index (Dash-style) with SIMD fingerprint probing.
- #39: intset and HyperLogLog sparse/dense encodings for wire compatibility.
- #40: OBJECT ENCODING / DEBUG OBJECT compatibility mapping.
- #56: compression interaction with mutating commands and hot-key cost.
- #60: forkless versioned point-in-time snapshot and diskless full-sync.
- #62: mmap warm-restart (graceful shutdown + state file + pointer fixup).
- #63: segment + atomic manifest durable log with corruption recovery.
- #64: HybridLog storage engine with in-place hot-set updates.
- #66: tiered RAM->SSD value store (extstore-inspired).
- #67: io_uring snapshot/tiering write path with SQPOLL and fallback.
- #51: TTL expiration via per-shard timing wheel with lazy backstop and background reclamation.
- #75: atomic, snapshot-streamed online slot migration without write freeze.
- #77: offset-based async replication with adaptive, disk-spillable backlog.
- #79: opt-in active-active CRDT mode (reject blanket LWW; principled CRDT/HLC).
- #83: ironcache upgrade with verified rollback.
- #84: packaging, cross-build matrix, reproducible builds, SBOM, musl penalty research.
- #85: TOML config with CONFIG GET/SET/REWRITE parity and live reload.
- #87: continuously-reported online Belady-MIN gap metric.
- #92: off-path per-value compression decision model.
- #93: offline Belady-MIN and learned-Belady oracle in the benchmark harness.
- #97: differential testing against pinned redis-server/valkey-server.
- #98: property-based and model-based tests for every data type.
- #99: Jepsen + Elle test plan for clustering/replication.
- #100: seeded fault-injection and corruption scenarios.

## References

- `docs/PRIOR_ART.md`, competitor landscape, the specific incumbent claims above, and their sources.
- `docs/prior-art/claims.yaml`, machine-readable claim register; every quantitative claim has a row and a verifying test or correction.
- `docs/research/`, research-issue outputs (benchmark bake-offs, eviction corpus results, value-size survey, runtime bake-off, allocator benchmarks).

## Post-audit additions (2026-06-13)

The pre-implementation audit (see [docs/AUDIT.md](../blob/main/docs/AUDIT.md)) filed these issues. Decompositions of too-large issues:

- from #11: #101, #102, #103
- from #22: #104, #105, #106
- from #29: #107, #108, #109
- from #35: #110, #111, #112, #113
- from #39: #114, #115, #116
- from #41: #117, #118
- from #82: #119, #120
- from #84: #121, #122, #123, #124, #125
- from #88: #126, #127

Coverage-gap issues:

- M0: #132, #146, #155, #156, #157, #162
- M1: #128, #129, #133, #136, #137, #138, #140, #141, #142, #144, #145, #147, #149, #150, #152, #153, #154, #159
- M2: #130, #131, #134, #135, #139, #143, #148, #151, #158, #160, #161, #163

## Implementation readiness

Sequencing of the whole tree into a critical path to first code lives in #164 and [docs/ROADMAP.md](../blob/main/docs/ROADMAP.md). The **Implementation Readiness** milestone holds the 42-issue gate set; `wave:0..3` labels carry the order; `critical-path` marks the thin first slice.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EPIC] IronCache: vision, tenets, scope, and the research-to-design plan #1

[EPIC]: IronCache vision and master index

Why this matters

The five tenets, with measurable intent

Prior art

What IronCache IS and is NOT

Open decisions (cross-cutting)

Acceptance criteria

Issue map

M0, Charter, claims, and decisions to make before building

M1, Core architecture, decisions locked, foundational designs

M2, Advanced engine, distribution, and the harder tests

References

Post-audit additions (2026-06-13)

Implementation readiness

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[EPIC] IronCache: vision, tenets, scope, and the research-to-design plan #1

Description

[EPIC]: IronCache vision and master index

Why this matters

The five tenets, with measurable intent

Prior art

What IronCache IS and is NOT

Open decisions (cross-cutting)

Acceptance criteria

Issue map

M0, Charter, claims, and decisions to make before building

M1, Core architecture, decisions locked, foundational designs

M2, Advanced engine, distribution, and the harder tests

References

Post-audit additions (2026-06-13)

Implementation readiness

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions