Starling Memory

English | 中文

Agent memory with a multi-subject social mind and brain-inspired dynamics.

Starling gives an LLM agent something a vector store can't: a persistent, evolving model of each person it talks to — a picture of the other, what the agent believes about them, and what the agent thinks they believe — together with brain-like memory dynamics: fast write / slow consolidation, priority replay, reconsolidation (without overwrite), adaptive forgetting, salience modulation, and prospective triggers.

It is not user_id isolation plus vector-RAG. It's a three-part system — a data model, a runtime scheduler, and a retrieval planner — that can sit on top of mem0 / Letta / cognee / Graphiti rather than replace them.

C++20 core, raw SQLite, with Python (pybind11) bindings.

Why it's different

Seven things mainstream agent-memory stacks collectively lack:

Cognizer is a first-class entity, not a user_id column. Memory is attributed to cognitive subjects.
Statement replaces Fact. Every write is "who, when, on what evidence, about whom, with what modality and polarity, holds what judgment."
Second-order Theory of Mind in the data model — nested Statements + nesting_depth + adaptive ToM order ("I believe that you believe…").
A brain-like six-state lifecycle — consolidation_state ∈ {VOLATILE, REPLAYING_CONSOLIDATING, REPLAYING_RECONSOLIDATING, CONSOLIDATED, ARCHIVED, FORGOTTEN}.
Reconsolidation never overwrites. Recalling a memory opens a plastic window; the old version enters a supersedes chain instead of being deleted.
Real prospection — typed Triggers (time / event / state / compound) + a Commitment five-state machine that wakes the agent without a user query.
Perspective-aware retrieval + mind summaries — retrieval is reconstructed per (querier, perspective, intent, goal) with explicit abstention, not a fan-out of tools.

Non-goals: it does not rewrite the vector store, do training, or chase formal completeness.

Three axioms

I — No isolated facts, only statements attributed to a subject. Every memory is Statement(holder, subject, predicate, object, modality, polarity, time, evidence, confidence). One shape solves attribution, conflict, retraction, perspective, and second-order ToM.
II — Two timescales cooperate (Complementary Learning Systems). Writes land in the Hippocampus (VOLATILE), then pass through Replay, pattern separation/completion, and reconsolidation before rising into Neocortex as stable semantics / norms / skills / personas.
III — Memory is reconstructed for the current goal, not replayed like a tape (Conway SMS). Retrieval returns a perspective-shaped mind summary, with the option to abstain.

Architecture

A Statement Bus is the spine — every read and write goes through it. Around it sit twelve subsystems (full design in docs/design/):

Subsystem	Responsibility
Substrate (`04`)	SQLite persistence, capability/preflight, Projection Index
Statement Bus (`05`)	Outbox-backed event bus; all reads/writes pass through it
Governance (`05`)	RuntimeHealth (READY/DEGRADED/UNREADY), preflight gate
EngramStore (`06`)	Evidence storage + ingest policy
Hippocampus (`06`)	VOLATILE buffer, pattern separation, Affect Buffer
Neocortex (`07`)	Persona / CommonGround containers (materialized, CAS-versioned)
Cognizer Hub (`08`)	Subject registry, alias normalization, social-graph edges
Theory of Mind (`09`)	Mentalizing primitives, nested beliefs, common ground
Replay Scheduler (`10`)	Forgetting curve, SWR priority sampling, 5 consolidation ops
Reconsolidation (`11`)	Plastic windows, arbitration, supersedes chains
Prospective Loop (`12`)	Commitment state machine, typed Triggers, ActionGuard
Retrieval (`13`)	Perspective filter, semantic recall, context-pack builder

Tech stack

C++20 · raw SQLite (≥3.46) · libcurl · nlohmann/json · OpenSSL · pybind11 · Python ≥3.11 · CMake (≥3.27) + Ninja · ctest + pytest.

Dashboard (optional): FastAPI · uvicorn · SvelteKit (Svelte 5) · Tailwind · TypeScript (Node ≥20). See Dashboard.

Build & test

Recommended path from the repo root:

python3 -m venv .venv
source .venv/bin/activate
python -m pip install -U pip
pip install -r requirements-build.txt
python scripts/configure_build.py --build --test

python scripts/configure_build.py --build --test configures first, then builds and runs C++ tests. It does not install the Python package. The script is local-first: it reuses tools and dependency sources from .venv, conda package caches, Homebrew, system packages, existing build/_deps sources, and finally the active conda environment as a fallback before letting CMake FetchContent use the network for missing nlohmann/json or GoogleTest. On Linux it uses build-linux by default; on macOS it uses build-macos.

Direct CMake users can run the generic development preset:

cmake --preset dev
cmake --build --preset dev
ctest --preset dev

Or use the platform-specific build directory presets:

cmake --preset linux-local        # Linux
cmake --build --preset linux-local
ctest --preset linux-local

cmake --preset macos-local        # macOS
cmake --build --preset macos-local
ctest --preset macos-local

These direct CMake paths do not use the script's local-first dependency hints. The linux-local and macos-local presets select build-linux / build-macos and a Release build, but still rely on CMake's normal dependency discovery. Use them when CMake can already find SQLite >= 3.46, OpenSSL, libcurl, Linux ICU, pybind11, Ninja, and the other build inputs; otherwise use python scripts/configure_build.py.

Python editable install, required for from starling import _core, examples such as python examples/quickstart.py, and the Dashboard's Python package prerequisite:

python scripts/configure_build.py --python-editable

After changing C++ sources, migrations/, or bindings, rebuild the C++ tree:

python scripts/configure_build.py --build

# Or, after configuring with a direct CMake preset:
cmake --build --preset linux-local   # Linux
cmake --build --preset macos-local   # macOS

If you use Python editable imports, examples, or the Dashboard, rerun python scripts/configure_build.py --python-editable after C++/migration/binding changes so the installed _core extension is refreshed.

Prerequisites

Python ≥3.11 — check with python3 --version.
A C++20 compiler + git. macOS: xcode-select --install (installs Apple Clang + git). Linux: sudo apt install build-essential git.
The C++ core links SQLite, OpenSSL, libcurl, ICU, and uses nlohmann/json plus GoogleTest for C++ tests. On Linux, install the dev headers when system packages are preferred: sudo apt install libsqlite3-dev libssl-dev libcurl4-openssl-dev libicu-dev.

After the editable install, from starling import _core (the bound C++ core) and the starling.Memory facade work; python examples/quickstart.py runs an offline end-to-end demo.

Troubleshooting

Stale build/: use build-linux, build-macos, or another fresh build dir.
SQLite: Starling requires SQLite >= 3.46.
Linux ICU: install libicu-dev or provide ICU CMake variables.
Offline FetchContent: rerun after a prior successful download or pass -DFETCHCONTENT_SOURCE_DIR_JSON=... and -DFETCHCONTENT_SOURCE_DIR_GOOGLETEST=....
Conda linker wrappers: avoid CMake caches containing compiler_compat.
Conda libstdc++ conflicts: the helper script filters broad conda root rpaths and, for conda libcurl on Linux, adds narrow package-cache runtime paths. If configuring manually, do not add all of CONDA_PREFIX/lib or ~/miniconda3/lib to rpath.

How it works: scripts/configure_build.py drives the CMake + Ninja build with explicit local dependency hints, while scikit-build-core remains the build backend declared in pyproject.toml for Python editable installs. The migrations/*.sql are embedded into the binary at build time, so the schema travels with the core — there is no runtime migration step.

The Python surface is the bound C++ core: from starling import _core, with the application-facing starling.Memory facade on top (open / remember / recall / tick / render_working_set / close). See tests/python/ and examples/ for runnable examples of writing statements, retrieval, replay, reconsolidation, and the commitment lifecycle.

Dashboard

A web observability + interaction surface: a FastAPI engine-API fronted by a SvelteKit + Tailwind UI, organized as a brain-region map. Pages span interaction (chat-with-memory converse, raw remember / recall, working-set render, commitment reminders), cognitive inspection (statement explorer, cognizer social graph, commitment state machine, and Lens provenance forensics for a single memory), dynamics (replay with a per-batch dream log, lifecycle state distribution, decay forecast, conflict probe), vitals (read-only observability: pump lag, queue depth, recall-receipt attribution), and an overview + eval-report landing. Live updates stream over WebSocket; inspection panels read SQLite read-only, while commands — converse, remember, tick, plus interventions (forget, review, replay_trigger, reconsolidate) — run through the engine (with UI-configurable models).

Prerequisites: the Python package installed (see Build & test) plus Node ≥20 and npm (for the one-time frontend build).

One-click run:

source .venv/bin/activate
pip install -e ".[dashboard]"
python scripts/run_dashboard.py

That single command loads the unified config (~/.starling/starling.json, created on first run), generates a bearer token, builds the SvelteKit frontend if missing (needs Node; --no-build skips it), and serves the API + the static frontend on one port. It prints a login URL with the token in the URL fragment:

Dashboard ready → http://127.0.0.1:8787/#token=…

Open it — the frontend reads the token from the fragment (which browsers never send to the server, so it stays out of access logs) and stores it. No second terminal, no env vars to memorize.

Configure models in the UI: the dashboard starts with no model configured; inspection panels, recall, tick, and working-set work immediately (offline embedder), while remember (statement extraction) returns 409 until you set one. Open Settings (/settings) to manage a named provider registry (each entry: provider / model / base URL / API key) and bind each role — extraction, embedding, chat, consolidation — to a provider. extraction and embedding are wired today; chat drives converse. Changes are hot-swapped — no restart. Changing the embedding provider re-embeds existing memories (the vector dimension changes).

Unified config — ~/.starling/starling.json (0600): one file holds everything — db_path (defaults to ~/.starling/dashboard.db, auto-created), agent, tenant, token (auto-generated), host, port, cors_origins, the named-provider registry (providers), and the per-role bindings (roles), including each provider's API key. Legacy single-{llm, embedder} files are auto-migrated on load, so existing configs keep working with no manual edit. The file is gitignored and chmod-0600. Override its path with STARLING_CONFIG or --config; environment variables (STARLING_DASH_*, OPENAI_*) still override the file for CI / ephemeral use.

Remote access: bind a public interface (a token is always present, hence required), then share the printed #token= URL:

export STARLING_DASH_HOST=0.0.0.0
python scripts/run_dashboard.py

Put it behind a TLS reverse proxy. Security: all secrets live only in starling.json (0600, gitignored) plus process memory (a transient env-swap at adapter-build time) — they never enter git, the SQLite memory DB, or logs. GET /api/config returns only key_set booleans, never the token or full keys. The token travels in the URL fragment (never in access logs) and is compared in constant time. The SPA fallback has a path-traversal guard, the WebSocket endpoint enforces an Origin check (anti-CSWSH), and the static shell is public while every /api + /ws route is token-gated.

Tests: pytest tests/python/test_dashboard_*.py (config / engine / auth / inspection / commands / WebSocket, offline-deterministic); in dashboard/web/, npx vitest run (unit) and npx playwright test (e2e smoke).

Dev hot-reload: for frontend iteration, run the backend (python scripts/run_dashboard.py) and, in a second terminal, cd dashboard/web && npm run dev (Vite on :5173, proxying /api + /ws to :8787).

Repository layout

include/    C++ public headers (starling/...)
src/        C++ implementation (bus, evidence, extractor, cognizer, tom,
            replay, reconsolidation, neocortex, projection, vector,
            embedding, retrieval, prospective, affect, persistence)
bindings/   pybind11 module
python/     Python package (starling) — incl. the Memory facade and the
            dashboard FastAPI api (python/starling/dashboard)
migrations/ SQL schema migrations (embedded into the core at build time)
scripts/    Python tooling — eval harnesses + run_dashboard.py launcher
dashboard/  Dashboard frontend (dashboard/web, SvelteKit) + run docs
tests/      ctest (tests/cpp) + pytest (tests/python)
docs/       system design (docs/design) + specs & plans (docs/superpowers)

Documentation

The authoritative design lives in docs/design/system_design.md (main document: axioms, topology, data ontology, roadmap, trade-offs) plus twelve subsystem documents in docs/design/subsystems_design/. Per-milestone specs and implementation plans are under docs/superpowers/.

License

A license has not yet been chosen. Until one is added, all rights are reserved by the authors.

Name		Name	Last commit message	Last commit date
Latest commit History 735 Commits
.github/workflows		.github/workflows
bindings/python		bindings/python
cmake		cmake
dashboard		dashboard
docs		docs
examples		examples
include/starling		include/starling
integrations/openclaw		integrations/openclaw
migrations		migrations
python/starling		python/starling
scripts		scripts
src		src
tests		tests
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
README.md		README.md
README.zh-CN.md		README.zh-CN.md
pyproject.toml		pyproject.toml
requirements-build.txt		requirements-build.txt
requirements-dev.txt		requirements-dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Starling Memory

Why it's different

Three axioms

Architecture

Tech stack

Build & test

Dashboard

Repository layout

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Starling Memory

Why it's different

Three axioms

Architecture

Tech stack

Build & test

Dashboard

Repository layout

Documentation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages