feat(memory_fs): LLM synthesis of MEMORY/SKILL from descriptions by sairin1202 · Pull Request #436 · NevaMind-AI/memU

sairin1202 · 2026-06-18T08:00:55Z

Stacked on #435 (base = feat/memory-file-artifacts).

Summary

Makes the MEMORY and SKILL bypasses synthesize directly from the shared multimodal description trunk via the LLM, instead of rendering already-extracted records. This realizes the "description → 3 sibling bypasses" model end-to-end. INDEX.md stays a deterministic table of contents.

Opt-in and additive: the existing memorize/extract/category pipeline is untouched; synthesis only changes what export_memory_files() writes.

How it works

New memu.memory_fs.MemorySynthesizer (prompts in memu.prompts.memory_fs):
- MEMORY.md: one LLM pass turns all per-source descriptions into a consolidated memory document (Profile / Preferences / Goals / Key Events).
- skill/<name>/SKILL.md: one LLM pass extracts skills as a JSON array of {name, body}, each written as its own doc (kebab-case slug, collision-safe).
MemoryFileExporter.export(...) gains optional memory_body / skills overrides; when present they replace the deterministic rendering. INDEX.md is always deterministic.
Gated by memory_files_config.synthesize (default off) + synthesis_llm_profile. Diff/manifest, scoping, and per-service lock all carry over.

Test plan

tests/test_memory_fs_synthesis.py (5): synthesizer parse/empty/helpers, exporter override path, full service wiring with a fake LLM (no network)
tests/test_memory_files.py (7) still green
Full suite: 91 passed, 1 skipped; ruff + mypy clean on changed files

Notes / follow-ups

"Description" currently = Resource.caption. A richer persisted per-source description (full preprocessed text, not just the caption) would improve synthesis quality — deferred.
Synthesis is whole-store each run (no incremental/diff-scoped LLM yet); pairs naturally with the future EntityCoordinator to run off dirty descriptions only.

Made with Cursor

…ions Add an opt-in synthesis mode where MEMORY.md and the skill/ tree are generated directly from the shared per-source descriptions by an LLM, instead of being rendered from already-extracted records. - New `MemorySynthesizer` (+ prompts in `memu.prompts.memory_fs`) turns the description trunk into a consolidated memory doc and a JSON list of skills. - Exporter gains `memory_body` / `skills` overrides; INDEX.md stays deterministic in both modes. - Gated by `memory_files_config.synthesize` (default off) and `synthesis_llm_profile`; existing memorize/extract pipeline is untouched. Tests use a fake LLM client (no network). Co-authored-by: Cursor <cursoragent@cursor.com>

Mirror the "submit the changed part of the file system" model: - First run (no MEMORY.md/skill tree) initializes the tree from all in-scope descriptions; subsequent runs incrementally merge only the changed sources' descriptions into the existing MEMORY.md body and skill bodies (upsert by slug). - Add MemorySynthesizer.update() with MEMORY_UPDATE_PROMPT / SKILL_UPDATE_PROMPT. - Add exporter read helpers (artifacts_exist / read_memory_body / read_skills) to feed the update path from disk. - MemoryService._build_memory_files() centralizes the init-vs-update decision; export_memory_files() keeps doing a full (re)initialization. - Gate an optional post-memorize hook behind memory_files_config.update_on_memorize that drives the builder with the just-created resources; best-effort so an export error never fails memorize. INDEX.md stays deterministic (recomputed from the current source set). Co-authored-by: Cursor <cursoragent@cursor.com>

sairin1202 and others added 2 commits June 18, 2026 00:59

sairin1202 mentioned this pull request Jun 18, 2026

refactor(app): decouple embedding, remove Rust, split flows, narrow DB protocol #437

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(memory_fs): LLM synthesis of MEMORY/SKILL from descriptions#436

feat(memory_fs): LLM synthesis of MEMORY/SKILL from descriptions#436
sairin1202 wants to merge 2 commits into
feat/memory-file-artifactsfrom
feat/memory-fs-synthesis

sairin1202 commented Jun 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

sairin1202 commented Jun 18, 2026

Summary

How it works

Test plan

Notes / follow-ups

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant