Purpose
Rename "ingestion plugin" to "chunking strategy" in the new lamb-kb-server and align the surrounding vocabulary.
In standard RAG terminology, ingestion refers to the full pipeline from an external source into a queryable vector store — i.e. import → chunking → embedding → vector storage end-to-end. The plugins currently called "ingestion plugins" in lamb-kb-server implement only the chunking step; embedding and storage already live in separate plugin families (embedding vendor, vector-DB backend). Calling the chunking-only plugin an "ingestion plugin" misnames it and confuses new contributors.
After this rename, the codebase should consistently:
- use "ingestion" only for the full pipeline / workflow (e.g. ingestion pipeline, ingestion endpoint, ingestion job, ingestion worker, library-only ingestion) — those usages stay as they are; and
- use "chunking strategy" for the individual plugin that performs the chunking step.
The thesis already adopts this convention and the codebase should follow.
Acceptance criteria
- Rename the abstract base class (currently
IngestionPlugin or equivalent) and its four subclasses (Simple, Parent-Child, By Page, By Section) to use "ChunkingStrategy" naming
- Rename the plugin directory
backend/plugins/ingestion/ (or equivalent) to backend/plugins/chunking/
- Update the plugin registry, registration decorators, and discovery walk
- Update internal docstrings, comments, and CLAUDE.md
- Update the README of
lamb-kb-server
- Update test names and test descriptions in
backend/tests/
- For HTTP routes and request/response schemas that surface "ingestion plugin" terminology, either rename or expose a backward-compatible alias so existing LAMB backend callers do not break
- Coordinate the LAMB backend's
knowledge_store_client.py to use the new field/path names once the server change lands
- Confirm that all remaining uses of "ingestion" in the codebase refer to the full pipeline, not to the chunking-only plugin
Author: @NoveliaYuki
Purpose
Rename "ingestion plugin" to "chunking strategy" in the new
lamb-kb-serverand align the surrounding vocabulary.In standard RAG terminology, ingestion refers to the full pipeline from an external source into a queryable vector store — i.e. import → chunking → embedding → vector storage end-to-end. The plugins currently called "ingestion plugins" in
lamb-kb-serverimplement only the chunking step; embedding and storage already live in separate plugin families (embedding vendor, vector-DB backend). Calling the chunking-only plugin an "ingestion plugin" misnames it and confuses new contributors.After this rename, the codebase should consistently:
The thesis already adopts this convention and the codebase should follow.
Acceptance criteria
IngestionPluginor equivalent) and its four subclasses (Simple, Parent-Child, By Page, By Section) to use "ChunkingStrategy" namingbackend/plugins/ingestion/(or equivalent) tobackend/plugins/chunking/lamb-kb-serverbackend/tests/knowledge_store_client.pyto use the new field/path names once the server change landsAuthor: @NoveliaYuki