Skip to content

feat(#384): rename ingestion plugin → chunking strategy in lamb-kb-server #384

@NoveliaYuki

Description

@NoveliaYuki

Purpose

Rename "ingestion plugin" to "chunking strategy" in the new lamb-kb-server and align the surrounding vocabulary.

In standard RAG terminology, ingestion refers to the full pipeline from an external source into a queryable vector store — i.e. importchunkingembeddingvector storage end-to-end. The plugins currently called "ingestion plugins" in lamb-kb-server implement only the chunking step; embedding and storage already live in separate plugin families (embedding vendor, vector-DB backend). Calling the chunking-only plugin an "ingestion plugin" misnames it and confuses new contributors.

After this rename, the codebase should consistently:

  • use "ingestion" only for the full pipeline / workflow (e.g. ingestion pipeline, ingestion endpoint, ingestion job, ingestion worker, library-only ingestion) — those usages stay as they are; and
  • use "chunking strategy" for the individual plugin that performs the chunking step.

The thesis already adopts this convention and the codebase should follow.

Acceptance criteria

  • Rename the abstract base class (currently IngestionPlugin or equivalent) and its four subclasses (Simple, Parent-Child, By Page, By Section) to use "ChunkingStrategy" naming
  • Rename the plugin directory backend/plugins/ingestion/ (or equivalent) to backend/plugins/chunking/
  • Update the plugin registry, registration decorators, and discovery walk
  • Update internal docstrings, comments, and CLAUDE.md
  • Update the README of lamb-kb-server
  • Update test names and test descriptions in backend/tests/
  • For HTTP routes and request/response schemas that surface "ingestion plugin" terminology, either rename or expose a backward-compatible alias so existing LAMB backend callers do not break
  • Coordinate the LAMB backend's knowledge_store_client.py to use the new field/path names once the server change lands
  • Confirm that all remaining uses of "ingestion" in the codebase refer to the full pipeline, not to the chunking-only plugin

Author: @NoveliaYuki

Metadata

Metadata

Assignees

Labels

documentationImprovements or additions to documentationenhancementNew feature or request

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions