#74, #75, #76 by LinseCed · Pull Request #87 · SprintStartProject/sprintstart-ai

LinseCed · 2026-06-11T11:43:22Z

This went a little bit out of scope : )

What was Done:
Replace the flat "retrieve → prompt → stream" chat path with an agentic pipeline, adds native LLM tool-calling, and ships a CLI test client.

How it fits together

ChatOrchestrator (SSE wrapper)
└─ OrchestratorAgent ← routes the question
└─ AgentTool("synthesis") ← exposes a sub-agent as a callable tool
└─ SynthesisAgent ← answers from the knowledge base
├─ RetrieveTool ├─ GrepTool └─ FetchFileTool
Every Agent runs the same two phases: gather (call tools until it has enough) then answer (synthesize a reply). Tools
are the leaves that touch data; an AgentTool lets one agent call another as just another tool.

Agents (src/agents/)

Agent (base) — shared engine. gather_stream loops up to max_steps, asking the LLM with the tool catalogue and
running returned tool calls until it stops; answer_stream synthesizes the final reply. Emits an Invocation per
tool/sub-agent used (drives tool_use events). User query fenced with a per-request random marker (injection guard).
OrchestratorAgent — top-level router. Delegates to the most relevant sub-agent; answers greetings/meta directly.
Streams a single delegation's answer straight through, or synthesizes once from multiple delegations' summaries.
SynthesisAgent — knowledge-base specialist. Picks among retrieve/grep/fetch_file to gather context, then answers
strictly from gathered sources.
ChatOrchestrator — not an Agent; wraps OrchestratorAgent and serializes its output as the SSE stream (tool_use →
token → citation → done).

Tools (src/agents/tools/)

Tool (base) / ToolRegistry — Pydantic-validated args, execute never raises, registry dispatches by name and exposes
JSON-schema specs to the LLM.
RetrieveTool — semantic + keyword search over the vector store; for conceptual/open-ended questions.
GrepTool — case-insensitive substring search; for exact identifiers or phrases.
FetchFileTool — returns all chunks of a named file; explicit extension matches exactly, bare name matches by stem.
AgentTool — adapts a sub-agent to the tool interface so agents can be composed; returns a deferred Delegation
(sub-agent gathers now, synthesizes only if needed).

LLM clients (src/llm/)

LLMClient protocol gains chat(messages, tools) plus ToolCall/ToolSpec/ChatResult; implemented natively for OpenAI
and Ollama (replaces the JSON workaround). Ollama tool-call IDs are uuid4-unique.

API & CLI

chat route runs through a new get_orchestrator dependency; SSE adds tool_use events (new ToolUseEvent schema).
ChatRequest no longer takes top_k/min_score (the pipeline owns retrieval depth). New scripts/chat_cli.py terminal
client (ask / ingest / history).

Tests

New tests/agents/ suite (agent loop, orchestrator, tools) and a ScriptedLLMClient stub that drives the tool loop
deterministically; added OpenAI/Ollama tool-calling coverage.

Type of PR — pick one:

Functional — adds or changes user-visible behavior
Non-functional — improves a measurable property (perf, security, a11y …)
Mixed — both
Internal — refactor / docs / tests only

1 Process baseline — every box must be ticked, on every PR

All acceptance criteria in the linked issue are met
1 review approval from a non-author
CI green: lint, type-check, unit tests, build, secret-scan
No secrets / tokens / credentials in the diff
No new TODO / FIXME without a follow-up issue
Docs updated if behavior, API, config, or architecture changed
No regression of other NFR baselines (a11y / perf / security)

2 Outcome proof — fill the bullets that match your PR type

Functional / Mixed PR:

≥ 1 black-box test that exercises the acceptance criteria
e.g. upload a markdown file, then assert the chat answer cites it

Non-functional / Mixed PR:

Measurement that proves the acceptance criteria, attached to the PR
e.g. benchmark output for chat p95 < 2 s, axe audit for a11y ≥ 90, scan report for 0 critical CVEs

3 Cross-cutting impact — tick the areas this PR touches

For each area below, ask: "does my PR touch this?"

-> If yes → tick the area and complete its sub-checks (they become mandatory).
-> If no → skip it.
-> If nothing applies → tick "None of the above".

…e-orchestrator

…g fixes

…ator' into 75-create-slim-pipeline-orchestrator

LinseCed and others added 19 commits June 9, 2026 17:52

Add all files + implement v1 of the synthesis agent

f16b22e

Full agentic workflow

6750555

Merge remote-tracking branch 'origin/dev' into 75-create-slim-pipelin…

5483262

…e-orchestrator

Add OpenAI & Ollama native Tool Support and remove JSON workaround

778ae70

Small CLI Client for manual testing

4e47597

Refactoring, hardening against prompt injection, Formatting, Small bi…

db7027f

…g fixes

Quickfix for consitency

c6e8fe0

Add AGENT_DEBUG to README and .env.example

0a85291

Merge branch 'dev' into 75-create-slim-pipeline-orchestrator

e738ffe

Upate README.md to include tool use event

faec9e3

Merge remote-tracking branch 'origin/75-create-slim-pipeline-orchestr…

7e01411

…ator' into 75-create-slim-pipeline-orchestrator

Retry if first try didnt call any tools

9f0ee16

Quickfix

c575812

Add seeding to improve performance for local models

1f2e369

Set Temmperatur to 0.1, some finetuning

c08e9d5

Add SplitLLMClient and rethink OrchestratorAgent

9ee5659

Remove greeting heuristic

c53e150

Make it slim again

7976346

Make test client render markdown live

2e39937

LinseCed requested review from Afif-del and DaniloTatti June 12, 2026 16:57

Afif-del approved these changes Jun 13, 2026

View reviewed changes

LinseCed merged commit 9418e8c into dev Jun 13, 2026
4 checks passed

LinseCed deleted the 75-create-slim-pipeline-orchestrator branch June 14, 2026 22:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#74, #75, #76#87

#74, #75, #76#87
LinseCed merged 19 commits into
devfrom
75-create-slim-pipeline-orchestrator

LinseCed commented Jun 11, 2026 •

edited by DaniloTatti

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LinseCed commented Jun 11, 2026 • edited by DaniloTatti Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1 Process baseline — every box must be ticked, on every PR

2 Outcome proof — fill the bullets that match your PR type

3 Cross-cutting impact — tick the areas this PR touches

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LinseCed commented Jun 11, 2026 •

edited by DaniloTatti

Loading