open-research

Local deep-research multi-agent system: Planner / Finder / Summarizer / Reviewer / Writer pipeline with SSE telemetry, durable sessions, and PDF + Markdown report exports.

App Screenshots

Home page with a search active

Result screen with PDF download, Markdown download, and preview with sources

Config panel

Sessions menu

Production-oriented local deep-research platform with:

multi-agent orchestration (Planner -> Finder -> Summarizer -> Reviewer -> Writer)
durable session + document persistence (SQLite on host-mounted path)
real-time streaming execution telemetry (SSE)
frontend Mission Control workspace + dedicated result screen
configurable per-run research controls (iterations, sources, memory, report length)

What Is Implemented

Backend: FastAPI + LangGraph orchestration + Ollama adapter
Frontend: React + Vite + Zustand + Framer Motion
Persistence:
- session snapshots
- event stream history
- final report documents (JSON + Markdown)
Session memory:
- new runs can use recent completed sessions as planning context
UI:
- System / Light / Dark theme modes
- bounded unified workspace panel (agent pipeline + event log aligned height)
- session history drawer with quick open/download/delete actions
- separate full result screen
Testing:
- backend test suite under backend/tests/

Architecture

Runtime flow

User starts research from frontend.
Backend creates a session and starts graph execution.
Graph emits events through SSE (/api/research/{session_id}/events).
Backend persists events + session state in SQLite.
Writer produces final report.
Backend persists report as:
- report_json
- report_markdown
Frontend opens dedicated result screen for final report.

Agent flow

Planner: decomposes query into sub-questions
Finder: discovers diverse sources
Summarizer: extracts evidence from fetched content
Reviewer: detects gaps and decides iteration
Writer: synthesizes final report with citations

Repository Layout

backend/
  app/
    agents/
    api/
    core/
    models/
  tests/
frontend/
  src/
    components/
    hooks/
    pages/
    stores/
    types/
ollama/

Prerequisites

Docker + Docker Compose
AMD GPU + ROCm setup (for current Ollama container profile)
Free ports: 5173, 8000, 11434

Environment Configuration

Copy env template:

cp .env.example .env

Set GPU group IDs for your machine:

getent group video | cut -d: -f3
getent group render | cut -d: -f3

Configure persistence path (host machine):

# Host path for Ollama model data
OLLAMA_MODELS_DIR=./data/ollama
OLLAMA_MODEL=gpt-oss:20b
OLLAMA_CONTEXT_LENGTH=8192
OLLAMA_KEEP_ALIVE=5m

# Host path for backend persistent data
BACKEND_DATA_DIR=./data/backend

# Path used inside backend container
DATABASE_PATH=/app/data/research.db

docker-compose.yml maps:

${OLLAMA_MODELS_DIR}:/ollama-models
${BACKEND_DATA_DIR}:/app/data

This keeps both Ollama models and backend sessions/documents on your PC between restarts/redeploys.

Quick Start

docker compose up --build -d

Open:

Frontend: http://localhost:5173
Backend API: http://localhost:8000
Custom API docs: http://localhost:8000/custom-docs

Health checks:

curl http://localhost:8000/health
curl http://localhost:8000/api/status

Research Runtime Options

Per-session options are sent in POST /api/research/start.

Option	Type	Purpose
`maxIterations`	int	Max reviewer/planner loops
`maxSources`	int	Global source cap
`maxSourcesPerQuestion`	int	Cap per sub-question
`searchResultsPerQuery`	int	Search hits requested per query
`sourceDiversity`	bool	Domain diversity enforcement
`reportLength`	`short	medium
`includeSessionMemory`	bool	Include prior completed sessions
`sessionMemoryLimit`	int	Max prior sessions loaded
`summarizerSourceLimit`	int	Max sources deep-summarized

Example request

curl -X POST http://localhost:8000/api/research/start \
  -H "Content-Type: application/json" \
  -d '{
    "query": "Latest practical advances in retrieval-augmented generation",
    "options": {
      "maxIterations": 4,
      "maxSources": 16,
      "maxSourcesPerQuestion": 5,
      "searchResultsPerQuery": 6,
      "sourceDiversity": true,
      "reportLength": "long",
      "includeSessionMemory": true,
      "sessionMemoryLimit": 3,
      "summarizerSourceLimit": 8
    }
  }'

API Surface

Core endpoints

GET /health
GET /api/status
POST /api/research/start
GET /api/research/{session_id}/events (SSE)
POST /api/research/{session_id}/stop
GET /api/research/{session_id}/status
GET /api/research/sessions
DELETE /api/research/sessions/{session_id}
GET /api/research/sessions/{session_id}/report
GET /api/research/sessions/{session_id}/documents
GET /api/research/sessions/{session_id}/documents/{document_id}

Development diagnostics endpoints

The project currently includes api/test/* endpoints for validating individual agents and streaming behavior in local/dev environments.

Frontend UX Notes

Workspace includes:
- top research input + progress tracker
- unified panel with agent pipeline (left) and event log (right)
- session history in a slide-out drawer
Running sessions can be resumed after refresh; event history is replayed before live events continue.
Result report is rendered on a separate screen (ResultScreen) instead of inline with running telemetry.
Theme supports system/light/dark with persistent preference.

Persistence Model

SQLite tables:

sessions
session_events
session_documents

Saved documents per completed session:

<session_id>-json (report_json)
<session_id>-markdown (report_markdown)

Ollama Data Migration (If You Previously Used Named Volume)

If older runs used Docker named volume open-research_ollama_data, migrate once:

mkdir -p ./data/ollama
docker run --rm \
  -v open-research_ollama_data:/from \
  -v "$(pwd)/data/ollama:/to" \
  alpine sh -c "cp -a /from/. /to/"

Then set OLLAMA_MODELS_DIR=./data/ollama in .env and start normally with the bind mount.

Testing

Backend tests (recommended inside Docker)

docker exec deepresearch-backend sh -lc "cd /app && PYTHONPATH=/app uv run --extra dev pytest -q tests"

Frontend production build

docker compose build frontend

Operations

Start / stop

docker compose up --build -d
docker compose down

Logs

docker compose logs -f backend
docker compose logs -f frontend
docker compose logs -f ollama

Rebuild one service

docker compose build backend
docker compose up -d backend

Production Hardening Checklist

If deploying outside local machine scope, add:

reverse proxy (TLS termination, request limits)
authn/authz for API + UI
stricter network boundaries (no public Ollama port)
backup strategy for ${BACKEND_DATA_DIR}
centralized logging/metrics
CORS policy tailored to your domain topology

Known Constraints

System is optimized for local/self-hosted operation.
Ollama model and ROCm requirements are hardware-dependent.
Full research latency depends on model speed and web source availability.

License

MIT for original code in this repository (FastAPI orchestration backend, React frontend, scripts, Compose configs). Third-party services pulled at runtime (Ollama, the underlying LLM, web search providers) retain their own upstream licenses.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
backend		backend
docs/diagrams		docs/diagrams
frontend		frontend
ollama		ollama
screenshots		screenshots
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

open-research

App Screenshots

What Is Implemented

Architecture

Runtime flow

Agent flow

Repository Layout

Prerequisites

Environment Configuration

Quick Start

Research Runtime Options

Example request

API Surface

Core endpoints

Development diagnostics endpoints

Frontend UX Notes

Persistence Model

Ollama Data Migration (If You Previously Used Named Volume)

Testing

Backend tests (recommended inside Docker)

Frontend production build

Operations

Start / stop

Logs

Rebuild one service

Production Hardening Checklist

Known Constraints

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

open-research

App Screenshots

What Is Implemented

Architecture

Runtime flow

Agent flow

Repository Layout

Prerequisites

Environment Configuration

Quick Start

Research Runtime Options

Example request

API Surface

Core endpoints

Development diagnostics endpoints

Frontend UX Notes

Persistence Model

Ollama Data Migration (If You Previously Used Named Volume)

Testing

Backend tests (recommended inside Docker)

Frontend production build

Operations

Start / stop

Logs

Rebuild one service

Production Hardening Checklist

Known Constraints

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages