FastAPI Chatbot

Author: Alyona Carolina Ivanova Araujo
Email: alenacivanovaa@gmail.com
Version: 1.0.1

A production-ready chatbot API built with FastAPI that leverages TF-IDF vectorization and cosine similarity for intelligent natural language processing and response generation.

What It Does

Automated conversational AI through an intelligent 3-stage pipeline:

graph LR
    A[User Message] --> B[Session Validation]
    B --> C[NLP Processing<br/>TF-IDF + Cosine Similarity]
    C --> D[Response Matching]
    D --> E[Confidence Check]
    E --> F[Intelligent Response]
    
    style A fill:#e3f2fd
    style B fill:#f3e5f5
    style C fill:#fff3e0
    style D fill:#e8f5e9
    style E fill:#fce4ec
    style F fill:#e3f2fd

Stage	Input	Output	Technology
Validate	Session ID + User message	Session verification	UUID-based tracking
Process	Raw text input	Vectorized representation	TF-IDF (scikit-learn)
Match	Vector + Knowledge base	Best matching responses	Cosine similarity
Deliver	Matched response	JSON API response	FastAPI + Pydantic

Performance Metrics

70%+ Test coverage with comprehensive unit and integration tests
<100ms Average response time (excluding NLP processing)
100% Type coverage with mypy static analysis
0.3+ Default confidence threshold for quality responses
Multilingual English and Norwegian support out of the box

Overview

This project implements a RESTful chatbot service following Clean Architecture principles with comprehensive session management, structured response matching, and enterprise-grade development tooling. The application is fully containerized and includes a complete CI/CD pipeline for automated testing, quality assurance, and deployment.

Why This Project Stands Out

Clean Architecture: Proper separation of concerns with domain-driven design
Test-Driven Development: Comprehensive unit and integration tests with 70%+ coverage
CI/CD Ready: Full GitHub Actions pipeline with matrix testing (Python 3.11, 3.12)
Modern Tooling: uv, Ruff, Black, mypy, Bandit, pre-commit hooks
Production Ready: Docker support, structured logging, environment-based configuration
Multilingual: English and Norwegian support out of the box
Type Safe: Full type hints with mypy static type checking

Key Features

Core Functionality

FastAPI Framework: High-performance async REST API with automatic OpenAPI/Swagger documentation
NLP Engine: TF-IDF vectorization with cosine similarity for intelligent response matching
Session Management: UUID-based conversation tracking with configurable TTL
Multilingual Support: English (en) and Norwegian (nb) with extensible language system
Confidence Scoring: Adjustable threshold for response quality control

Architecture & Design

Clean Architecture: Domain-centric design with dependency inversion principle
Repository Pattern: Abstracted data access with interface-based design
Dependency Injection: Loose coupling through FastAPI's DI system
Factory Pattern: Centralized object creation and lifecycle management
Service Layer: Business logic isolated from HTTP and infrastructure concerns

Development Excellence

Comprehensive Testing: Unit tests, integration tests, and API endpoint tests
Code Coverage: 70%+ coverage with HTML and XML reports
Code Formatting: Black (88 chars) + Ruff import sorting
Static Analysis: Ruff linting + mypy type checking
Security Scanning: Bandit for code security analysis
Makefile Automation: Streamlined development workflow commands
Pre-commit Hooks: Automated quality checks before every commit

DevOps & Deployment

Docker Ready: Full containerization with docker-compose support
CI/CD Pipeline: GitHub Actions with matrix testing (Python 3.11, 3.12)
Package Management: Setuptools configuration with versioning
Configuration Management: Environment-based settings with pydantic-settings
Structured Logging: Configurable log levels and formatting
CORS Support: Configurable cross-origin resource sharing

Quick Start

Get up and running in minutes:

# 1. Clone the repository
git clone https://github.com/AlyonaCIA/FastAPI_Chatbot.git
cd FastAPI_Chatbot

# 2. Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# 3. Setup development environment (recommended)
make install-dev

# 4. Run the application
make run

# 5. Test the API
curl -X POST "http://localhost:8080/api/v1/conversations/start" \
  -H "Content-Type: application/json" \
  -d '{"language": "en"}'

# 6. View interactive documentation
open http://localhost:8080/docs

The API will be available at http://localhost:8080 with:

Swagger UI: http://localhost:8080/docs
ReDoc: http://localhost:8080/redoc
OpenAPI Schema: http://localhost:8080/openapi.json

Technology Stack

Core Framework & Runtime

Python 3.11/3.12: Modern Python with latest performance improvements
FastAPI 0.109.0: High-performance async web framework
Uvicorn 0.27.0: Lightning-fast ASGI server
Pydantic 2.6.0: Data validation and settings management with pydantic-settings

NLP & Data Processing

scikit-learn 1.4.0: TF-IDF vectorization and cosine similarity
cachetools 6.1.0: Response caching for performance optimization

Development & Quality Tools

uv: Ultra-fast Python package installer and resolver (replacing pip)
pytest 8.0.0: Modern Python testing framework with plugins
- pytest-cov: Code coverage measurement
- pytest-mock: Mock object integration
- pytest-asyncio: Async test support
Black 24.1.0: Uncompromising code formatter (88 char line length)
Ruff 0.3.0: Extremely fast Python linter (replaces flake8, isort, and more)
mypy 1.8.0: Static type checker for type safety
Bandit 1.7.0: Security-focused static analyzer
pre-commit 3.6.0: Git hook framework for automated quality checks
- Trailing whitespace removal
- YAML/TOML validation
- Large file detection
- Branch protection (dev/main)
- Security checks
- Typo detection

Build & Deployment Tools

Makefile: Comprehensive task automation
Hatchling: Modern Python build backend
Docker: Containerization for consistent deployments

Architecture

The FastAPI Chatbot follows Clean Architecture principles with Domain-Driven Design (DDD), ensuring clear separation of concerns, high testability, and maintainability. The architecture is organized into concentric layers where dependencies flow inward toward the domain.

High-Level System Overview

%%{init: {'theme':'base', 'themeVariables': {'primaryColor':'#e3f2fd','primaryTextColor':'#000','primaryBorderColor':'#1565c0','lineColor':'#424242','secondaryColor':'#f3e5f5','tertiaryColor':'#e8f5e9'}}}%%
graph TB
    subgraph External["EXTERNAL WORLD"]
        Users["End Users<br/>Web Browsers<br/>Mobile Apps<br/>API Clients"]
        CI["CI/CD<br/>GitHub Actions<br/>Automated Testing<br/>Quality Checks"]
    end

    subgraph FastAPI["FASTAPI CHATBOT APPLICATION"]
        direction TB
        
        subgraph API["API Layer"]
            Router["REST Endpoints<br/>/conversations<br/>/health"]
            Middleware["Middleware<br/>CORS<br/>Error Handling<br/>Logging"]
            Validation["Pydantic<br/>Request Validation<br/>Response Serialization"]
        end
        
        subgraph Services["Business Services"]
            Chatbot["ChatbotService<br/>NLP Processing<br/>Response Matching<br/>Greeting Generation"]
            Session["SessionService<br/>Session Management<br/>History Tracking<br/>Validation"]
        end
        
        subgraph Data["Data Layer"]
            Repository["Repository<br/>Session Storage<br/>In-Memory"]
            KnowledgeBase["Knowledge Base<br/>Q&A Dialogues<br/>JSON Dataset"]
        end
    end

    subgraph ML["MACHINE LEARNING"]
        direction TB
        SKLearn["scikit-learn<br/>TF-IDF Vectorizer<br/>Cosine Similarity<br/>Text Processing"]
    end

    subgraph Tools["DEVELOPMENT TOOLS"]
        direction TB
        UV["uv<br/>Package Manager<br/>Virtual Environments"]
        Ruff["Ruff<br/>Linter & Formatter"]
        Pytest["pytest<br/>Testing Framework<br/>70%+ Coverage"]
    end

    %% User interactions
    Users -->|HTTP Requests, JSON| Router
    Router -->|Responses, JSON| Users
    
    %% API Flow
    Router --> Middleware
    Middleware --> Validation
    Validation --> Chatbot
    Validation --> Session
    
    %% Service to Data
    Chatbot --> KnowledgeBase
    Chatbot --> SKLearn
    Session --> Repository
    
    %% CI/CD
    CI -.->|Deploy| FastAPI
    CI -.->|Test| Tools
    
    %% Tool connections
    Tools -.->|Develop & Test| FastAPI
    
    %% Styling
    classDef externalClass fill:#e3f2fd,stroke:#1565c0,stroke-width:3px,color:#000
    classDef apiClass fill:#fff3e0,stroke:#e65100,stroke-width:2px,color:#000
    classDef serviceClass fill:#f3e5f5,stroke:#6a1b9a,stroke-width:2px,color:#000
    classDef dataClass fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px,color:#000
    classDef mlClass fill:#ffebee,stroke:#c62828,stroke-width:2px,color:#000
    classDef toolClass fill:#fff9c4,stroke:#f57f17,stroke-width:2px,color:#000
    
    class Users,CI externalClass
    class Router,Middleware,Validation apiClass
    class Chatbot,Session serviceClass
    class Repository,KnowledgeBase dataClass
    class SKLearn mlClass
    class UV,Ruff,Pytest toolClass

System Architecture

%%{init: {'theme':'base', 'themeVariables': { 'fontSize':'16px', 'fontFamily':'arial'}}}%%
graph LR
    subgraph Client["CLIENT LAYER"]
        direction TB
        CL1["HTTP Client<br/>Browser/Postman/App"]
        CL2["REST API<br/>JSON over HTTP"]
    end

    subgraph API["PRESENTATION LAYER<br/>FastAPI Framework"]
        direction TB
        API1["FastAPI App<br/>CORS Middleware<br/>OpenAPI Docs<br/>Error Handling"]
        API2["Routes<br/>/conversations<br/>/health<br/>Versioned v1"]
        API3["Schemas<br/>Request Validation<br/>Response Models<br/>Pydantic"]
        API4["Dependencies<br/>Dependency Injection<br/>Service Factories"]
    end

    subgraph Domain["DOMAIN LAYER<br/>Business Logic - Pure Python"]
        direction TB
        DOM1["ChatbotService<br/>NLP Processing<br/>Response Matching<br/>TF-IDF Analysis"]
        DOM2["SessionService<br/>Session Management<br/>Business Rules<br/>Validation"]
        DOM3["Entities<br/>Session Model<br/>Domain Objects"]
        DOM4["Repository Interface<br/>Abstract Contracts<br/>No Implementation"]
    end

    subgraph Infra["INFRASTRUCTURE LAYER<br/>External Integrations"]
        direction TB
        INF1["Memory Repository<br/>In-Memory Storage<br/>Session Persistence"]
        INF2["Data Loaders<br/>JSON Data Source<br/>Q&A Knowledge Base"]
        INF3["NLP Engine<br/>scikit-learn<br/>TF-IDF Vectorizer<br/>Cosine Similarity"]
    end

    subgraph Core["CORE - Cross-Cutting"]
        direction TB
        CORE1["Configuration<br/>Environment Vars<br/>Settings Management"]
        CORE2["Logging<br/>Structured Logs<br/>Log Levels"]
        CORE3["Exceptions<br/>Custom Errors<br/>Error Handling"]
    end

    %% Client to API
    CL1 -->|HTTP Requests| API1
    CL2 -.->|JSON Payload| API2

    %% API to Domain
    API1 --> API2
    API2 --> API3
    API2 --> API4
    API4 -->|Inject Services| DOM1
    API4 -->|Inject Services| DOM2

    %% Domain Layer Internal
    DOM1 -.->|Uses| DOM3
    DOM2 -->|Uses| DOM3
    DOM2 -->|Depends on Interface| DOM4

    %% Domain to Infrastructure
    DOM1 -->|Loads Data| INF2
    DOM1 -->|NLP Processing| INF3
    DOM4 -.->|Implemented by| INF1
    DOM2 -->|Via Interface| INF1

    %% Core connections
    API1 -.->|Uses| CORE1
    API1 -.->|Uses| CORE2
    API2 -.->|Uses| CORE3
    DOM1 -.->|Logs| CORE2
    DOM2 -.->|Logs| CORE2
    INF1 -.->|Logs| CORE2

    %% Styling
    classDef clientStyle fill:#e3f2fd,stroke:#1565c0,stroke-width:3px,color:#000
    classDef apiStyle fill:#fff3e0,stroke:#e65100,stroke-width:3px,color:#000
    classDef domainStyle fill:#f3e5f5,stroke:#6a1b9a,stroke-width:3px,color:#000
    classDef infraStyle fill:#e8f5e9,stroke:#2e7d32,stroke-width:3px,color:#000
    classDef coreStyle fill:#fce4ec,stroke:#c2185b,stroke-width:3px,color:#000

    class CL1,CL2 clientStyle
    class API1,API2,API3,API4 apiStyle
    class DOM1,DOM2,DOM3,DOM4 domainStyle
    class INF1,INF2,INF3 infraStyle
    class CORE1,CORE2,CORE3 coreStyle

Key Architectural Principles

Dependency Rule: Dependencies point inward. Domain layer has no dependencies on outer layers
Interface Segregation: Domain defines interfaces, Infrastructure implements them
Single Responsibility: Each layer has a specific purpose and responsibility
Testability: Pure domain logic can be tested without external dependencies
Flexibility: Infrastructure can be swapped (e.g., Memory → Database) without changing domain

Conversation Flow Diagram

How a message flows through the system:

%%{init: {'theme':'base', 'themeVariables': {'fontSize':'14px'}}}%%
flowchart TD
    Start([User sends message]) --> API[<b>FastAPI Route</b><br/>/conversations/:id/messages]
    
    API --> Validate{<b>Validate</b><br/>Session exists?}
    Validate -->|No| Error1[404 Not Found]
    Validate -->|Yes| SessionSvc[<b>SessionService</b><br/>Add to conversation history]
    
    SessionSvc --> ChatbotSvc[<b>ChatbotService</b><br/>Process message]
    
    ChatbotSvc --> Vectorize[<b>TF-IDF Vectorization</b><br/>Convert text to vectors]
    Vectorize --> Similarity[<b>Cosine Similarity</b><br/>Compare with training data]
    
    Similarity --> Threshold{<b>Confidence</b><br/>> 0.3?}
    Threshold -->|No| Fallback[Fallback response<br/>"I don't understand"]
    Threshold -->|Yes| Match[Best match found]
    
    Match --> SelectAnswer[<b>Select Answer</b><br/>Random from matched responses]
    Fallback --> Store
    SelectAnswer --> Store[<b>Store in Session</b><br/>Update conversation history]
    
    Store --> Response([Return response to user])
    
    style Start fill:#e3f2fd,stroke:#1565c0,stroke-width:3px
    style API fill:#fff3e0,stroke:#e65100,stroke-width:2px
    style SessionSvc fill:#f3e5f5,stroke:#6a1b9a,stroke-width:2px
    style ChatbotSvc fill:#f3e5f5,stroke:#6a1b9a,stroke-width:2px
    style Vectorize fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px
    style Similarity fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px
    style Store fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px
    style Response fill:#e3f2fd,stroke:#1565c0,stroke-width:3px
    style Error1 fill:#ffebee,stroke:#c62828,stroke-width:2px
    style Fallback fill:#fff9c4,stroke:#f57f17,stroke-width:2px
    style SelectAnswer fill:#c8e6c9,stroke:#388e3c,stroke-width:2px
    style Validate fill:#fff3e0,stroke:#f57c00,stroke-width:2px
    style Threshold fill:#fff3e0,stroke:#f57c00,stroke-width:2px
    style Match fill:#c8e6c9,stroke:#388e3c,stroke-width:2px

NLP Processing Pipeline

The chatbot uses scikit-learn for intelligent response matching:

%%{init: {'theme':'base'}}%%
graph LR
    A["<b>User Input</b><br/>'How are you?'"] --> B["<b>Preprocessing</b><br/>Lowercase<br/>Strip whitespace"]
    B --> C["<b>TF-IDF Vectorizer</b><br/>n-grams (1,2)<br/>Unicode normalization"]
    C --> D["<b>Vector Space</b><br/>Document-term matrix"]
    
    E["<b>Training Data</b><br/>Q&A Knowledge Base"] --> F["<b>Vectorization</b><br/>Pre-computed vectors"]
    F --> D
    
    D --> G["<b>Cosine Similarity</b><br/>Calculate similarity scores"]
    G --> H{"<b>Threshold</b><br/>Score > 0.3?"}
    
    H -->|Yes| I["<b>Best Match</b><br/>Highest score answer"]
    H -->|No| J["<b>Fallback</b><br/>Default response"]
    
    I --> K["<b>Response</b><br/>Return to user"]
    J --> K
    
    style A fill:#e3f2fd,stroke:#1565c0,stroke-width:3px
    style B fill:#fff3e0,stroke:#e65100,stroke-width:2px
    style C fill:#f3e5f5,stroke:#6a1b9a,stroke-width:2px
    style D fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px
    style E fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    style F fill:#f3e5f5,stroke:#6a1b9a,stroke-width:2px
    style G fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px
    style H fill:#fff3e0,stroke:#f57c00,stroke-width:2px
    style I fill:#c8e6c9,stroke:#388e3c,stroke-width:2px
    style J fill:#ffecb3,stroke:#f57f17,stroke-width:2px
    style K fill:#e3f2fd,stroke:#1565c0,stroke-width:3px

Project Structure

Detailed view of the project organization:

app/
├── main.py                     # Application entry point
├── core/                       # Application configuration and utilities
│   ├── config.py               # Centralized settings management
│   ├── exceptions.py           # Custom exception definitions
│   └── logging.py              # Logging configuration
├── api/                        # HTTP layer (FastAPI routes and schemas)
│   ├── dependencies.py         # Dependency injection setup
│   └── v1/                     # API versioning
│       ├── routes/             # HTTP endpoints
│       │   ├── conversation.py # Conversation management endpoints
│       │   └── health.py       # Health check endpoints
│       └── schemas/            # Request/response models
│           ├── conversation.py # Conversation-related schemas
│           └── common.py       # Shared schemas
├── domain/                     # Business logic layer
│   ├── entities/               # Domain objects
│   │   └── session.py          # Session entity
│   ├── repositories/           # Repository interfaces
│   │   └── session.py          # Session repository interface
│   └── services/               # Business logic services
│       ├── chatbot.py          # Core chatbot logic
│       └── session.py          # Session management service
└── infrastructure/             # External concerns layer
    ├── data/                   # Data access and loading
    │   ├── datasets/           # Training data
    │   │   └── data-bot.json # Chatbot conversation data
    │   └── loaders/            # Data loading utilities
    │       └── chatbot_data.py # JSON data loader
    └── repositories/           # Repository implementations
        └── memory/             # In-memory implementations
            └── session.py      # In-memory session repository

Design Patterns & Architectural Principles

1. Clean Architecture (Hexagonal Architecture)

The project follows the Dependency Rule: dependencies point inward toward the domain layer.

Domain Layer (Core): Contains business logic, entities, and interfaces - has no external dependencies
Application Layer (API): Orchestrates use cases and handles HTTP concerns
Infrastructure Layer: Implements interfaces defined by the domain (repositories, data loaders, NLP)
Core Layer: Cross-cutting concerns (config, logging, exceptions)

Benefits: Testable, maintainable, framework-independent business logic.

2. Repository Pattern

Abstracts data access through interfaces, allowing easy swapping of storage implementations.

# Domain defines the contract
class SessionRepositoryInterface(ABC):
    @abstractmethod
    def create_session(self, language: str) -> str: ...
    
# Infrastructure provides implementations
class InMemorySessionRepository(SessionRepositoryInterface):
    def create_session(self, language: str) -> str:
        # Implementation details

Current: In-memory storage
Future: Can easily add PostgreSQL, Redis, or MongoDB implementations without changing business logic.

3. Dependency Injection (DI)

FastAPI's DI system provides loose coupling and testability.

# Dependencies module acts as a Service Locator
def get_session_service(
    repo: SessionRepositoryInterface = Depends(get_session_repository)
) -> SessionService:
    return SessionService(repo)

# Routes receive dependencies automatically
@router.post("/start")
async def start_conversation(
    session_service: SessionService = Depends(get_session_service)
):
    # Use service without knowing implementation details

Benefits: Easy mocking for tests, singleton management, clear dependencies.

4. Service Layer Pattern

Encapsulates business logic separate from HTTP/infrastructure concerns.

ChatbotService: NLP processing, response generation, confidence scoring
SessionService: Session validation, history management, language rules

Benefits: Business logic reusable across different interfaces (REST, GraphQL, CLI).

5. Strategy Pattern (implicit)

Different NLP strategies can be plugged in (currently TF-IDF, future: transformers, LLMs).

6. Factory Pattern

Centralized object creation in dependencies.py manages lifecycles and ensures singletons.

Layer Responsibilities

Layer	Responsibility	Examples
API	HTTP handling, validation, serialization	Routes, Pydantic schemas
Domain	Business rules, core logic	Services, entities, interfaces
Infrastructure	External systems, I/O	Repositories, data loaders, NLP
Core	Cross-cutting concerns	Config, logging, exceptions

Key Architectural Decisions

Stateless API: Sessions stored server-side, identified by UUID
In-Memory Storage: Fast prototyping; easily replaced with persistent storage
TF-IDF NLP: Lightweight, no external API dependencies, good for FAQ-style conversations
Pydantic Models: Type safety, automatic validation, OpenAPI documentation
Async/Await: Non-blocking I/O for scalability (ready for database operations)

API Documentation

POST /api/v1/conversations/start

Initiates a new conversation session with language selection.

Request:

POST /api/v1/conversations/start
Content-Type: application/json

{
  "language": "en"  // "en" or "nb"
}

Response:

{
  "session_id": "550e8400-e29b-41d4-a716-446655440000",
  "message": "Hello! I am a chatbot!",
  "success": true
}

Status Codes:

200 OK: Session created successfully
400 Bad Request: Invalid language code
500 Internal Server Error: Server error

POST /api/v1/conversations/{session_id}/messages

Sends a message to an existing conversation session.

Request:

POST /api/v1/conversations/{session_id}/messages
Content-Type: application/json

{
  "message": "Hello, how are you?"
}

Response:

{
  "session_id": "550e8400-e29b-41d4-a716-446655440000",
  "message": "I am fine, thank you for asking! What can I do for you today?",
  "success": true
}

Status Codes:

200 OK: Message processed successfully
404 Not Found: Session not found or expired
400 Bad Request: Invalid message format

GET /api/v1/health

Health check endpoint for monitoring and load balancers.

Response:

{
  "status": "healthy",
  "version": "1.0.1",
  "timestamp": "2026-02-18T12:00:00Z"
}

GET /api/v1/conversations/debug/sessions

Debug endpoint to list active sessions (development only).

Response Times:

Average: <100ms (excluding NLP processing)
NLP Processing: 50-200ms depending on query complexity
Total: 150-300ms typical end-to-end

Interactive Documentation

Explore the API interactively:

Swagger UI: http://localhost:8080/docs
ReDoc: http://localhost:8080/redoc
OpenAPI Schema: http://localhost:8080/openapi.json

Installation

Prerequisites

Ensure you have the following installed:

Python 3.11 or 3.12 (3.12 recommended for latest features)
uv - Ultra-fast Python package manager (installation guide)
git for cloning the repository
(Optional) Docker for containerized deployment

Local Development Setup

Option 1: Quick Setup with Makefile (Recommended)

# Clone the repository
git clone https://github.com/AlyonaCIA/FastAPI_Chatbot.git
cd FastAPI_Chatbot

# Install uv if not already installed
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install all dependencies (including dev tools)
make install-dev

# Run the application
make run

# Or run in development mode with auto-reload
make dev

Option 2: Manual Setup with uv

# 1. Clone the repository
git clone https://github.com/AlyonaCIA/FastAPI_Chatbot.git
cd FastAPI_Chatbot

# 2. Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# 3. Create virtual environment with uv
uv venv

# 4. Activate virtual environment
source .venv/bin/activate  # Linux/macOS
# or
.venv\Scripts\activate  # Windows

# 5. Install production dependencies
uv pip install -e .

# 6. Install development dependencies (for contributors)
uv pip install -e ".[dev]"

# 7. Install pre-commit hooks (optional but recommended)
pre-commit install

# 8. Verify installation
pytest tests/

Environment Configuration

Copy the example environment file and customize as needed:

# Copy example file
cp .env.example .env

# Edit with your preferred editor
nano .env

Example .env configuration:

# Application Settings
DEBUG=false
LOG_LEVEL=INFO

# Server Configuration
HOST=0.0.0.0
PORT=8080

# Chatbot Settings
DEFAULT_LANGUAGE=en
CONFIDENCE_THRESHOLD=0.3
SESSION_TTL_HOURS=24
MAX_SESSIONS=1000

Running the Application

# Using Makefile (recommended)
make run              # Production mode
make dev              # Development mode with auto-reload

# Direct execution
python -m app.main

# Or with uvicorn directly
uvicorn app.main:app --reload --host 0.0.0.0 --port 8080

# Production mode
uvicorn app.main:app --host 0.0.0.0 --port 8080 --workers 4

Access the API:

API Base: http://localhost:8080
Swagger Docs: http://localhost:8080/docs
ReDoc: http://localhost:8080/redoc

Docker Deployment

For containerized deployment:

# Build Docker image
docker build -t fastapi-chatbot:latest .

# Run container
docker run -d \
  --name fastapi-chatbot \
  -p 8080:8080 \
  -e DEBUG=false \
  -e LOG_LEVEL=INFO \
  fastapi-chatbot:latest

# View logs
docker logs -f fastapi-chatbot

# Stop container
docker stop fastapi-chatbot

# Or use docker-compose
docker-compose up -d --build

# View docker-compose logs
docker-compose logs -f

Docker Features:

Production-ready multi-stage build
Minimal image size with alpine base
Non-root user for security
Health checks included
Structured logging to stdout

Development

Quick Development Commands

The project includes a comprehensive Makefile for common development tasks:

# See all available commands
make help

# Setup and Installation
make install          # Install production dependencies
make install-dev      # Install all dependencies including dev tools
make sync             # Sync dependencies from lock file
make update           # Update dependencies to latest versions

# Code Quality
make format           # Format code with black and ruff
make lint             # Run ruff linter
make type-check       # Run mypy type checking
make security         # Run bandit security checks
make pre-commit       # Run all pre-commit hooks

# Testing
make test             # Run all tests
make test-cov         # Run tests with coverage report

# CI/CD
make ci               # Run full CI pipeline locally

# Running
make run              # Run application
make dev              # Run in development mode with auto-reload

# Cleanup
make clean            # Remove build artifacts and cache files

Using Makefile (Recommended Workflow)

# Initial setup
make install-dev

# Development cycle
make format           # Format your code
make lint             # Check for issues
make test             # Run tests
make type-check       # Verify types

# Before committing
make ci               # Run full pipeline locally

Pre-commit Hooks

Install pre-commit hooks to automatically enforce code quality standards:

# Install hooks (one-time setup)
pre-commit install

# Manually run hooks on all files
pre-commit run --all-files

# Update hook versions
pre-commit autoupdate

Pre-commit checks include:

Trailing whitespace removal
End-of-file fixer
YAML syntax validation
Large file detection (>500KB)
Branch protection (prevents commits to dev/main)
Docstring formatting (88 char wrap)
Import sorting (isort)
Code linting (flake8)
Typo detection

Testing & Quality Assurance

Running Tests

# Run all tests with coverage
make test

# Or manually with pytest
pytest tests/ -v --cov=app --cov-report=html

# Run specific test categories
pytest tests/unit/           # Unit tests only
pytest tests/integration/    # Integration tests only

# Run with detailed output
pytest tests/ -vv --tb=short

# Run with uv
uv run pytest tests/ --cov=app

Expected output:

================= test session starts =================
platform darwin -- Python 3.11.7, pytest-8.0.0
collected 25 items

tests/unit/domain/test_chatbot_service.py ✓✓✓✓    16%
tests/unit/domain/test_session_service.py ✓✓✓✓✓   36%
tests/integration/api/test_conversation_api.py ✓✓✓ 48%
tests/integration/api/test_conversation_flow.py ✓✓ 56%

================= 25 passed in 2.3s =================

Coverage: 70%+

Test Structure

tests/
├── conftest.py             # Pytest fixtures and configuration
├── unit/                   # Unit tests (fast, isolated)
│   ├── domain/
│   │   ├── test_chatbot_service.py
│   │   └── test_session_service.py
│   └── infrastructure/
│       └── test_repositories.py
└── integration/            # Integration tests (API endpoints)
    └── api/
        ├── test_conversation_api.py
        └── test_conversation_flow.py

Test Coverage

Generate and view coverage reports:

# Generate HTML coverage report
make test

# View in browser
open htmlcov/index.html  # macOS
xdg-open htmlcov/index.html  # Linux

# Generate XML report (for CI/CD)
pytest --cov=app --cov-report=xml

Code Quality Checks

# Run all quality checks (format, lint, type-check, security)
make ci

# Individual checks
make format        # Format code with black and ruff
make lint          # Lint with ruff
make type-check    # Type check with mypy
make security      # Security scan with bandit

Configuration

The application uses environment variables for configuration:

# Create .env file (optional)
DEBUG=true
LOG_LEVEL=INFO
HOST=0.0.0.0
PORT=8080
DEFAULT_LANGUAGE=en
CONFIDENCE_THRESHOLD=0.3
SESSION_TTL_HOURS=24

Configuration Options

DEBUG: Enable debug mode (default: false)
LOG_LEVEL: Logging level (default: INFO)
HOST: Server host (default: 0.0.0.0)
PORT: Server port (default: 8080)
DEFAULT_LANGUAGE: Default conversation language (default: en)
CONFIDENCE_THRESHOLD: NLP confidence threshold (default: 0.3)
SESSION_TTL_HOURS: Session time-to-live (default: 24)

CI/CD Pipeline

The project includes a comprehensive GitHub Actions workflow that ensures code quality and reliability.

Pipeline Architecture

┌─────────────────────────────────────────────────────────────┐
│                    GitHub Actions Pipeline                   │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  ┌──────────────┐  ┌──────────────┐  ┌─────────────────┐  │
│  │ Code Quality │  │    Testing    │  │  Build & Deploy │  │
│  ├──────────────┤  ├──────────────┤  ├─────────────────┤  │
│  │ • Format ✓   │  │ • Python 3.11│  │ • Package Build │  │
│  │ • Lint ✓     │  │ • Python 3.12│  │ • Docker Image  │  │
│  │ • Type Check │  │ • Coverage   │  │ • Artifacts     │  │
│  │ • Security   │  │ • JUnit XML  │  │ • Reports       │  │
│  └──────────────┘  └──────────────┘  └─────────────────┘  │
│                                                              │
└─────────────────────────────────────────────────────────────┘

CI/CD Jobs

1. Code Quality & Security

Black & isort: Code formatting validation
flake8: PEP 8 compliance and linting
mypy: Static type checking
Safety: Dependency vulnerability scanning
Exit on failure: Pipeline stops if quality checks fail

2. Matrix Testing

Python Versions: 3.11, 3.12
Parallel Execution: Tests run simultaneously
Coverage Reports: HTML, XML, and terminal output
JUnit XML: Test result artifacts for analysis
Minimum Coverage: 70% threshold enforced

3. GitHub Actions Validation

Automated CI/CD: Runs on every PR and push to main
Full Pipeline: format → lint → test → security
Dependency Caching: Faster builds with uv cache
Artifact Upload: Coverage reports and test results

4. Integration Testing

API Workflow: Real conversation flow testing
Session Management: UUID tracking validation
Multi-language: English and Norwegian responses
Health Checks: Endpoint availability verification

5. Build Verification

Setup.py: Package building and installation
Dependency Resolution: Requirements validation
Import Testing: Module accessibility checks

Trigger Conditions

The pipeline runs automatically on:

Push to main, develop, or feature/* branches
Pull Requests targeting main or develop
Manual Dispatch: Workflow can be triggered manually

Caching Strategy

Optimized build times through intelligent caching:

uv Cache: Python packages cached per version
GitHub Actions: Workflow caching per commit
Cache Key: Based on pyproject.toml hash

Artifacts & Reports

Generated artifacts include:

Test Results: JUnit XML format
Coverage Reports: HTML and XML
Security Scan: JSON vulnerability report
Retention: 30 days for all artifacts

NLP and Chatbot Logic

Response Selection Algorithm

Greeting Detection: Checks for common greetings (hello, hi, hey)
Sample Matching: Uses TF-IDF vectorization and cosine similarity
Keyword Matching: Falls back to keyword-based responses
Fallback Response: Default response for unmatched inputs

Training Data

The chatbot uses structured JSON data (chatbot-data.json) with:

Greetings: Welcome messages in multiple languages
Dialogues: Sample-based conversations with replies
Keywords: Topic-based responses
Fallbacks: Default responses for unknown inputs

Multilingual Support

English (en) and Norwegian (nb) support
Language-specific responses and greetings
Automatic language detection and validation

Contributing

We welcome contributions from the community! Please follow these guidelines to ensure a smooth collaboration.

Development Guidelines

Follow Clean Architecture: Maintain separation between domain, infrastructure, and API layers
Write Tests First: TDD approach - write tests before implementing features
Maintain Coverage: Keep code coverage above 70%
Type Everything: Use type hints for all function parameters and return values
Document Code: Add docstrings to all classes and public methods
Conventional Commits: Use clear, descriptive commit messages
Run Quality Checks: Ensure all CI checks pass before submitting PR

Contribution Workflow

# 1. Fork the repository on GitHub
# 2. Clone your fork
git clone https://github.com/YOUR_USERNAME/FastAPI_Chatbot.git
cd FastAPI_Chatbot

# 3. Create a feature branch from main
git checkout main
git pull origin main
git checkout -b feature##/your-feature-name

# 4. Install development dependencies
./run_local.sh -i

# 5. Install pre-commit hooks
pre-commit install

# 6. Make your changes following TDD
# - Write tests first (tests/)
# - Implement feature
# - Run tests: make test

# 7. Run all quality checks locally
make ci
# or use individual commands
make format lint type-check test-cov security

# 8. Commit with conventional commit message
git add .
git commit -m "feat: add new feature description"

# 9. Push to your fork
git push origin feature##/your-feature-name

# 10. Open a Pull Request on GitHub
# - Target branch: main
# - Fill out PR template
# - Wait for CI to pass
# - Request review

Branch Naming Convention

Follow the repository's naming pattern:

feature##/description - New features (e.g., feature04/add-dual-license)
bugfix##/description - Bug fixes
docs##/description - Documentation updates
refactor##/description - Code refactoring

Commit Message Format

Use conventional commits for clear history:

<type>(<scope>): <description>

[optional body]

[optional footer]

Types:

feat: New feature
fix: Bug fix
docs: Documentation changes
style: Code style changes (formatting, etc.)
refactor: Code refactoring
test: Adding or updating tests
chore: Maintenance tasks

Examples:

feat(api): add conversation history endpoint
fix(chatbot): resolve TF-IDF vectorization edge case
docs(readme): update installation instructions
test(domain): add session service unit tests

Code Standards

Python Style

Line Length: 88 characters (Black default)
Import Sorting: isort with black profile
Docstring Style: Google-style docstrings
Naming Conventions:
- Classes: PascalCase
- Functions/Variables: snake_case
- Constants: UPPER_SNAKE_CASE
- Private members: _leading_underscore

Type Hints

from typing import List, Optional, Dict

def process_message(
    message: str,
    session_id: str,
    metadata: Optional[Dict[str, str]] = None
) -> List[str]:
    """Process incoming message."""
    pass

Testing Requirements

Unit tests for all business logic
Integration tests for API endpoints
Minimum 70% code coverage
Test file naming: test_*.py
Test function naming: test_<feature>_<scenario>()

Pull Request Checklist

Before submitting, ensure:

Code follows project style guide
All tests pass (make test)
Code coverage is maintained or improved
Type checking passes (make type-check)
Security scan passes (make security)
Documentation is updated (if needed)
Commit messages follow conventional format
Branch is up to date with main
No conflicts with main branch

Getting Help

Email: alenacivanovaa@gmail.com
Documentation: Check /docs folder
Bug Reports: Open an issue with detailed description
Feature Requests: Open an issue with use case explanation

Troubleshooting

Common Issues

Issue: ModuleNotFoundError: No module named 'app'

Solution: Ensure you're in the project root and virtual environment is activated
```
source .venv/bin/activate  # or use uv run
uv sync
```

Issue: Port 8080 already in use

Solution: Change port in .env file or kill the process using port 8080

# Find process
lsof -ti:8080
# Kill process
kill -9 $(lsof -ti:8080)
# Or use different port
export PORT=8081

Issue: Tests failing with import errors

Solution: Install development dependencies

uv sync --all-extras
# or
make install-dev

Issue: Slow API responses

Solution:
- Check if TF-IDF vectorizer is properly cached
- Verify training data is loaded correctly
- Monitor system resources (CPU, memory)
- Check logs for errors: make logs

Issue: Session not found (404)

Solution:
- Sessions expire after 24 hours (default TTL)
- Verify session ID is correct UUID format
- Check if session was created successfully
- Use debug endpoint: GET /api/v1/conversations/debug/sessions

Issue: Pre-commit hooks failing

Solution:

# Update pre-commit hooks
pre-commit autoupdate
# Run manually to see errors
pre-commit run --all-files
# Install missing tools
make install-dev

Issue: Docker build fails

Solution:
- Clear Docker cache: docker system prune -a
- Verify Dockerfile syntax
- Check network connectivity for package downloads
- Use docker build --no-cache for clean build

Issue: mypy type errors

Solution:

# Check specific file
uv run mypy app/specific/file.py
# Update type stubs
uv pip install types-all
# See mypy config
cat pyproject.toml | grep -A 20 "\[tool.mypy\]"

Debugging Tips

Enable Debug Logging: Set LOG_LEVEL=DEBUG in .env
Check Logs: Monitor application logs for errors
Use Interactive Docs: Test API at http://localhost:8080/docs
Run Tests in Verbose: pytest -vv --tb=long
Check Dependencies: uv pip list to verify installed packages

Performance Optimization

Increase Workers: For production, use multiple Uvicorn workers
```
uvicorn app.main:app --workers 4 --host 0.0.0.0 --port 8080
```
Enable Caching: Implement Redis for session storage
Database Connection Pool: Configure optimal pool size for PostgreSQL
Load Balancing: Use nginx or traefik for multiple instances

Performance Considerations

Singleton Pattern: Services are instantiated once per application lifecycle
Caching: TF-IDF vectors are pre-computed and cached
Session Management: Efficient in-memory storage with TTL cleanup
Async Support: Full async/await implementation for scalability
Connection Pooling: Optimized for database connections (future enhancement)

Security

Input Validation: Pydantic models validate all inputs
Dependency Scanning: Automated security vulnerability checks
CORS Configuration: Configurable cross-origin resource sharing
Environment Variables: Sensitive configuration externalized

License

This project is available under a dual licensing model to support both open-source community and commercial use:

Free License: AGPL-3.0

The following users can use this software FREE under the GNU Affero General Public License v3.0:

User Type	Use Case	Cost
Individual Developers	Personal projects, learning, portfolio	FREE
Educational Institutions	Universities, schools, research, teaching	FREE
Students & Researchers	Academic projects, thesis work	FREE
Non-Profit Organizations	Charitable work, community projects	FREE
Open Source Projects	AGPL-compatible projects, contributions	FREE

Requirements:

Source code modifications must be disclosed
Derivative works must use AGPL-3.0
Network use triggers copyleft (AGPL provision)
Attribution to original author required

Commercial License

For-profit companies and commercial entities must obtain a commercial license:

User Type	When Required
Companies	Using software in any commercial capacity
For-Profit Orgs	Revenue-generating products or services
SaaS Providers	Hosting as a service for customers
Product Integration	Embedding in proprietary software

Benefits:

No source code disclosure required
No copyleft obligations
Proprietary use allowed
Priority support & maintenance
Custom feature development available
Legal protection & indemnification

To obtain a commercial license:

Email: alenacivanovaa@gmail.com
Subject: "FastAPI Chatbot - Commercial License Request"
Include: Company name, use case, number of developers

Licensing FAQ

Q: I'm a freelancer building a client project. Which license?
A: If your client is a commercial entity, they need a commercial license. If you're doing non-profit work, AGPL-3.0 applies.

Q: Can I use this for my startup?
A: If your startup is a registered business or generating revenue, you need a commercial license.

Q: What if I modify the code?
A: Under AGPL-3.0, you must share modifications. Under commercial license, modifications are proprietary.

Q: University spin-off company?
A: Company = Commercial license. University research project = AGPL-3.0.

For complete legal terms, see the LICENSE file.

Questions? Contact: alenacivanovaa@gmail.com

Author

Alyona Carolina Ivanova Araujo

Software Engineer | Python Developer | API Architect

Email: alenacivanovaa@gmail.com
GitHub: @AlyonaCIA
Version: 1.0.1

Project Stats

Acknowledgments

This project was built with:

Modern Python development practices and tooling
Clean Architecture and Domain-Driven Design principles
Test-Driven Development methodology
Comprehensive CI/CD pipeline with GitHub Actions
Enterprise-grade code quality standards
Community-driven open source values

Special thanks to:

FastAPI Community for the excellent web framework
scikit-learn Team for powerful NLP tools
uv Team for revolutionizing Python package management
Open source contributors and maintainers

If you find this project useful, please consider giving it a ⭐ star!

Made with precision and care by Alyona Carolina Ivanova Araujo

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github		.github
app		app
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

FastAPI Chatbot

What It Does

Performance Metrics

Overview

Why This Project Stands Out

Key Features

Core Functionality

Architecture & Design

Development Excellence

DevOps & Deployment

Quick Start

Technology Stack

Core Framework & Runtime

NLP & Data Processing

Development & Quality Tools

Build & Deployment Tools

Architecture

High-Level System Overview

System Architecture

Key Architectural Principles

Conversation Flow Diagram

NLP Processing Pipeline

Project Structure

Design Patterns & Architectural Principles

1. Clean Architecture (Hexagonal Architecture)

2. Repository Pattern

3. Dependency Injection (DI)

4. Service Layer Pattern

5. Strategy Pattern (implicit)

6. Factory Pattern

Layer Responsibilities

Key Architectural Decisions

API Documentation

POST /api/v1/conversations/start

POST /api/v1/conversations/{session_id}/messages

GET /api/v1/health

GET /api/v1/conversations/debug/sessions

Interactive Documentation

Installation

Prerequisites

Local Development Setup

Option 1: Quick Setup with Makefile (Recommended)

Option 2: Manual Setup with uv

Environment Configuration

Running the Application

Docker Deployment

Development

Quick Development Commands

Using Makefile (Recommended Workflow)

Pre-commit Hooks

Testing & Quality Assurance

Running Tests

Test Structure

Test Coverage

Code Quality Checks

Configuration

Configuration Options

CI/CD Pipeline

Pipeline Architecture

CI/CD Jobs

1. Code Quality & Security

2. Matrix Testing

3. GitHub Actions Validation

4. Integration Testing

5. Build Verification

Trigger Conditions

Caching Strategy

Artifacts & Reports

NLP and Chatbot Logic

Response Selection Algorithm

Training Data

Multilingual Support

Contributing

Development Guidelines

Contribution Workflow

Branch Naming Convention

Packages