🛡️ Sentinel

Multi-Agent AI Cybersecurity Platform - Autonomous Vulnerability Detection & Security Analysis

Sentinel uses a centralized LLM powering specialized security agents to scan, simulate attacks, analyze threats, and generate actionable security recommendations—all orchestrated in a production-grade multi-agent pipeline.

🎯 Elevator Pitch

Sentinel is an autonomous cybersecurity platform that combines code scanning, attack simulation, and threat intelligence into a unified multi-agent system. One powerful LLM orchestrates five specialized agents that execute sequentially to deliver comprehensive security insights—from vulnerability discovery to patch recommendations—in minutes, not weeks.

🚀 Features

✨ Multi-Agent Architecture

5 specialized agents working in perfect harmony
Sequential workflow: Scanner → Threat → Attack → Patch → Report
Centralized LLM orchestration for consistency and token optimization

🔍 Scanner Agent

Detects 7+ vulnerability types (SQL Injection, XSS, RCE, etc.)
CWE/OWASP mappings with exploitability scoring
Realistic code analysis patterns (Bandit + Semgrep simulation)

⚔️ Attack Simulation Agent

Plans realistic attack scenarios with MITRE ATT&CK framework
Calculates success probability and impact scoring
Maps vulnerabilities to specific exploitation techniques

🎯 Threat Intelligence Agent

Classifies threats by severity and category
Maps findings to security frameworks
Generates exploitability metrics

🛠️ Patch Generation Agent

Creates remediation patches with code examples
Estimates complexity and risk of fixes
Supports auto-apply recommendations

📊 Report Generation Agent

Compiles executive summaries
Risk scoring (0-100 scale)
Patch coverage analysis
Remediation effort estimation

🎨 Modern Frontend

Next.js 15 with TypeScript
Real-time scan monitoring
Interactive dashboards
Protected routes with JWT authentication

📡 Production-Ready Backend

FastAPI with async/await architecture
SQLAlchemy ORM with async support
PostgreSQL + Redis ready
Complete REST API

🏗️ Architecture Overview

┌─────────────────────────────────────────────────────────────────┐
│                        Frontend (Next.js)                        │
│         Real-time Scan Monitoring & Dashboard UI                 │
└──────────────────────────┬──────────────────────────────────────┘
                           │
                    HTTP/REST API
                           │
┌──────────────────────────▼──────────────────────────────────────┐
│                   FastAPI Backend Server                         │
│  ┌────────────────────────────────────────────────────────────┐ │
│  │         API Gateway & Authentication                       │ │
│  └────────────────────┬─────────────────────────────────────┘ │
│                      │                                         │
│  ┌──────────────────▼──────────────────────────────────────┐  │
│  │        Orchestrator Service                         │  │
│  │    (Manages multi-agent workflow execution)              │  │
│  └──────┬─────┬─────┬─────┬─────────────────────────────────┘  │
│         │     │     │     │                                     │
│  ┌──────▼──┐  │     │     │                                     │
│  │ Scanner ├──┘     │     │                                     │
│  │ Agent   │        │     │                                     │
│  └─────────┘   ┌────▼──┐  │                                     │
│                │Threat ├──┘                                     │
│                │Agent  │        ┌──────────┐                   │
│                └───────┘        │  Attack  ├──┐                │
│                                 │  Agent   │  │                │
│                                 └──────────┘  │                │
│                                    ┌──────────▼────┐            │
│                                    │ Patch Agent   ├──┐         │
│                                    └───────────────┘  │         │
│                                       ┌───────────────▼──┐      │
│                                       │ Report Agent     │      │
│                                       └──────────────────┘      │
└──────────────────────────────────────────────────────────────────┘
                           │
                           │
┌──────────────────────────▼──────────────────────────────────────┐
│                    Database Layer                                │
│    ┌──────────────┐    ┌──────────────┐    ┌──────────────┐   │
│    │  PostgreSQL  │    │    Redis     │    │  SQLite      │   │
│    │ (Production) │    │    (Cache)   │    │ (Development)│   │
│    └──────────────┘    └──────────────┘    └──────────────┘   │
└──────────────────────────────────────────────────────────────────┘

🤖 Multi-Agent Workflow Explanation

Sequential Execution Pipeline

[1] SCANNER AGENT (Progress: 5% → 20%)
    ├─ Input: Repository URL
    ├─ Process: Code analysis (Bandit + Semgrep simulation)
    ├─ Output: 7+ Findings with CWE/OWASP mappings
    └─ Data: Saved to `findings` table

[2] THREAT AGENT (Progress: 20% → 40%)
    ├─ Input: Findings from Scanner
    ├─ Process: Threat classification & severity mapping
    ├─ Output: Threat types, exploitability scores
    └─ Data: Enhanced findings with threat metadata

[3] ATTACK AGENT (Progress: 40% → 60%)
    ├─ Input: Findings + Threats from previous agents
    ├─ Process: MITRE ATT&CK mapping, attack planning
    ├─ Output: Attack scenarios with success probability
    └─ Data: Saved to `attacks` table

[4] PATCH AGENT (Progress: 60% → 80%)
    ├─ Input: Findings from Scanner
    ├─ Process: Patch generation, complexity assessment
    ├─ Output: Code fixes with confidence scores
    └─ Data: Saved to `patches` table

[5] REPORT AGENT (Progress: 80% → 100%)
    ├─ Input: Findings, Attacks, Patches
    ├─ Process: Aggregation, risk scoring, metrics
    ├─ Output: Executive summary + recommendations
    └─ Data: Saved to `reports` table

Why This Architecture Works

Sequential Processing: Each agent builds on previous results
State Preservation: Database maintains context between agents
Error Resilience: One agent failure doesn't halt pipeline
Progress Tracking: Real-time frontend updates (0-100%)
Async Execution: Non-blocking background jobs
Token Optimization: Single LLM orchestrates multiple specialized tasks

🧠 How One LLM Powers Multiple Agents

Centralized Intelligence, Specialized Tasks

# Pseudo-code showing LLM routing
class Agent:
    def __init__(self, llm, system_prompt):
        self.llm = llm  # Shared instance
        self.system_prompt = system_prompt  # Agent-specific instructions
    
    async def execute(self, data):
        # LLM understands context from system_prompt + input data
        response = await self.llm.chat([
            {"role": "system", "content": self.system_prompt},
            {"role": "user", "content": data}
        ])
        return response

Benefits:

Cost Efficient: One LLM instance vs. five separate models
Consistent: Same underlying reasoning across all agents
Flexible: Easy to add/modify agent behaviors via prompts
Stateful: Context flows between agents via database
Scalable: Single LLM handles all reasoning

📋 Agent Details

1️⃣ Scanner Agent

Purpose: Detect vulnerabilities in source code

Inputs:

Repository URL
Scan ID

Process:

Bandit analysis (hardcoded secrets, dangerous functions)
Semgrep analysis (weak crypto, dangerous execution)
Deduplication to remove false positives
CWE/OWASP mapping

Outputs:

Finding records with:
- Vulnerability type
- Severity (Critical/High/Medium/Low)
- File path & line number
- Code snippet
- CWE ID & OWASP category
- Exploitability score (0-1)
- Confidence score (0-1)

Example Finding:

{
  "id": "finding-001",
  "vulnerability_type": "SQL Injection",
  "severity": "critical",
  "file_path": "src/database.py",
  "line_number": 45,
  "cwe_id": "CWE-89",
  "cwe_name": "Improper Neutralization of Special Elements used in an SQL Command",
  "owasp_category": "A03:2021 - Injection",
  "exploitability_score": 0.95,
  "code_snippet": "query = f\"SELECT * FROM users WHERE id = {user_id}\""
}

2️⃣ Threat Intelligence Agent

Purpose: Classify threats and assess impact

Inputs:

Findings from Scanner Agent
CWE/OWASP framework data

Process:

Match finding type to threat category
Calculate exploitability (0-1)
Calculate impact (1-10)
Assign threat classification

Outputs:

Finding enrichment with:
- Threat type (injection, auth, crypto, etc.)
- Exploitability metric
- Impact rating
- Threat metadata

3️⃣ Attack Simulation Agent

Purpose: Plan realistic attack scenarios

Inputs:

Findings with threat data
MITRE ATT&CK framework

Process:

Map vulnerabilities to attack templates
Calculate success probability (0-1)
Assign MITRE technique
Plan attack steps
Estimate impact

Outputs:

Attack records with:
- Attack type (SQL Injection, RCE, etc.)
- Attack vector
- Success probability (0-1)
- Impact score (0-10)
- MITRE technique (T1190, T1110, etc.)
- Attack steps (list)
- Prerequisites
- Mitigation strategies

Example Attack:

{
  "id": "attack-001",
  "attack_type": "SQL Injection",
  "success_probability": 0.85,
  "impact_score": 9.5,
  "mitre_technique": "T1190",
  "mitre_tactic": "Initial Access",
  "attack_steps": [
    "Identify SQL injection parameter",
    "Craft malicious SQL payload",
    "Execute against database",
    "Extract sensitive data"
  ]
}

4️⃣ Patch Generation Agent

Purpose: Generate security patches

Inputs:

Findings from Scanner
Vulnerability patterns

Process:

Match finding to patch template
Generate code fix
Estimate complexity (simple/moderate/complex)
Calculate risk (0-1)
Determine auto-applicability

Outputs:

Patch records with:
- Original code (vulnerable)
- Patched code (secure)
- Explanation
- Complexity rating
- Risk score
- Can auto-apply flag
- Confidence (0-1)

Example Patch:

{
  "id": "patch-001",
  "vulnerability_type": "SQL Injection",
  "original_code": "query = f\"SELECT * FROM users WHERE id = {user_id}\"",
  "patched_code": "query = \"SELECT * FROM users WHERE id = %s\"\ncursor.execute(query, (user_id,))",
  "explanation": "Use parameterized queries to prevent SQL injection",
  "apply_complexity": "simple",
  "can_auto_apply": true,
  "confidence": 0.98
}

5️⃣ Report Generation Agent

Purpose: Compile comprehensive security report

Inputs:

All findings, attacks, patches
Aggregated metrics

Process:

Aggregate findings by severity
Calculate risk score (0-100)
Estimate patch coverage
Calculate remediation effort
Generate executive summary
Create recommendations

Outputs:

Report record with:
- Overall risk score
- Severity breakdown (critical/high/medium/low)
- Patch coverage %
- Remediation effort (hours)
- Executive summary (text)
- Detailed findings (JSON)
- Recommendations (list)
- Metadata

🛠️ Tech Stack

Frontend

Framework: Next.js 15.5.18
Language: TypeScript (strict mode)
Styling: Tailwind CSS
UI Components: Custom + shadcn/ui patterns
State Management: React hooks + Context API
HTTP Client: Fetch API + async/await
Build: Webpack (Next.js default)

Backend

Framework: FastAPI 0.104.1
Language: Python 3.11+
Async: asyncio + FastAPI async routes
Database ORM: SQLAlchemy 2.0+ (async)
DB Driver: aiosqlite (dev), asyncpg (prod)
Auth: JWT tokens
Validation: Pydantic v2
API Docs: Swagger UI (auto-generated)

Database

Development: SQLite with aiosqlite
Production: PostgreSQL with asyncpg
Caching: Redis (optional)
ORM Models: SQLAlchemy with async support

AI/Agents

Orchestration: Custom ScanOrchestrator service
Agent Base: Abstract BaseAgent class
LLM Integration: Ready for OpenAI/Claude/Ollama
Frameworks: MITRE ATT&CK, CWE, OWASP mapping

DevOps

Container: Docker + Docker Compose
Port (Frontend): 3000
Port (Backend): 8000
Environment: .env configuration

📦 Installation Guide

Prerequisites

Python 3.11+
Node.js 18+
npm or yarn
PostgreSQL 12+ (production)
Redis (optional)

Step 1: Clone Repository

git clone https://github.com/yourusername/sentinel.git
cd sentinel

Step 2: Backend Setup

# Create Python virtual environment
cd backend
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Create .env file
cp .env.example .env

# Edit .env with your configuration
nano .env

# Run database migrations (if applicable)
# python alembic upgrade head

# Start backend server
uvicorn main:app --reload --host 0.0.0.0 --port 8000

Step 3: Frontend Setup

# In new terminal, from project root
cd frontend

# Install dependencies
npm install

# Create .env.local file
cp .env.example .env.local

# Edit environment variables
nano .env.local

# Start frontend dev server
npm run dev

Step 4: Verify Installation

# Check backend health
curl http://localhost:8000/health

# Frontend should be accessible at
# http://localhost:3000

# Swagger API docs at
# http://localhost:8000/docs

🔐 Environment Variables

Backend (.env)

# Database
DATABASE_URL=postgresql+asyncpg://user:password@localhost:5432/sentinel
# For development: sqlite+aiosqlite://./sentinel.db

# API Configuration
API_HOST=0.0.0.0
API_PORT=8000
DEBUG=true

# JWT
JWT_SECRET=your-super-secret-key-change-this-in-production
JWT_ALGORITHM=HS256
JWT_EXPIRATION_HOURS=24

# LLM Configuration (when integrating)
OPENAI_API_KEY=sk-your-key-here
LLM_MODEL=gpt-4-turbo

# Redis (optional)
REDIS_URL=redis://localhost:6379/0

# CORS
CORS_ORIGINS=http://localhost:3000,http://localhost:8000

# Logging
LOG_LEVEL=INFO

Frontend (.env.local)

# API Configuration
NEXT_PUBLIC_API_URL=http://localhost:8000

# Feature Flags
NEXT_PUBLIC_ENABLE_DEMO=true

🚀 Running the Application

Option 1: Manual (Development)

Terminal 1 - Backend:

cd backend
source venv/bin/activate
uvicorn main:app --reload

Terminal 2 - Frontend:

cd frontend
npm run dev

Then visit: http://localhost:3000

Option 2: Docker Compose (Production)

# From project root
docker-compose up -d

# View logs
docker-compose logs -f

# Stop
docker-compose down

📡 API Endpoints

Authentication

POST   /api/auth/login        - Login with credentials
POST   /api/auth/register     - Register new account
GET    /api/auth/me           - Get current user

Scans

GET    /api/scans             - List all scans (paginated)
POST   /api/scans             - Create new scan (triggers orchestration)
GET    /api/scans/{scan_id}   - Get scan details with metrics

Findings

GET    /api/scans/{scan_id}/findings  - Get findings for scan
GET    /api/findings/{finding_id}     - Get specific finding

Attacks

GET    /api/scans/{scan_id}/attacks   - Get attack scenarios
GET    /api/attacks/{attack_id}       - Get specific attack

Patches

GET    /api/scans/{scan_id}/patches   - Get patches for scan
GET    /api/patches/{patch_id}        - Get specific patch

Reports

GET    /api/scans/{scan_id}/report    - Get comprehensive report

System

GET    /health                - Health check

📁 Folder Structure

sentinel/
├── frontend/                          # Next.js frontend
│   ├── src/
│   │   ├── app/                      # Pages (dashboard, scan, threat, etc.)
│   │   ├── components/               # React components
│   │   ├── hooks/                    # Custom React hooks
│   │   ├── lib/                      # Utilities
│   │   ├── services/                 # API service layer
│   │   └── types/                    # TypeScript interfaces
│   ├── public/                       # Static assets
│   ├── package.json
│   ├── next.config.ts
│   ├── tsconfig.json
│   ├── tailwind.config.js
│   └── .env.example
│
├── backend/                          # FastAPI backend
│   ├── app/
│   │   ├── agents/                   # Agent implementations
│   │   │   ├── base.py               # BaseAgent abstract class
│   │   │   ├── scanner_agent.py
│   │   │   ├── threat_agent.py
│   │   │   ├── attack_agent.py
│   │   │   ├── patch_agent.py
│   │   │   ├── report_agent.py
│   │   │   └── __init__.py
│   │   ├── api/                      # API routes
│   │   │   ├── scans.py
│   │   │   ├── findings.py
│   │   │   ├── attacks.py
│   │   │   ├── patches.py
│   │   │   ├── reports.py
│   │   │   ├── auth.py
│   │   │   └── __init__.py
│   │   ├── models/                   # Database models
│   │   │   ├── orm.py                # SQLAlchemy models
│   │   │   ├── schemas.py            # Pydantic schemas
│   │   │   └── __init__.py
│   │   ├── services/                 # Business logic
│   │   │   ├── orchestrator.py       # Multi-agent orchestration
│   │   │   ├── auth_service.py
│   │   │   └── __init__.py
│   │   ├── utils/                    # Utilities
│   │   │   ├── helpers.py
│   │   │   ├── cwe_mapping.py
│   │   │   ├── mitre_mapping.py
│   │   │   └── __init__.py
│   │   ├── database.py               # Database connection
│   │   ├── config.py                 # Configuration
│   │   └── __init__.py
│   ├── main.py                       # Entry point
│   ├── requirements.txt              # Python dependencies
│   ├── .env.example
│   └── pytest.ini
│
├── docs/                             # Documentation
│   ├── ARCHITECTURE.md               # System architecture
│   ├── INSTALLATION.md               # Setup guide
│   ├── API_DOCUMENTATION.md          # API reference
│   ├── MULTI_AGENT_EXPLANATION.md   # Agent details
│   └── JUDGE_VALIDATION_GUIDE.md    # Hackathon guide
│
├── screenshots/                      # UI screenshots
├── demo/                             # Demo scripts
├── docker-compose.yml                # Docker configuration
├── Dockerfile                        # Docker image
├── README.md                         # This file
├── .gitignore                        # Git ignore rules
├── .env.example                      # Environment template
└── LICENSE                           # MIT License

🔒 Security Notes

What Sentinel Does

✅ Scans code for vulnerabilities
✅ Simulates realistic attacks
✅ Generates patches
✅ Creates reports

What Sentinel Does NOT Do

❌ Execute actual attacks
❌ Modify source code without approval
❌ Store sensitive credentials
❌ Bypass authentication

Security Best Practices

Environment Variables: All secrets in .env (gitignored)
JWT Tokens: Short expiration + refresh tokens
Database: Use PostgreSQL with SSL in production
CORS: Restricted to trusted origins only
Input Validation: Pydantic schema validation on all inputs
API Keys: Rotate regularly, use scoped keys

For Judges

Run with test credentials (see .env.example)
No real data is harmed
Everything is reversible
See JUDGE_VALIDATION_GUIDE.md for details

🚀 Future Improvements

💥 Challenges Faced

1. Multi-Agent Coordination

Challenge: Ensuring agents execute in sequence while maintaining state Solution: Built ScanOrchestrator service with database-backed context

2. LLM Token Optimization

Challenge: Preventing runaway token usage with 5 agents Solution: Centralized LLM with specialized system prompts instead of separate models

3. Async Database Operations

Challenge: SQLAlchemy async with multiple agents running simultaneously Solution: AsyncSession management with proper connection pooling

4. Real-time Frontend Updates

Challenge: Tracking progress from background agent execution Solution: REST API with polling + WebSocket ready architecture

📚 What We Learned

Agent Orchestration: Proper sequencing matters more than parallel execution
LLM Efficiency: One powerful model beats five specialized ones
State Management: Database as agent communication layer works well
Async Architecture: Critical for handling multiple agents at scale
DevX: Good error messages and logging saves debugging time

Performance Optimizations

Async/await throughout (no blocking I/O)
Connection pooling (50+ concurrent requests)
Redis caching for frequent queries
Database indexing on scan_id, finding_id, status
Pagination on list endpoints

Production Deployment

Docker containers with health checks
Kubernetes orchestration ready
Auto-scaling based on scan queue
CDN for static frontend assets
Separate read replicas for reporting

✨ Why This Project Matters

For Security Teams:

Automated vulnerability discovery
Realistic attack simulation
Actionable remediation patches
Executive reporting

For Developers:

Easy to understand codebase
Extensible agent framework
Production-ready FastAPI backend
Modern React frontend

For Enterprises:

Scalable to thousands of scans
Compliant architecture (SOC 2 ready)
Cost-effective (single LLM vs. multiple tools)
Integrates with existing pipelines

👥 Meet the Team

Built collaboratively by:

Sambhav Jain
Yug Agrawal
Divi Chopra

Made with ❤️ for the Hackathon Community

Stars ⭐ are appreciated! Fork and contribute!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
backend		backend
frontend		frontend
migrations		migrations
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

🛡️ Sentinel

🎯 Elevator Pitch

🚀 Features

🏗️ Architecture Overview

🤖 Multi-Agent Workflow Explanation

Sequential Execution Pipeline

Why This Architecture Works

🧠 How One LLM Powers Multiple Agents

Centralized Intelligence, Specialized Tasks

📋 Agent Details

1️⃣ Scanner Agent

2️⃣ Threat Intelligence Agent

3️⃣ Attack Simulation Agent

4️⃣ Patch Generation Agent

5️⃣ Report Generation Agent

🛠️ Tech Stack

Frontend

Backend

Database

AI/Agents

DevOps

📦 Installation Guide

Prerequisites

Step 1: Clone Repository

Step 2: Backend Setup

Step 3: Frontend Setup

Step 4: Verify Installation

🔐 Environment Variables

Backend (.env)

Frontend (.env.local)

🚀 Running the Application

Option 1: Manual (Development)

Option 2: Docker Compose (Production)

📡 API Endpoints

Authentication

Scans

Findings

Attacks

Patches

Reports

System

📁 Folder Structure

🔒 Security Notes

What Sentinel Does

What Sentinel Does NOT Do

Security Best Practices

For Judges

🚀 Future Improvements

💥 Challenges Faced

1. Multi-Agent Coordination

2. LLM Token Optimization

3. Async Database Operations

4. Real-time Frontend Updates

📚 What We Learned

Performance Optimizations

Production Deployment

✨ Why This Project Matters

👥 Meet the Team

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages