PersonaForge

"How do we know our voice agent won't fail in production?"

PersonaForge answers that question before your first customer call.

PersonaForge is a synthetic customer generation and reliability testing platform for conversational voice agents. It acts as the "GitHub Actions for Voice Agents," allowing developers to autonomously validate behavior, reliability, and compliance through thousands of simulated customer interactions.

The Vision

Traditional testing for voice agents is broken. It's manual, slow, and ignores voice-native failures like interruptions and latency.

The PersonaForge Way:

Build Agent -> 1,000 Synthetic Customers -> Failure Detection -> Deploy Safely

Key Features

Forge (Persona Engine): Goal-driven, emotionally consistent synthetic customers. They don't just generate text; they maintain memory, pursue subgoals, and react to agent behavior.
Runner (Execution Engine): High-concurrency voice-native conversation runner. Supports ElevenLabs Conversational AI with real-time audio streaming.
Judge (Evaluation Engine): Multi-stage LLM evaluation that detects:
- Hallucinations: Agent inventing policies or facts.
- Escalation Failures: Agent failing to hand off to a human when required.
- Compliance: Violations of safety or business rules.
- Voice Metrics: Interruption recovery and response latency.
CI/CD Integration: Built-in quality gates for your deployment pipeline.
Studio (Dashboard): Visualize regressions, explore failure clusters, and replay conversations turn-by-turn.

Documentation

Comprehensive user guides are available in the docs/ directory:

getting-started.md: Setup, prerequisites, and first test run guide.
configuration-guide.md: Structural parameter explanations for YAML/Markdown DSL configurations.
cli-commands.md: Complete CLI command options, syntax, and behaviors.
dashboard-studio.md: Startup commands for server and web services, metrics walkthrough, and database details.
ci-cd-integration.md: Steps to add secrets and design pipeline checks.
troubleshooting.md: Solutions for common database lock issues, WebSocket error codes, and rate limits.

Quick Start

1. Installation

You can install the CLI tool directly from PyPI:

pip install personaforge

Or install from source for development and web studio access:

git clone https://github.com/arjun-vegeta/personaforge.git
cd personaforge
pip install -r requirements.txt

2. Configure API Keys

Create a .env file in your project directory:

ELEVENLABS_API_KEY=your_key_here
GOOGLE_API_KEY=your_gemini_key_here

3. Initialize Project

personaforge init

4. Run a Scenario

personaforge run scenarios/telecom_refund.yaml

5. Check for Regressions (CI Mode)

personaforge ci --scenario scenarios/telecom_refund.yaml

Dashboard

The PersonaForge Studio provides a deep dive into your agent's health.

# Start the backend
uvicorn personaforge.backend.app.main:app --reload

# Start the frontend
cd personaforge/web
npm install
npm run dev

Visit http://localhost:3000 to view pass rates, failure clusters, and conversation replays.

Docker Support

You can run the entire PersonaForge stack (PostgreSQL, Redis, Backend, Frontend, and Worker) using Docker Compose:

# Create a .env file with your API keys
cp .env.example .env

# Start the services
docker-compose up --build

CI/CD Integration

PersonaForge is designed to be part of your development workflow. The repository includes a GitHub Action template in .github/workflows/ci.yml that:

Runs unit tests.
Initializes the PersonaForge environment.
Executes a CI quality gate check against your scenarios.

To use this, add GOOGLE_API_KEY and ELEVENLABS_API_KEY to your GitHub repository secrets.

Architecture

PersonaForge is built with a modular, provider-first architecture:

FastAPI / SQLModel: High-performance backend with PostgreSQL.
Gemini 3.1 Flash Lite: Ultra-low latency LLM reasoning for customer actions and judging.
ElevenLabs ConvAI: Direct WebSocket integration for voice interaction.
Redis / RQ: Asynchronous task processing for large-scale test suites.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Built for the future of Conversational AI.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
docker		docker
docs		docs
personaforge		personaforge
personas		personas
policies		policies
scenarios		scenarios
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
github-action-template.yml		github-action-template.yml
personaforge.yaml		personaforge.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.sh		setup.sh
skills-lock.json		skills-lock.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PersonaForge

The Vision

Key Features

Documentation

Quick Start

1. Installation

2. Configure API Keys

3. Initialize Project

4. Run a Scenario

5. Check for Regressions (CI Mode)

Dashboard

Docker Support

CI/CD Integration

Architecture

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PersonaForge

The Vision

Key Features

Documentation

Quick Start

1. Installation

2. Configure API Keys

3. Initialize Project

4. Run a Scenario

5. Check for Regressions (CI Mode)

Dashboard

Docker Support

CI/CD Integration

Architecture

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages