🤖 AI Partner

Autonomous multi-agent orchestration platform that executes complex goals end-to-end — without hand-holding.

Most AI tools work for you. AI Partner works as you.

Give it a goal in plain English. It researches, writes code, generates documents, and delivers results to your Telegram, Discord, or Slack — autonomously.

🌐 Website · ·

⚡ Quick Start · ✨ Features · 🤖 Agent Profiles · 🔌 Integrations · 📐 Architecture · 🗺️ Ecosystem

🎬 See It In Action

_{▶ Goal → autonomous execution}

_{▶ Browser automation & delivery}

🎯 What Is AI Partner?

AI Partner is a self-hosted, autonomous AI agent platform you run on your own machine with Docker. You describe a goal — it decomposes it into tasks, executes them using real tools (web search, code execution, file generation, messaging), validates outcomes against measurable criteria, and delivers results to you automatically.

"Research the top 10 AI tools launched this week,
 write a PDF comparison report, and send it to my Telegram."

AI Partner will: search the web → extract data → analyse results → generate a PDF → send to Telegram → confirm delivery. You don't touch it again.

⚡ Quick Start

Requirements: Docker Desktop + one LLM API key (OpenAI, Anthropic, Gemini, Groq, DeepSeek, or local Ollama)

Mac / Linux — one command:

curl -fsSL https://raw.githubusercontent.com/AmitkrPaiwal/AI-Partner/main/setup.sh | bash

Windows — paste into PowerShell:

iwr -useb https://raw.githubusercontent.com/AmitkrPaiwal/AI-Partner/main/install.ps1 | iex

Or manually:

git clone https://github.com/AmitkrPaiwal/AI-Partner
cd AI-Partner
./setup.sh          # Mac/Linux
.\install.ps1       # Windows (PowerShell)

The installer walks you through choosing an LLM provider, entering your API key, and opens the browser UI automatically. First run takes 2–4 minutes (Docker image build).

✨ Features

🧠 Autonomous Goal Execution

Type a goal — AI Partner decomposes it, builds an execution plan, runs it with real tools, validates outcomes against measurable success criteria, and retries or replans on failure.

✅ Up to 3 concurrent goals, each with up to 5 parallel sub-agents
✅ ReAct loop: Reason → Act → Assess → Retry
✅ Self-correcting: if a script errors, it semantically repairs and re-runs
✅ Typed success criteria — agent proves completion, doesn't just say "done"

🤖 16 Specialist Agent Profiles

Pre-built agents with enforced tool whitelists, iteration caps, and automatic routing based on keywords.

Cluster	Agents
Research	Web Researcher, Fact Checker, Trend Spotter
Dev	Python Developer, Node.js Developer, Debugger, Shell Operator
Data	Financial Analyst, Data Analyst, Excel Builder
Content	Report Generator, Summarizer, Tech Writer, Prompt Architect, Task Planner
Delivery	Telegram Reporter

Invoke directly: @fin-analyst what is RELIANCE.NS today? Or let keywords auto-route: typing "trending AI tools" fires @trend-spotter automatically.

🌐 Live Browser Automation

Puppeteer-powered browser with live CDP screencasting visible in the UI. When a CAPTCHA appears, the agent pauses and shows a "Solve CAPTCHA — Take Control" button. You solve it, the agent resumes.

📬 Goal-Integrated Messaging Delivery

Results aren't just saved — they're validated delivery goals. The agent marks a task failed if messaging_send_file doesn't succeed.

Supports: Telegram · Discord · Slack · WhatsApp · Signal

🧠 Persistent Memory

Episodic memory — timestamped event log of every conversation and outcome
Vector search — semantic similarity across 4 embedding backends
Persona — biographic facts and preferences injected into every prompt
Knowledge base — upload PDFs/docs for RAG retrieval

📄 Document Generation

PDF · Excel (xlsx) · PowerPoint (pptx) · Word (docx) · HTML — downloadable from the UI or sent via messaging.

📚 Skill Learning

After a successful goal, AI Partner generalises the solution into a reusable parameterised skill template. Deduplicated by embedding similarity. Skills can be promoted to first-class MCP tools.

⏰ Scheduler + Triggers

Cron-expression scheduling, webhook triggers, Google Calendar events, Gmail arrival — all fire autonomous goal execution.

🤖 Agent Profiles

Each profile specifies:

Field	Description
Tool whitelist	Enforced — agent cannot use tools outside its list
Iteration cap	Prevents runaway loops
Auto-select keywords	Fires automatically when matched in chat
agentType	Determines exhaustion behaviour (`research / execution / delivery / synthesis`)
Handoff instructions	Baked into every system prompt

Profiles are editable from the UI (Settings → Agent Profiles) or by editing server/src/agents/seedProfiles.ts.

🔌 Integrations

Add any key to .env — the agent automatically gains those tools.

Service	Env Var	Tools Unlocked
GitHub	`GITHUB_TOKEN`	search repos, list issues, create issues, get files, list PRs, add comments, search code
Notion	`NOTION_API_KEY`	search, read page, create page, query database, append blocks
Gmail	`GMAIL_USER` + `GMAIL_APP_PASSWORD`	send, search, read, list inbox
Google Calendar	`GOOGLE_CALENDAR_ACCESS_TOKEN`	list events, create event, check availability, delete event
Google Drive	`GOOGLE_DRIVE_ACCESS_TOKEN`	search, get file, list folder, create file
Twitter/X	`TWITTER_BEARER_TOKEN`	search tweets, read timeline (+ OAuth keys for posting)
Trello	`TRELLO_API_KEY` + `TRELLO_TOKEN`	list boards/cards, create card, move card, add comment
Spotify	`SPOTIFY_ACCESS_TOKEN`	search, play, pause, skip, queue, create playlist
Apify	`APIFY_API_TOKEN`	residential proxy scraping for CAPTCHA-protected sites
Image Gen	`OPENAI_API_KEY` or `STABILITY_API_KEY`	DALL-E 3 / Stability AI image generation

Messaging platforms: Telegram · Discord · Slack · WhatsApp · Signal

🧠 LLM Providers

At least one required. Add the key to .env:

Provider	Env Var	Notes
Anthropic	`ANTHROPIC_API_KEY`	Claude 3.5 / 4 family
OpenAI	`OPENAI_API_KEY`	GPT-4o, GPT-4o-mini
Google	`GOOGLE_API_KEY`	Gemini 2.0 Flash
Groq	`GROQ_API_KEY`	Free tier, very fast (Llama, Mistral)
DeepSeek	`DEEPSEEK_API_KEY`	Low cost, strong at coding
Mistral	`MISTRAL_API_KEY`	European-hosted option
Together AI	`TOGETHER_API_KEY`	Wide open-source model selection
Ollama	`OLLAMA_HOST`	Local models, no API key needed
Perplexity	`PERPLEXITY_API_KEY`	Search-grounded LLM with citations

Switch models any time from Settings → Models in the UI.

📐 Architecture

flowchart TD
    U([👤 User]) -->|"Natural language goal"| AO[AgentOrchestrator\nChat OODA loop]
    AO -->|Goal detected| GE[GoalOrientedExecutor]
    GE --> GX[GoalExtractor\nTyped success criteria]
    GX --> RR[ReActReasoner\nReason → Act → Assess loop]
    RR --> EE[ExecutionEngine]
    EE --> MCP[MCP Tool Servers\n17 servers]
    EE --> DS[Docker Sandbox\ncode execution]
    RR --> SC[SelfCorrector\nSemantic script repair]
    SC --> RR
    RR --> GV[GoalValidator\nChecks file / content / messaging]
    GV -->|All criteria pass| SL[SkillLearner\nGeneralise + store for reuse]
    GV -->|Criteria fail| RR

    style U fill:#6366f1,color:#fff
    style SL fill:#10b981,color:#fff
    style GV fill:#f59e0b,color:#000

Concurrency: Up to 3 concurrent goals, each with up to 5 parallel sub-agents via delegate_parallel.

MCP Tool Servers (17): web_search · browser_automation · code_executor · file_system · gmail · google_calendar · google_drive · github · notion · twitter · trello · spotify · apify · messaging (Telegram/Discord/Slack/WhatsApp/Signal) · image_generator · document_builder · memory

🗺️ Ecosystem Landscape

The open-source self-hosted agent space has several strong projects, each built around a different design philosophy. Here's how they are positioned:

Project	Primary Design Focus	Best Suited For
AI Partner	End-to-end goal execution with validated outcomes, specialist agents, document generation, and messaging delivery	Users who want to hand off a complete goal and receive a finished, delivered result — with no babysitting
Hermes Agent	Autonomous agent with 68 built-in tools, 15+ chat platforms, NL scheduling, and RL training via Atropos	Users who want wide messaging platform coverage and self-improving skills out of the box
OpenHuman	Desktop-native personal agent with 118+ OAuth integrations, auto-sync every 20 min, Memory Tree + Obsidian vault	Users who want zero server ops and automatic context from all their connected accounts
OpenClaw	Personal, always-on AI assistant with 22+ messaging platforms and a modular skills registry (ClawHub)	Users who want the widest messaging reach — LINE, WeChat, QQ, iMessage, Matrix, and more
Agent Zero	OS-level autonomy with dynamic tool creation at runtime; runs in an isolated Docker terminal	Power users who want an agent that can build its own tools and interact deeply with the operating system
OpenHands	Enterprise-grade autonomous software engineering; multi-agent collaboration and audit trails	Engineering teams automating code review, bug fixing, or large-scale software development workflows
CrewAI	Multi-agent crew pipelines with defined roles, tasks, and handoffs; best-documented orchestration framework	Teams building structured multi-agent workflows with clear role boundaries

Where AI Partner fits: It is the only project in this space that treats goal delivery — not just task execution — as a first-class requirement. The agent proves outcomes via typed success criteria, validates messaging delivery, and self-corrects on failure. v2 adds human proxy capabilities — acting as you across email, meetings, and phone — which no other self-hosted tool in this table does.

⚙️ Configuration

Key files — editable without redeploying:

File	Purpose
`server/prompts/agent.system.md`	Agent core identity
`server/prompts/profiles/`	Per-profile LLM prompts
`server/prompts/reasoner-reason.md`	ReAct reasoning prompt
`server/prompts/reasoner-decide.md`	ReAct action-selection prompt
`server/config/blocked-domains.json`	Domains blocked from browser navigation
`server/config/data-api-hints.json`	API fallback hints injected when search fails
`server/templates/workspace/HEARTBEAT.md`	Proactive agenda tasks
`server/templates/workspace/SOUL.md`	Agent persona + quiet hours

Environment variables — see .env.example for the full annotated list.

🐳 Docker Commands

# Start
docker compose up -d

# View logs
docker compose logs -f app

# Stop
docker compose down

# Update to latest
./setup.sh --update        # Mac/Linux
.\install.ps1 -Update      # Windows

# Wipe all data and start fresh
./setup.sh --reset
.\install.ps1 -Reset

🛠️ Development

# Hot-reload dev mode
docker compose -f docker-compose.dev.yml up

# Run unit tests (145 tests)
cd server && npm run test:unit

# TypeScript check
cd server && npx tsc --noEmit

🚀 What's Coming in v2

v2 shifts the platform from working for you to working as you — acting on your behalf across email, meetings, phone, and messaging with your identity and your voice, gated by a policy you control.

Capability	Status
Human proxy — reply to email, Slack DMs, Telegram DMs in your voice (not a bot)	🔨 Building
Meeting proxy — join Teams / Zoom / Meet as a participant, speak via voice clone	🔨 Building
Phone proxy — voicemail intake, outbound calls via Twilio	🔨 Building
AuthorityPolicy — declarative auto / draft-and-ask / block per action + contact class	🔨 Building
CounterpartyStore — unified identity graph across email, Slack, Telegram, and meetings	🔨 Building
Proactive heartbeat — LLM decides when to act autonomously, no prompt needed	🔨 Building
Voice clone — speaks in your voice via ElevenLabs / MiniMax / OpenAI	🔨 Building
41 MCP servers (up from 17) — Stripe, HubSpot, S3, Sentry, Microsoft 365 added	🔨 Building
18+ LLM providers (up from 9) — OpenRouter, Cerebras, NVIDIA NIM, MiniMax added	🔨 Building
5-layer memory — adds biographic facts + counterparty graph to the existing stack	🔨 Building

⭐ Star this repo to be notified when v2 ships. Follow progress at aipartner.amitpaliwal.com.

🤝 Contributing

Contributions are welcome! Here are some good places to start:

Browse good first issues — beginner-friendly tasks
Open a Discussion to propose features or ask questions
Submit a PR — all improvements are reviewed within 48 hours

Ideas for new contributors:

New MCP tool server integrations (Linear, Jira, Confluence, Airtable)
Additional LLM provider adapters
Browser automation improvements
UI/UX enhancements
Documentation and tutorials

📄 License

MIT — free to use, modify, and distribute.

Built with TypeScript · Express · React · Puppeteer · Docker · SQLite · MCP

⭐ If AI Partner saves you time, please star this repo — it helps others find it.

Star ⭐ · Fork 🍴 · Issues 🐛 · Discussions 💬

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
cli		cli
client		client
docker		docker
searxng-config		searxng-config
server		server
shared		shared
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.searxng.yml		docker-compose.searxng.yml
docker-compose.signal.yml		docker-compose.signal.yml
docker-compose.yml		docker-compose.yml
install.ps1		install.ps1
package-lock.json		package-lock.json
package.json		package.json
playwright.config.ts		playwright.config.ts
setup.sh		setup.sh
tsconfig.server.json		tsconfig.server.json
verify-production.ps1		verify-production.ps1
verify-production.sh		verify-production.sh
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 AI Partner

🎬 See It In Action

🎯 What Is AI Partner?

⚡ Quick Start

✨ Features

🧠 Autonomous Goal Execution

🤖 16 Specialist Agent Profiles

🌐 Live Browser Automation

📬 Goal-Integrated Messaging Delivery

🧠 Persistent Memory

📄 Document Generation

📚 Skill Learning

⏰ Scheduler + Triggers

🤖 Agent Profiles

🔌 Integrations

🧠 LLM Providers

📐 Architecture

🗺️ Ecosystem Landscape

⚙️ Configuration

🐳 Docker Commands

🛠️ Development

🚀 What's Coming in v2

🤝 Contributing

📄 License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🤖 AI Partner

🎬 See It In Action

🎯 What Is AI Partner?

⚡ Quick Start

✨ Features

🧠 Autonomous Goal Execution

🤖 16 Specialist Agent Profiles

🌐 Live Browser Automation

📬 Goal-Integrated Messaging Delivery

🧠 Persistent Memory

📄 Document Generation

📚 Skill Learning

⏰ Scheduler + Triggers

🤖 Agent Profiles

🔌 Integrations

🧠 LLM Providers

📐 Architecture

🗺️ Ecosystem Landscape

⚙️ Configuration

🐳 Docker Commands

🛠️ Development

🚀 What's Coming in v2

🤝 Contributing

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages