Skip to content

Amitkrpaliwal/AI-Partner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🤖 AI Partner

Autonomous multi-agent orchestration platform that executes complex goals end-to-end — without hand-holding.

Most AI tools work for you. AI Partner works as you.

Give it a goal in plain English. It researches, writes code, generates documents, and delivers results to your Telegram, Discord, or Slack — autonomously.

🌐 Website · v1 live · v2 coming soon

License: MIT GitHub Stars GitHub Forks Last Commit Open Issues Docker TypeScript Tests


⚡ Quick Start · ✨ Features · 🤖 Agent Profiles · 🔌 Integrations · 📐 Architecture · 🗺️ Ecosystem


🎬 See It In Action

AI Partner demo — goal execution
▶ Goal → autonomous execution
AI Partner demo — browser automation & delivery
▶ Browser automation & delivery

🎯 What Is AI Partner?

AI Partner is a self-hosted, autonomous AI agent platform you run on your own machine with Docker. You describe a goal — it decomposes it into tasks, executes them using real tools (web search, code execution, file generation, messaging), validates outcomes against measurable criteria, and delivers results to you automatically.

"Research the top 10 AI tools launched this week,
 write a PDF comparison report, and send it to my Telegram."

AI Partner will: search the web → extract data → analyse results → generate a PDF → send to Telegram → confirm delivery. You don't touch it again.


⚡ Quick Start

Requirements: Docker Desktop + one LLM API key (OpenAI, Anthropic, Gemini, Groq, DeepSeek, or local Ollama)

Mac / Linux — one command:

curl -fsSL https://raw.githubusercontent.com/AmitkrPaiwal/AI-Partner/main/setup.sh | bash

Windows — paste into PowerShell:

iwr -useb https://raw.githubusercontent.com/AmitkrPaiwal/AI-Partner/main/install.ps1 | iex

Or manually:

git clone https://github.com/AmitkrPaiwal/AI-Partner
cd AI-Partner
./setup.sh          # Mac/Linux
.\install.ps1       # Windows (PowerShell)

The installer walks you through choosing an LLM provider, entering your API key, and opens the browser UI automatically. First run takes 2–4 minutes (Docker image build).


✨ Features

🧠 Autonomous Goal Execution

Type a goal — AI Partner decomposes it, builds an execution plan, runs it with real tools, validates outcomes against measurable success criteria, and retries or replans on failure.

  • ✅ Up to 3 concurrent goals, each with up to 5 parallel sub-agents
  • ✅ ReAct loop: Reason → Act → Assess → Retry
  • ✅ Self-correcting: if a script errors, it semantically repairs and re-runs
  • ✅ Typed success criteria — agent proves completion, doesn't just say "done"

🤖 16 Specialist Agent Profiles

Pre-built agents with enforced tool whitelists, iteration caps, and automatic routing based on keywords.

Cluster Agents
Research Web Researcher, Fact Checker, Trend Spotter
Dev Python Developer, Node.js Developer, Debugger, Shell Operator
Data Financial Analyst, Data Analyst, Excel Builder
Content Report Generator, Summarizer, Tech Writer, Prompt Architect, Task Planner
Delivery Telegram Reporter

Invoke directly: @fin-analyst what is RELIANCE.NS today? Or let keywords auto-route: typing "trending AI tools" fires @trend-spotter automatically.

🌐 Live Browser Automation

Puppeteer-powered browser with live CDP screencasting visible in the UI. When a CAPTCHA appears, the agent pauses and shows a "Solve CAPTCHA — Take Control" button. You solve it, the agent resumes.

📬 Goal-Integrated Messaging Delivery

Results aren't just saved — they're validated delivery goals. The agent marks a task failed if messaging_send_file doesn't succeed.

Supports: Telegram · Discord · Slack · WhatsApp · Signal

🧠 Persistent Memory

  • Episodic memory — timestamped event log of every conversation and outcome
  • Vector search — semantic similarity across 4 embedding backends
  • Persona — biographic facts and preferences injected into every prompt
  • Knowledge base — upload PDFs/docs for RAG retrieval

📄 Document Generation

PDF · Excel (xlsx) · PowerPoint (pptx) · Word (docx) · HTML — downloadable from the UI or sent via messaging.

📚 Skill Learning

After a successful goal, AI Partner generalises the solution into a reusable parameterised skill template. Deduplicated by embedding similarity. Skills can be promoted to first-class MCP tools.

⏰ Scheduler + Triggers

Cron-expression scheduling, webhook triggers, Google Calendar events, Gmail arrival — all fire autonomous goal execution.


🤖 Agent Profiles

Each profile specifies:

Field Description
Tool whitelist Enforced — agent cannot use tools outside its list
Iteration cap Prevents runaway loops
Auto-select keywords Fires automatically when matched in chat
agentType Determines exhaustion behaviour (research / execution / delivery / synthesis)
Handoff instructions Baked into every system prompt

Profiles are editable from the UI (Settings → Agent Profiles) or by editing server/src/agents/seedProfiles.ts.


🔌 Integrations

Add any key to .env — the agent automatically gains those tools.

Service Env Var Tools Unlocked
GitHub GITHUB_TOKEN search repos, list issues, create issues, get files, list PRs, add comments, search code
Notion NOTION_API_KEY search, read page, create page, query database, append blocks
Gmail GMAIL_USER + GMAIL_APP_PASSWORD send, search, read, list inbox
Google Calendar GOOGLE_CALENDAR_ACCESS_TOKEN list events, create event, check availability, delete event
Google Drive GOOGLE_DRIVE_ACCESS_TOKEN search, get file, list folder, create file
Twitter/X TWITTER_BEARER_TOKEN search tweets, read timeline (+ OAuth keys for posting)
Trello TRELLO_API_KEY + TRELLO_TOKEN list boards/cards, create card, move card, add comment
Spotify SPOTIFY_ACCESS_TOKEN search, play, pause, skip, queue, create playlist
Apify APIFY_API_TOKEN residential proxy scraping for CAPTCHA-protected sites
Image Gen OPENAI_API_KEY or STABILITY_API_KEY DALL-E 3 / Stability AI image generation

Messaging platforms: Telegram · Discord · Slack · WhatsApp · Signal


🧠 LLM Providers

At least one required. Add the key to .env:

Provider Env Var Notes
Anthropic ANTHROPIC_API_KEY Claude 3.5 / 4 family
OpenAI OPENAI_API_KEY GPT-4o, GPT-4o-mini
Google GOOGLE_API_KEY Gemini 2.0 Flash
Groq GROQ_API_KEY Free tier, very fast (Llama, Mistral)
DeepSeek DEEPSEEK_API_KEY Low cost, strong at coding
Mistral MISTRAL_API_KEY European-hosted option
Together AI TOGETHER_API_KEY Wide open-source model selection
Ollama OLLAMA_HOST Local models, no API key needed
Perplexity PERPLEXITY_API_KEY Search-grounded LLM with citations

Switch models any time from Settings → Models in the UI.


📐 Architecture

flowchart TD
    U([👤 User]) -->|"Natural language goal"| AO[AgentOrchestrator\nChat OODA loop]
    AO -->|Goal detected| GE[GoalOrientedExecutor]
    GE --> GX[GoalExtractor\nTyped success criteria]
    GX --> RR[ReActReasoner\nReason → Act → Assess loop]
    RR --> EE[ExecutionEngine]
    EE --> MCP[MCP Tool Servers\n17 servers]
    EE --> DS[Docker Sandbox\ncode execution]
    RR --> SC[SelfCorrector\nSemantic script repair]
    SC --> RR
    RR --> GV[GoalValidator\nChecks file / content / messaging]
    GV -->|All criteria pass| SL[SkillLearner\nGeneralise + store for reuse]
    GV -->|Criteria fail| RR

    style U fill:#6366f1,color:#fff
    style SL fill:#10b981,color:#fff
    style GV fill:#f59e0b,color:#000
Loading

Concurrency: Up to 3 concurrent goals, each with up to 5 parallel sub-agents via delegate_parallel.

MCP Tool Servers (17): web_search · browser_automation · code_executor · file_system · gmail · google_calendar · google_drive · github · notion · twitter · trello · spotify · apify · messaging (Telegram/Discord/Slack/WhatsApp/Signal) · image_generator · document_builder · memory


🗺️ Ecosystem Landscape

The open-source self-hosted agent space has several strong projects, each built around a different design philosophy. Here's how they are positioned:

Project Primary Design Focus Best Suited For
AI Partner End-to-end goal execution with validated outcomes, specialist agents, document generation, and messaging delivery Users who want to hand off a complete goal and receive a finished, delivered result — with no babysitting
Hermes Agent Autonomous agent with 68 built-in tools, 15+ chat platforms, NL scheduling, and RL training via Atropos Users who want wide messaging platform coverage and self-improving skills out of the box
OpenHuman Desktop-native personal agent with 118+ OAuth integrations, auto-sync every 20 min, Memory Tree + Obsidian vault Users who want zero server ops and automatic context from all their connected accounts
OpenClaw Personal, always-on AI assistant with 22+ messaging platforms and a modular skills registry (ClawHub) Users who want the widest messaging reach — LINE, WeChat, QQ, iMessage, Matrix, and more
Agent Zero OS-level autonomy with dynamic tool creation at runtime; runs in an isolated Docker terminal Power users who want an agent that can build its own tools and interact deeply with the operating system
OpenHands Enterprise-grade autonomous software engineering; multi-agent collaboration and audit trails Engineering teams automating code review, bug fixing, or large-scale software development workflows
CrewAI Multi-agent crew pipelines with defined roles, tasks, and handoffs; best-documented orchestration framework Teams building structured multi-agent workflows with clear role boundaries

Where AI Partner fits: It is the only project in this space that treats goal delivery — not just task execution — as a first-class requirement. The agent proves outcomes via typed success criteria, validates messaging delivery, and self-corrects on failure. v2 adds human proxy capabilities — acting as you across email, meetings, and phone — which no other self-hosted tool in this table does.


⚙️ Configuration

Key files — editable without redeploying:

File Purpose
server/prompts/agent.system.md Agent core identity
server/prompts/profiles/ Per-profile LLM prompts
server/prompts/reasoner-reason.md ReAct reasoning prompt
server/prompts/reasoner-decide.md ReAct action-selection prompt
server/config/blocked-domains.json Domains blocked from browser navigation
server/config/data-api-hints.json API fallback hints injected when search fails
server/templates/workspace/HEARTBEAT.md Proactive agenda tasks
server/templates/workspace/SOUL.md Agent persona + quiet hours

Environment variables — see .env.example for the full annotated list.


🐳 Docker Commands

# Start
docker compose up -d

# View logs
docker compose logs -f app

# Stop
docker compose down

# Update to latest
./setup.sh --update        # Mac/Linux
.\install.ps1 -Update      # Windows

# Wipe all data and start fresh
./setup.sh --reset
.\install.ps1 -Reset

🛠️ Development

# Hot-reload dev mode
docker compose -f docker-compose.dev.yml up

# Run unit tests (145 tests)
cd server && npm run test:unit

# TypeScript check
cd server && npx tsc --noEmit

🚀 What's Coming in v2

v2 shifts the platform from working for you to working as you — acting on your behalf across email, meetings, phone, and messaging with your identity and your voice, gated by a policy you control.

Capability Status
Human proxy — reply to email, Slack DMs, Telegram DMs in your voice (not a bot) 🔨 Building
Meeting proxy — join Teams / Zoom / Meet as a participant, speak via voice clone 🔨 Building
Phone proxy — voicemail intake, outbound calls via Twilio 🔨 Building
AuthorityPolicy — declarative auto / draft-and-ask / block per action + contact class 🔨 Building
CounterpartyStore — unified identity graph across email, Slack, Telegram, and meetings 🔨 Building
Proactive heartbeat — LLM decides when to act autonomously, no prompt needed 🔨 Building
Voice clone — speaks in your voice via ElevenLabs / MiniMax / OpenAI 🔨 Building
41 MCP servers (up from 17) — Stripe, HubSpot, S3, Sentry, Microsoft 365 added 🔨 Building
18+ LLM providers (up from 9) — OpenRouter, Cerebras, NVIDIA NIM, MiniMax added 🔨 Building
5-layer memory — adds biographic facts + counterparty graph to the existing stack 🔨 Building

⭐ Star this repo to be notified when v2 ships. Follow progress at aipartner.amitpaliwal.com.


🤝 Contributing

Contributions are welcome! Here are some good places to start:

  • Browse good first issues — beginner-friendly tasks
  • Open a Discussion to propose features or ask questions
  • Submit a PR — all improvements are reviewed within 48 hours

Ideas for new contributors:

  • New MCP tool server integrations (Linear, Jira, Confluence, Airtable)
  • Additional LLM provider adapters
  • Browser automation improvements
  • UI/UX enhancements
  • Documentation and tutorials

📄 License

MIT — free to use, modify, and distribute.


Built with TypeScript · Express · React · Puppeteer · Docker · SQLite · MCP


If AI Partner saves you time, please star this repo — it helps others find it.

Star ⭐ · Fork 🍴 · Issues 🐛 · Discussions 💬

About

Self-hosted AI agent that executes goals autonomously — web search, code execution, document generation, browser automation, delivered to Telegram/Discord/Slack. Docker, 18+ LLMs, 17 MCP servers. v2 adding human proxy + meeting join.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors