Summary
Create a collection of intentionally vulnerable AI agent projects that demonstrate g0's scanning capabilities and help developers understand agent security risks hands-on.
Motivation
There's no "try it yourself" experience for g0. A developer running g0 scan on their own code may get zero findings (good code) or be overwhelmed (many findings). Sample repos provide:
- Onboarding: "Run g0 on this and see what it finds"
- Demos: Conference talks, sales demos, blog posts
- Testing: Regression testing for g0's detection engine
- Education: Teach developers about agent security risks
Proposed Repos
1. vuln-banking-agent (Python/LangChain)
- Shared memory between users
- SQL injection via unvalidated tool
- System prompt with no boundaries
- Hardcoded API keys
- No human oversight for transfers
- Expected: ~15 findings, score ~45
2. vuln-support-bot (TypeScript/Vercel AI)
- PII leakage in responses
- No rate limiting on endpoint
- Code execution without sandbox
- Memory persists across sessions
- Expected: ~12 findings, score ~55
3. vuln-mcp-server (TypeScript/MCP)
- Tool descriptions with injection payloads
- No input schema validation
- Hardcoded credentials in config
- Overly broad tool permissions
- No version pinning
- Expected: ~10 findings, score ~50
4. vuln-multi-agent (Python/CrewAI)
- Unrestricted agent delegation
- No privilege boundaries between agents
- Shared tools without access control
- No audit trail
- Expected: ~18 findings, score ~35
5. secure-agent-reference (Python/LangChain)
- Same use case as vuln-banking-agent but properly secured
- Shows all the fixes applied
- Expected: 0-2 findings, score ~95
- Paired with vuln-banking-agent for before/after comparison
Per Repo Structure
vuln-banking-agent/
├── README.md # What this is, how to scan it
├── main.py # Agent code
├── tools.py # Tools with vulnerabilities
├── .g0.yaml # g0 config
├── FINDINGS.md # Expected findings with explanations
└── requirements.txt
Hosting
- Separate public repos under the guard0-ai org
- Or a single
guard0-ai/vuln-agents monorepo with subdirectories
- Referenced from g0 README and docs/getting-started.md
Acceptance Criteria
Summary
Create a collection of intentionally vulnerable AI agent projects that demonstrate g0's scanning capabilities and help developers understand agent security risks hands-on.
Motivation
There's no "try it yourself" experience for g0. A developer running
g0 scanon their own code may get zero findings (good code) or be overwhelmed (many findings). Sample repos provide:Proposed Repos
1.
vuln-banking-agent(Python/LangChain)2.
vuln-support-bot(TypeScript/Vercel AI)3.
vuln-mcp-server(TypeScript/MCP)4.
vuln-multi-agent(Python/CrewAI)5.
secure-agent-reference(Python/LangChain)Per Repo Structure
Hosting
guard0-ai/vuln-agentsmonorepo with subdirectoriesAcceptance Criteria
npx @guard0/g0 scan .out of the box