Agent Browser Guide

A Claude Code skill that forces browser automation tasks to use the Vercel Agent Browser CLI instead of MCP tools or Playwright.

Keywords: claude-code, agent-skills, browser-automation, playwright-alternative, mcp-replacement, vercel, agent-browser, headless-browser, chrome-automation, cli-tools, web-scraping, screenshot, form-automation

Why

By default, Claude Code may use Playwright MCP or other browser MCP tools for web automation. This skill enforces a CLI-first approach using the agent-browser command, which:

Works immediately after installation (no MCP config, no restart needed)
Is more token-efficient than MCP-based browser tools
Supports screenshots, snapshots, form filling, clicking, and JavaScript evaluation

Installation

npx skills add heart-wcl/agent-browser-guide

Or install globally:

npx skills add heart-wcl/agent-browser-guide -g

Prerequisites

Node.js >= 18
npm

The skill includes an auto-installer that downloads and sets up agent-browser CLI and Chrome for Testing automatically.

Agent Browser CLI vs Playwright MCP

Factor	Agent Browser CLI	Playwright MCP
Setup	Install → use immediately	Configure → restart Claude
Token usage	Compact `@eN` refs (~7K for 10 steps)	Full DOM snapshots (~114K for 10 steps)
Screenshot	Save to file, AI reads	Depends on MCP implementation
Speed	Direct Rust CLI execution	JSON-RPC overhead
Chrome download	~150MB, auto-install	Requires separate Playwright install

Supported Agents

This skill works with any agent that supports the Agent Skills specification, including:

Claude Code (primary target)
Codex
Cursor
OpenCode
Kiro CLI
And 50+ more agents

Use Cases

Web scraping — Extract data from websites programmatically
UI testing — Screenshot and verify frontend changes
Form automation — Fill forms, click buttons, navigate workflows
Price monitoring — Check e-commerce sites for price changes
Content verification — Verify web pages load correctly after deployments
Research — Search and summarize web content

Usage

Once installed, any browser-related request (navigate, screenshot, click, fill form, etc.) will automatically trigger this skill and use agent-browser CLI via Bash.

Examples

# Navigate
agent-browser open https://example.com

# Screenshot
agent-browser screenshot ./result.png

# Get interactive elements
agent-browser snapshot

# Click element by @eN ref
agent-browser click @e5

# Fill form
agent-browser fill @e2 "search text"

# Wait for element
agent-browser wait "#results" 10000

Files

File	Purpose
`SKILL.md`	Skill definition with rules and workflows
`scripts/install.js`	Auto-installer for agent-browser CLI
`scripts/browser.js`	JSON wrapper for batch browser operations
`references/cli-reference.md`	Complete CLI command reference

Rules

HARD RULE: NEVER use MCP tools for browser tasks
HARD RULE: Once agent-browser is selected, never switch to other tools
Install agent-browser automatically if not present
Retry up to 2 times on network failure before asking user

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
references		references
scripts		scripts
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
SKILL.md		SKILL.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Browser Guide

Why

Installation

Prerequisites

Agent Browser CLI vs Playwright MCP

Supported Agents

Use Cases

Usage

Examples

Files

Rules

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent Browser Guide

Why

Installation

Prerequisites

Agent Browser CLI vs Playwright MCP

Supported Agents

Use Cases

Usage

Examples

Files

Rules

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages