Installation#

Installs the native Rust binary for maximum performance:

npm install -g agent-browser
agent-browser install  # Download Chrome from Chrome for Testing (first time)

This is the fastest option -- commands run through the native Rust CLI directly with sub-millisecond parsing overhead.

Quick start (no install)#

npx agent-browser install   # Download Chrome (first time only)
npx agent-browser open example.com

Project installation (local dependency)#

For projects that want to pin the version in package.json:

npm install agent-browser
npx agent-browser install  # Download Chrome (first time)

Then use via npx or package.json scripts.

Homebrew (macOS)#

brew install agent-browser
agent-browser install  # Download Chrome (first time)

Cargo (Rust)#

cargo install agent-browser
agent-browser install  # Download Chrome (first time)

Compiles from source (~2-3 min). Requires the Rust toolchain (rustup.rs).

From source#

git clone https://github.com/vercel-labs/agent-browser
cd agent-browser
pnpm install
pnpm build
pnpm build:native
./bin/agent-browser install
pnpm link --global

Linux dependencies#

On Linux, install system dependencies:

agent-browser install --with-deps

Updating#

Upgrade to the latest version:

agent-browser upgrade

Detects your installation method (npm, Homebrew, or Cargo) and runs the appropriate update command automatically. Displays the version change on success, or informs you if you are already on the latest version.

Doctor#

doctor diagnoses your install and auto-cleans stale daemon files. Run it whenever something stops working unexpectedly, or after upgrades:

agent-browser doctor                     # Full diagnosis
agent-browser doctor --offline --quick   # Local-only, fastest (~<1s)
agent-browser doctor --fix               # Also run destructive repairs
agent-browser doctor --json              # Structured output

It checks:

CategoryWhat it checks
EnvironmentCLI version, platform, home directory, state and socket dirs, free disk space
ChromeChrome install path and version, cache dir, Puppeteer fallback, user-data dir and profile count, optional lightpanda engine
DaemonsRunning daemons per session, stale .sock / .pid / .version / .stream files (auto-cleaned), version mismatch with the CLI, dashboard process liveness
Config~/.agent-browser/config.json, ./agent-browser.json, and any file at AGENT_BROWSER_CONFIG parse as valid JSON
SecurityEncryption key env var or ~/.agent-browser/.encryption-key (with 0600 permissions on unix), state file count and age vs AGENT_BROWSER_STATE_EXPIRE_DAYS, action policy file
ProvidersEnv vars for Browserless, Browserbase, Browser Use, Kernel, AgentCore (AWS creds), Appium (for --provider ios), and AI_GATEWAY_API_KEY for chat
NetworkReachability of the Chrome for Testing CDN, AI Gateway (if configured), and any currently selected provider endpoint (skipped under --offline)
Launch testSpawns a scratch session, launches headless Chrome, navigates to about:blank, then closes. Measures wall time (skipped under --quick)

Stale sidecar files are always cleaned. Destructive actions are opt-in via --fix:

CheckWhat --fix does
Chrome missingRuns agent-browser install
Version-mismatched daemonsSends close to each and cleans files
Old state filesDeletes state files older than AGENT_BROWSER_STATE_EXPIRE_DAYS (default 30)
Missing encryption keyGenerates a new key at ~/.agent-browser/.encryption-key (0600, unix); never overwrites an existing key

Exit code is 0 if all checks pass (warnings are fine), 1 if any fail.

Custom browser#

Use a custom browser executable instead of bundled Chromium:

  • Serverless - Use @sparticuz/chromium (~50MB vs ~684MB)
  • System browser - Use existing Chrome installation
  • Custom builds - Use modified browser builds
# Via flag
agent-browser --executable-path /path/to/chromium open example.com

# Via environment variable
AGENT_BROWSER_EXECUTABLE_PATH=/path/to/chromium agent-browser open example.com

Serverless example#

Use @sparticuz/chromium or similar to obtain a Chromium executable path, then pass it via --executable-path or AGENT_BROWSER_EXECUTABLE_PATH.

AI agent setup#

agent-browser works with any AI agent out of the box. For richer context:

Install the skill for your AI coding assistant:

npx skills add vercel-labs/agent-browser

This works with Claude Code, Codex, Cursor, Gemini CLI, GitHub Copilot, Goose, OpenCode, and Windsurf. The skill is fetched from the repository and stays up to date automatically.

Do not copy SKILL.md from node_modules -- it will become stale as new features are added. Always use npx skills add or reference the repository version.

AGENTS.md / CLAUDE.md#

Add to your instructions file:

## Browser Automation

Use `agent-browser` for web automation. Run `agent-browser --help` for all commands.

Core workflow:
1. `agent-browser open <url>` - Navigate to page
2. `agent-browser snapshot -i` - Get interactive elements with refs (@e1, @e2)
3. `agent-browser click @e1` / `fill @e2 "text"` - Interact using refs
4. Re-snapshot after page changes