Changelog

v0.8.0

January 2026

New Features

  • Kernel cloud browser provider - Connect to Kernel (kernel.sh) for remote browser infrastructure with stealth mode and persistent profiles
    # Via -p flag
    agent-browser -p kernel open https://example.com
    
    # Via environment variable
    export AGENT_BROWSER_PROVIDER=kernel
    export KERNEL_API_KEY=your-api-key
    agent-browser open https://example.com
    
    # With persistent profile
    export KERNEL_PROFILE_NAME=my-profile
    agent-browser open https://example.com
  • Ignore HTTPS certificate errors - New flag for working with self-signed certificates and development environments
    agent-browser --ignore-https-errors open https://localhost:3000
  • Enhanced cookie management - Extended cookies set command with additional flags for setting cookies before page load
    agent-browser cookies set session_id "abc123" --url https://app.example.com --httpOnly --secure
    agent-browser cookies set token "xyz" --domain .example.com --path /api --expires 1735689600

Bug Fixes

  • Fixed tab list command not recognizing new pages opened via clicks or target="_blank" links
  • Fixed check command hanging indefinitely
  • Fixed set device not applying deviceScaleFactor - HiDPI screenshots now work correctly
  • Fixed state load and profile persistence not working in v0.7.6
  • Screenshots now save to temp directory when no path is provided

Security

  • Daemon and stream server now reject cross-origin connections

v0.7.1

January 2026

Bug Fixes

  • Fix native binary distribution - Native binaries for all platforms (Linux x64/arm64, macOS x64/arm64, Windows x64) are now included in the npm package. Previously, the release workflow published to npm before building binaries, causing "No binary found" errors on installation.

v0.7.0

January 2026

New Features

  • Cloud browser providers - Connect to Browserbase or Browser Use for remote browser infrastructure
    # Via -p flag (recommended)
    agent-browser -p browserbase open https://example.com
    agent-browser -p browseruse open https://example.com
    
    # Via environment variable
    export AGENT_BROWSER_PROVIDER=browserbase
    agent-browser open https://example.com
  • Persistent browser profiles - Store cookies, localStorage, and login sessions across browser restarts
    agent-browser --profile ~/.myapp-profile open myapp.com
    # Login persists across restarts
  • Remote CDP WebSocket URLs - Connect to remote browser services via WebSocket
    agent-browser --cdp "wss://browser-service.com/cdp?token=..." snapshot
  • download command - Trigger downloads and wait for completion
    agent-browser download @e1 ./file.pdf
    agent-browser wait --download ./output.zip --timeout 30000
  • Browser launch configuration - Fine-grained control over browser startup
    agent-browser --args "--disable-gpu,--no-sandbox" open example.com
    agent-browser --user-agent "Custom UA" open example.com
    agent-browser --proxy-bypass "localhost,*.internal" open example.com
  • Enhanced skills - Hierarchical structure with references and templates for Claude Code

Bug Fixes

  • Screenshot command now supports refs and has improved error messages
  • WebSocket URLs work in connect command
  • Fixed socket file location (uses ~/.agent-browser instead of TMPDIR)
  • Windows binary path fix (.exe extension)
  • State load and path-based actions now show correct output messages

Documentation

  • Added Claude Code marketplace plugin installation instructions
  • Updated skill documentation with references and templates
  • Improved error documentation

v0.6.0

January 2026

New Features

  • Video recording - Record browser sessions to WebM using Playwright's native recording
    agent-browser record start ./demo.webm
    agent-browser click @e1
    agent-browser record stop
  • connect command - Connect to a browser via CDP and persist the connection for subsequent commands
    agent-browser connect 9222
    agent-browser snapshot  # No --cdp needed after connect
  • --proxy flag - Configure browser proxy with optional authentication
    agent-browser --proxy http://user:pass@proxy.com:8080 open example.com
  • get styles command - Extract computed styles from elements
    agent-browser get styles "button"
  • Claude marketplace plugin - Added .claude-plugin/marketplace.json for Claude Code integration
  • Enhanced network output - network requests now shows method, URL, and resource type
  • --version flag - Display CLI version

Bug Fixes

  • Fix Windows daemon startup and port calculation
  • Support libasound2t64 on newer Ubuntu versions (24.04+)
  • Prevent CDP timeout on empty URL tabs
  • Output screenshot as base64 when no path provided
  • Resolve refs in get value command
  • Support URL parameter in tab new command
  • Allow about:, data:, and file: URL schemes
  • Detect stale unix socket by attempting connection
  • Respect AGENT_BROWSER_HEADED environment variable
  • Handle SIGPIPE to prevent panic when piping to head/tail
  • Fix null path validation in screenshot command

Protocol Alignment

These changes align the CLI with the daemon protocol for consistency:

  • select command now uses values field (supports multiple selections)
  • frame main uses mainframe action
  • mouse wheel uses wheel action
  • set media uses emulatemedia action
  • Console output uses messages field

Documentation

  • Expanded SKILL.md with comprehensive command reference
  • Updated README with new commands and options
  • Updated CDP mode documentation with connect workflow