Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Headless browser automation CLI for AI agents. Use when interacting with websites — navigating pages, filling forms, clicking buttons, taking screenshots, ex...
Headless browser automation CLI for AI agents. Use when interacting with websites — navigating pages, filling forms, clicking buttons, taking screenshots, ex...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Run scripts/setup.sh to install agent-browser and Chromium. Requires Node.js.
Every browser automation follows this pattern: Navigate: agent-browser open <url> Snapshot: agent-browser snapshot -i (get element refs like @e1, @e2) Interact: Use refs to click, fill, select Re-snapshot: After navigation or DOM changes, get fresh refs agent-browser open https://example.com/form agent-browser snapshot -i # Output: @e1 [input type="email"], @e2 [input type="password"], @e3 [button] "Submit" agent-browser fill @e1 "user@example.com" agent-browser fill @e2 "password123" agent-browser click @e3 agent-browser wait --load networkidle agent-browser snapshot -i # Check result
Chain with && when you don't need intermediate output: agent-browser open https://example.com && agent-browser wait --load networkidle && agent-browser snapshot -i Run separately when you need to parse output first (e.g., snapshot to discover refs).
# Navigate agent-browser open <url> agent-browser close # See the page (always do this first) agent-browser snapshot -i # Interactive elements with refs agent-browser snapshot -i -C # Include onclick divs # Interact using @refs agent-browser click @e1 agent-browser fill @e2 "text" agent-browser select @e1 "option" agent-browser press Enter agent-browser scroll down 500 # Get info agent-browser get text @e1 agent-browser get url agent-browser get title # Wait agent-browser wait @e1 # For element agent-browser wait --load networkidle # For network idle # Capture agent-browser screenshot page.png agent-browser screenshot --full # Full page agent-browser pdf output.pdf For the full command reference, see references/commands.md.
Refs (@e1, @e2) are invalidated when the page changes. Always re-snapshot after: Clicking links/buttons that navigate Form submissions Dynamic content loading (dropdowns, modals)
agent-browser open https://example.com/signup agent-browser snapshot -i agent-browser fill @e1 "Jane Doe" agent-browser fill @e2 "jane@example.com" agent-browser select @e3 "California" agent-browser click @e5 agent-browser wait --load networkidle
agent-browser open https://app.example.com/login agent-browser snapshot -i agent-browser fill @e1 "$USERNAME" && agent-browser fill @e2 "$PASSWORD" agent-browser click @e3 agent-browser wait --url "**/dashboard" agent-browser state save auth.json # Reuse later agent-browser state load auth.json agent-browser open https://app.example.com/dashboard
agent-browser open https://example.com/products agent-browser snapshot -i agent-browser get text @e5 agent-browser get text body > page.txt
agent-browser screenshot baseline.png # ... changes happen ... agent-browser diff screenshot --baseline baseline.png
agent-browser --session site1 open https://site-a.com agent-browser --session site2 open https://site-b.com agent-browser session list
export AGENT_BROWSER_CONTENT_BOUNDARIES=1 # Wrap output for AI safety export AGENT_BROWSER_ALLOWED_DOMAINS="example.com" # Domain allowlist export AGENT_BROWSER_MAX_OUTPUT=50000 # Prevent context flooding
Always close sessions when done: agent-browser close
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.