Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Build and run Gemini 2.5 Computer Use browser-control agents with Playwright. Use when a user wants to automate web browser tasks via the Gemini Computer Use model, needs an agent loop (screenshot → function_call → action → function_response), or asks to integrate safety confirmation for risky UI actions.
Build and run Gemini 2.5 Computer Use browser-control agents with Playwright. Use when a user wants to automate web browser tasks via the Gemini Computer Use model, needs an agent loop (screenshot → function_call → action → function_response), or asks to integrate safety confirmation for risky UI actions.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Source the env file and set your API key: cp env.example env.sh $EDITOR env.sh source env.sh Create a virtual environment and install dependencies: python -m venv .venv source .venv/bin/activate pip install google-genai playwright playwright install chromium Run the agent script with a prompt: python scripts/computer_use_agent.py \ --prompt "Find the latest blog post title on example.com" \ --start-url "https://example.com" \ --turn-limit 6
Default: Playwright's bundled Chromium (no env vars required). Choose a channel (Chrome/Edge) with COMPUTER_USE_BROWSER_CHANNEL. Use a custom Chromium-based executable (e.g., Brave) with COMPUTER_USE_BROWSER_EXECUTABLE. If both are set, COMPUTER_USE_BROWSER_EXECUTABLE takes precedence.
Capture a screenshot and send the user goal + screenshot to the model. Parse function_call actions in the response. Execute each action in Playwright. If a safety_decision is require_confirmation, prompt the user before executing. Send function_response objects containing the latest URL + screenshot. Repeat until the model returns only text (no actions) or you hit the turn limit.
Run in a sandboxed browser profile or container. Use --exclude to block risky actions you do not want the model to take. Keep the viewport at 1440x900 unless you have a reason to change it.
Script: scripts/computer_use_agent.py Reference notes: references/google-computer-use.md Env template: env.example
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.