Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
KasmVNC-based virtual desktop for headless Linux with AI-first automation and human handoff. Use when most steps are automated but a user must manually inter...
KasmVNC-based virtual desktop for headless Linux with AI-first automation and human handoff. Use when most steps are automated but a user must manually inter...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Use this when the workflow is: AI runs browser automation most of the time Captcha / risk-control / MFA appears User takes over remotely for 1-3 minutes AI continues automatically This version replaces x11vnc+noVNC with KasmVNC and keeps computer-use style action scripts for AI control.
bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/install_kasmvnc.sh This installer also prepares required runtime tools: fluxbox (lightweight desktop) xdotool + scrot (computer-use actions) xauth
Before starting, always confirm these user requirements: takeover device: 手机 or 电脑 website render mode: 手机网页 or 桌面网页 access mode: 本地隧道 (127.0.0.1) or 临时公网 (0.0.0.0) network quality: 弱网 / 普通 / 良好 Use guided script (interactive Q&A): bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/start_vrd_guided.sh Preview config without starting: bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/start_vrd_guided.sh --dry-run
bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/start_vrd.sh Important env vars: AUTO_LAUNCH_URL (optional): open target page automatically KASM_BIND (default 127.0.0.1, safer) AUTO_STOP_IDLE_SECS (default 900) BROWSER_MOBILE_MODE=1 (launch browser with mobile emulation) BROWSER_DEVICE=iphone14pro|pixel7|ipad Example: AUTO_LAUNCH_URL="https://example.com/login" \ AUTO_STOP_IDLE_SECS=1200 \ bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/start_vrd.sh Mobile-friendly VNC stream example (better phone takeover UX): MOBILE_MODE=1 MOBILE_PRESET=phone \ AUTO_STOP_IDLE_SECS=900 \ bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/start_vrd.sh Mobile stream options: MOBILE_MODE=1 enables mobile defaults MOBILE_PRESET=phone|tablet sets default resolution (960x540 / 1280x720) KASM_MAX_FPS can be lowered further (e.g. 18) on weak networks Browser mobile emulation (website renders as mobile page): AUTO_LAUNCH_URL="https://example.com" \ BROWSER_MOBILE_MODE=1 BROWSER_DEVICE=iphone14pro \ bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/start_vrd.sh Notes: Browser mobile emulation changes UA + viewport + touch behavior. It is different from MOBILE_MODE (which optimizes VNC stream size).
bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/status_vrd.sh bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/health_vrd.sh
bash /home/ubuntu/.openclaw/workspace/skills/virtual-remote-desktop/scripts/stop_vrd.sh
All actions run on the active virtual display from pids.env. # screenshot (base64) bash scripts/action_screenshot.sh # click / type / key / scroll bash scripts/action_click.sh 500 420 left bash scripts/action_type.sh "hello" bash scripts/action_key.sh "ctrl+l" bash scripts/action_scroll.sh down 4 # helpers bash scripts/action_mouse_move.sh 800 300 bash scripts/action_cursor_position.sh bash scripts/action_wait.sh 2 Recommended loop: action_screenshot.sh analyze action_click/type/key/... screenshot verify repeat
When captcha/risk-control appears: AI sends user the KasmVNC URL + username/password User manually solves challenge User replies “done” AI runs screenshot + validation step AI resumes automation This avoids full manual operation while keeping recovery fast.
KASM_BIND=127.0.0.1 user connects via SSH tunnel best for security
KASM_BIND=0.0.0.0 short AUTO_STOP_IDLE_SECS (e.g. 300) use only for urgent remote intervention
KasmVNC uses HTTPS + per-user auth (username/password). This skill stores runtime files in ~/.openclaw/vrd-data by default. Browser profile persistence is controlled by CHROME_PROFILE_DIR. If captcha frequency is high, keep one long-lived profile to reduce repeated challenges.
Messaging, meetings, inboxes, CRM, and teammate communication surfaces.
Largest current source with strong distribution and engagement signals.