Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Control a Linux X11 desktop by taking screenshots and moving/clicking/typing via xdotool + scrot.
Control a Linux X11 desktop by taking screenshots and moving/clicking/typing via xdotool + scrot.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
This skill provides a small, scriptable desktop GUI control helper for Linux X11. Itβs intended for βvision loopβ automation: take a screenshot decide where to click move/click/type repeat Under the hood it wraps: scrot for screenshots xdotool for mouse/keyboard/window control
desktopctl.py β the CLI script
Linux running X11 (not Wayland-only) python3 xdotool scrot Ubuntu/Debian: sudo apt-get update sudo apt-get install -y xdotool scrot
From this skill directory: python3 desktopctl.py screenshot python3 desktopctl.py click 500 300 python3 desktopctl.py type "hello" python3 desktopctl.py key ctrl+l python3 desktopctl.py windows python3 desktopctl.py activate "Chromium"
If youβre running from a daemon/headless shell where DISPLAY isnβt set: DISPLAY=:0 XAUTHORITY=$HOME/.Xauthority python3 desktopctl.py screenshot Or use flags: python3 desktopctl.py --display :0 --xauthority $HOME/.Xauthority screenshot
This can click/type into your real desktop session. Use carefully.
0.1.0: Initial published skill.
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.