Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Pixel art desktop lobster that lip-syncs to OpenClaw TTS speech. Use when: (1) user wants a visual avatar for their AI agent, (2) user wants a desktop overla...
Pixel art desktop lobster that lip-syncs to OpenClaw TTS speech. Use when: (1) user wants a visual avatar for their AI agent, (2) user wants a desktop overla...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
A transparent desktop overlay featuring a pixel art lobster that animates when your OpenClaw agent speaks. Powered by envelope data from your local TTS server β the lobster's mouth only moves during AI speech, not music or system audio. The app is fully bundled inside this skill β no external repository clone required.
Node.js 18+ with npx available A running TTS server exposing GET /audio/envelope (XTTS on port 8787, or any OpenAI-compatible TTS via the OpenClaw TTS proxy) Windows or Linux desktop (macOS not supported)
The app is included in this skill at <skill_dir>/app/. Install dependencies once: cd <skill_dir>/app npm install
Edit <skill_dir>/app/config.json before launching. Key settings: KeyDefaultDescriptionaudioMode"tts""tts" reacts only to TTS speech; "system" captures all audio outputttsUrl"http://127.0.0.1:8787"Base URL of your TTS servermonitor"primary""primary", "secondary", "left", "right", or display indexlobsterScale4Sprite scale (4 = 480px tall lobster)clickThroughfalseStart with click-through mode on so the lobster doesn't block clicksswimEnabledtrueEnable swimming animation
cd <skill_dir>/app npx electron . Or use the included helper script (handles first-run npm install automatically): bash <skill_dir>/scripts/launch.sh
KeyActionF8Move window to next monitor (cycles through all displays)F9Toggle click-through modeF12Toggle DevTools
With OpenClaw and a local XTTS server, set audioMode to "tts" and point ttsUrl at your XTTS instance. The lobster polls the envelope endpoint at 45ms intervals during active speech and 500ms when idle β no perceptible CPU cost. If you use the OpenClaw TTS proxy (port 8788), point ttsUrl at port 8787 (the XTTS server directly), not the proxy β the envelope endpoint is on the TTS server, not the proxy layer.
If the mouth movement is ahead of or behind the audio: Mouth moves too early: increase ttsPlayStartOffsetMs (default 1100ms) Mouth moves too late: decrease ttsPlayStartOffsetMs The default is tuned for PowerShell MediaPlayer on Windows. Other playback methods may need adjustment.
Six visemes drive natural speech animation: A β wide open "ah" B β wide grin "ee" C β round "oh" D β small pucker "oo" E β medium "eh" F β teeth "ff" Plus X (closed) for silence and pauses. Spring physics and variety enforcement prevent robotic repetition.
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.