Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Interactive AI avatar with Simli video rendering and ElevenLabs TTS
Interactive AI avatar with Simli video rendering and ElevenLabs TTS
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Interactive AI avatar interface for OpenClaw with real-time lip-synced video and text-to-speech.
Voice Responses: Speaks conversational summaries using ElevenLabs TTS Visual Avatar: Realistic lip-synced video via Simli Detail Panel: Shows formatted markdown alongside spoken responses Multi-language: Supports multiple languages for speech and TTS Slack/Email: Forward responses to Slack DMs or email (when configured) Stream Deck: Optional hardware control with Elgato Stream Deck
Get API keys: Simli - Avatar rendering ElevenLabs - Text-to-speech Set environment variables: export SIMLI_API_KEY=your-key export ELEVENLABS_API_KEY=your-key Start the avatar: openclaw-avatar Open http://localhost:5173
When responding to avatar queries, use this format: <spoken> A short conversational summary (1-3 sentences). NO markdown, NO formatting. Plain speech only. </spoken> <detail> Full detailed response with markdown formatting (bullet points, headers, bold, etc). </detail>
spoken: Brief, natural, conversational. This is read aloud. detail: Comprehensive information with proper markdown. Always include both sections.
Avatar responses use session key: agent:main:avatar
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.