Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Text-to-speech, sound effects, music generation, voice management, and quota checks via the ElevenLabs API. Use when generating audio with ElevenLabs or managing voices.
Text-to-speech, sound effects, music generation, voice management, and quota checks via the ElevenLabs API. Use when generating audio with ElevenLabs or managing voices.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Core tools for interacting with the ElevenLabs API for sound generation, music, and voice management.
See SETUP.md for prerequisites and setup instructions.
ModelIDUse CaseEleven v3eleven_v3โญ Best for expressive/creative audio. Supports audio tags (square brackets): [laughs], [sighs], [whispers], [excited], [grumpy voice], [clears throat], etc. Use for storytelling, characters, demos.Multilingual v2eleven_multilingual_v2Stable multilingual. No audio tags. Good for straightforward narration.Turbo v2.5eleven_turbo_v2_5Low-latency, good for non-English (German TTS). Required for realtime/conversational.Flash v2.5eleven_flash_v2_5Fastest, lowest cost.
[laughs], [chuckles], [sighs], [clears throat], [whispers], [shouts] [excited], [sad], [angry], [warmly], [deadpan], [sarcastic] [grumpy voice], [philosophical], [whiny voice], [resigned] [laughs hard], [sighs deeply], [pause] Tags can be placed anywhere in text. Combine freely. v3 understands emotional context deeply.
All scripts support multiple output formats via --format: FormatDescriptionmp3_44100_128MP3, 44.1kHz, 128kbps (default)mp3_44100_192MP3, 44.1kHz, 192kbpsmp3_44100_96MP3, 44.1kHz, 96kbpsmp3_44100_64MP3, 44.1kHz, 64kbpsmp3_44100_32MP3, 44.1kHz, 32kbpsmp3_24000_48MP3, 24kHz, 48kbpsmp3_22050_32MP3, 22.05kHz, 32kbpsopus_48000_192Opus, 48kHz, 192kbps โญ best for AirPlayopus_48000_128Opus, 48kHz, 128kbpsopus_48000_96Opus, 48kHz, 96kbpsopus_48000_64Opus, 48kHz, 64kbpsopus_48000_32Opus, 48kHz, 32kbpspcm_16000Raw PCM, 16kHzpcm_22050Raw PCM, 22.05kHzpcm_24000Raw PCM, 24kHzalaw_8000A-law, 8kHz (telephony)
Text-to-speech using ElevenLabs voices. # Basic usage python3 {baseDir}/scripts/speech.py "Hello world" -v <voice_id> -o output.mp3 # With format option python3 {baseDir}/scripts/speech.py "Hello world" -v <voice_id> -o output.pcm --format pcm_44100 # With voice settings python3 {baseDir}/scripts/speech.py "Hello" -v <voice_id> -o out.mp3 --stability 0.7 --similarity 0.8
Generate sound effects and short audio clips. # Generate a sound python3 {baseDir}/scripts/sfx.py "Cinematic boom" -o boom.mp3 # Generate a loop python3 {baseDir}/scripts/sfx.py "Lo-fi hip hop beat" --duration 10 --loop -o beat.mp3 # Different format python3 {baseDir}/scripts/sfx.py "Whoosh" -o whoosh.pcm --format pcm_44100
Generate full musical compositions or instrumental tracks. # Generate instrumental intro python3 {baseDir}/scripts/music.py --prompt "Upbeat 6s news intro sting, instrumental" --length-ms 6000 -o intro.mp3 # Generate background bed python3 {baseDir}/scripts/music.py --prompt "Soft ambient synth pad" --length-ms 30000 -o bed.mp3 # High quality MP3 python3 {baseDir}/scripts/music.py --prompt "Jazz piano" --length-ms 10000 -o jazz.mp3 --output-format mp3_44100_192
List available voices and their IDs. # List voices python3 {baseDir}/scripts/voices.py # JSON output python3 {baseDir}/scripts/voices.py --json
Create instant voice clones from audio samples. Security: by default this script will only read files from: ~/.openclaw/elevenlabs/voiceclone-samples/ Copy your samples there (or pass --sample-dir). Reading files outside the sample directory is blocked. # Clone from audio files (put samples into ~/.openclaw/elevenlabs/voiceclone-samples) python3 {baseDir}/scripts/voiceclone.py --name "MyVoice" --files sample1.mp3 sample2.mp3 # Use a custom sample dir python3 {baseDir}/scripts/voiceclone.py --name "Andi" --sample-dir ./samples --files a.m4a b.m4a --language de --gender male # With description and noise removal python3 {baseDir}/scripts/voiceclone.py --name "Andi" --files a.m4a b.m4a --description "German male" --denoise
Check subscription quota and usage statistics. # Show current quota python3 {baseDir}/scripts/quota.py # Include usage breakdown by voice python3 {baseDir}/scripts/quota.py --usage # Last 7 days usage python3 {baseDir}/scripts/quota.py --usage --days 7 # JSON output python3 {baseDir}/scripts/quota.py --json Output: ๐ ElevenLabs Quota ======================================= Plan: pro (active) โ annual Characters: 66.6K / 500.0K (13.3%) [โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ] Resets: 2026-02-18 (29 days) Voices: 22 / 160 (IVC: โ) Pro Voice: 0 / 1 (PVC: โ)
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.