Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages.
Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Local text-to-speech using Microsoft's VibeVoice model. Generates natural Spanish voice audio, perfect for WhatsApp voice messages.
# Basic usage {baseDir}/scripts/vv.sh "Hola, esto es una prueba" -o /tmp/audio.ogg # From file {baseDir}/scripts/vv.sh -f texto.txt -o /tmp/audio.ogg # Different voice {baseDir}/scripts/vv.sh "Texto" -v en-Wayne -o /tmp/audio.ogg # Adjust speed (0.5-2.0) {baseDir}/scripts/vv.sh "Texto" -s 1.2 -o /tmp/audio.ogg
SettingDefaultDescriptionVoicesp-Spk1_manSpanish male voice (slight Mexican accent)Speed1.1515% faster than normalFormat.oggOpus codec for WhatsApp
Spanish: sp-Spk1_man - Male, slight Mexican accent (default) English: en-Wayne - Male en-Denise - Female Other voices in ~/VibeVoice/demo/voices/streaming_model/
.ogg - Opus codec (WhatsApp compatible, recommended) .mp3 - MP3 format .wav - Uncompressed WAV
Always use .ogg format with asVoice=true in the message tool: # Generate {baseDir}/scripts/vv.sh "Tu mensaje aquí" -o /tmp/mensaje.ogg # Send via message tool message action=send channel=whatsapp to="+34XXXXXXXXX" filePath=/tmp/mensaje.ogg asVoice=true
GPU: NVIDIA with ~2GB VRAM VibeVoice: Installed at ~/VibeVoice ffmpeg: For audio conversion Python 3.10+: With torch, torchaudio
RTF: ~0.24x (generates faster than realtime) 1 minute of audio ≈ 15 seconds to generate
First run loads model (~10s), subsequent runs are faster Audio rule: Only send voice if user requests it or speaks via audio Keep text under 1500 chars for best quality
Messaging, meetings, inboxes, CRM, and teammate communication surfaces.
Largest current source with strong distribution and engagement signals.