Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).
Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Transcribe audio files using ElevenLabs' Scribe v2 model. Supports 90+ languages with speaker diarization.
# Basic transcription {baseDir}/scripts/transcribe.sh /path/to/audio.mp3 # With speaker diarization {baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --diarize # Specify language (improves accuracy) {baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --lang en # Full JSON output with timestamps {baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --json
FlagDescription--diarizeIdentify different speakers--lang CODEISO language code (e.g., en, pt, es)--jsonOutput full JSON with word timestamps--eventsTag audio events (laughter, music, etc.)
All major audio/video formats: mp3, m4a, wav, ogg, webm, mp4, etc.
Set ELEVENLABS_API_KEY environment variable, or configure in clawdbot.json: { skills: { entries: { "elevenlabs-stt": { apiKey: "sk_..." } } } }
# Transcribe a WhatsApp voice note {baseDir}/scripts/transcribe.sh ~/Downloads/voice_note.ogg # Meeting recording with multiple speakers {baseDir}/scripts/transcribe.sh meeting.mp3 --diarize --lang en # Get JSON for processing {baseDir}/scripts/transcribe.sh podcast.mp3 --json > transcript.json
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.