Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.
Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Transcribe audio files with speaker diarization (who speaks when).
โ Transcription in 100+ languages โ Speaker diarization (Speaker A, B, C...) โ Timestamps per utterance โ Automatic language detection โ Supports MP3, WAV, M4A, FLAC, OGG, WEBM
Create AssemblyAI account: https://www.assemblyai.com/ Get API key (free tier: 100 min/month) Set environment variable: export ASSEMBLYAI_API_KEY="your-api-key" Or save to config file: // ~/.assemblyai_config.json { "api_key": "YOUR_API_KEY" }
python3 scripts/transcribe.py /path/to/recording.mp3
python3 scripts/transcribe.py https://example.com/meeting.mp3
python3 scripts/transcribe.py audio.mp3 --no-diarization # Skip speaker labels python3 scripts/transcribe.py audio.mp3 --json # Raw JSON output
## Transcript *Language: EN* *Duration: 05:32* **Speaker A** [00:00]: Hello everyone, welcome to the meeting. **Speaker B** [00:03]: Thanks! Happy to be here. **Speaker A** [00:06]: Let's start with the first item...
Free Tier: 100 minutes/month free After: ~$0.01/minute
For best speaker diarization: clear speaker changes, minimal overlap Background noise is filtered well Multi-language auto-detection works reliably Author: xenofex7 | Version: 1.1.0
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.