Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...
Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
This skill sets up local whisper.cpp STT for inbound Telegram voice notes.
You need build tools (git, cmake, compiler toolchain) + curl and ffmpeg (to decode Telegram OGG/Opus β WAV).
From this skill directory: bash scripts/install_whisper_cpp.sh bash scripts/download_models.sh bash scripts/install_wrapper.sh bash scripts/patch_openclaw_audio.sh Send a Telegram voice note to test.
This setup uses ggml Whisper models stored in ~/.cache/whisper. Common model names you can download: tiny, base, small, medium large-v1, large-v2, large-v3 (bigger/slower, usually more accurate) By default we download: base + small. To download specific models: bash scripts/download_models.sh tiny base small For the OpenClaw wrapper, you can select: OPENCLAW_WHISPER_MODEL=small openclaw-whisper-stt /path/to/audio Default language: auto-detect (OPENCLAW_WHISPER_LANG=auto) Force a language (example): OPENCLAW_WHISPER_LANG=en openclaw-whisper-stt /path/to/audio Models are stored in: ~/.cache/whisper.
After install (whisper-cli + libs are in ~/.local/): bash scripts/cleanup_build.sh
Confirm OpenClaw is using the wrapper: which openclaw-whisper-stt openclaw config get tools.media.audio.models Test the wrapper directly: openclaw-whisper-stt /path/to/audio.ogg OPENCLAW_WHISPER_MODEL=small openclaw-whisper-stt /path/to/audio.ogg Follow gateway logs while sending a Telegram voice note: openclaw logs --follow
Wrapper source: bin/openclaw-whisper-stt.sh (linked to ~/.local/bin/openclaw-whisper-stt) OpenClaw config patcher: scripts/patch_openclaw_audio.sh
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.