← All skills
Tencent SkillHub · Communication & Collaboration

VoiceClaw

Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp) and generate voice replies using local Piper T...

skill openclawclawhub Free
0 Downloads
0 Stars
0 Installs
0 Score
High Signal

Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp) and generate voice replies using local Piper T...

⬇ 0 downloads ★ 0 stars Unverified but indexed

Install for OpenClaw

Quick setup
  1. Download the package from Yavira.
  2. Extract the archive and review SKILL.md first.
  3. Import or place the package into your OpenClaw setup.

Requirements

Target platform
OpenClaw
Install method
Manual import
Extraction
Extract archive
Prerequisites
OpenClaw
Primary doc
SKILL.md

Package facts

Download mode
Yavira redirect
Package format
ZIP package
Source platform
Tencent SkillHub
What's included
README.md, LICENSE, SECURITY.md, .clawhubsafe, SKILL.md, scripts/transcribe.sh

Validation

  • Use the Yavira download entry.
  • Review SKILL.md after the package is downloaded.
  • Confirm the extracted package contains the expected setup assets.

Install with your agent

Agent handoff

Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.

  1. Download the package from Yavira.
  2. Extract it into a folder your agent can access.
  3. Paste one of the prompts below and point your agent at the extracted folder.
New install

I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.

Upgrade existing

I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.

Trust & source

Release facts

Source
Tencent SkillHub
Verification
Indexed source record
Version
1.0.6

Documentation

ClawHub primary doc Primary doc: SKILL.md 10 sections Open source page

VoiceClaw

Local-only voice I/O for OpenClaw agents. STT: transcribe.sh — converts audio to text via local Whisper binary TTS: speak.sh — converts text to speech via local Piper binary Network calls: none — both scripts run fully offline No cloud APIs, no API keys required

Prerequisites

The following must be installed on the system before using this skill: RequirementPurposewhisper binarySpeech-to-text inferenceggml-base.en.bin model fileWhisper STT modelpiper binaryText-to-speech synthesis*.onnx voice model filesPiper TTS voicesffmpegAudio format conversion See README.md for installation and setup instructions.

Environment Variables

VariableDefaultPurposeWHISPER_BINauto-detected via whichPath to whisper binaryWHISPER_MODEL~/.cache/whisper/ggml-base.en.binPath to Whisper model filePIPER_BINauto-detected via whichPath to piper binaryVOICECLAW_VOICES_DIR~/.local/share/piper/voicesDirectory containing .onnx voice model files

Verify Setup

which whisper && echo "STT binary: OK" which piper && echo "TTS binary: OK" which ffmpeg && echo "ffmpeg: OK" ls "${WHISPER_MODEL:-$HOME/.cache/whisper/ggml-base.en.bin}" && echo "STT model: OK" ls "${VOICECLAW_VOICES_DIR:-$HOME/.local/share/piper/voices}"/*.onnx 2>/dev/null | head -1 && echo "TTS voices: OK"

Inbound Voice: Transcribe

# Transcribe audio → text (supports ogg, mp3, m4a, wav, flac) TRANSCRIPT=$(bash scripts/transcribe.sh /path/to/audio.ogg) Override model path: WHISPER_MODEL=/path/to/ggml-base.en.bin bash scripts/transcribe.sh audio.ogg

Outbound Voice: Speak

# Step 1: Generate WAV (local Piper — no network) WAV=$(bash scripts/speak.sh "Your response here." /tmp/reply.wav en_US-lessac-medium) # Step 2: Convert to OGG Opus (Telegram voice requirement) ffmpeg -i "$WAV" -c:a libopus -b:a 32k /tmp/reply.ogg -y -loglevel error # Step 3: Send via message tool (filePath=/tmp/reply.ogg) Override voice directory: VOICECLAW_VOICES_DIR=/path/to/voices bash scripts/speak.sh "Hello." /tmp/reply.wav

Available Voices

VoiceStyleen_US-lessac-mediumNeutral American (default)en_US-amy-mediumWarm American femaleen_US-joe-mediumAmerican maleen_US-kusal-mediumExpressive American maleen_US-danny-lowDeep American male (fast)en_GB-alba-mediumBritish femaleen_GB-northern_english_male-mediumNorthern British male

Agent Behavior Rules

Voice in → Voice + Text out. Always respond with both a voice reply and a text reply when a voice message is received. Include the transcript. Show "🎙️ I heard: [transcript]" at the top of every text reply to a voice message. Keep voice responses concise. Piper TTS works best under ~200 words — summarize for audio, include full detail in text. Local only. Never use a cloud TTS/STT API. Only the local whisper and piper binaries. Send voice before text. Send the audio file first, then follow with the text reply.

Full Example

# 1. Transcribe inbound voice message TRANSCRIPT=$(bash path/to/voiceclaw/scripts/transcribe.sh /path/to/voice.ogg) # 2. Compose reply and generate audio RESPONSE="Deployment complete. All checks passed." WAV=$(bash path/to/voiceclaw/scripts/speak.sh "$RESPONSE" /tmp/reply_$$.wav) ffmpeg -i "$WAV" -c:a libopus -b:a 32k /tmp/reply_$$.ogg -y -loglevel error # 3. Send voice + text # message(action=send, filePath=/tmp/reply_$$.ogg, ...) # reply: "🎙️ I heard: $TRANSCRIPT\n\n$RESPONSE"

Troubleshooting

IssueFixwhisper: command not foundEnsure whisper binary is installed and in PATHWhisper model not foundSet WHISPER_MODEL=/path/to/ggml-base.en.binpiper: command not foundEnsure piper binary is installed and in PATHVoice model missingSet VOICECLAW_VOICES_DIR=/path/to/voices/OGG won't play on TelegramEnsure -c:a libopus flag in ffmpeg command

Category context

Messaging, meetings, inboxes, CRM, and teammate communication surfaces.

Source: Tencent SkillHub

Largest current source with strong distribution and engagement signals.

Package contents

Included in package
3 Docs2 Files1 Scripts
  • SKILL.md Primary doc
  • README.md Docs
  • SECURITY.md Docs
  • scripts/transcribe.sh Scripts
  • .clawhubsafe Files
  • LICENSE Files