Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Send voice messages across chat channels (Telegram, Discord, Feishu/Lark, Signal, WhatsApp, and others) using edge-tts for text-to-speech and ffmpeg for audi...
Send voice messages across chat channels (Telegram, Discord, Feishu/Lark, Signal, WhatsApp, and others) using edge-tts for text-to-speech and ffmpeg for audi...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Send text as voice messages to any chat channel.
edge-tts — Microsoft Edge TTS (pip install edge-tts) ffmpeg / ffprobe — audio conversion and duration detection
Chinese: zh-CN-XiaoxiaoNeural English: en-US-JennyNeural Other languages: see references/voices.md
Use scripts/gen_voice.sh to convert text to an ogg/opus file: scripts/gen_voice.sh "你好" /tmp/voice.ogg scripts/gen_voice.sh "Hello" /tmp/voice.ogg en-US-JennyNeural Arguments: <text> <output.ogg> [voice] If voice is omitted, defaults to zh-CN-XiaoxiaoNeural.
Use the message tool directly: action=send, asVoice=true, filePath=/tmp/voice.ogg This works for most channels. Telegram confirmed working.
⚠️ Feishu does NOT support asVoice=true via the message tool. You must use the dedicated script. Use scripts/send_feishu_voice.sh: scripts/send_feishu_voice.sh /tmp/voice.ogg <receive_id> <tenant_access_token> [receive_id_type] receive_id_type: open_id (default), chat_id, user_id, union_id, email The script handles upload (as opus with duration) and sends as audio message type to produce a voice bubble. To get tenant_access_token, use the Feishu tenant token API with your app credentials.
Discord voice messages require a waveform and special flags. Generate ogg with scripts/gen_voice.sh Generate waveform: python3 scripts/gen_waveform.py /tmp/voice.ogg Outputs JSON: {"duration_secs": 4.2, "waveform": "base64..."} Send via Discord API with flags: 8192 (IS_VOICE_MESSAGE) and the waveform/duration in attachments metadata. Missing waveform/duration causes error 50161.
If asVoice=true does not produce a voice bubble on a channel: Try sending via the platform's native API If native API unavailable, send as audio file attachment
Messaging, meetings, inboxes, CRM, and teammate communication surfaces.
Largest current source with strong distribution and engagement signals.