← All skills
Tencent SkillHub · AI

飞书语音

实现飞书语音消息的上传下载、语音转文字及文字转语音,支持与 ElevenLabs 语音服务集成。

skill openclawclawhub Free
0 Downloads
0 Stars
0 Installs
0 Score
High Signal

实现飞书语音消息的上传下载、语音转文字及文字转语音,支持与 ElevenLabs 语音服务集成。

⬇ 0 downloads ★ 0 stars Unverified but indexed

Install for OpenClaw

Quick setup
  1. Download the package from Yavira.
  2. Extract the archive and review SKILL.md first.
  3. Import or place the package into your OpenClaw setup.

Requirements

Target platform
OpenClaw
Install method
Manual import
Extraction
Extract archive
Prerequisites
OpenClaw
Primary doc
SKILL.md

Package facts

Download mode
Yavira redirect
Package format
ZIP package
Source platform
Tencent SkillHub
What's included
SKILL.md, _meta.json, skill.json

Validation

  • Use the Yavira download entry.
  • Review SKILL.md after the package is downloaded.
  • Confirm the extracted package contains the expected setup assets.

Install with your agent

Agent handoff

Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.

  1. Download the package from Yavira.
  2. Extract it into a folder your agent can access.
  3. Paste one of the prompts below and point your agent at the extracted folder.
New install

I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.

Upgrade existing

I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.

Trust & source

Release facts

Source
Tencent SkillHub
Verification
Indexed source record
Version
1.0.0

Documentation

ClawHub primary doc Primary doc: SKILL.md 15 sections Open source page

概述

本技能用于实现飞书与 ElevenLabs 的语音交互,包括: 语音转文字(用户发语音 → 识别内容) 文字转语音(生成语音回复用户) 飞书语音消息的收发

1.1 ElevenLabs API Key

export ELEVENLABS_API_KEY="你的API Key"

1.2 FFmpeg 安装

apt-get update && apt-get install -y ffmpeg

2.1 下载飞书语音

用户发送语音时,收到的是 file_key,需要通过以下步骤下载: TOKEN=$(curl -s -X POST "https://open.feishu.cn/open-apis/auth/v3/tenant_access_token/internal" \ -H "Content-Type: application/json; charset=utf-8" \ -d '{"app_id":"你的app_id","app_secret":"你的app_secret"}' | grep -o '"tenant_access_token":"[^"]*"' | cut -d'"' -f4) # 下载语音文件 curl -s "https://open.feishu.cn/open-apis/im/v1/messages/{message_id}/resources/{file_key}?type=file" \ -H "Authorization: Bearer $TOKEN" -o /path/to/voice.ogg

2.2 ElevenLabs 语音转文字

curl -s -X POST "https://api.elevenlabs.io/v1/speech-to-text?enable_logging=true" \ -H "xi-api-key: ${ELEVENLABS_API_KEY}" \ -F model_id="scribe_v1" \ -F file=@/path/to/voice.ogg 返回结果包含 text 字段,即识别出的文字内容。

3.1 ElevenLabs TTS 生成

curl -s -X POST "https://api.elevenlabs.io/v1/text-to-speech/pNInz6obpgDQGcFmaJgB" \ -H "Content-Type: application/json" \ -H "xi-api-key: ${ELEVENLABS_API_KEY}" \ -d '{ "text": "要转换的文字", "model_id": "eleven_multilingual_v2" }' -o /path/to/output.mp3

3.2 转换为飞书兼容格式

飞书语音需要 Ogg/Opus 格式,需要用 FFmpeg 转换: ffmpeg -i input.mp3 -ar 16000 -ac 1 -acodec libopus output.ogg -y

4.1 Node.js 实现

const { Client } = require('@larksuiteoapi/node-sdk'); const fs = require('fs'); const client = new Client({ appId: '你的appId', appSecret: '你的appSecret', }); async function sendVoice(filePath, durationMs, receiveId) { // 1. 上传语音文件 const uploadRes = await client.im.file.create({ data: { file_type: 'opus', file_name: 'voice.ogg', file: fs.createReadStream(filePath), duration: durationMs } }); const fileKey = uploadRes.file_key; // 2. 发送语音消息 const sendRes = await client.im.message.create({ params: { receive_id_type: 'open_id' }, data: { receive_id: receiveId, msg_type: 'audio', content: JSON.stringify({ file_key: fileKey, duration: durationMs }) } }); return sendRes; }

5.1 语音下载失败

错误: "The app is not the resource sender" 原因: 飞书安全限制,机器人只能下载自己发送的文件 解决: 用户需将语音转发给机器人(转发后机器人成为发送者)

5.2 TTS 生成文件为空

检查: 确认 ELEVENLABS_API_KEY 已设置且有余额

5.3 语音无法播放

检查: 文件格式是否为 Ogg/Opus duration 参数是否正确 文件是否在允许的目录(workspace 目录)

5.4 消息太长被拦截

钉钉:单条消息超过约7000字符会被拦截,需要拆分多条发送 飞书:同样有限制

6. 飞书权限配置

需要以下权限: im:message - 消息收发 im:resource - 文件/媒体资源 im:resource:download - 下载消息资源

7. 完整流程示例

用户发送语音 ↓ 1. 获取 message_id 和 file_key 2. 下载语音文件 (type=file) 3. ElevenLabs 语音转文字 → 理解内容 4. 生成回复内容 5. ElevenLabs TTS 生成语音 6. FFmpeg 转为 Ogg 格式 7. 上传并发送语音消息给用户

8. 相关文件位置

临时语音文件: /root/.openclaw/workspace/ TTS 转换: 需要 ffmpeg 支持 最后更新: 2026-02-23

Category context

Agent frameworks, memory systems, reasoning layers, and model-native orchestration.

Source: Tencent SkillHub

Largest current source with strong distribution and engagement signals.

Package contents

Included in package
2 Config1 Docs
  • SKILL.md Primary doc
  • _meta.json Config
  • skill.json Config