# Send Whisper Stt to your agent
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
## Fast path
- Download the package from Yavira.
- Extract it into a folder your agent can access.
- Paste one of the prompts below and point your agent at the extracted folder.
## Suggested prompts
### New install

```text
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
```
### Upgrade existing

```text
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
```
## Machine-readable fields
```json
{
  "schemaVersion": "1.0",
  "item": {
    "slug": "openclaw-skill-whisper-stt",
    "name": "Whisper Stt",
    "source": "tencent",
    "type": "skill",
    "category": "AI 智能",
    "sourceUrl": "https://clawhub.ai/QiaoTuCodes/openclaw-skill-whisper-stt",
    "canonicalUrl": "https://clawhub.ai/QiaoTuCodes/openclaw-skill-whisper-stt",
    "targetPlatform": "OpenClaw"
  },
  "install": {
    "downloadUrl": "/downloads/openclaw-skill-whisper-stt",
    "sourceDownloadUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=openclaw-skill-whisper-stt",
    "sourcePlatform": "tencent",
    "targetPlatform": "OpenClaw",
    "packageFormat": "ZIP package",
    "primaryDoc": "SKILL.md",
    "includedAssets": [
      "README-CN.md",
      "README.md",
      "SKILL.md",
      "transcribe.py"
    ],
    "downloadMode": "redirect",
    "sourceHealth": {
      "source": "tencent",
      "status": "healthy",
      "reason": "direct_download_ok",
      "recommendedAction": "download",
      "checkedAt": "2026-04-30T16:55:25.780Z",
      "expiresAt": "2026-05-07T16:55:25.780Z",
      "httpStatus": 200,
      "finalUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=network",
      "contentType": "application/zip",
      "probeMethod": "head",
      "details": {
        "probeUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=network",
        "contentDisposition": "attachment; filename=\"network-1.0.0.zip\"",
        "redirectLocation": null,
        "bodySnippet": null
      },
      "scope": "source",
      "summary": "Source download looks usable.",
      "detail": "Yavira can redirect you to the upstream package for this source.",
      "primaryActionLabel": "Download for OpenClaw",
      "primaryActionHref": "/downloads/openclaw-skill-whisper-stt"
    },
    "validation": {
      "installChecklist": [
        "Use the Yavira download entry.",
        "Review SKILL.md after the package is downloaded.",
        "Confirm the extracted package contains the expected setup assets."
      ],
      "postInstallChecks": [
        "Confirm the extracted package includes the expected docs or setup files.",
        "Validate the skill or prompts are available in your target agent workspace.",
        "Capture any manual follow-up steps the agent could not complete."
      ]
    }
  },
  "links": {
    "detailUrl": "https://openagent3.xyz/skills/openclaw-skill-whisper-stt",
    "downloadUrl": "https://openagent3.xyz/downloads/openclaw-skill-whisper-stt",
    "agentUrl": "https://openagent3.xyz/skills/openclaw-skill-whisper-stt/agent",
    "manifestUrl": "https://openagent3.xyz/skills/openclaw-skill-whisper-stt/agent.json",
    "briefUrl": "https://openagent3.xyz/skills/openclaw-skill-whisper-stt/agent.md"
  }
}
```
## Documentation

### Whisper 语音转文字技能

将音频/语音文件识别并转换为文字。

### 自动处理（推荐）

当用户发送语音/音频文件时，自动：

识别语音内容
转换为文字
发送文字到飞书

### 手动命令

用户发送音频文件后，技能自动处理。

### 支持格式

MP3, WAV, M4A, OGG, FLAC, WebM
任何音频格式（FFmpeg支持即可）

### 模型选择

模型大小速度精度tiny~1GB最快基础base~1GB快一般small~2GB中等较好medium~5GB较慢很好large~10GB最慢最佳turbo~6GB快接近large

### 技术实现

import whisper

# 加载模型（首次使用会下载）
model = whisper.load_model("base")  # 可选: tiny/base/small/medium/large/turbo

# 识别语音
result = model.transcribe("audio.mp3")

# 输出文字
print(result["text"])

### 依赖

Python 3.8+
PyTorch
openai-whisper
ffmpeg

### 注意事项

首次使用会下载模型（1-10GB）
大模型需要较多内存
中文识别效果很好
## Trust
- Source: tencent
- Verification: Indexed source record
- Publisher: QiaoTuCodes
- Version: 0.1.0
## Source health
- Status: healthy
- Source download looks usable.
- Yavira can redirect you to the upstream package for this source.
- Health scope: source
- Reason: direct_download_ok
- Checked at: 2026-04-30T16:55:25.780Z
- Expires at: 2026-05-07T16:55:25.780Z
- Recommended action: Download for OpenClaw
## Links
- [Detail page](https://openagent3.xyz/skills/openclaw-skill-whisper-stt)
- [Send to Agent page](https://openagent3.xyz/skills/openclaw-skill-whisper-stt/agent)
- [JSON manifest](https://openagent3.xyz/skills/openclaw-skill-whisper-stt/agent.json)
- [Markdown brief](https://openagent3.xyz/skills/openclaw-skill-whisper-stt/agent.md)
- [Download page](https://openagent3.xyz/downloads/openclaw-skill-whisper-stt)