Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
A RESTful service for high-quality text-to-speech using Qwen3 and specialized voice cloning. Optimized for reusing a specific voice prompt to avoid re-computation.
A RESTful service for high-quality text-to-speech using Qwen3 and specialized voice cloning. Optimized for reusing a specific voice prompt to avoid re-computation.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
This skill provides a FastAPI-based REST service for Qwen3 TTS, specifically configured for reusing a high-quality reference audio prompt for efficient and consistent voice cloning. This service is packaged as an installable CLI.
Prerequisites: python >= 3.10. pip install -e .
The service runs on port 9090 by default. # Start the server (runs in foreground, use & for background or a separate terminal) # Optional: Uudate to your own reference audio and text for voice cloning chichi-speech --port 9090 --host 127.0.0.1 --ref-audio "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-TTS-Repo/clone_2.wav" --ref-text "Okay. Yeah. I resent you. I love you. I respect you. But you know what? You blew it! And thanks to you."
Check the health/docs: curl http://localhost:9090/docs
Use cURL: curl -X POST "http://localhost:9090/synthesize" \ -H "Content-Type: application/json" \ -d '{ "text": "Nice to meet you", "language": "English" }' \ --output output/nice_to_meet.wav
Endpoint: POST /synthesize Default Port: 9090 Voice Cloning: Uses a pre-computed voice prompt from reference files to ensure the cloned voice is consistent and generation is fast.
Python 3.10+ qwen-tts (Qwen3 model library) Access to a reference audio file for voice cloning. By default, it uses public sample audio from Qwen3. CRITICAL: You can provide your own reference audio using the --ref-audio and --ref-text flags.
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.