Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Voice design workflows with Alibaba Cloud Model Studio Qwen TTS VD models. Use when creating custom synthetic voices from text descriptions and using them for speech synthesis.
Voice design workflows with Alibaba Cloud Model Studio Qwen TTS VD models. Use when creating custom synthetic voices from text descriptions and using them for speech synthesis.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Use voice design models to create controllable synthetic voices from natural language descriptions.
Use one of these exact model strings: qwen3-tts-vd-2026-01-26 qwen3-tts-vd-realtime-2026-01-15
Install SDK in a virtual environment: python3 -m venv .venv . .venv/bin/activate python -m pip install dashscope Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.
voice_prompt (string, required) target voice description text (string, required) stream (bool, optional)
audio_url (string) or streaming PCM chunks voice_id (string) request_id (string)
Write voice prompts with tone, pace, emotion, and timbre constraints. Build a reusable voice prompt library for product consistency. Validate generated voice in short utterances before long scripts.
Prepare a normalized request JSON and validate response schema: .venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \ --voice-prompt "A warm female host voice, clear articulation, medium pace" \ --text "This is a voice-design demo"
Default output: output/ai-audio-tts-voice-design/audio/ Override base dir with OUTPUT_DIR.
mkdir -p output/alicloud-ai-audio-tts-voice-design for f in skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/*.py; do python3 -m py_compile "$f" done echo "py_compile_ok" > output/alicloud-ai-audio-tts-voice-design/validate.txt Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-voice-design/validate.txt is generated.
Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-voice-design/. Include key parameters (region/resource id/time range) in evidence files for reproducibility.
Confirm user intent, region, identifiers, and whether the operation is read-only or mutating. Run one minimal read-only query first to verify connectivity and permissions. Execute the target operation with explicit parameters and bounded scope. Verify results and save output/evidence files.
references/sources.md
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.