Tencent SkillHub · AI

Alicloud Ai Audio Tts

Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text to speech,...

skill openclawclawhub Free

0 Downloads

0 Stars

0 Installs

0 Score

High Signal

Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text to speech,...

⬇ 0 downloads ★ 0 stars Unverified but indexed

Install for OpenClaw

Quick setup

Download the package from Yavira.
Extract the archive and review SKILL.md first.
Import or place the package into your OpenClaw setup.

Requirements

Target platform: OpenClaw
Install method: Manual import
Extraction: Extract archive
Prerequisites: OpenClaw
Primary doc: SKILL.md

Package facts

Download mode: Yavira redirect
Package format: ZIP package
Source platform: Tencent SkillHub
What's included: SKILL.md, agents/openai.yaml, references/api_reference.md, references/sources.md, scripts/generate_tts.py

Validation

Use the Yavira download entry.
Review SKILL.md after the package is downloaded.
Confirm the extracted package contains the expected setup assets.

Install with your agent

Agent handoff

Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.

Download the package from Yavira.
Extract it into a folder your agent can access.
Paste one of the prompts below and point your agent at the extracted folder.

New install

I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.

Upgrade existing

I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.

Open Send to Agent page Open JSON manifest Open Markdown brief

Trust & source

Release facts

Source: Tencent SkillHub
Verification: Indexed source record
Version: 1.0.4

Provenance

Publisher: cinience
Source page: View original listing
Canonical URL: Open canonical page

Documentation

ClawHub primary doc Primary doc: SKILL.md 12 sections Open source page

Validation

mkdir -p output/alicloud-ai-audio-tts python -m py_compile skills/ai/audio/alicloud-ai-audio-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/alicloud-ai-audio-tts/validate.txt Pass criteria: command exits 0 and output/alicloud-ai-audio-tts/validate.txt is generated.

Output And Evidence

Save generated audio links, sample audio files, and request payloads to output/alicloud-ai-audio-tts/. Keep one validation log per execution.

Critical model names

Use one of the recommended models: qwen3-tts-flash qwen3-tts-instruct-flash qwen3-tts-instruct-flash-2026-01-26

Prerequisites

Install SDK (recommended in a venv to avoid PEP 668 limits): python3 -m venv .venv . .venv/bin/activate python -m pip install dashscope Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials (env takes precedence).

Request

text (string, required) voice (string, required) language_type (string, optional; default Auto) instruction (string, optional; recommended for instruct models) stream (bool, optional; default false)

Response

audio_url (string, when stream=false) audio_base64_pcm (string, when stream=true) sample_rate (int, 24000) format (string, wav or pcm depending on mode)

Quick start (Python + DashScope SDK)

import os import dashscope # Prefer env var for auth: export DASHSCOPE_API_KEY=... # Or use ~/.alibabacloud/credentials with dashscope_api_key under [default]. # Beijing region; for Singapore use: https://dashscope-intl.aliyuncs.com/api/v1 dashscope.base_http_api_url = "https://dashscope.aliyuncs.com/api/v1" text = "Hello, this is a short voice line." response = dashscope.MultiModalConversation.call( model="qwen3-tts-instruct-flash", api_key=os.getenv("DASHSCOPE_API_KEY"), text=text, voice="Cherry", language_type="English", instruction="Warm and calm tone, slightly slower pace.", stream=False, ) audio_url = response.output.audio.url print(audio_url)

Streaming notes

stream=True returns Base64-encoded PCM chunks at 24kHz. Decode chunks and play or concatenate to a pcm buffer. The response contains finish_reason == "stop" when the stream ends.

Operational guidance

Keep requests concise; split long text into multiple calls if you hit size or timeout errors. Use language_type consistent with the text to improve pronunciation. Use instruction only when you need explicit style/tone control. Cache by (text, voice, language_type) to avoid repeat costs.

Output location

Default output: output/alicloud-ai-audio-tts/audio/ Override base dir with OUTPUT_DIR.

Workflow

Confirm user intent, region, identifiers, and whether the operation is read-only or mutating. Run one minimal read-only query first to verify connectivity and permissions. Execute the target operation with explicit parameters and bounded scope. Verify results and save output/evidence files.

References

references/api_reference.md for parameter mapping and streaming example. Realtime mode is provided by skills/ai/audio/alicloud-ai-audio-tts-realtime/. Voice cloning/design are provided by skills/ai/audio/alicloud-ai-audio-tts-voice-clone/ and skills/ai/audio/alicloud-ai-audio-tts-voice-design/. Source list: references/sources.md

Category context

Agent frameworks, memory systems, reasoning layers, and model-native orchestration.

Source: Tencent SkillHub

Largest current source with strong distribution and engagement signals.

Package contents

Included in package

3 Docs1 Scripts1 Config

SKILL.md Primary doc
references/api_reference.md Docs
references/sources.md Docs
scripts/generate_tts.py Scripts
agents/openai.yaml Config

Install for OpenClaw

Requirements

Package facts

Validation

Install with your agent

Trust & source

Release facts

Provenance

Documentation

Validation

Output And Evidence

Critical model names

Prerequisites

Request

Response

Quick start (Python + DashScope SDK)

Streaming notes

Operational guidance

Output location

Workflow

References

Package contents

Related skills

Tavily Web Search

Browser Research Agent

UX Research Systems