# Send qwenspeak to your agent Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually. ## Fast path - Download the package from Yavira. - Extract it into a folder your agent can access. - Paste one of the prompts below and point your agent at the extracted folder. ## Suggested prompts ### New install ```text I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete. ``` ### Upgrade existing ```text I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run. ``` ## Machine-readable fields ```json { "schemaVersion": "1.0", "item": { "slug": "qwenspeak", "name": "qwenspeak", "source": "tencent", "type": "skill", "category": "AI 智能", "sourceUrl": "https://clawhub.ai/psyb0t/qwenspeak", "canonicalUrl": "https://clawhub.ai/psyb0t/qwenspeak", "targetPlatform": "OpenClaw" }, "install": { "downloadUrl": "/downloads/qwenspeak", "sourceDownloadUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=qwenspeak", "sourcePlatform": "tencent", "targetPlatform": "OpenClaw", "packageFormat": "ZIP package", "primaryDoc": "SKILL.md", "includedAssets": [ "SKILL.md", "references/setup.md", "scripts/qwenspeak.sh" ], "downloadMode": "redirect", "sourceHealth": { "source": "tencent", "status": "healthy", "reason": "direct_download_ok", "recommendedAction": "download", "checkedAt": "2026-04-30T16:55:25.780Z", "expiresAt": "2026-05-07T16:55:25.780Z", "httpStatus": 200, "finalUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=network", "contentType": "application/zip", "probeMethod": "head", "details": { "probeUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=network", "contentDisposition": "attachment; filename=\"network-1.0.0.zip\"", "redirectLocation": null, "bodySnippet": null }, "scope": "source", "summary": "Source download looks usable.", "detail": "Yavira can redirect you to the upstream package for this source.", "primaryActionLabel": "Download for OpenClaw", "primaryActionHref": "/downloads/qwenspeak" }, "validation": { "installChecklist": [ "Use the Yavira download entry.", "Review SKILL.md after the package is downloaded.", "Confirm the extracted package contains the expected setup assets." ], "postInstallChecks": [ "Confirm the extracted package includes the expected docs or setup files.", "Validate the skill or prompts are available in your target agent workspace.", "Capture any manual follow-up steps the agent could not complete." ] } }, "links": { "detailUrl": "https://openagent3.xyz/skills/qwenspeak", "downloadUrl": "https://openagent3.xyz/downloads/qwenspeak", "agentUrl": "https://openagent3.xyz/skills/qwenspeak/agent", "manifestUrl": "https://openagent3.xyz/skills/qwenspeak/agent.json", "briefUrl": "https://openagent3.xyz/skills/qwenspeak/agent.md" } } ``` ## Documentation ### qwenspeak YAML-driven text-to-speech over SSH using Qwen3-TTS models. For installation and deployment, see references/setup.md. ### SSH Wrapper Use scripts/qwenspeak.sh for all commands. It handles host, port, and host key acceptance via QWENSPEAK_HOST and QWENSPEAK_PORT env vars. scripts/qwenspeak.sh [args] scripts/qwenspeak.sh < input_file scripts/qwenspeak.sh > output_file ### TTS Generation Submit YAML, get a job UUID back immediately, poll for progress. Jobs run sequentially — one at a time, the rest queue up. # Get the YAML template scripts/qwenspeak.sh "tts print-yaml" > job.yaml # Submit job scripts/qwenspeak.sh "tts" < job.yaml # {"id": "550e8400-...", "status": "queued", "total_steps": 3, "total_generations": 7} # Check progress scripts/qwenspeak.sh "tts get-job 550e8400" # Follow job log scripts/qwenspeak.sh "tts get-job-log 550e8400 -f" # Download result scripts/qwenspeak.sh "get hello.wav" > hello.wav ### YAML Structure Global settings + list of steps. Each step loads a model, runs all its generations, then unloads. Settings cascade: global > step > generation. steps: - mode: custom-voice model_size: 1.7b speaker: Ryan language: English generate: - text: "Hello world" output: hello.wav - text: "I cannot believe this!" speaker: Vivian instruct: "Speak angrily" output: angry.wav - mode: voice-design generate: - text: "Welcome to our store." instruct: "A warm, friendly young female voice with a cheerful tone" output: welcome.wav - mode: voice-clone model_size: 1.7b ref_audio: ref.wav ref_text: "Transcript of reference" generate: - text: "First line in cloned voice" output: clone1.wav - text: "Second line" output: clone2.wav ### Modes custom-voice — Pick from 9 preset speakers. 1.7B supports emotion/style via instruct. voice-design — Describe the voice in natural language via instruct. 1.7B only. voice-clone — Clone from reference audio. Set ref_audio and ref_text at step level to reuse across generations. x_vector_only: true skips transcript. ### Emotion trick for cloned voices Upload references with different emotions, use separate steps: scripts/qwenspeak.sh "create-dir refs" scripts/qwenspeak.sh "put refs/happy.wav" < me_happy.wav scripts/qwenspeak.sh "put refs/angry.wav" < me_angry.wav steps: - mode: voice-clone ref_audio: refs/happy.wav ref_text: "transcript of happy ref" generate: - text: "Great news everyone!" output: happy1.wav - mode: voice-clone ref_audio: refs/angry.wav ref_text: "transcript of angry ref" generate: - text: "This is unacceptable" output: angry1.wav ### Job Management scripts/qwenspeak.sh "tts list-jobs" # list all scripts/qwenspeak.sh "tts list-jobs --json" # JSON output scripts/qwenspeak.sh "tts get-job " # job details scripts/qwenspeak.sh "tts get-job-log " # view log scripts/qwenspeak.sh "tts get-job-log -f" # follow log scripts/qwenspeak.sh "tts cancel-job " # cancel Statuses: queued → running → completed | failed | cancelled Completed jobs auto-cleaned after 1 day, all jobs after 1 week. UUID prefixes work (e.g. first 8 chars). ### File Operations All paths relative to the work directory. Traversal blocked. CommandDescriptionput Upload file from stdinget Download file to stdoutlist-files [--json]List directoryremove-file Delete a filecreate-dir Create directoryremove-dir Remove empty directorymove-file Move or renamecopy-file Copy a filefile-exists Check if file exists (true/false)search-files Glob search (** recursive) ### Speakers SpeakerGenderLanguageDescriptionVivianFemaleChineseBright, slightly edgy young voiceSerenaFemaleChineseWarm, gentle young voiceUncle_FuMaleChineseSeasoned, low mellow timbreDylanMaleChineseYouthful Beijing dialect, clear natural timbreEricMaleChineseLively Chengdu/Sichuan dialect, slightly huskyRyanMaleEnglishDynamic with strong rhythmic driveAidenMaleEnglishSunny American, clear midrangeOno_AnnaFemaleJapanesePlayful, light nimble timbreSoheeFemaleKoreanWarm with rich emotion ### YAML Options All settings cascade: global > step > generation. FieldDefaultDescriptiondtypefloat32float32, float16, bfloat16 (float16/bfloat16 GPU only)flash_attnautoFlashAttention-2: auto-detects, auto-switches float32→bfloat16temperature0.9Sampling temperaturetop_k50Top-k samplingtop_p1.0Top-p / nucleus samplingrepetition_penalty1.05Repetition penaltymax_new_tokens2048Max codec tokens to generateno_samplefalseGreedy decodingstreamingfalseStreaming mode (lower latency)moderequiredStep only: custom-voice, voice-design, or voice-clonemodel_size1.7bStep only: 1.7b or 0.6btextrequiredText to synthesizeoutputrequiredOutput file pathspeakerViviancustom-voice: speaker namelanguageAutoLanguage for synthesisinstruct-custom-voice: emotion/style; voice-design: voice descriptionref_audio-voice-clone: reference audio file pathref_text-voice-clone: transcript of reference audiox_vector_onlyfalsevoice-clone: use speaker embedding only ## Trust - Source: tencent - Verification: Indexed source record - Publisher: psyb0t - Version: 1.5.0 ## Source health - Status: healthy - Source download looks usable. - Yavira can redirect you to the upstream package for this source. - Health scope: source - Reason: direct_download_ok - Checked at: 2026-04-30T16:55:25.780Z - Expires at: 2026-05-07T16:55:25.780Z - Recommended action: Download for OpenClaw ## Links - [Detail page](https://openagent3.xyz/skills/qwenspeak) - [Send to Agent page](https://openagent3.xyz/skills/qwenspeak/agent) - [JSON manifest](https://openagent3.xyz/skills/qwenspeak/agent.json) - [Markdown brief](https://openagent3.xyz/skills/qwenspeak/agent.md) - [Download page](https://openagent3.xyz/downloads/qwenspeak)