Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabili...
Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabili...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Create AI-powered podcasts and audio content via inference.sh CLI.
curl -fsSL https://cli.inference.sh | sh && infsh login # Generate podcast segment infsh app run infsh/kokoro-tts --input '{ "text": "Welcome to the AI Frontiers podcast. Today we explore the latest developments in generative AI.", "voice": "am_michael" }' Install note: The install script only detects your OS/architecture, downloads the matching binary from dist.inference.sh, and verifies its SHA-256 checksum. No elevated permissions or background processes. Manual install & verification available.
Voice IDDescriptionBest Foraf_sarahAmerican female, warmHost, narratoraf_nicoleAmerican female, professionalNews, businessam_michaelAmerican male, authoritativeDocumentary, techam_adamAmerican male, conversationalCasual podcastbf_emmaBritish female, refinedAudiobooksbm_georgeBritish male, classicFormal content
Voice IDDescriptionBest Fordia-conversationalNatural conversationDialogue, interviews
Voice IDDescriptionBest Forchatterbox-defaultExpressiveCasual, entertainment
# Single voice podcast segment infsh app run infsh/kokoro-tts --input '{ "text": "Your podcast script here. Make it conversational and engaging. Add natural pauses with punctuation.", "voice": "am_michael" }'
# Host introduction infsh app run infsh/kokoro-tts --input '{ "text": "Welcome back to Tech Talk. Today I have a special guest to discuss AI developments.", "voice": "am_michael" }' > host_intro.json # Guest response infsh app run infsh/kokoro-tts --input '{ "text": "Thanks for having me. I am excited to share what we have been working on.", "voice": "af_sarah" }' > guest_response.json # Merge into conversation infsh app run infsh/media-merger --input '{ "audio_files": ["<host-url>", "<guest-url>"], "crossfade_ms": 500 }'
# 1. Generate script with Claude infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Write a 5-minute podcast script about the impact of AI on creative work. Format as a two-person dialogue between HOST and GUEST. Include natural conversation, questions, and insights." }' > script.json # 2. Generate intro music infsh app run infsh/ai-music --input '{ "prompt": "Podcast intro music, upbeat, modern, tech feel, 15 seconds" }' > intro_music.json # 3. Generate host segments infsh app run infsh/kokoro-tts --input '{ "text": "<host-lines>", "voice": "am_michael" }' > host.json # 4. Generate guest segments infsh app run infsh/kokoro-tts --input '{ "text": "<guest-lines>", "voice": "af_sarah" }' > guest.json # 5. Generate outro music infsh app run infsh/ai-music --input '{ "prompt": "Podcast outro music, matching intro style, fade out, 10 seconds" }' > outro_music.json # 6. Merge everything infsh app run infsh/media-merger --input '{ "audio_files": [ "<intro-music>", "<host>", "<guest>", "<outro-music>" ], "crossfade_ms": 1000 }'
Generate podcast-style discussions from documents. # 1. Extract key points infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Read this document and create a podcast script where two hosts discuss the key points in an engaging, conversational way. Include questions, insights, and natural dialogue.\n\nDocument:\n<your-document-content>" }' > discussion_script.json # 2. Generate Host A infsh app run infsh/kokoro-tts --input '{ "text": "<host-a-lines>", "voice": "am_michael" }' > host_a.json # 3. Generate Host B infsh app run infsh/kokoro-tts --input '{ "text": "<host-b-lines>", "voice": "af_sarah" }' > host_b.json # 4. Interleave and merge infsh app run infsh/media-merger --input '{ "audio_files": ["<host-a-1>", "<host-b-1>", "<host-a-2>", "<host-b-2>"], "crossfade_ms": 300 }'
# Long-form narration infsh app run infsh/kokoro-tts --input '{ "text": "Chapter One. It was a dark and stormy night when the first AI achieved consciousness...", "voice": "bf_emma", "speed": 0.9 }'
# 1. Generate podcast audio infsh app run infsh/kokoro-tts --input '{ "text": "<podcast-script>", "voice": "am_michael" }' > podcast.json # 2. Generate ambient music infsh app run infsh/ai-music --input '{ "prompt": "Soft ambient background music for podcast, subtle, non-distracting, loopable" }' > background.json # 3. Mix with lower background volume infsh app run infsh/media-merger --input '{ "audio_files": ["<podcast-url>"], "background_audio": "<background-url>", "background_volume": 0.15 }'
# Transition sounds between segments infsh app run infsh/ai-music --input '{ "prompt": "Short podcast transition sound, whoosh, 2 seconds" }' > transition.json
infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Write a podcast script with these requirements: - Topic: [YOUR TOPIC] - Duration: 5 minutes (about 750 words) - Format: Two hosts (HOST_A and HOST_B) - Tone: Conversational, informative, engaging - Include: Hook intro, 3 main points, call to action - Mark speaker changes clearly Make it sound natural, not scripted. Add verbal fillers like \"you know\" and \"I mean\" occasionally." }'
HOST: Introduction and welcome GUEST: Thank you, happy to be here HOST: First question about background GUEST: Response with story HOST: Follow-up question GUEST: Deeper insight ... continue pattern ... HOST: Closing question GUEST: Final thoughts HOST: Thank you and outro
Introduction with hook Topic overview Point 1 with examples Point 2 with examples Point 3 with examples Summary and takeaways Call to action Outro
Intro music Welcome and date Story 1: headline + details Story 2: headline + details Story 3: headline + details Analysis/opinion segment Outro
Natural punctuation - Use commas and periods for pacing Short sentences - Easier to speak and listen Varied voices - Different speakers prevent monotony Background music - Subtle, at 10-15% volume Crossfades - Smooth transitions between segments Edit scripts - Remove filler before generating
# Text-to-speech models npx skills add inference-sh/skills@text-to-speech # AI music generation npx skills add inference-sh/skills@ai-music-generation # LLM for scripts npx skills add inference-sh/skills@llm-models # Content pipelines npx skills add inference-sh/skills@ai-content-pipeline # Full platform skill npx skills add inference-sh/skills@inference-sh Browse all apps: infsh app list --category audio
Messaging, meetings, inboxes, CRM, and teammate communication surfaces.
Largest current source with strong distribution and engagement signals.