Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Build multi-step AI content creation pipelines combining image, video, audio, and text. Workflow examples: generate image -> animate -> add voiceover -> merg...
Build multi-step AI content creation pipelines combining image, video, audio, and text. Workflow examples: generate image -> animate -> add voiceover -> merg...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Build multi-step content creation pipelines via inference.sh CLI.
curl -fsSL https://cli.inference.sh | sh && infsh login # Simple pipeline: Generate image -> Animate to video infsh app run falai/flux-dev --input '{"prompt": "portrait of a woman smiling"}' > image.json infsh app run falai/wan-2-5 --input '{"image_url": "<url-from-previous>"}' Install note: The install script only detects your OS/architecture, downloads the matching binary from dist.inference.sh, and verifies its SHA-256 checksum. No elevated permissions or background processes. Manual install & verification available.
[FLUX Image] -> [Wan 2.5 Video] -> [Foley Sound]
[LLM Script] -> [Kokoro TTS] -> [OmniHuman Avatar]
[Tavily Search] -> [Claude Summary] -> [FLUX Visual] -> [Twitter Post]
Create a complete short-form video from a topic. # 1. Generate script with Claude infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Write a 30-second script about the future of AI. Make it engaging and conversational. Just the script, no stage directions." }' > script.json # 2. Generate voiceover with Kokoro infsh app run infsh/kokoro-tts --input '{ "text": "<script-text>", "voice": "af_sarah" }' > voice.json # 3. Generate background image with FLUX infsh app run falai/flux-dev --input '{ "prompt": "Futuristic city skyline at sunset, cyberpunk aesthetic, 4K wallpaper" }' > background.json # 4. Animate image to video with Wan infsh app run falai/wan-2-5 --input '{ "image_url": "<background-url>", "prompt": "slow camera pan across cityscape, subtle movement" }' > video.json # 5. Add captions (manually or with another tool) # 6. Merge video with audio infsh app run infsh/media-merger --input '{ "video_url": "<video-url>", "audio_url": "<voice-url>" }'
Create an AI avatar presenting content. # 1. Write the script infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Write a 1-minute explainer script about quantum computing for beginners." }' > script.json # 2. Generate speech infsh app run infsh/kokoro-tts --input '{ "text": "<script>", "voice": "am_michael" }' > speech.json # 3. Generate or use a portrait image infsh app run falai/flux-dev --input '{ "prompt": "Professional headshot of a friendly tech presenter, neutral background, looking at camera" }' > portrait.json # 4. Create talking head video infsh app run bytedance/omnihuman-1-5 --input '{ "image_url": "<portrait-url>", "audio_url": "<speech-url>" }' > talking_head.json
Create a product showcase video. # 1. Generate product image infsh app run falai/flux-dev --input '{ "prompt": "Sleek wireless earbuds on white surface, studio lighting, product photography" }' > product.json # 2. Animate product reveal infsh app run falai/wan-2-5 --input '{ "image_url": "<product-url>", "prompt": "slow 360 rotation, smooth motion" }' > product_video.json # 3. Upscale video quality infsh app run falai/topaz-video-upscaler --input '{ "video_url": "<product-video-url>" }' > upscaled.json # 4. Add background music infsh app run infsh/media-merger --input '{ "video_url": "<upscaled-url>", "audio_url": "https://your-music.mp3", "audio_volume": 0.3 }'
Convert written content to video format. # 1. Summarize blog post infsh app run openrouter/claude-haiku-45 --input '{ "prompt": "Summarize this blog post into 5 key points for a video script: <blog-content>" }' > summary.json # 2. Generate images for each point for i in 1 2 3 4 5; do infsh app run falai/flux-dev --input "{ \"prompt\": \"Visual representing point $i: <point-text>\" }" > "image_$i.json" done # 3. Animate each image for i in 1 2 3 4 5; do infsh app run falai/wan-2-5 --input "{ \"image_url\": \"<image-$i-url>\" }" > "video_$i.json" done # 4. Generate voiceover infsh app run infsh/kokoro-tts --input '{ "text": "<full-script>", "voice": "bf_emma" }' > narration.json # 5. Merge all clips infsh app run infsh/media-merger --input '{ "videos": ["<video1>", "<video2>", "<video3>", "<video4>", "<video5>"], "audio_url": "<narration-url>", "transition": "crossfade" }'
StepAppPurposeScriptopenrouter/claude-sonnet-45Write contentResearchtavily/search-assistantGather informationSummaryopenrouter/claude-haiku-45Condense content
StepAppPurposeImagefalai/flux-devGenerate imagesImagegoogle/imagen-3Alternative image genUpscalefalai/topaz-image-upscalerEnhance quality
StepAppPurposeI2Vfalai/wan-2-5Animate imagesT2Vgoogle/veo-3-1-fastGenerate from textAvatarbytedance/omnihuman-1-5Talking heads
StepAppPurposeTTSinfsh/kokoro-ttsVoice narrationMusicinfsh/ai-musicBackground musicFoleyinfsh/hunyuanvideo-foleySound effects
StepAppPurposeUpscalefalai/topaz-video-upscalerEnhance videoMergeinfsh/media-mergerCombine mediaCaptioninfsh/caption-videoAdd subtitles
Plan the pipeline first - Map out each step before running Save intermediate results - Store outputs for iteration Use appropriate quality - Fast models for drafts, quality for finals Match resolutions - Keep consistent aspect ratios throughout Test each step - Verify outputs before proceeding
# Video generation models npx skills add inference-sh/skills@ai-video-generation # Image generation npx skills add inference-sh/skills@ai-image-generation # Text-to-speech npx skills add inference-sh/skills@text-to-speech # LLM models for scripts npx skills add inference-sh/skills@llm-models # Full platform skill npx skills add inference-sh/skills@inference-sh Browse all apps: infsh app list
Content Pipeline Example - Official pipeline guide Building Workflows - Workflow best practices
Writing, remixing, publishing, visual generation, and marketing content production.
Largest current source with strong distribution and engagement signals.