Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Original music, fully yours. 5 seconds to 10 minutes using frontier music generation models. Instrumental and vocal tracks with perfect vocals. Cinematic scores, background tracks, podcast intros, game soundtracks, ambient soundscapes, jingles, lo-fi beats, orchestral compositions, songs with lyrics.
Original music, fully yours. 5 seconds to 10 minutes using frontier music generation models. Instrumental and vocal tracks with perfect vocals. Cinematic scores, background tracks, podcast intros, game soundtracks, ambient soundscapes, jingles, lo-fi beats, orchestral compositions, songs with lyrics.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Original music, fully yours. No licensing, no attribution, no fees. 5 seconds to 10 minutes using frontier music generation models. Instrumental and vocal tracks with perfect vocals. Every track generated is royalty-free and 100% yours to use commercially โ YouTube, podcasts, apps, games, ads, films, streaming. No strings attached.
This skill requires the cellcog skill for SDK setup and API calls. clawhub install cellcog Read the cellcog skill first for SDK setup. This skill shows you what's possible. Quick pattern (v1.0+): result = client.create_chat( prompt="[your music request]", notify_session_key="agent:main:main", task_label="music-creation", chat_mode="agent" )
Just describe what you want. The frontier model handles the rest โ genre, arrangement, instrumentation, dynamics, and even lyrics: "Compose a 90-second cinematic score. Start with solo piano, layer in strings at 30 seconds, build to a full orchestral swell, then resolve softly. Mood: bittersweet turning hopeful." "Create a 3-minute lo-fi hip-hop track with soft piano, vinyl crackle, and mellow drums. 75 BPM. Study vibes." "Write a 2-minute upbeat pop song with female vocals about starting fresh on a Monday morning. Catchy chorus, feel-good energy." The model is exceptionally sophisticated โ it handles any genre, genre fusion, songs with lyrics, complex arrangements, and mood transitions from a simple description.
Only use this when you need exact section durations โ for example, syncing music to specific video segments or presentation slides: "I need music that syncs with my video: Intro: exactly 10 seconds, soft ambient Build: exactly 20 seconds, energy rising Climax: exactly 15 seconds, full orchestra Outro: exactly 10 seconds, gentle fade" This mode gives precise timing control per section but should only be used when timing accuracy matters for syncing with other media.
TypeExampleCinematic scoresEpic orchestral, tense thriller, emotional piano, sci-fi ambientBackground tracksLo-fi beats, corporate background, cafe jazz, ambient soundscapesPodcast intros/outros5-10 second branded stings, transitions, bumpersGame soundtracksBattle themes, exploration music, boss fights, menu themesJinglesAd jingles, notification sounds, reveal stingersAmbientMeditation, nature soundscapes, focus music
CellCog generates songs with perfect AI vocals โ just describe the lyrical theme: TypeExamplePop songsCatchy hooks, verse-chorus structure, radio-readyBalladsEmotional, piano-driven, storytellingHip-hop/RapRhythmic vocals, beats, flowRockGuitar-driven, powerful vocalsR&B/SoulSmooth, melodic, groove
ParameterRangeDuration5 seconds to 10 minutesOutputMP3 (44.1kHz, 128kbps)VocalsInstrumental or with AI vocalsLicensingRoyalty-free, fully yours, no attribution
Use chat_mode="agent" for music generation. Music executes well in agent mode.
Cinematic score: "Compose a 2-minute cinematic score for a nature documentary finale. Begin with solo cello (melancholic), layer in strings and piano at 40 seconds, build to a hopeful orchestral swell, resolve with gentle piano. Think Planet Earth meets Interstellar." Lo-fi background: "Create 5 minutes of lo-fi study beats. Soft piano, mellow drums, vinyl crackle, gentle bass. 75 BPM. Warm and unobtrusive โ good for focus." Podcast intro + outro: "Create a podcast intro (8 seconds) and outro (6 seconds). Show is a tech startup podcast. Intro: energetic, modern electronic with a hook. Outro: same vibe but mellower wind-down. Should feel like the same show." Song with vocals: "Write a 3-minute upbeat indie pop song with female vocals. Theme: the excitement of moving to a new city. Catchy chorus, acoustic guitar foundation, builds with drums and synth. Feel-good, sing-along energy." Game soundtrack: "Compose a 2-minute boss battle theme for a fantasy RPG. Intense orchestral with choir, driving percussion, escalating tension. Think Dark Souls meets Final Fantasy."
Describe the feeling, not just the genre: "Music that makes a startup pitch feel like the future" works better than "electronic music." Specify duration: "45 seconds" vs "3 minutes" changes composition structure significantly. Reference moods, not copyrighted songs: "Hans Zimmer-style epic" and "ChilledCow lo-fi vibes" work well. Do not reference specific copyrighted songs. For vocals: Set the lyrical theme and mood. The model writes lyrics that fit. Or provide specific lyrics you want sung. Energy arc matters: "Starts quiet, builds at midpoint, resolves softly" gives clear compositional structure. For video background music: If the music is for a CellCog video, mention it in your video prompt instead โ CellCog handles music as part of video production automatically.
Writing, remixing, publishing, visual generation, and marketing content production.
Largest current source with strong distribution and engagement signals.