Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Music songwriting guide for ACE-Step. Provides professional knowledge on writing captions, lyrics, choosing BPM/key/duration, and structuring songs. Use this skill when users want to create, write, or plan a song before generating it with ACE-Step.
Music songwriting guide for ACE-Step. Provides professional knowledge on writing captions, lyrics, choosing BPM/key/duration, and structuring songs. Use this skill when users want to create, write, or plan a song before generating it with ACE-Step.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Professional music creation knowledge for writing captions, lyrics, and choosing music parameters for ACE-Step.
After using this guide, produce two things for the acestep skill: Caption (-c): Style/genre/instruments/emotion description Lyrics (-l): Complete structured lyrics with tags Parameters: --duration, --bpm, --key, --time-signature, --language
Caption is the most important factor affecting generated music. Supports multiple formats: simple style words, comma-separated tags, complex natural language descriptions.
DimensionExamplesStyle/Genrepop, rock, jazz, electronic, hip-hop, R&B, folk, classical, lo-fi, synthwaveEmotion/Atmospheremelancholic, uplifting, energetic, dreamy, dark, nostalgic, euphoric, intimateInstrumentsacoustic guitar, piano, synth pads, 808 drums, strings, brass, electric bassTimbre Texturewarm, bright, crisp, muddy, airy, punchy, lush, raw, polishedEra Reference80s synth-pop, 90s grunge, 2010s EDM, vintage soul, modern trapProduction Stylelo-fi, high-fidelity, live recording, studio-polished, bedroom popVocal Characteristicsfemale vocal, male vocal, breathy, powerful, falsetto, raspy, choirSpeed/Rhythmslow tempo, mid-tempo, fast-paced, groovy, driving, laid-backStructure Hintsbuilding intro, catchy chorus, dramatic bridge, fade-out ending
Specific beats vague โ "sad piano ballad with female breathy vocal" > "a sad song" Combine multiple dimensions โ style+emotion+instruments+timbre anchors direction precisely Use references well โ "in the style of 80s synthwave" conveys complex aesthetic quickly Texture words are useful โ warm, crisp, airy, punchy influence mixing and timbre Don't pursue perfection โ Caption is a starting point, iterate based on results Granularity determines freedom โ Less detail = more model creativity; more detail = more control Avoid conflicting words โ "classical strings" + "hardcore metal" degrades output Fix: Repetition reinforcement โ Repeat the elements you want more Fix: Conflict to evolution โ "Start with soft strings, middle becomes metal rock, end turns to hip-hop" Don't put BPM/key/tempo in Caption โ Use dedicated parameters instead
Lyrics controls how music unfolds over time. It carries: Lyric text itself Structure tags ([Verse], [Chorus], [Bridge]...) Vocal style hints ([raspy vocal], [whispered]...) Instrumental sections ([guitar solo], [drum break]...) Energy changes ([building energy], [explosive drop]...)
CategoryTagDescriptionBasic Structure[Intro]Opening, establish atmosphere[Verse] / [Verse 1]Verse, narrative progression[Pre-Chorus]Pre-chorus, build energy[Chorus]Chorus, emotional climax[Bridge]Bridge, transition or elevation[Outro]Ending, conclusionDynamic Sections[Build]Energy gradually rising[Drop]Electronic music energy release[Breakdown]Reduced instrumentation, spaceInstrumental[Instrumental]Pure instrumental, no vocals[Guitar Solo]Guitar solo[Piano Interlude]Piano interludeSpecial[Fade Out]Fade out ending[Silence]Silence
Use - for finer control, but keep it concise: โ [Chorus - anthemic] โ [Chorus - anthemic - stacked harmonies - high energy - powerful - epic] Put complex style descriptions in Caption, not in tags.
Models are not good at resolving conflicts. Checklist: Instruments in Caption โ Instrumental section tags in Lyrics Emotion in Caption โ Energy tags in Lyrics Vocal description in Caption โ Vocal control tags in Lyrics
TagEffect[raspy vocal]Raspy, textured vocals[whispered]Whispered[falsetto]Falsetto[powerful belting]Powerful, high-pitched singing[spoken word]Rap/recitation[harmonies]Layered harmonies[call and response]Call and response[ad-lib]Improvised embellishments
TagEffect[high energy]High energy, passionate[low energy]Low energy, restrained[building energy]Increasing energy[explosive]Explosive energy[melancholic]Melancholic[euphoric]Euphoric[dreamy]Dreamy[aggressive]Aggressive
6-10 syllables per line โ Model aligns syllables to beats; keep similar counts for lines in same position (ยฑ1-2) Uppercase = stronger intensity โ WE ARE THE CHAMPIONS! (shouting) vs walking through the streets (normal) Parentheses = background vocals โ We rise together (together) Extend vowels โ Feeeling so aliiive (use cautiously, effects unstable) Clear section separation โ Blank lines between sections
Red FlagDescriptionAdjective stacking"neon skies, electric hearts, endless dreams" โ vague imagery fillerRhyme chaosInconsistent patterns or forced rhymes breaking meaningBlurred boundariesLyric content crosses structure tagsNo breathing roomLines too long to sing in one breathMixed metaphorsWater โ fire โ flying โ listeners can't anchor Metaphor discipline: One core metaphor per song, explore its multiple aspects.
Most of the time, let LM auto-infer. Only set manually when you have clear requirements. ParameterRangeDescriptionbpm30โ300Slow 60โ80, mid 90โ120, fast 130โ180keyscaleKeye.g. C Major, Am. Common keys (C, G, D, Am, Em) most stabletimesignatureTime sig4/4 (most common), 3/4 (waltz), 6/8 (swing)vocal_languageLanguageUsually auto-detected from lyricsdurationSecondsSee duration calculation below
ScenarioSetDaily generationLet LM auto-inferClear tempo requirementbpmSpecific style (waltz)timesignature=3/4Match other materialbpm + durationSpecific key colorkeyscale
Intro/Outro: 5-10 seconds each Instrumental sections: 5-15 seconds each Typical structures: 2 verses + 2 choruses: 120-150s minimum 2 verses + 2 choruses + bridge: 180-240s minimum Full song with intro/outro: 210-270s (3.5-4.5 min)
Slower BPM (60-80): Need MORE duration for same lyrics Medium BPM (100-130): Standard duration Faster BPM (150-180): Can fit more lyrics, but still need breathing room Rule of thumb: When in doubt, estimate longer. A song too short feels rushed. Note: Lyrics tags (piano, powerful, whispered) are consistent with Caption (piano ballad, building to powerful chorus, intimate).
Writing, remixing, publishing, visual generation, and marketing content production.
Largest current source with strong distribution and engagement signals.