โ† All skills
Tencent SkillHub ยท Content Creation

acestep-songwriting

Music songwriting guide for ACE-Step. Provides professional knowledge on writing captions, lyrics, choosing BPM/key/duration, and structuring songs. Use this skill when users want to create, write, or plan a song before generating it with ACE-Step.

skill openclawclawhub Free
0 Downloads
0 Stars
0 Installs
0 Score
High Signal

Music songwriting guide for ACE-Step. Provides professional knowledge on writing captions, lyrics, choosing BPM/key/duration, and structuring songs. Use this skill when users want to create, write, or plan a song before generating it with ACE-Step.

โฌ‡ 0 downloads โ˜… 0 stars Unverified but indexed

Install for OpenClaw

Quick setup
  1. Download the package from Yavira.
  2. Extract the archive and review SKILL.md first.
  3. Import or place the package into your OpenClaw setup.

Requirements

Target platform
OpenClaw
Install method
Manual import
Extraction
Extract archive
Prerequisites
OpenClaw
Primary doc
SKILL.md

Package facts

Download mode
Yavira redirect
Package format
ZIP package
Source platform
Tencent SkillHub
What's included
SKILL.md

Validation

  • Use the Yavira download entry.
  • Review SKILL.md after the package is downloaded.
  • Confirm the extracted package contains the expected setup assets.

Install with your agent

Agent handoff

Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.

  1. Download the package from Yavira.
  2. Extract it into a folder your agent can access.
  3. Paste one of the prompts below and point your agent at the extracted folder.
New install

I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.

Upgrade existing

I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.

Trust & source

Release facts

Source
Tencent SkillHub
Verification
Indexed source record
Version
1.0.0

Documentation

ClawHub primary doc Primary doc: SKILL.md 17 sections Open source page

ACE-Step Songwriting Guide

Professional music creation knowledge for writing captions, lyrics, and choosing music parameters for ACE-Step.

Output Format

After using this guide, produce two things for the acestep skill: Caption (-c): Style/genre/instruments/emotion description Lyrics (-l): Complete structured lyrics with tags Parameters: --duration, --bpm, --key, --time-signature, --language

Caption: The Most Important Input

Caption is the most important factor affecting generated music. Supports multiple formats: simple style words, comma-separated tags, complex natural language descriptions.

Common Dimensions

DimensionExamplesStyle/Genrepop, rock, jazz, electronic, hip-hop, R&B, folk, classical, lo-fi, synthwaveEmotion/Atmospheremelancholic, uplifting, energetic, dreamy, dark, nostalgic, euphoric, intimateInstrumentsacoustic guitar, piano, synth pads, 808 drums, strings, brass, electric bassTimbre Texturewarm, bright, crisp, muddy, airy, punchy, lush, raw, polishedEra Reference80s synth-pop, 90s grunge, 2010s EDM, vintage soul, modern trapProduction Stylelo-fi, high-fidelity, live recording, studio-polished, bedroom popVocal Characteristicsfemale vocal, male vocal, breathy, powerful, falsetto, raspy, choirSpeed/Rhythmslow tempo, mid-tempo, fast-paced, groovy, driving, laid-backStructure Hintsbuilding intro, catchy chorus, dramatic bridge, fade-out ending

Caption Writing Principles

Specific beats vague โ€” "sad piano ballad with female breathy vocal" > "a sad song" Combine multiple dimensions โ€” style+emotion+instruments+timbre anchors direction precisely Use references well โ€” "in the style of 80s synthwave" conveys complex aesthetic quickly Texture words are useful โ€” warm, crisp, airy, punchy influence mixing and timbre Don't pursue perfection โ€” Caption is a starting point, iterate based on results Granularity determines freedom โ€” Less detail = more model creativity; more detail = more control Avoid conflicting words โ€” "classical strings" + "hardcore metal" degrades output Fix: Repetition reinforcement โ€” Repeat the elements you want more Fix: Conflict to evolution โ€” "Start with soft strings, middle becomes metal rock, end turns to hip-hop" Don't put BPM/key/tempo in Caption โ€” Use dedicated parameters instead

Lyrics: The Temporal Script

Lyrics controls how music unfolds over time. It carries: Lyric text itself Structure tags ([Verse], [Chorus], [Bridge]...) Vocal style hints ([raspy vocal], [whispered]...) Instrumental sections ([guitar solo], [drum break]...) Energy changes ([building energy], [explosive drop]...)

Structure Tags

CategoryTagDescriptionBasic Structure[Intro]Opening, establish atmosphere[Verse] / [Verse 1]Verse, narrative progression[Pre-Chorus]Pre-chorus, build energy[Chorus]Chorus, emotional climax[Bridge]Bridge, transition or elevation[Outro]Ending, conclusionDynamic Sections[Build]Energy gradually rising[Drop]Electronic music energy release[Breakdown]Reduced instrumentation, spaceInstrumental[Instrumental]Pure instrumental, no vocals[Guitar Solo]Guitar solo[Piano Interlude]Piano interludeSpecial[Fade Out]Fade out ending[Silence]Silence

Combining Tags

Use - for finer control, but keep it concise: โœ… [Chorus - anthemic] โŒ [Chorus - anthemic - stacked harmonies - high energy - powerful - epic] Put complex style descriptions in Caption, not in tags.

Caption-Lyrics Consistency

Models are not good at resolving conflicts. Checklist: Instruments in Caption โ†” Instrumental section tags in Lyrics Emotion in Caption โ†” Energy tags in Lyrics Vocal description in Caption โ†” Vocal control tags in Lyrics

Vocal Control Tags

TagEffect[raspy vocal]Raspy, textured vocals[whispered]Whispered[falsetto]Falsetto[powerful belting]Powerful, high-pitched singing[spoken word]Rap/recitation[harmonies]Layered harmonies[call and response]Call and response[ad-lib]Improvised embellishments

Energy and Emotion Tags

TagEffect[high energy]High energy, passionate[low energy]Low energy, restrained[building energy]Increasing energy[explosive]Explosive energy[melancholic]Melancholic[euphoric]Euphoric[dreamy]Dreamy[aggressive]Aggressive

Lyric Writing Tips

6-10 syllables per line โ€” Model aligns syllables to beats; keep similar counts for lines in same position (ยฑ1-2) Uppercase = stronger intensity โ€” WE ARE THE CHAMPIONS! (shouting) vs walking through the streets (normal) Parentheses = background vocals โ€” We rise together (together) Extend vowels โ€” Feeeling so aliiive (use cautiously, effects unstable) Clear section separation โ€” Blank lines between sections

Avoiding "AI-flavored" Lyrics

Red FlagDescriptionAdjective stacking"neon skies, electric hearts, endless dreams" โ€” vague imagery fillerRhyme chaosInconsistent patterns or forced rhymes breaking meaningBlurred boundariesLyric content crosses structure tagsNo breathing roomLines too long to sing in one breathMixed metaphorsWater โ†’ fire โ†’ flying โ€” listeners can't anchor Metaphor discipline: One core metaphor per song, explore its multiple aspects.

Music Metadata

Most of the time, let LM auto-infer. Only set manually when you have clear requirements. ParameterRangeDescriptionbpm30โ€“300Slow 60โ€“80, mid 90โ€“120, fast 130โ€“180keyscaleKeye.g. C Major, Am. Common keys (C, G, D, Am, Em) most stabletimesignatureTime sig4/4 (most common), 3/4 (waltz), 6/8 (swing)vocal_languageLanguageUsually auto-detected from lyricsdurationSecondsSee duration calculation below

When to Set Manually

ScenarioSetDaily generationLet LM auto-inferClear tempo requirementbpmSpecific style (waltz)timesignature=3/4Match other materialbpm + durationSpecific key colorkeyscale

Estimation Method

Intro/Outro: 5-10 seconds each Instrumental sections: 5-15 seconds each Typical structures: 2 verses + 2 choruses: 120-150s minimum 2 verses + 2 choruses + bridge: 180-240s minimum Full song with intro/outro: 210-270s (3.5-4.5 min)

BPM and Duration Relationship

Slower BPM (60-80): Need MORE duration for same lyrics Medium BPM (100-130): Standard duration Faster BPM (150-180): Can fit more lyrics, but still need breathing room Rule of thumb: When in doubt, estimate longer. A song too short feels rushed. Note: Lyrics tags (piano, powerful, whispered) are consistent with Caption (piano ballad, building to powerful chorus, intimate).

Category context

Writing, remixing, publishing, visual generation, and marketing content production.

Source: Tencent SkillHub

Largest current source with strong distribution and engagement signals.

Package contents

Included in package
1 Docs
  • SKILL.md Primary doc