Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Create music with MiniMax music models (e.g., music-2.5). Use when generating songs or instrumental tracks from lyrics and style prompts, or when integrating...
Create music with MiniMax music models (e.g., music-2.5). Use when generating songs or instrumental tracks from lyrics and style prompts, or when integrating...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Use this skill to generate music with MiniMax's Music Generation API. All usage and outputs are designed for music-2.5 unless specified otherwise.
Set the API key: export MINIMAX_MUSIC_API_KEY="your_api_key" Generate music from lyrics (and optional prompt): python scripts/generate_music.py \ --lyrics "[Verse]\n...\n[Chorus]\n..." \ --prompt "indie folk, melancholic, introspective" \ --output ./output.mp3
scripts/generate_music.py — main generator (lyrics + prompt → audio) scripts/utils_audio.py — hex decoding + save helpers
Read references/minimax_music_api.md for: Endpoint, auth header, payload schema Required/optional fields Output formats (hex/url) and constraints
This skill supports 3 generation modes:
User provides or wants lyrics + music Use structured lyrics with prompt for style
User requests: "纯音乐", "纯音乐,无人声", "pure music", "instrumental" No lyrics, just instrumental arrangement See "Pure Music Generation" section below
User requests: "哼唱", "吟唱", "humming", "chanting" Music with vocal syllables instead of full lyrics See "Melodic Chanting Generation" section below
lyrics is required by the API. prompt is optional for music-2.5. output_format defaults to hex (inline audio). Use url if you prefer a download URL. URLs expire (24h). Download immediately if using url.
Core Principles: Use "descriptions" instead of "commands", keep it structured, clear, and parseable.
Genre/Subgenre (with era or region) Mood/Emotion (2-3 emotional descriptors) Tempo/BPM (specify BPM if possible) Key Instruments (3-5 key instruments/timbres) Vocals (vocal type, processing, or instrumental) Use case (purpose/scene)
Structure (section structure) References (1-2 style references) Avoid/Negative (exclusions)
Basic Template (Beginner) [Genre], [Mood], [Tempo/BPM], [Key Instruments], [Vocal Style] Standard Template (Recommended) Genre: [Specific genre + era] Mood: [2-3 descriptors] Tempo: [BPM or speed] Instruments: [3-5 key instruments] Vocals: [Type or instrumental only] Use case: [scene/usage] Avoid: [unwanted elements] References: [1-2 artists/songs] Advanced Template (Production Brief) Genre: … | Era: … BPM: … | Key: … Mood: ... (can include emotional arc, e.g., "from restrained to explosive") Lead: … Rhythm: … Bass: … Texture: … Vocals: … Structure: Intro / Verse / Chorus / Bridge / Outro Avoid: … Reference: …
[Intro] [Verse] [Pre-Chorus] [Chorus] [Bridge] [Outro] [Instrumental]
No vocals / Instrumental only Avoid autotune No distorted guitars Avoid heavy reverb No trap hi-hats
Style inaccurate: Add "era + sub-genre + instrument anchors + reference artists" Rhythm wrong: Specify BPM + rhythm description (e.g., four-on-the-floor) Intro too generic: Write [Intro] in lyrics and describe the opening Vocals off: Move Vocals to front of prompt and add negative constraints
Contains: Genre / Mood / BPM / Instruments / Vocals / Use case Check for conflicts (e.g., "very slow + high energy") Has Avoid items to filter unwanted elements Is structured (avoid long prose paragraphs)
Use this mode when user wants instrumental tracks without lyrics.
纯音乐、纯音乐无人声、无人声 pure music、instrumental、no lyrics 背景音乐、轻音乐、器乐 无歌词
pure music, [scene/style description], no lyrics
Use placeholder tags: [intro] [outro]
⚠️ Duration: Pure music tracks are typically shorter (1-2 minutes) because there's no lyrics to guide the musical progression.
#PromptScene1pure music, coffee shop, no lyrics咖啡馆氛围2pure music, rainy night, no lyrics雨夜3pure music, morning sunshine, no lyrics晨光4pure music, jazz bar, no lyrics爵士酒吧5pure music, city walk, no lyrics城市漫步6pure music, night drive, no lyrics夜间驾驶7pure music, piano solo, no lyrics钢琴独奏8pure music, lofi chill, no lyricsLo-Fi轻松9pure music, bookstore, no lyrics书店10pure music, cafe closing time, no lyrics咖啡馆打烊
Generate pure music for coffee shop: python scripts/generate_music.py \ --lyrics "[intro] [outro]" \ --prompt "pure music, coffee shop, no lyrics" \ --output ./coffee_shop.mp3 Generate rainy night atmosphere: python scripts/generate_music.py \ --lyrics "[intro] [outro]" \ --prompt "pure music, rainy night, no lyrics" \ --output ./rainy_night.mp3 Generate piano solo: python scripts/generate_music.py \ --lyrics "[intro] [outro]" \ --prompt "pure music, piano solo, no lyrics" \ --output ./piano_solo.mp3
Use this mode when user wants music with vocal syllables instead of full lyrics (humming, chanting).
Humming, chanting, vocalizing humming、chanting Healing, vocal accompaniment Melodic Harmony, ooh ah la
pure music, [风格/场景描述], no lyrics
Use vocal syllables instead of words: Syllable PatternVibeah, ah, ah, ah...柔和吟唱la, la, la, la...旋律哼唱mmm, mmm, mmm...治愈系哼鸣ooh, ooh, ooh...空灵吟唱hum, hum, hum...持续哼唱
[intro] [verse] [chorus] [verse] [outro] Replace sections with chosen syllables.
ScenePromptLyricsHealingpure music, healing, relaxing, no lyrics[intro] mmm, mmm, mmm... [verse] mmm, mmm, mmm... [outro]Meditationpure music, meditation, calming, no lyrics[intro] ooh, ooh, ooh... [verse] ooh, ooh, ooh... [outro]Happypure music, happy, uplifting, no lyrics[intro] la, la, la, la... [chorus] la, la, la, la... [outro]Mysticalpure music, mystical, ethereal, no lyrics[intro] ah, ah, ah... [verse] ah, ah, ah... [outro]
Generate healing chant music: python scripts/generate_music.py \ --lyrics "[intro] mmm, mmm, mmm... [verse] mmm, mmm, mmm... [chorus] mmm, mmm, mmm... [outro]" \ --prompt "pure music, healing, relaxing, no lyrics" \ --output ./healing_chant.mp3 Generate meditation chant: python scripts/generate_music.py \ --lyrics "[intro] ooh, ooh, ooh... [verse] ooh, ooh, ooh... [outro]" \ --prompt "pure music, meditation, calming, no lyrics" \ --output ./meditation.mp3 Generate happy humming: python scripts/generate_music.py \ --lyrics "[intro] la, la, la... [verse] la, la, la... [chorus] la, la, la... [outro]" \ --prompt "pure music, happy, uplifting, no lyrics" \ --output ./happy_humming.mp3
When user requests music, detect the mode: Standard Song → User provides lyrics OR asks for "song with lyrics" Pure Music → Keywords: "纯音乐", "pure music", "instrumental", "no lyrics", "无人声" Chanting/Humming → Keywords: "哼唱", "吟唱", "humming", "chanting", "治愈系"
User Request ↓ Contains "纯音乐"/"pure music"/"instrumental"? ├─ Yes → Is "哼唱"/"humming" also mentioned? │ ├─ Yes → Melodic Chanting Mode │ └─ No → Pure Music Mode └─ No → Standard Song Mode (requires lyrics)
Generate MP3 (hex response → file): python scripts/generate_music.py \ --lyrics "[Intro]\n..." \ --prompt "cinematic, uplifting" \ --output ./music.mp3 \ --format mp3 \ --bitrate 256000 \ --sample-rate 44100 Generate with structured prompt fields (auto-build): python scripts/generate_music.py \ --lyrics "[Verse]\n..." \ --genre "1980s synthwave" \ --mood "nostalgic, energetic" \ --bpm 120 \ --instruments "analog synths, drum machine, bass guitar" \ --vocals "female vocals" \ --use-case "retro game trailer" \ --avoid "no acoustic guitar" \ --references "The Midnight, FM-84" \ --output ./music.mp3 Generate and download from URL: python scripts/generate_music.py \ --lyrics "[Verse]\n..." \ --prompt "lofi, rainy night" \ --output ./music.mp3 \ --output-format url \ --download
For detailed API documentation (endpoints, authentication, request/response formats), see: references/minimax_music_api.md Key points for all modes: Endpoint: POST https://api.minimaxi.com/v1/music_generation Auth: Authorization: Bearer <token> Model: music-2.5 lyrics: Required field - use [intro] [outro] for pure music, or syllable patterns for chanting
Standard Song: { "model": "music-2.5", "prompt": "indie folk, melancholic, introspective", "lyrics": "[verse]\n...\n[chorus]\n..." } Pure Music: { "model": "music-2.5", "prompt": "pure music, coffee shop, no lyrics", "lyrics": "[intro] [outro]" } Melodic Chanting: { "model": "music-2.5", "prompt": "pure music, healing, relaxing, no lyrics", "lyrics": "[intro] mmm, mmm, mmm... [verse] mmm, mmm, mmm... [outro]" }
data.audio: hex string (default) or URL (valid 24 hours) data.status: generation status extra_info: duration, sample rate, channels, bitrate, size base_resp.status_code: 0 on success
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.