Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podcast workflows.
Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podcast workflows.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Required: ffmpeg / ffprobe โ core audio processing Optional (for advanced features): sox โ additional noise reduction whisper โ local transcription (or use API) demucs โ stem separation
SituationLoadFFmpeg commands by taskcommands.mdLoudness standards by platformloudness.mdPodcast production workflowpodcast.mdTranscription workflowtranscription.md
TaskMethodConvert formatsFFmpeg (-acodec)Remove noiseFFmpeg filters or SoXNormalize loudnessffmpeg-normalize or -af loudnormTranscribeWhisper โ text, SRT, VTTSeparate stemsDemucs (vocals, drums, bass, other)
Clarify goal โ What format? What loudness? What platform? Analyze source โ ffprobe for codec, sample rate, channels, duration Process โ FFmpeg/SoX for transformation Verify โ Check output plays, meets specs, sounds correct Deliver โ Provide file to user
User saysAgent does"Convert to MP3"-acodec libmp3lame -q:a 2"Remove background noise"Apply highpass/lowpass or dedicated denoiser"Normalize for podcast"-af loudnorm=I=-16:TP=-1.5:LRA=11"Transcribe this"Whisper โ output SRT/VTT/TXT"Extract audio from video"-vn -acodec copy or re-encode"Make it smaller"Lower bitrate: -b:a 128k or -b:a 96k"Speed up 1.5x"-af atempo=1.5
FormatUse CaseQualityWAVMaster, editingLosslessFLACArchive, audiophileLossless compressedMP3Universal sharingLossy, 128-320 kbpsAAC/M4AApple, podcastsLossy, efficientOGG/OpusWhatsApp, DiscordLossy, very efficient
Podcast: -16 LUFS (Spotify), -19 LUFS (Apple) Music: -14 LUFS (Spotify), -16 LUFS (Apple Music) MP3 quality: VBR -q:a 2 (~190 kbps) or CBR -b:a 192k Sample rate: 44.1kHz for music, 48kHz for video sync
This skill: Processes audio files user explicitly provides Runs FFmpeg commands on user request Does NOT access cloud services without user knowing Does NOT store files persistently (user manages their files)
Writing, remixing, publishing, visual generation, and marketing content production.
Largest current source with strong distribution and engagement signals.