Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Fast, accurate, and incredibly inexpensive automatic speech-to-text transcription service.
Disruptive Pricing: $0.06 - $0.12 per hour (2-15x cheaper than Deepgram or OpenAI). Extreme Speed: 100 minutes of audio transcribes in ~1 minute. Multilingual: Supports 100 languages with auto-detection. Agent-Ready: Designed for high-volume, automated pipelines.
Sign up at speechischeap.com. Use code CH5 for $5 off.
This skill looks for your API key in the SIC_API_KEY environment variable. Add this to your .env or agent config: SIC_API_KEY=your_key_here
When this skill is installed, you can transcribe any URL from an OpenClaw session and get the JSON results immediately by running: ./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"
# Basic transcription ./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3" # Advanced transcription with options ./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3" \ --speakers --words --labels \ --language "en" \ --format "srt" \ --private
Perfect for processing audio already on your disk. This handles the upload automatically. # Upload and transcribe local media ./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav" # Upload with webhook callback ./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav" --webhook "https://mysite.com/callback" # Note: For local files, the skill handles the multi-part upload to # https://upload.speechischeap.com before starting the transcription.
--speakers: Enable speaker diarization --words: Enable word-level timestamps --labels: Enable audio labeling (music, noise, etc.) --stream: Enable streaming output --private: Do not store audio/transcript (privacy mode) --language <code>: ISO language code (e.g., 'en', 'es') --confidence <float>: Minimum confidence threshold (default 0.5) --format <fmt>: Output format (json, srt, vtt, webvtt) --webhook <url>: URL to receive job completion payload --segment-duration <n>: Segment duration in seconds (default 30)
./skills/asr/scripts/asr.sh status "job-id-here"
The asr.sh command-line tool returns JSON by default when successful, making it easy to pipe into other tools or parse directly. If the SIC_API_KEY is missing, the tool will provide a clear error message and a direct link to the signup page.
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.