Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Analyze and summarize videos from 1000+ sites using Google Gemini AI, providing transcripts, descriptions, summaries, and answers to questions.
Analyze and summarize videos from 1000+ sites using Google Gemini AI, providing transcripts, descriptions, summaries, and answers to questions.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Analyze videos using Google Gemini's multimodal video understanding. Supports 1000+ video sources via yt-dlp.
yt-dlp β brew install yt-dlp / pip install yt-dlp ffmpeg β brew install ffmpeg (for merging video+audio streams) GEMINI_API_KEY environment variable
Returns structured JSON: transcript β Verbatim transcript with [MM:SS] timestamps description β Visual description (people, setting, UI, text on screen, flow) summary β 2-3 sentence summary duration_seconds β Estimated duration speakers β Identified speakers
uv run {baseDir}/scripts/analyze_video.py "<video-url>"
uv run {baseDir}/scripts/analyze_video.py "<video-url>" -q "What product is shown?"
uv run {baseDir}/scripts/analyze_video.py "<video-url>" -p "Custom prompt" --raw
uv run {baseDir}/scripts/analyze_video.py "<video-url>" --download-only -o video.mp4
FlagDescriptionDefault-q / --questionQuestion to answer (added to default fields)none-p / --promptOverride entire prompt (ignores -q)structured JSON-m / --modelGemini modelgemini-2.5-flash-o / --outputSave output to filestdout--keepKeep downloaded video filefalse--download-onlyDownload only, skip analysisfalse--max-sizeMax file size in MB500--rawRaw text output instead of JSONfalse
YouTube URLs β Passed directly to Gemini (no download needed) All other URLs β Downloaded via yt-dlp β uploaded to Gemini File API β poll until processed Gemini analyzes video with structured prompt β returns JSON Temp files and Gemini uploads cleaned up automatically
Any URL supported by yt-dlp: Loom, YouTube, TikTok, Vimeo, Twitter/X, Instagram, Dailymotion, Twitch, and 1000+ more.
Use -q for targeted questions on top of the full analysis YouTube is fastest (no download step) Large videos (10min+) work fine β Gemini File API supports up to 2GB (free) / 20GB (paid) The script auto-installs Python dependencies via uv
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.