Tencent SkillHub · Developer Tools

Youtube Transcription Generator

Use VLM Run (vlmrun) to generate transcriptions from YouTube videos. Download a video with yt-dlp, then run vlmrun to transcribe with optional timestamps. VLMRUN_API_KEY must be in .env; follow vlmrun-cli-skill for CLI setup and options.

skill openclawclawhub Free

0 Downloads

0 Stars

0 Installs

0 Score

High Signal

⬇ 0 downloads ★ 0 stars Unverified but indexed

Install for OpenClaw

Quick setup

Download the package from Yavira.
Extract the archive and review SKILL.md first.
Import or place the package into your OpenClaw setup.

Requirements

Target platform: OpenClaw
Install method: Manual import
Extraction: Extract archive
Prerequisites: OpenClaw
Primary doc: SKILL.md

Package facts

Download mode: Yavira redirect
Package format: ZIP package
Source platform: Tencent SkillHub
What's included: SKILL.md

Validation

Use the Yavira download entry.
Review SKILL.md after the package is downloaded.
Confirm the extracted package contains the expected setup assets.

Install with your agent

Agent handoff

Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.

Download the package from Yavira.
Extract it into a folder your agent can access.
Paste one of the prompts below and point your agent at the extracted folder.

New install

I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.

Upgrade existing

I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.

Open Send to Agent page Open JSON manifest Open Markdown brief

Trust & source

Release facts

Source: Tencent SkillHub
Verification: Indexed source record
Version: 0.1.0

Provenance

Publisher: MehediAhamed
Source page: View original listing
Canonical URL: Open canonical page

Documentation

ClawHub primary doc Primary doc: SKILL.md 9 sections Open source page

YouTube Transcription Generator (VLM Run)

Generate transcriptions from YouTube videos using vlmrun for speech-to-text and optional timestamps. This skill: Downloads the YouTube video (or audio) with yt-dlp. Transcribes the video with vlmrun (Orion visual AI). Saves the transcript to a file (plain text or with timestamps). Refer to vlmrun-cli-skill for vlmrun CLI setup, environment variables, and all vlmrun chat options.

How the assistant should use this skill

Check .env for API key Ensure .env (or .env.local) contains VLMRUN_API_KEY. If missing, instruct the user to set it before running any vlmrun commands. Use vlmrun for transcription only For transcription (and optional timestamps), use the vlmrun CLI with a video file as input (-i <video>). vlmrun accepts video files (e.g. .mp4). For YouTube, the skill first downloads the video with yt-dlp, then passes the file to vlmrun. Workflow User provides a YouTube URL (and optionally output path). Download the video (or audio-only for faster/smaller) with yt-dlp. Run: vlmrun chat "Transcribe this video with timestamps for each section. Output the full transcript in a clear, readable format." -i <downloaded_file> -o <output_dir>. Capture vlmrun’s response and save it as the transcript file (e.g. transcript.txt).

Prerequisites

Python 3.10+ VLMRUN_API_KEY (required for vlmrun) vlmrun CLI (vlmrun[cli]) yt-dlp (for downloading YouTube videos) See vlmrun-cli-skill for detailed vlmrun usage and examples (including video transcription).

Installation & Setup

From the youtube-transcription-generator directory: Windows (PowerShell): cd path\to\youtube-transcription-generator uv venv .venv\Scripts\Activate.ps1 uv pip install -r requirements.txt macOS/Linux: cd path/to/youtube-transcription-generator uv venv source .venv/bin/activate uv pip install -r requirements.txt Copy .env_template to .env and set VLMRUN_API_KEY.

Option A: Run the script (recommended)

# From youtube-transcription-generator directory, with venv activated python scripts/run_transcription.py "https://www.youtube.com/watch?v=VIDEO_ID" -o ./output This will: Download the video with yt-dlp into the output directory. Run vlmrun to transcribe the video. Save the transcript as output/transcript.txt (and keep artifacts in output/).

Option B: Manual vlmrun (after downloading the video yourself)

# 1) Download with yt-dlp yt-dlp -f "bv*[ext=mp4]+ba/best[ext=mp4]/best" -o video.mp4 "https://www.youtube.com/watch?v=VIDEO_ID" # 2) Transcribe with vlmrun (see vlmrun-cli-skill for options) vlmrun chat "Transcribe this video with timestamps for each section. Output the full transcript in a clear, readable format." -i video.mp4 -o ./output Capture the vlmrun stdout and save it as your transcript, or use --json if you need structured output.

Prompt variants for vlmrun

With timestamps: "Transcribe this video with timestamps for each section. Output the full transcript in a clear, readable format." Plain transcript only: "Transcribe everything said in this video. Output only the spoken text, no timestamps." Structured (e.g. JSON): Use --json and ask for a structured format in the prompt (e.g. list of { "time": "...", "text": "..." }).

Workflow checklist

Confirm vlmrun is installed and VLMRUN_API_KEY is set (see vlmrun-cli-skill). Install dependencies: uv pip install -r requirements.txt (includes vlmrun[cli] and yt-dlp). Run python scripts/run_transcription.py <youtube_url> -o ./output or download + vlmrun manually. Find transcript in the output directory (e.g. output/transcript.txt).

Troubleshooting

vlmrun not found Activate the venv and run: uv pip install "vlmrun[cli]". See vlmrun-cli-skill. Authentication errors Verify VLMRUN_API_KEY in .env or the current shell. yt-dlp fails Update yt-dlp: uv pip install -U yt-dlp. Check the URL is a valid public YouTube video. Large or long videos Use audio-only download in the script (e.g. -f bestaudio) to reduce size and speed up transcription.

Category context

Code helpers, APIs, CLIs, browser automation, testing, and developer operations.

Source: Tencent SkillHub

Largest current source with strong distribution and engagement signals.

Package contents

Included in package

1 Docs

SKILL.md Primary doc