Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extractin...
Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extractin...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Convert PDF, Word, PPT, and images to clean Markdown using MinerU's VLM engine — LaTeX formulas, tables, and images all preserved.
Get free API token at https://mineru.net/user-center/api-token export MINERU_TOKEN="your-token-here" Limits: 2000 pages/day · 200 MB per file · 600 pages per file
TypeFormats📕 PDF.pdf — papers, textbooks, scanned docs📝 Word.docx — reports, manuscripts📊 PPT.pptx — slides, presentations🖼️ Image.jpg, .jpeg, .png — OCR extraction
python3 scripts/mineru_v2.py --file ./document.pdf --output ./output/
python3 scripts/mineru_v2.py \ --dir ./docs/ \ --output ./output/ \ --workers 10 \ --resume
python3 scripts/mineru_v2.py \ --dir ./pdfs/ \ --output "~/Library/Mobile Documents/com~apple~CloudDocs/Obsidian/VaultName/" \ --resume
python3 scripts/mineru_v2.py --dir ./papers/ --output ./output/ --language ch
python3 scripts/mineru_v2.py --file ./paper.pdf --output ./output/ --model vlm
--dir PATH Input directory (PDF/Word/PPT/images) --file PATH Single file --output PATH Output directory (default: ./output/) --workers N Concurrent workers (default: 5, max: 15) --resume Skip already processed files --model MODEL Model version: pipeline | vlm | MinerU-HTML (default: vlm) --language LANG Document language: auto | en | ch (default: auto) --no-formula Disable formula recognition --no-table Disable table extraction --token TOKEN API token (overrides MINERU_TOKEN env var)
ModelSpeedAccuracyBest Forpipeline⚡ FastHighStandard docs, most use casesvlm🐢 SlowHighestComplex layouts, multi-column, mixed text+figuresMinerU-HTML⚡ FastHighWeb-style output, HTML-ready content
ScriptUse Whenmineru_v2.pyDefault — async parallel (up to 15 workers)mineru_async.pyFast network, need maximum throughputmineru_stable.pyUnstable network — sequential, max retry
output/ ├── document-name/ │ ├── document-name.md # Main Markdown │ ├── images/ # Extracted images │ └── content.json # Metadata
WorkersSpeed1 (sequential)1.2 files/min53.1 files/min155.6 files/min
5x auto-retry with exponential backoff Use --resume to continue interrupted batches Failed files listed at end of run
For detailed API documentation, see references/api_reference.md.
Workflow acceleration for inboxes, docs, calendars, planning, and execution loops.
Largest current source with strong distribution and engagement signals.