Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable formats.
PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable formats.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Convert PDF documents to Markdown format, preserving document structure, formulas, tables, and images. Description: Use MinerU to parse PDF documents and output in Markdown format, supporting OCR, formula recognition, table extraction, and other features. Parameters: file_path (string, required): Absolute path to the PDF file output_dir (string, required): Absolute path to the output directory backend (string, optional): Parsing backend, options: hybrid-auto-engine (default), pipeline, vlm-auto-engine language (string, optional): OCR language code, such as en (English), ch (Chinese), ja (Japanese), etc., defaults to auto-detection enable_formula (boolean, optional): Whether to enable formula recognition, defaults to true enable_table (boolean, optional): Whether to enable table extraction, defaults to true start_page (integer, optional): Start page number (starting from 0), defaults to 0 end_page (integer, optional): End page number (starting from 0), defaults to -1 meaning parse all pages Return Value: { "success": true, "output_path": "/path/to/output", "markdown_content": "Converted Markdown content...", "images": ["List of image paths"], "tables": ["List of table information"], "formula_count": 10 } Examples: python .claude/skills/pdf-process/script/pdf_parser.py \ '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output"}}' # Use specific backend python .claude/skills/pdf-process/script/pdf_parser.py \ '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "backend": "pipeline"}}' # Parse specific pages python .claude/skills/pdf-process/script/pdf_parser.py \ '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "start_page": 0, "end_page": 5}}'
Convert PDF documents to JSON format, including detailed layout and structural information. Description: Use MinerU to parse PDF documents and output in JSON format, containing structured information such as text blocks, images, tables, formulas, etc. Parameters: file_path (string, required): Absolute path to the PDF file output_dir (string, required): Absolute path to the output directory backend (string, optional): Parsing backend, options: hybrid-auto-engine (default), pipeline, vlm-auto-engine language (string, optional): OCR language code, such as en (English), ch (Chinese), ja (Japanese), etc., defaults to auto-detection enable_formula (boolean, optional): Whether to enable formula recognition, defaults to true enable_table (boolean, optional): Whether to enable table extraction, defaults to true start_page (integer, optional): Start page number (starting from 0), defaults to 0 end_page (integer, optional): End page number (starting from 0), defaults to -1 meaning parse all pages Return Value: { "success": true, "output_path": "/path/to/output.json", "pages": [ { "page_no": 0, "page_size": [595, 842], "blocks": [ { "type": "text", "text": "Text content", "bbox": [x, y, x, y] } ], "images": [], "tables": [], "formulas": [] } ], "metadata": { "total_pages": 10, "author": "Author", "title": "Title" } } Examples: python .claude/skills/pdf-process/script/pdf_parser.py \ '{"name": "pdf_to_json", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output"}}' # Use specific backend and language python .claude/skills/pdf-process/script/pdf_parser.py \ '{"name": "pdf_to_json", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "backend": "hybrid-auto-engine", "language": "ch"}}'
# Update pip and install uv pip install --upgrade pip pip install uv # Install MinerU (including all features) uv pip install -U "mineru[all]"
# Check if MinerU is installed successfully mineru --version # Test basic functionality mineru --help
Python Version: 3.10-3.13 Operating System: Linux / Windows / macOS 14.0+ Memory: Using pipeline backend: minimum 16GB, recommended 32GB+ Using hybrid/vlm backend: minimum 16GB, recommended 32GB+ Disk Space: minimum 20GB (SSD recommended) GPU (optional): pipeline backend: supports CPU-only hybrid/vlm backend: requires NVIDIA GPU (Volta architecture and above) or Apple Silicon
Academic Paper Parsing: Extract structured content such as formulas, tables, and images Technical Document Conversion: Convert PDF documents to Markdown for version control and online publishing OCR Processing: Process scanned PDFs and garbled PDFs Multilingual Documents: Supports OCR recognition for 109 languages Batch Processing: Batch convert multiple PDF documents
hybrid-auto-engine (default): Balanced accuracy and speed, suitable for most scenarios pipeline: Suitable for CPU-only environments, best compatibility vlm-auto-engine: Highest accuracy, requires GPU acceleration
File Paths: All paths must be absolute paths Output Directory: Non-existent directories will be created automatically Performance: Using GPU can significantly improve parsing speed Page Numbers: Page numbers start counting from 0 Memory: Processing large documents may consume more memory
Installation Failure: Ensure using Python 3.10-3.13 Windows only supports Python 3.10-3.12 (ray does not support 3.13) Using uv pip install can resolve most dependency conflicts Insufficient Memory: Use pipeline backend Limit parsing pages: start_page and end_page Reduce virtual memory allocation Slow Parsing Speed: Enable GPU acceleration Use hybrid-auto-engine backend Disable unnecessary features (formulas, tables) Low OCR Accuracy: Specify the correct document language Ensure the backend supports OCR (use pipeline or hybrid-*)
MinerU Official Documentation: https://opendatalab.github.io/MinerU/ MinerU GitHub: https://github.com/opendatalab/MinerU Online Demo: https://mineru.net/
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.