# Send pdf-parser-mineru to your agent
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
## Fast path
- Download the package from Yavira.
- Extract it into a folder your agent can access.
- Paste one of the prompts below and point your agent at the extracted folder.
## Suggested prompts
### New install

```text
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
```
### Upgrade existing

```text
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
```
## Machine-readable fields
```json
{
  "schemaVersion": "1.0",
  "item": {
    "slug": "pdf-parser-mineru",
    "name": "pdf-parser-mineru",
    "source": "tencent",
    "type": "skill",
    "category": "开发工具",
    "sourceUrl": "https://clawhub.ai/baokui/pdf-parser-mineru",
    "canonicalUrl": "https://clawhub.ai/baokui/pdf-parser-mineru",
    "targetPlatform": "OpenClaw"
  },
  "install": {
    "downloadUrl": "/downloads/pdf-parser-mineru",
    "sourceDownloadUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=pdf-parser-mineru",
    "sourcePlatform": "tencent",
    "targetPlatform": "OpenClaw",
    "packageFormat": "ZIP package",
    "primaryDoc": "SKILL.md",
    "includedAssets": [
      "install.sh",
      "SKILL.md",
      "SKILL_zh.md",
      "script/pdf_parser.py"
    ],
    "downloadMode": "redirect",
    "sourceHealth": {
      "source": "tencent",
      "status": "healthy",
      "reason": "direct_download_ok",
      "recommendedAction": "download",
      "checkedAt": "2026-05-07T17:22:31.273Z",
      "expiresAt": "2026-05-14T17:22:31.273Z",
      "httpStatus": 200,
      "finalUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=afrexai-annual-report",
      "contentType": "application/zip",
      "probeMethod": "head",
      "details": {
        "probeUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=afrexai-annual-report",
        "contentDisposition": "attachment; filename=\"afrexai-annual-report-1.0.0.zip\"",
        "redirectLocation": null,
        "bodySnippet": null
      },
      "scope": "source",
      "summary": "Source download looks usable.",
      "detail": "Yavira can redirect you to the upstream package for this source.",
      "primaryActionLabel": "Download for OpenClaw",
      "primaryActionHref": "/downloads/pdf-parser-mineru"
    },
    "validation": {
      "installChecklist": [
        "Use the Yavira download entry.",
        "Review SKILL.md after the package is downloaded.",
        "Confirm the extracted package contains the expected setup assets."
      ],
      "postInstallChecks": [
        "Confirm the extracted package includes the expected docs or setup files.",
        "Validate the skill or prompts are available in your target agent workspace.",
        "Capture any manual follow-up steps the agent could not complete."
      ]
    }
  },
  "links": {
    "detailUrl": "https://openagent3.xyz/skills/pdf-parser-mineru",
    "downloadUrl": "https://openagent3.xyz/downloads/pdf-parser-mineru",
    "agentUrl": "https://openagent3.xyz/skills/pdf-parser-mineru/agent",
    "manifestUrl": "https://openagent3.xyz/skills/pdf-parser-mineru/agent.json",
    "briefUrl": "https://openagent3.xyz/skills/pdf-parser-mineru/agent.md"
  }
}
```
## Documentation

### 1. pdf_to_markdown

Convert PDF documents to Markdown format, preserving document structure, formulas, tables, and images.

Description: Use MinerU to parse PDF documents and output in Markdown format, supporting OCR, formula recognition, table extraction, and other features.

Parameters:

file_path (string, required): Absolute path to the PDF file
output_dir (string, required): Absolute path to the output directory
backend (string, optional): Parsing backend, options: hybrid-auto-engine (default), pipeline, vlm-auto-engine
language (string, optional): OCR language code, such as en (English), ch (Chinese), ja (Japanese), etc., defaults to auto-detection
enable_formula (boolean, optional): Whether to enable formula recognition, defaults to true
enable_table (boolean, optional): Whether to enable table extraction, defaults to true
start_page (integer, optional): Start page number (starting from 0), defaults to 0
end_page (integer, optional): End page number (starting from 0), defaults to -1 meaning parse all pages

Return Value:

{
  "success": true,
  "output_path": "/path/to/output",
  "markdown_content": "Converted Markdown content...",
  "images": ["List of image paths"],
  "tables": ["List of table information"],
  "formula_count": 10
}

Examples:

python .claude/skills/pdf-process/script/pdf_parser.py \\
  '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output"}}'

# Use specific backend
python .claude/skills/pdf-process/script/pdf_parser.py \\
  '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "backend": "pipeline"}}'

# Parse specific pages
python .claude/skills/pdf-process/script/pdf_parser.py \\
  '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "start_page": 0, "end_page": 5}}'

### 2. pdf_to_json

Convert PDF documents to JSON format, including detailed layout and structural information.

Description: Use MinerU to parse PDF documents and output in JSON format, containing structured information such as text blocks, images, tables, formulas, etc.

Parameters:

file_path (string, required): Absolute path to the PDF file
output_dir (string, required): Absolute path to the output directory
backend (string, optional): Parsing backend, options: hybrid-auto-engine (default), pipeline, vlm-auto-engine
language (string, optional): OCR language code, such as en (English), ch (Chinese), ja (Japanese), etc., defaults to auto-detection
enable_formula (boolean, optional): Whether to enable formula recognition, defaults to true
enable_table (boolean, optional): Whether to enable table extraction, defaults to true
start_page (integer, optional): Start page number (starting from 0), defaults to 0
end_page (integer, optional): End page number (starting from 0), defaults to -1 meaning parse all pages

Return Value:

{
  "success": true,
  "output_path": "/path/to/output.json",
  "pages": [
    {
      "page_no": 0,
      "page_size": [595, 842],
      "blocks": [
        {
          "type": "text",
          "text": "Text content",
          "bbox": [x, y, x, y]
        }
      ],
      "images": [],
      "tables": [],
      "formulas": []
    }
  ],
  "metadata": {
    "total_pages": 10,
    "author": "Author",
    "title": "Title"
  }
}

Examples:

python .claude/skills/pdf-process/script/pdf_parser.py \\
  '{"name": "pdf_to_json", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output"}}'

# Use specific backend and language
python .claude/skills/pdf-process/script/pdf_parser.py \\
  '{"name": "pdf_to_json", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "backend": "hybrid-auto-engine", "language": "ch"}}'

### 1. Install MinerU

# Update pip and install uv
pip install --upgrade pip
pip install uv

# Install MinerU (including all features)
uv pip install -U "mineru[all]"

### 2. Verify Installation

# Check if MinerU is installed successfully
mineru --version

# Test basic functionality
mineru --help

### 3. System Requirements

Python Version: 3.10-3.13
Operating System: Linux / Windows / macOS 14.0+
Memory:

Using pipeline backend: minimum 16GB, recommended 32GB+
Using hybrid/vlm backend: minimum 16GB, recommended 32GB+


Disk Space: minimum 20GB (SSD recommended)
GPU (optional):

pipeline backend: supports CPU-only
hybrid/vlm backend: requires NVIDIA GPU (Volta architecture and above) or Apple Silicon

### Use Cases

Academic Paper Parsing: Extract structured content such as formulas, tables, and images
Technical Document Conversion: Convert PDF documents to Markdown for version control and online publishing
OCR Processing: Process scanned PDFs and garbled PDFs
Multilingual Documents: Supports OCR recognition for 109 languages
Batch Processing: Batch convert multiple PDF documents

### Backend Selection Recommendations

hybrid-auto-engine (default): Balanced accuracy and speed, suitable for most scenarios
pipeline: Suitable for CPU-only environments, best compatibility
vlm-auto-engine: Highest accuracy, requires GPU acceleration

### Notes

File Paths: All paths must be absolute paths
Output Directory: Non-existent directories will be created automatically
Performance: Using GPU can significantly improve parsing speed
Page Numbers: Page numbers start counting from 0
Memory: Processing large documents may consume more memory

### Common Issues

Installation Failure:

Ensure using Python 3.10-3.13
Windows only supports Python 3.10-3.12 (ray does not support 3.13)
Using uv pip install can resolve most dependency conflicts



Insufficient Memory:

Use pipeline backend
Limit parsing pages: start_page and end_page
Reduce virtual memory allocation



Slow Parsing Speed:

Enable GPU acceleration
Use hybrid-auto-engine backend
Disable unnecessary features (formulas, tables)



Low OCR Accuracy:

Specify the correct document language
Ensure the backend supports OCR (use pipeline or hybrid-*)

### Related Resources

MinerU Official Documentation: https://opendatalab.github.io/MinerU/
MinerU GitHub: https://github.com/opendatalab/MinerU
Online Demo: https://mineru.net/
## Trust
- Source: tencent
- Verification: Indexed source record
- Publisher: baokui
- Version: 1.0.2
## Source health
- Status: healthy
- Source download looks usable.
- Yavira can redirect you to the upstream package for this source.
- Health scope: source
- Reason: direct_download_ok
- Checked at: 2026-05-07T17:22:31.273Z
- Expires at: 2026-05-14T17:22:31.273Z
- Recommended action: Download for OpenClaw
## Links
- [Detail page](https://openagent3.xyz/skills/pdf-parser-mineru)
- [Send to Agent page](https://openagent3.xyz/skills/pdf-parser-mineru/agent)
- [JSON manifest](https://openagent3.xyz/skills/pdf-parser-mineru/agent.json)
- [Markdown brief](https://openagent3.xyz/skills/pdf-parser-mineru/agent.md)
- [Download page](https://openagent3.xyz/downloads/pdf-parser-mineru)