# Send MinerU PDF Parser to your agent
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
## Fast path
- Download the package from Yavira.
- Extract it into a folder your agent can access.
- Paste one of the prompts below and point your agent at the extracted folder.
## Suggested prompts
### New install

```text
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
```
### Upgrade existing

```text
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
```
## Machine-readable fields
```json
{
  "schemaVersion": "1.0",
  "item": {
    "slug": "mineru",
    "name": "MinerU PDF Parser",
    "source": "tencent",
    "type": "skill",
    "category": "开发工具",
    "sourceUrl": "https://clawhub.ai/EasonAI-5589/mineru",
    "canonicalUrl": "https://clawhub.ai/EasonAI-5589/mineru",
    "targetPlatform": "OpenClaw"
  },
  "install": {
    "downloadUrl": "/downloads/mineru",
    "sourceDownloadUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=mineru",
    "sourcePlatform": "tencent",
    "targetPlatform": "OpenClaw",
    "packageFormat": "ZIP package",
    "primaryDoc": "SKILL.md",
    "includedAssets": [
      "SKILL.md"
    ],
    "downloadMode": "redirect",
    "sourceHealth": {
      "source": "tencent",
      "slug": "mineru",
      "status": "healthy",
      "reason": "direct_download_ok",
      "recommendedAction": "download",
      "checkedAt": "2026-05-05T14:29:04.901Z",
      "expiresAt": "2026-05-12T14:29:04.901Z",
      "httpStatus": 200,
      "finalUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=mineru",
      "contentType": "application/zip",
      "probeMethod": "head",
      "details": {
        "probeUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=mineru",
        "contentDisposition": "attachment; filename=\"mineru-1.0.1.zip\"",
        "redirectLocation": null,
        "bodySnippet": null,
        "slug": "mineru"
      },
      "scope": "item",
      "summary": "Item download looks usable.",
      "detail": "Yavira can redirect you to the upstream package for this item.",
      "primaryActionLabel": "Download for OpenClaw",
      "primaryActionHref": "/downloads/mineru"
    },
    "validation": {
      "installChecklist": [
        "Use the Yavira download entry.",
        "Review SKILL.md after the package is downloaded.",
        "Confirm the extracted package contains the expected setup assets."
      ],
      "postInstallChecks": [
        "Confirm the extracted package includes the expected docs or setup files.",
        "Validate the skill or prompts are available in your target agent workspace.",
        "Capture any manual follow-up steps the agent could not complete."
      ]
    }
  },
  "links": {
    "detailUrl": "https://openagent3.xyz/skills/mineru",
    "downloadUrl": "https://openagent3.xyz/downloads/mineru",
    "agentUrl": "https://openagent3.xyz/skills/mineru/agent",
    "manifestUrl": "https://openagent3.xyz/skills/mineru/agent.json",
    "briefUrl": "https://openagent3.xyz/skills/mineru/agent.md"
  }
}
```
## Documentation

### 📄 MinerU - 文档解析神器

OpenDataLab 出品

PDF/Word/PPT/图片 → 结构化 Markdown，公式表格全保留！

### 🔗 资源链接

资源链接官网https://mineru.net/API 文档https://mineru.net/apiManage/docsGitHubhttps://github.com/opendatalab/MinerU

### 支持的文件类型

类型格式📕 PDF论文、书籍、扫描件📝 Word.docx📊 PPT.pptx🖼️ 图片.jpg, .png (OCR)

### 核心优势

公式完美保留 - LaTeX 格式输出
表格结构识别 - 复杂表格也能搞定
多语言 OCR - 中英文混排无压力
版面分析 - 多栏、图文混排自动处理

### 认证

# Header 认证
Authorization: Bearer {YOUR_API_KEY}

### 单文件解析

# 1. 提交任务
curl -X POST "https://mineru.net/api/v4/extract/task" \\
  -H "Authorization: Bearer $MINERU_TOKEN" \\
  -H "Content-Type: application/json" \\
  -d '{
    "url": "https://arxiv.org/pdf/2410.17247",
    "enable_formula": true,
    "enable_table": true,
    "layout_model": "doclayout_yolo",
    "language": "en"
  }'

# 返回: {"task_id": "xxx", "status": "pending"}

# 2. 轮询结果
curl "https://mineru.net/api/v4/extract/task/{task_id}" \\
  -H "Authorization: Bearer $MINERU_TOKEN"

# 返回: {"status": "done", "result": {...}}

### 批量解析

# 1. 获取上传 URL
curl -X POST "https://mineru.net/api/v4/file-urls/batch" \\
  -H "Authorization: Bearer $MINERU_TOKEN" \\
  -d '{"file_names": ["paper1.pdf", "paper2.pdf"]}'

# 2. 上传文件到返回的 presigned URLs

# 3. 批量提交任务
curl -X POST "https://mineru.net/api/v4/extract/task/batch" \\
  -H "Authorization: Bearer $MINERU_TOKEN" \\
  -d '{"files": [{"url": "...", "name": "paper1.pdf"}, ...]}'

### ⚙️ 参数说明

参数类型说明urlstring文件 URL (支持 http/https)enable_formulabool启用公式识别 (默认 true)enable_tablebool启用表格识别 (默认 true)layout_modelstringdoclayout_yolo (快) / layoutlmv3 (准)languagestringen / ch / automodel_versionstringpipeline / vlm / MinerU-HTML

### 模型版本对比

版本速度准确度适用场景pipeline⚡ 快高常规文档vlm🐢 慢最高复杂版面MinerU-HTML⚡ 快高网页样式输出

### 📂 输出结构

解析完成后下载的 ZIP 包含：

output/
├── full.md           # 完整 Markdown
├── content_list.json # 结构化内容
├── images/           # 提取的图片
└── layout.json       # 版面分析结果

### 论文解析流程

# 1. 创建论文目录
mkdir -p "./paper-reading/[CVPR 2025] NewPaper"
cd "./paper-reading/[CVPR 2025] NewPaper"

# 2. 提交解析任务
TASK_ID=$(curl -s -X POST "https://mineru.net/api/v4/extract/task" \\
  -H "Authorization: Bearer $MINERU_TOKEN" \\
  -H "Content-Type: application/json" \\
  -d '{"url": "https://arxiv.org/pdf/XXXX.XXXXX"}' | jq -r '.task_id')

# 3. 等待完成 & 下载
# (轮询 status 直到 done，然后下载 result.zip)

# 4. 解压
unzip result.zip -d .

### 环境变量

在 ~/.bashrc 或 OpenClaw config 中设置：

export MINERU_TOKEN="your_api_key_here"

### ⚠️ 限制

限制数值单文件大小200 MB单文件页数600 页并发任务数根据套餐

### 💡 使用技巧

arXiv 论文直接用 URL
https://arxiv.org/pdf/2410.17247



中文论文用 language: ch


复杂表格用 vlm 模型


批量处理省 quota

一次提交多个文件，比单个提交更高效

### 📚 相关资源

Paper Banana Skill - 论文配图生成

论文解析不再手动复制粘贴！📖
## Trust
- Source: tencent
- Verification: Indexed source record
- Publisher: EasonAI-5589
- Version: 1.0.1
## Source health
- Status: healthy
- Item download looks usable.
- Yavira can redirect you to the upstream package for this item.
- Health scope: item
- Reason: direct_download_ok
- Checked at: 2026-05-05T14:29:04.901Z
- Expires at: 2026-05-12T14:29:04.901Z
- Recommended action: Download for OpenClaw
## Links
- [Detail page](https://openagent3.xyz/skills/mineru)
- [Send to Agent page](https://openagent3.xyz/skills/mineru/agent)
- [JSON manifest](https://openagent3.xyz/skills/mineru/agent.json)
- [Markdown brief](https://openagent3.xyz/skills/mineru/agent.md)
- [Download page](https://openagent3.xyz/downloads/mineru)