Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
This item's current download entry is known to bounce back to a listing or homepage instead of returning a package file.
Use the source page and any available docs to guide the install because the item currently does not return a direct package file.
I tried to install a skill package from Yavira, but the item currently does not return a direct package file. Inspect the source page and any extracted docs, then tell me what you can confirm and any manual steps still required. Then review README.md for any prerequisites, environment setup, or post-install checks.
I tried to upgrade a skill package from Yavira, but the item currently does not return a direct package file. Compare the source page and any extracted docs with my current installation, then summarize what changed and what manual follow-up I still need. Then review README.md for any prerequisites, environment setup, or post-install checks.
Parse a PDF locally with MinerU (CPU). Default output is Markdown + JSON. Use tables/images only when requested.
# Run from the skill directory ./scripts/mineru_parse.sh /path/to/file.pdf Optional examples: ./scripts/mineru_parse.sh /path/to/file.pdf --format json ./scripts/mineru_parse.sh /path/to/file.pdf --tables --images
If flags differ from your wrapper or you need advanced defaults (backend/method/device/threads/format mapping), read: references/mineru-cli.md
Output root defaults to ./mineru-output/. MinerU creates the per-document subfolder under the output root (e.g., ./mineru-output/<basename>/...).
Default is single-PDF parsing. Only implement batch folder parsing if explicitly requested.
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.