Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs.
Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Parse images and PDF files using PaddleOCR's API. Supports both synchronous and asynchronous parsing modes with structured output.
ResourceLinkOfficial Websitehttps://www.paddleocr.comAPI Documentationhttps://ai.baidu.com/ai-doc/AISTUDIO/Cmkz2m0maGitHubhttps://github.com/PaddlePaddle/PaddleOCR
Multi-format support: PDF and image files (JPG, PNG, BMP, TIFF) Two parsing modes: Sync mode: Fast response for small files (<600s timeout) Async mode: For large files with progress polling Layout analysis: Automatic detection of text blocks, tables, formulas Multi-language: Support for 110+ languages Structured output: Markdown format with preserved document structure
Visit PaddleOCR to obtain your API credentials Set environment variables: export PADDLEOCR_ACCESS_TOKEN="your_token_here" export PADDLEOCR_API_URL="https://your-endpoint.aistudio-app.com/layout-parsing" # Optional: For async mode export PADDLEOCR_JOB_URL="https://your-job-endpoint.aistudio-app.com/api/v2/ocr/jobs" export PADDLEOCR_MODEL="PaddleOCR-VL-1.5"
For small files and quick processing: # Parse local image {baseDir}/paddleocr_parse.sh document.jpg # Parse PDF {baseDir}/paddleocr_parse.sh -t pdf document.pdf # Parse from URL {baseDir}/paddleocr_parse.sh https://example.com/document.jpg # Save output to file {baseDir}/paddleocr_parse.sh -o result.json document.jpg # Verbose output {baseDir}/paddleocr_parse.sh -v document.jpg
For large files with progress tracking: # Parse large PDF with async mode {baseDir}/paddleocr_parse.sh --async large-document.pdf # Parse from URL with async mode {baseDir}/paddleocr_parse.sh --async -t pdf https://example.com/doc.pdf # Save async result to file {baseDir}/paddleocr_parse.sh --async -o result.json document.pdf
# Sync mode python3 {baseDir}/paddleocr_parse.py document.jpg # Async mode python3 {baseDir}/paddleocr_parse.py --async-mode document.pdf # With output file python3 {baseDir}/paddleocr_parse.py -o result.json --async-mode document.pdf
{ "logId": "unique_request_id", "errorCode": 0, "errorMsg": "Success", "result": { "layoutParsingResults": [ { "prunedResult": [...], "markdown": { "text": "# Document Title\n\nParagraph content...", "images": {} }, "outputImages": [...], "inputImage": "http://input-image" } ], "dataInfo": {...} } } Important Fields: prunedResult - Contains detailed layout element information including positions, categories, etc. markdown - Stores the document content converted to Markdown format with preserved structure and formatting.
Use CaseRecommended ModeSmall images (< 10MB)SyncSingle page PDFsSyncLarge PDFs (> 10MB)AsyncMulti-page documentsAsyncBatch processingAsyncQuick text extractionSync
The script will exit with code 1 and print error message for: Missing required environment variables File not found API authentication failures Invalid JSON responses API error codes (non-zero)
See official documentation: https://ai.baidu.com/ai-doc/AISTUDIO/Xmjclapam
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.