Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), HTML, YouTube.
OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), HTML, YouTube.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Documentation and utilities for converting documents to Markdown using Microsoft's MarkItDown library. Note: This skill provides documentation and a batch script. The actual conversion is done by the markitdown CLI/library installed via pip.
Use markitdown for: π Fetching documentation (README, API docs) π Converting web pages to markdown π Document analysis (PDFs, Word, PowerPoint) π¬ YouTube transcripts πΌοΈ Image text extraction (OCR) π€ Audio transcription
# Convert file to markdown markitdown document.pdf -o output.md # Convert URL markitdown https://example.com/docs -o docs.md
FormatFeaturesPDFText extraction, structureWord (.docx)Headings, lists, tablesPowerPointSlides, textExcelTables, sheetsImagesOCR + EXIF metadataAudioSpeech transcriptionHTMLStructure preservationYouTubeVideo transcription
The skill requires Microsoft's markitdown CLI: pip install 'markitdown[all]' Or install specific formats only: pip install 'markitdown[pdf,docx,pptx]'
markitdown https://github.com/user/repo/blob/main/README.md -o readme.md
markitdown document.pdf -o document.md
# Using included script python ~/.openclaw/skills/markitdown/scripts/batch_convert.py docs/*.pdf -o markdown/ -v # Or shell loop for file in docs/*.pdf; do markitdown "$file" -o "${file%.pdf}.md" done
from markitdown import MarkItDown md = MarkItDown() result = md.convert("document.pdf") print(result.text_content)
pip install 'markitdown[all]'
# Ubuntu/Debian sudo apt-get install tesseract-ocr # macOS brew install tesseract
ComponentSourcemarkitdown CLIMicrosoft's pip packagemarkitdown Python APIMicrosoft's pip packagescripts/batch_convert.pyThis skill (utility)DocumentationThis skill
USAGE-GUIDE.md - Detailed examples reference.md - Full API reference Microsoft MarkItDown - Upstream library
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.