Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Fetch any webpage and convert its main content into clean Markdown format, preserving image links and resolving relative URLs.
Fetch any webpage and convert its main content into clean Markdown format, preserving image links and resolving relative URLs.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
抓取任意网页,转换为干净的 Markdown 格式,保留图片链接。
webfetch-md url="https://example.com"
# 基本使用(输出 JSON 格式) npx webfetch-md https://example.com # 或使用 --url 参数 npx webfetch-md --url https://example.com # 提取 Markdown 内容(配合 jq) npx webfetch-md https://example.com | jq -r '.markdown' # 保存到文件 npx webfetch-md https://example.com | jq -r '.markdown' > article.md
CLI 和工具都输出统一的 JSON 格式: { "success": true, "title": "文章标题", "markdown": "# 文章标题\n\n正文内容...", "images": ["https://example.com/img1.png"], "imageCount": 1, "contentLength": 1523 }
const { fetchAsMarkdown } = require('./index'); const result = await fetchAsMarkdown('https://example.com'); console.log(result.markdown);
✅ 抓取任意网页 HTML ✅ 智能提取正文内容(过滤导航、广告等) ✅ 保留图片链接(转换为  格式) ✅ 自动转换相对路径为绝对路径 ✅ 输出干净的 Markdown
turndown: HTML to Markdown 转换 cheerio: HTML 解析和提取
网页抓取:使用 fetch API 获取 HTML,模拟浏览器 User-Agent HTML解析:使用 cheerio 加载和解析 HTML 内容 内容提取:智能识别正文区域,过滤无关元素 URL处理:将相对路径转换为绝对路径 Markdown转换:使用 turndown 转换为标准 Markdown 格式
按优先级选择正文容器: article 标签 main 标签 [role="main"] 属性 .post-content / .entry-content 类 .content / .post 类 #content / #main ID 回退到 body 标签
脚本和样式标签 导航、页眉、页脚 侧边栏和广告区域 评论区
工具返回统一的 JSON 格式,包含 success 字段标识操作状态: { "success": false, "error": "错误信息" }
webfetch-md/ ├── index.js # 核心功能模块 ├── cli.js # CLI 和 OpenClaw 工具入口 ├── package.json # 依赖配置 ├── test.js # 测试脚本 └── SKILL.md # 技能文档
# 运行测试 npm test # 或直接测试 node test.js https://example.com
v1.1.0 (当前): 统一 CLI 和 OpenClaw 工具入口,优化错误处理 v1.0.1: 基础功能实现,支持网页抓取和 Markdown 转换 v1.0.0: 初始版本发布
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.