← All skills
Tencent SkillHub · Developer Tools

Amazon Scraper

Scrape Amazon product listings with Playwright stealth browser automation. Use when the user asks to scrape, crawl, or extract product data from Amazon (sear...

skill openclawclawhub Free
0 Downloads
0 Stars
0 Installs
0 Score
High Signal

Scrape Amazon product listings with Playwright stealth browser automation. Use when the user asks to scrape, crawl, or extract product data from Amazon (sear...

⬇ 0 downloads ★ 0 stars Unverified but indexed

Install for OpenClaw

Quick setup
  1. Download the package from Yavira.
  2. Extract the archive and review SKILL.md first.
  3. Import or place the package into your OpenClaw setup.

Requirements

Target platform
OpenClaw
Install method
Manual import
Extraction
Extract archive
Prerequisites
OpenClaw
Primary doc
SKILL.md

Package facts

Download mode
Yavira redirect
Package format
ZIP package
Source platform
Tencent SkillHub
What's included
SKILL.md, assets/amazon_handler.js, assets/main_handler.js, assets/youtube_handler.js, package.json

Validation

  • Use the Yavira download entry.
  • Review SKILL.md after the package is downloaded.
  • Confirm the extracted package contains the expected setup assets.

Install with your agent

Agent handoff

Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.

  1. Download the package from Yavira.
  2. Extract it into a folder your agent can access.
  3. Paste one of the prompts below and point your agent at the extracted folder.
New install

I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.

Upgrade existing

I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.

Trust & source

Release facts

Source
Tencent SkillHub
Verification
Indexed source record
Version
3.0.0

Documentation

ClawHub primary doc Primary doc: SKILL.md 9 sections Open source page

Deep Scraper

Docker容器化爬虫,支持穿透反爬,三种模式自动识别。

前置要求

Docker已安装并运行 镜像已构建: docker build -t clawd-crawlee skills/deep-scraper/

1. Amazon模式 (amazon_handler.js)

自动触发条件: URL包含 amazon.com,或用户提到亚马逊/Amazon/ASIN/BSR/选品/竞品/畅销榜/类目分析等关键词 根据URL自动识别页面类型: URL特征页面类型可获取字段/zgbs/ 或 /bestsellers/畅销榜rank, title, asin, price, rating, reviews, image, url/zg/new-releases/新品榜同上/zg/movers-and-shakers/飙升榜同上/s?k= 或 /s/搜索结果title, asin, price, rating, reviews, image, url, boughtPastMonth, sponsored/dp/ 或 /gp/product/产品详情title, asin, price, rating, reviews, brand, bsr, boughtPastMonth, dateFirstAvailable, category, bullets, details, image ⚠️ 重要规则: Best Sellers页面没有月销量(boughtPastMonth)数据 — 亚马逊不在榜单页显示此信息 要获取月销量,必须用搜索页(/s?k=关键词)或产品详情页(/dp/ASIN) 如果用户同时需要排名+月销量,建议:先爬Best Sellers拿排名,再用搜索页补月销 # 畅销榜(有排名,无月销) docker run -t --rm clawd-crawlee node assets/amazon_handler.js "https://www.amazon.com/zgbs/electronics" # 搜索结果(有月销,无排名) docker run -t --rm clawd-crawlee node assets/amazon_handler.js "https://www.amazon.com/s?k=feather+duster" # 产品详情(最全字段:BSR、品牌、卖点、月销) docker run -t --rm clawd-crawlee node assets/amazon_handler.js "https://www.amazon.com/dp/B001TQ6IHS" # 多页爬取 docker run -t --rm clawd-crawlee node assets/amazon_handler.js "URL" --pages 2 输出格式: JSON { "status": "SUCCESS", "type": "bestsellers|search|product-detail", "category": "品类名", "totalProducts": 30, "scrapedAt": "ISO时间", "products": [ { "rank": 1, "title": "产品名", "asin": "B001TQ6IHS", "price": 9.94, "priceStr": "$9.94", "rating": 4.6, "reviews": 20547, "boughtPastMonth": "1K+", "image": "https://...", "url": "https://..." } ] }

2. YouTube模式 (main_handler.js)

自动触发条件: URL包含 youtube.com,或用户提到YouTube/视频字幕/转录/transcript 拦截网络请求捕获字幕API (timedtext) 模拟点击"展开描述"和"转录稿"按钮 输出: {status, type:"TRANSCRIPT"|"DESCRIPTION", videoId, data} docker run -t --rm clawd-crawlee node assets/main_handler.js "https://youtube.com/watch?v=xxx"

3. 通用模式 (main_handler.js)

触发条件: 非Amazon、非YouTube的URL,或用户提到爬取/抓取任意网页/社交媒体 Playwright打开页面,等待JS加载完成 提取 document.body.innerText(纯文本,去广告噪音) 输出上限10000字符 输出: {status:"SUCCESS", type:"GENERIC", title, data} docker run -t --rm clawd-crawlee node assets/main_handler.js "https://任意网址"

Agent调用决策树

用户给了URL? ├─ 包含 amazon.com → 用 amazon_handler.js │ ├─ 需要月销量? → 建议用搜索URL(/s?k=) 或详情页(/dp/) │ └─ 需要排名? → 用畅销榜URL(/zgbs/) ├─ 包含 youtube.com → 用 main_handler.js (自动YouTube模式) └─ 其他网站 → 用 main_handler.js (通用模式) 用户没给URL,只说了需求? ├─ "爬亚马逊XX品类Top" / "XX类目排行" / "XX畅销榜" → 构造 https://www.amazon.com/zgbs/品类 ├─ "搜亚马逊XX" / "XX关键词搜索" / "找XX产品" → 构造 https://www.amazon.com/s?k=关键词 ├─ "分析某个ASIN" / "看看这个产品" / "XX的详情" → 构造 https://www.amazon.com/dp/ASIN ├─ "XX的月销量" / "XX卖了多少" / "XX销量怎么样" → 用搜索页或详情页(有boughtPastMonth) ├─ "竞品分析" / "竞品调研" / "对手在卖什么" → 先搜索再逐个爬详情 ├─ "选品" / "什么好卖" / "品类机会" / "市场调研" → Best Sellers + 搜索结合 └─ 其他 → 先web_search找到URL,再用对应模式爬

常见用户意图 → 操作映射

用户说操作"帮我看看亚马逊XX品类"爬 /zgbs/品类 畅销榜"XX在亚马逊卖得怎么样"搜索 /s?k=XX 看月销"分析一下这个ASIN: BXXXXXXXXX"爬 /dp/ASIN 详情页"XX品类有什么机会"畅销榜 + 搜索 综合分析"帮我爬这个链接"判断URL类型,选对应handler"这个YouTube视频讲了什么"YouTube模式抓字幕"帮我抓XX网站的内容"通用模式"搜一下XX的竞品"搜索页爬取 + 分析"XX月销多少" / "XX一个月卖多少"搜索页或详情页"帮我看看top 100" / "热门产品"Best Sellers畅销榜"新品有哪些" / "最近上了什么新品"/zg/new-releases/"什么产品涨得快" / "飙升榜"/zg/movers-and-shakers/

反爬能力

每次清除Cookie,模拟全新用户 Docker沙箱隔离,无指纹追踪 Playwright模拟真实浏览器行为 自动滚动加载懒加载内容 支持重试(maxRetries: 2)

局限

通用模式输出上限10000字符 Amazon单页最多约30-50个产品 不支持需要登录的页面 Docker容器启动有~10秒冷启动时间

Category context

Code helpers, APIs, CLIs, browser automation, testing, and developer operations.

Source: Tencent SkillHub

Largest current source with strong distribution and engagement signals.

Package contents

Included in package
3 Scripts1 Docs1 Config
  • SKILL.md Primary doc
  • assets/amazon_handler.js Scripts
  • assets/main_handler.js Scripts
  • assets/youtube_handler.js Scripts
  • package.json Config