← All skills
Tencent SkillHub · AI

tappi

Lightweight CDP browser control for AI agents. Token-efficient alternative to the built-in browser tool — 3-10x fewer tokens per interaction. Use when browsi...

skill openclawclawhub Free
0 Downloads
0 Stars
0 Installs
0 Score
High Signal

Lightweight CDP browser control for AI agents. Token-efficient alternative to the built-in browser tool — 3-10x fewer tokens per interaction. Use when browsi...

⬇ 0 downloads ★ 0 stars Unverified but indexed

Install for OpenClaw

Quick setup
  1. Download the package from Yavira.
  2. Extract the archive and review SKILL.md first.
  3. Import or place the package into your OpenClaw setup.

Requirements

Target platform
OpenClaw
Install method
Manual import
Extraction
Extract archive
Prerequisites
OpenClaw
Primary doc
SKILL.md

Package facts

Download mode
Yavira redirect
Package format
ZIP package
Source platform
Tencent SkillHub
What's included
SKILL.md, _meta.json, scripts/browser.js, scripts/package-lock.json, scripts/package.json

Validation

  • Use the Yavira download entry.
  • Review SKILL.md after the package is downloaded.
  • Confirm the extracted package contains the expected setup assets.

Install with your agent

Agent handoff

Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.

  1. Download the package from Yavira.
  2. Extract it into a folder your agent can access.
  3. Paste one of the prompts below and point your agent at the extracted folder.
New install

I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.

Upgrade existing

I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.

Trust & source

Release facts

Source
Tencent SkillHub
Verification
Indexed source record
Version
3.5.0

Documentation

ClawHub primary doc Primary doc: SKILL.md 14 sections Open source page

tappi

Lightweight CLI that talks to Chrome via CDP (Chrome DevTools Protocol). Returns minimal, indexed output that agents can act on immediately — no accessibility tree parsing, no ref hunting.

Setup

# Install dependency (one-time, in the skill scripts/ dir) cd scripts && npm install # Ensure browser is running with CDP enabled. # With OpenClaw: # browser start profile=openclaw # Or manually: # google-chrome --remote-debugging-port=18800 --user-data-dir=~/.browser-data The tool connects to http://127.0.0.1:18800 by default. Override with CDP_URL env var.

Alias setup (optional)

mkdir -p ~/.local/bin cat > ~/.local/bin/bjs << 'WRAPPER' #!/bin/bash exec node /path/to/scripts/browser.js "$@" WRAPPER chmod +x ~/.local/bin/bjs

Commands

bjs tabs List open tabs bjs open <url> Navigate to URL bjs tab <index> Switch to tab bjs newtab [url] Open new tab bjs close [index] Close tab bjs elements [selector] List interactive elements (indexed) Smart actions (handle focus, verification, fallbacks internally): bjs click <index> [Smart] Click + report what changed (navigation, checkbox, dialog) bjs type <index> <text> [Smart] Type into element (auto-focus, auto-verify). Good for short fields. bjs paste <index> <text> [Smart] Paste with auto-verify + fallback. PREFERRED for long content. bjs paste <index> --file <path> [Smart] Paste from file (.md, .txt) Low-level actions (building blocks, rarely needed directly): bjs focus <index> [Low-level] Reclaim focus without click events. Use when smart actions report focus loss. bjs check <index> [Low-level] Read element value + focus state. Use to inspect without modifying. bjs eval <js> [Low-level] Run JavaScript. Last resort when smart actions can't solve the problem. Read & filter: bjs text [selector] Extract visible page text (max 8KB) bjs text | grep -i "pattern" Filter text — much cheaper than full output bjs elements | grep -i "send" Filter elements by pattern bjs html <selector> Get element HTML Other: bjs upload <path> [selector] Upload file to input (bypasses OS dialog) bjs screenshot [path] Save screenshot bjs scroll <up|down|top|bottom> [px] bjs url Current URL bjs back / forward / refresh bjs wait <ms> Raw keyboard input (canvas apps: Google Sheets, Docs, Figma): bjs keys <text> Type text via CDP keyboard events (bypasses DOM) bjs keys <text> --tab Type text then press Tab bjs keys <text> --enter Type text then press Enter bjs keys --combo cmd+b Key combination (cmd/ctrl/shift/alt + key) bjs keys --combo cmd+a Select all bjs keys "hello" --tab "world" --enter Chain text + keys in one command bjs keys --enter --delay 50 Flags: --enter --tab --escape --backspace --delete --up --down --left --right --home --end --space Coordinate commands (cross-origin iframes, captchas, overlays): bjs click-xy <x> <y> Click at page coordinates via CDP Input bjs click-xy <x> <y> --double Double-click at coordinates bjs click-xy <x> <y> --right Right-click at coordinates bjs hover-xy <x> <y> Hover at page coordinates bjs drag-xy <x1> <y1> <x2> <y2> Drag between coordinates bjs iframe-rect <selector> Get iframe bounding box (for click-xy targeting)

How it works

elements scans the page for all interactive elements (links, buttons, inputs, selects, etc.) — including those inside shadow DOM (web components). This means sites like Reddit, GitHub, and other modern SPAs that use shadow DOM are fully supported. The scan recursively pierces all shadow roots. Returns a compact numbered list: [0] (link) Hacker News → https://news.ycombinator.com/news [1] (link) new → https://news.ycombinator.com/newest [2] (input:text) q [3] (button) Submit Then click 3 or type 2 search query — immediately actionable, no interpretation needed. Auto-indexing: click and type auto-index elements if not already indexed. You can skip calling elements first and go straight to click/type after open. Call elements explicitly when you need to see what's on the page. After navigation or AJAX changes: Elements get re-indexed automatically on next click/type if stamps are stale. For manual re-index, call elements again. Real mouse events: click uses CDP Input.dispatchMouseEvent (mousePressed + mouseReleased) instead of JS .click(). This triggers React/Vue/Angular synthetic event handlers that ignore plain .click() calls. Works reliably on SPAs like Instagram, GitHub, LinkedIn.

File uploads

upload uses CDP's DOM.setFileInputFiles to inject files directly into hidden <input type="file"> elements — no OS file picker dialog. Works with Instagram, Twitter, any site with file uploads. bjs upload ~/photos/image.jpg # auto-finds input[type=file] bjs upload ~/docs/resume.pdf "input.file-drop" # specific selector

Token efficiency

ApproachTokens per interactionNotesbjs~50-200Indexed list, 1-line responsesbrowser tool (snapshot)~2,000-5,000Full accessibility treebrowser tool + thinking~3,000-8,000Plus reasoning to find refs Over a 10-step flow: ~1,500 tokens (bjs) vs ~30,000-80,000 (browser tool).

Typical flow

bjs open https://example.com # Navigate bjs elements # See what's clickable bjs click 5 # Click element [5] bjs type 12 "hello world" # Type into element [12] bjs text # Read page content bjs screenshot /tmp/result.png # Verify visually

Shadow DOM support

bjs automatically pierces shadow DOM boundaries. Sites built with web components (Reddit, GitHub, etc.) work out of the box — elements, click, type, and text all recurse into shadow roots. No special flags needed.

Canvas-based apps (Google Sheets, Docs, Slides, Figma)

Some web apps render content on a <canvas> element instead of the DOM. In these apps, bjs type won't work for the content area because there are no DOM input elements to target. Use bjs keys instead — it sends raw CDP keyboard events that the canvas app picks up directly. Navigation elements (menus, toolbars, name boxes, search bars) are still regular DOM — use bjs click/bjs type for those. Example: Google Sheets workflow # Navigate to a cell: click the Name Box (DOM element), type ref, press Enter bjs click <namebox-index> # Click name box input bjs type <namebox-index> "A1" # Type cell ref bjs keys --enter # Press Enter to navigate # Type a row using Tab to move between columns bjs keys "Revenue" --tab "Q1" --tab "Q2" --tab "Total" --tab # IMPORTANT: --enter does NOT reliably move to the next row in Sheets. # Navigate to each row start via the Name Box instead: bjs click <namebox-index> && bjs type <namebox-index> "A2" && bjs keys --enter bjs keys "100" --tab "200" --tab "=SUM(B2:C2)" --tab # Formatting: select a range via Name Box, then apply bjs click <namebox-index> && bjs type <namebox-index> "A1:D1" && bjs keys --enter bjs keys --combo cmd+b # Bold the selection Pattern: Name Box for row navigation + --tab within rows. Don't rely on --enter to advance rows.

Coordinate commands (iframes, captchas, overlays)

When you can't use click by index — e.g. the target is inside a cross-origin iframe (captcha checkbox, payment form, OAuth widget) — use coordinate-based commands that dispatch real CDP Input events at the OS level. These bypass all DOM boundaries. Workflow for clicking inside an iframe: bjs iframe-rect 'iframe[title*="hCaptcha"]' # Get bounding box # Output: x=95 y=440 w=302 h=76 center=(246, 478) bjs click-xy 125 458 # Click checkbox position iframe-rect returns the iframe's position on the page. Add offsets to target specific elements inside it (e.g. a checkbox is typically near the left side). Other uses: hover-xy — trigger hover menus, tooltips that need mouse position drag-xy — slider controls, drag-and-drop, canvas interactions click-xy --double — double-click to select text, expand items click-xy --right — context menus When to use coordinate commands vs click: click <index> — always preferred when the element shows up in elements click-xy — only when the target is inside a cross-origin iframe or otherwise unreachable by DOM indexing

Paste (preferred for long content)

For emails, comments, posts, or any long content — use paste instead of type. It handles focus, insertion, verification, and JS fallback in a single command: # Paste from a file (preferred — avoids passing large text inline) bjs paste 5 --file ~/drafts/email_body.md # → Pasted 4184 chars into [5] (div, contenteditable) — verified ✓ # Paste inline text bjs paste 5 "Hello, this is my comment..." # → Pasted 30 chars into [5] (textarea) — verified ✓ paste is a superset of type + check with auto-fallback. Use type for short fields (usernames, search boxes). Use paste for anything longer than a sentence. For canvas apps (Google Sheets, Docs), use keys — paste targets DOM elements only.

Verify & recover

After filling forms or composing messages, verify the content landed before clicking Send/Submit. Use check for targeted verification — it returns the element's value, length, and focus state in one call: # After typing into a compose body (element [5]): bjs check 5 # → [5] (textarea) Compose body — value: "Hello world" (11 chars, focused) # Empty or wrong value? Focus shifted silently. Recover: bjs focus 5 # Reclaim focus (no click events, won't trigger popups) bjs type 5 "Hello world" # Retype bjs check 5 # Verify again Focus recovery pattern — when a popup, contact card, or autocomplete overlay steals focus: bjs keys --escape # Dismiss the overlay bjs focus 5 # Reclaim focus on target element bjs type 5 "your text" # Type into it bjs check 5 # Verify it landed focus is lighter than click — it calls el.focus() only, without dispatching mouse events that might spawn additional popups or contact cards. For more complex verification, eval and text still work: bjs eval 'document.querySelector("[contenteditable]")?.innerText?.substring(0,100)' bjs text ".compose-area" | head -5 tappi commands report success if the CDP call went through — but that doesn't guarantee the target element received the input (e.g. focus may have shifted). One check before a destructive action (send, submit, delete) is cheap insurance. If check/focus don't resolve the issue, use eval for custom JS fixes.

Tips

elements with a CSS selector narrows scope: bjs elements ".modal" eval runs arbitrary JS and returns the result — use for custom extraction text caps at 8KB — enough for most pages, won't blow up context html <selector> caps at 10KB — for inspecting specific elements Pipe through grep to filter: bjs elements | grep -i "submit\|login"

Category context

Agent frameworks, memory systems, reasoning layers, and model-native orchestration.

Source: Tencent SkillHub

Largest current source with strong distribution and engagement signals.

Package contents

Included in package
3 Config1 Docs1 Scripts
  • SKILL.md Primary doc
  • scripts/browser.js Scripts
  • _meta.json Config
  • scripts/package-lock.json Config
  • scripts/package.json Config