# Send Jina Reader to your agent
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
## Fast path
- Download the package from Yavira.
- Extract it into a folder your agent can access.
- Paste one of the prompts below and point your agent at the extracted folder.
## Suggested prompts
### New install

```text
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
```
### Upgrade existing

```text
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
```
## Machine-readable fields
```json
{
  "schemaVersion": "1.0",
  "item": {
    "slug": "haibo-jina-reader",
    "name": "Jina Reader",
    "source": "tencent",
    "type": "skill",
    "category": "开发工具",
    "sourceUrl": "https://clawhub.ai/smile-xuc/haibo-jina-reader",
    "canonicalUrl": "https://clawhub.ai/smile-xuc/haibo-jina-reader",
    "targetPlatform": "OpenClaw"
  },
  "install": {
    "downloadUrl": "/downloads/haibo-jina-reader",
    "sourceDownloadUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=haibo-jina-reader",
    "sourcePlatform": "tencent",
    "targetPlatform": "OpenClaw",
    "packageFormat": "ZIP package",
    "primaryDoc": "SKILL.md",
    "includedAssets": [
      "SKILL.md",
      "scripts/jina-reader.py",
      "scripts/jina-reader.sh"
    ],
    "downloadMode": "redirect",
    "sourceHealth": {
      "source": "tencent",
      "status": "healthy",
      "reason": "direct_download_ok",
      "recommendedAction": "download",
      "checkedAt": "2026-04-30T16:55:25.780Z",
      "expiresAt": "2026-05-07T16:55:25.780Z",
      "httpStatus": 200,
      "finalUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=network",
      "contentType": "application/zip",
      "probeMethod": "head",
      "details": {
        "probeUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=network",
        "contentDisposition": "attachment; filename=\"network-1.0.0.zip\"",
        "redirectLocation": null,
        "bodySnippet": null
      },
      "scope": "source",
      "summary": "Source download looks usable.",
      "detail": "Yavira can redirect you to the upstream package for this source.",
      "primaryActionLabel": "Download for OpenClaw",
      "primaryActionHref": "/downloads/haibo-jina-reader"
    },
    "validation": {
      "installChecklist": [
        "Use the Yavira download entry.",
        "Review SKILL.md after the package is downloaded.",
        "Confirm the extracted package contains the expected setup assets."
      ],
      "postInstallChecks": [
        "Confirm the extracted package includes the expected docs or setup files.",
        "Validate the skill or prompts are available in your target agent workspace.",
        "Capture any manual follow-up steps the agent could not complete."
      ]
    }
  },
  "links": {
    "detailUrl": "https://openagent3.xyz/skills/haibo-jina-reader",
    "downloadUrl": "https://openagent3.xyz/downloads/haibo-jina-reader",
    "agentUrl": "https://openagent3.xyz/skills/haibo-jina-reader/agent",
    "manifestUrl": "https://openagent3.xyz/skills/haibo-jina-reader/agent.json",
    "briefUrl": "https://openagent3.xyz/skills/haibo-jina-reader/agent.md"
  }
}
```
## Documentation

### Overview

Jina Reader provides clean markdown extraction from any URL, bypassing HTML complexity, JavaScript rendering, and many paywalls. It returns structured text content with metadata (title, URL, published time) that's perfect for AI analysis.

### Extract markdown content

scripts/jina-reader.py <url>

### Extract with JSON metadata

scripts/jina-reader.py <url> --format json

### Save to file

scripts/jina-reader.py <url> -o output.md

### 1. Basic Extraction

Extract clean markdown from any URL:

scripts/jina-reader.py https://example.com/article

Returns: Full markdown content including title, metadata headers, and structured text.

Use when: You need readable text from a webpage for summarization, analysis, or content processing.

### 2. JSON Format

Get structured data with metadata:

scripts/jina-reader.py https://example.com/article --format json

Returns:

{
  "status": "success",
  "metadata": {
    "title": "Article Title",
    "url": "https://example.com/article",
    "published": "Mon, 10 Feb 2026 12:00:00 GMT"
  },
  "content": "Markdown content..."
}

Use when: You need programmatic access to metadata or want to integrate with other tools.

### 3. Shell Script Quick Access

For simple one-liners:

scripts/jina-reader.sh https://example.com/article

Returns: Raw markdown content directly to stdout.

Use when: Quick extraction without arguments or when piping to other commands.

### Combining with Search

When using tavily-search, web_search, or searxng skills:

Get search results with relevant URLs
Extract content from top results using jina-reader
Process and summarize the extracted content

# Example workflow
URL="https://example.com/article"
scripts/jina-reader.py "$URL" --format json | jq -r '.content'

### Batch Processing

Extract from multiple URLs:

for url in $(cat urls.txt); do
  scripts/jina-reader.py "$url" -o "output/$(basename $url).md"
done

### Content Analysis

Pipe extracted content to analysis tools:

scripts/jina-reader.py https://example.com/article | wc -w
scripts/jina-reader.py https://example.com/article | grep -i "keyword"

### Python Script (jina-reader.py)

url (required): The URL to extract content from
-f, --format: Output format - markdown or json (default: markdown)
-t, --timeout: Request timeout in seconds (default: 30)
-o, --output: Save output to file instead of stdout

### Shell Script (jina-reader.sh)

url (required): The URL to extract content from

### Limitations

Timeout: Default 30 seconds. Increase with -t for slow-loading pages
Rate limits: Jina Reader API has rate limits. Use batching strategically
Dynamic content: Can't extract content generated by client-side JavaScript after page load
Authentication: Can't access pages requiring login or special headers

### Timeout errors

scripts/jina-reader.py <url> -t 60  # Increase timeout

### Invalid URLs

The tool auto-prepends https:// if missing. Use fully qualified URLs for reliability.

### Empty content

Some pages may block scraping. Try the shell script as fallback, or verify the URL is accessible.

### scripts/jina-reader.py

Full-featured Python tool with JSON output, metadata extraction, and file saving.

### scripts/jina-reader.sh

Lightweight shell script for quick markdown extraction.
## Trust
- Source: tencent
- Verification: Indexed source record
- Publisher: smile-xuc
- Version: 1.0.0
## Source health
- Status: healthy
- Source download looks usable.
- Yavira can redirect you to the upstream package for this source.
- Health scope: source
- Reason: direct_download_ok
- Checked at: 2026-04-30T16:55:25.780Z
- Expires at: 2026-05-07T16:55:25.780Z
- Recommended action: Download for OpenClaw
## Links
- [Detail page](https://openagent3.xyz/skills/haibo-jina-reader)
- [Send to Agent page](https://openagent3.xyz/skills/haibo-jina-reader/agent)
- [JSON manifest](https://openagent3.xyz/skills/haibo-jina-reader/agent.json)
- [Markdown brief](https://openagent3.xyz/skills/haibo-jina-reader/agent.md)
- [Download page](https://openagent3.xyz/downloads/haibo-jina-reader)