# Send nano-banana-pdf-edit to your agent
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
## Fast path
- Download the package from Yavira.
- Extract it into a folder your agent can access.
- Paste one of the prompts below and point your agent at the extracted folder.
## Suggested prompts
### New install

```text
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
```
### Upgrade existing

```text
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
```
## Machine-readable fields
```json
{
  "schemaVersion": "1.0",
  "item": {
    "slug": "nano-banana-pdf-skill",
    "name": "nano-banana-pdf-edit",
    "source": "tencent",
    "type": "skill",
    "category": "开发工具",
    "sourceUrl": "https://clawhub.ai/ps06756/nano-banana-pdf-skill",
    "canonicalUrl": "https://clawhub.ai/ps06756/nano-banana-pdf-skill",
    "targetPlatform": "OpenClaw"
  },
  "install": {
    "downloadUrl": "/downloads/nano-banana-pdf-skill",
    "sourceDownloadUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=nano-banana-pdf-skill",
    "sourcePlatform": "tencent",
    "targetPlatform": "OpenClaw",
    "packageFormat": "ZIP package",
    "primaryDoc": "SKILL.md",
    "includedAssets": [
      "README.md",
      "SKILL.md"
    ],
    "downloadMode": "redirect",
    "sourceHealth": {
      "source": "tencent",
      "status": "healthy",
      "reason": "direct_download_ok",
      "recommendedAction": "download",
      "checkedAt": "2026-05-07T17:22:31.273Z",
      "expiresAt": "2026-05-14T17:22:31.273Z",
      "httpStatus": 200,
      "finalUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=afrexai-annual-report",
      "contentType": "application/zip",
      "probeMethod": "head",
      "details": {
        "probeUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=afrexai-annual-report",
        "contentDisposition": "attachment; filename=\"afrexai-annual-report-1.0.0.zip\"",
        "redirectLocation": null,
        "bodySnippet": null
      },
      "scope": "source",
      "summary": "Source download looks usable.",
      "detail": "Yavira can redirect you to the upstream package for this source.",
      "primaryActionLabel": "Download for OpenClaw",
      "primaryActionHref": "/downloads/nano-banana-pdf-skill"
    },
    "validation": {
      "installChecklist": [
        "Use the Yavira download entry.",
        "Review SKILL.md after the package is downloaded.",
        "Confirm the extracted package contains the expected setup assets."
      ],
      "postInstallChecks": [
        "Confirm the extracted package includes the expected docs or setup files.",
        "Validate the skill or prompts are available in your target agent workspace.",
        "Capture any manual follow-up steps the agent could not complete."
      ]
    }
  },
  "links": {
    "detailUrl": "https://openagent3.xyz/skills/nano-banana-pdf-skill",
    "downloadUrl": "https://openagent3.xyz/downloads/nano-banana-pdf-skill",
    "agentUrl": "https://openagent3.xyz/skills/nano-banana-pdf-skill/agent",
    "manifestUrl": "https://openagent3.xyz/skills/nano-banana-pdf-skill/agent.json",
    "briefUrl": "https://openagent3.xyz/skills/nano-banana-pdf-skill/agent.md"
  }
}
```
## Documentation

### Nano PDF Editing Skill

Edit PDF files with natural language prompts using the nano-pdf CLI tool.

Nano-PDF converts PDF pages to images, sends them to Google's Gemini 3 Pro Image with your edit instructions, then stitches the AI-edited pages back into the PDF — preserving searchable text via OCR re-hydration.

Source: https://github.com/gavrielc/Nano-PDF

### Prerequisites

Before running any nano-pdf command, ensure the following dependencies are installed. If any are missing, install them before proceeding:

nano-pdf — pip install nano-pdf (or use uvx nano-pdf to run without installing)
poppler — PDF-to-image rendering (brew install poppler on macOS / sudo apt-get install poppler-utils on Linux)
tesseract — OCR for text layer restoration (brew install tesseract on macOS / sudo apt-get install tesseract-ocr on Linux)
GEMINI_API_KEY — A paid Google Gemini API key (free tier does not support image generation). Get one at https://aistudio.google.com/api-keys — then export GEMINI_API_KEY="your_key"

### nano-pdf edit — Modify existing pages

nano-pdf edit <file.pdf> <page> "<prompt>" [<page> "<prompt>" ...] [options]

Pages are 1-indexed. Multiple page+prompt pairs can be provided and are processed in parallel.

### nano-pdf add — Insert new AI-generated slides

nano-pdf add <file.pdf> <position> "<prompt>" [options]

Position 0 inserts at the beginning. The new slide automatically matches the visual style of the existing deck. Document context is enabled by default for add.

### Options Reference

For full details on all available flags, read references/options.md in this skill directory.

Key flags:

--output "new.pdf" — Output filename (default: edited_<original>.pdf)
--resolution "4K" — 4K (default), 2K, or 1K
--style-refs "1,5" — Pages to use as style references
--use-context / --no-use-context — Include full PDF text as model context
--disable-google-search — Prevent model from using Google Search

### Workflow

When a user asks to edit a PDF:

Check dependencies — Ensure nano-pdf, poppler, tesseract, and GEMINI_API_KEY are available. If any are missing, tell the user what to install and stop.
Identify the edit — Determine which page(s) need changes and what the prompt should be
Choose the right command — edit for modifying existing pages, add for inserting new ones
Pick appropriate options:

Use --style-refs if the user wants a specific visual style from certain pages
Use --use-context when editing multiple pages that need to be consistent
Use --resolution "2K" if speed matters more than quality


Run nano-pdf and present the output PDF to the user

### Prompt Writing Tips

The quality of the edit depends heavily on the prompt. Follow these guidelines:

Be specific: "Change the title from 'Overview' to 'Q3 Summary'" beats "update the title"
Reference visible elements: "The bar chart on the left side" helps the model locate what to change
One focused change per prompt: For complex edits, use multiple page+prompt pairs
Mention what to preserve: "Keep the layout the same but change the header color to blue"
Use style refs for consistency: When updating branding across pages, point at a reference page

### Examples

For a comprehensive set of examples covering common use cases (typos, charts, branding, adding slides, batch edits), read references/examples.md in this skill directory.

Quick reference:

# Fix a typo on page 3
nano-pdf edit report.pdf 3 "Fix 'recieve' to 'receive'"

# Update chart data
nano-pdf edit deck.pdf 12 "Update the revenue chart to show Q3 at $2.5M"

# Multi-page branding update
nano-pdf edit slides.pdf \\
  1 "Change header background to dark blue, text to white" \\
  2 "Update the logo to show 'NewCorp' instead of 'OldCorp'" \\
  --style-refs "1" --output branded.pdf

# Add a new title slide at the beginning
nano-pdf add deck.pdf 0 "Title slide: 'Annual Review 2025' with subtitle 'Building the Future'"

# Add a summary slide after page 5 using document context
nano-pdf add deck.pdf 5 "Summary slide with key takeaways as bullet points"

### Troubleshooting

IssueSolutionMissing system dependenciesInstall missing deps (see Prerequisites above), restart terminalGEMINI_API_KEY not foundexport GEMINI_API_KEY="your_key"PAID API key requiredEnable billing at https://aistudio.google.com/api-keysStyle mismatchUse --style-refs "1,3" pointing at pages with desired styleSlow processingUse --resolution "2K" or "1K"Bad OCR / text layerUse --resolution "4K" for better OCR accuracyModel ignores part of promptBreak into smaller, focused edits across multiple runs
## Trust
- Source: tencent
- Verification: Indexed source record
- Publisher: ps06756
- Version: 1.0.1
## Source health
- Status: healthy
- Source download looks usable.
- Yavira can redirect you to the upstream package for this source.
- Health scope: source
- Reason: direct_download_ok
- Checked at: 2026-05-07T17:22:31.273Z
- Expires at: 2026-05-14T17:22:31.273Z
- Recommended action: Download for OpenClaw
## Links
- [Detail page](https://openagent3.xyz/skills/nano-banana-pdf-skill)
- [Send to Agent page](https://openagent3.xyz/skills/nano-banana-pdf-skill/agent)
- [JSON manifest](https://openagent3.xyz/skills/nano-banana-pdf-skill/agent.json)
- [Markdown brief](https://openagent3.xyz/skills/nano-banana-pdf-skill/agent.md)
- [Download page](https://openagent3.xyz/downloads/nano-banana-pdf-skill)