Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Generate new images or edit existing ones using Google's Nano Banana Pro API (Gemini 3 Pro Image).
Google AI Studio (Public Preview): POST https://generativelanguage.googleapis.com/v1beta/models/gemini-3-pro-image-preview:generateContent?key=${API_KEY} Vertex AI (Enterprise): POST https://${REGION}-aiplatform.googleapis.com/v1/projects/${PROJECT_ID}/locations/${REGION}/publishers/google/models/gemini-3-pro-image-preview:predict
API: gemini-3-pro-image-preview SDK interno: nanobanana-pro-001
ParameterValuesDescriptionaspect_ratio1:1, 4:3, 3:4, 16:9, 9:16Output aspect ratiooutput_mime_typeimage/png, image/jpegOutput formatreference_imagesArray (max 14)Reference images for consistencyreference_typeCHARACTER, STYLE, SUBJECTHow to use referenceperson_generationALLOW_ADULT, DONT_ALLOW, FILTER_SENSITIVEPerson generation policyimage_size1K, 2K, 4KOutput resolution
STYLE: Transfer visual style, color palette, mood from reference CHARACTER: Maintain facial features, traits consistency across images SUBJECT: Keep the subject/product consistent (use for product photography!)
Text Rendering: Native text rendering without spelling errors In-context Editing: Send existing image + modification prompt (automatic in-painting) High Resolution: Native upscale to 4K via upscale: true
Run the script using absolute path (do NOT cd to skill directory first): Generate new image: uv run ~/.clawdbot/skills/nano-banana-pro/scripts/generate_image.py \ --prompt "your image description" \ --filename "output-name.png" \ [--resolution 1K|2K|4K] \ [--api-key KEY] Edit existing image: uv run ~/.clawdbot/skills/nano-banana-pro/scripts/generate_image.py \ --prompt "editing instructions" \ --filename "output-name.png" \ --input-image "path/to/input.png" \ [--resolution 1K|2K|4K] With reference image (product/style/character consistency): uv run ~/.clawdbot/skills/nano-banana-pro/scripts/generate_image.py \ --prompt "your description" \ --filename "output-name.png" \ --reference-image "path/to/reference.jpg" \ --reference-type SUBJECT|STYLE|CHARACTER \ [--resolution 1K|2K|4K] Important: Always run from the user's current working directory so images are saved where the user is working, not in the skill directory.
1K (default) - ~1024px resolution 2K - ~2048px resolution (recommended for most uses) 4K - ~4096px resolution (high quality) Map user requests: No mention โ 1K "low resolution", "1080", "1080p", "1K" โ 1K "2K", "2048", "normal", "medium resolution" โ 2K "high resolution", "high-res", "hi-res", "4K", "ultra" โ 4K
The script checks for API key in this order: --api-key argument GEMINI_API_KEY environment variable
Format: {timestamp}-{descriptive-name}.png Timestamp: yyyy-mm-dd-hh-mm-ss (24-hour format) Name: Descriptive lowercase with hyphens Examples: 2025-11-23-14-23-05-japanese-garden.png 2025-11-23-15-30-12-sunset-mountains.png
You are an expert Prompt Engineer specializing in Nano Banana Pro. Transform basic user ideas and reference images into high-fidelity, descriptive prompts.
When provided with a user idea and reference images, evaluate: Subject Matter: Identify primary actors, objects, or focal points Reference Utility: Determine if image provides composition (layout), style (aesthetic/texture), or character (specific features) Text Requirements: Note any specific text to render within the image
Structure optimized prompts using this hierarchy:
Clear description of "who" or "what" is doing "what."
Specify artistic medium: Hyper-realistic photography Oil painting 3D render Minimalist vector Commercial food photography Editorial style
Explicitly instruct on how to use uploaded images: "Retain the product packaging from the reference image as the hero element" "Apply the warm lighting aesthetic from Reference A"
Lighting: Cinematic rim lighting Soft diffused sunlight Harsh strobes Warm tungsten lighting Golden hour warmth Composition: Wide-angle shot Macro detail Bird's-eye view Shallow depth of field Product as hero element Color Theory: Monochromatic blue High-contrast complementary Warm amber tones Dark moody palette Text Rendering: Use double quotes for specific text: "The word 'FUTURE' written in bold, brushed-metal 3D lettering across the center"
Use descriptive positive language Expand vague terms ("cool" โ "iridescent", "pretty" โ "ethereal", "realistic" โ "photorealistic 8k texture") Maintain consistency with reference images Use powerful adjectives for mood ("gritty," "serene," "industrial," "whimsical") Specify "8k texture detail" or "8k photorealistic detail" for quality
Use negative prompts (say what you want, not what you don't) Contradict visual data in reference images Use vague terms without expansion
When generating images with products as protagonists: Always use --reference-type SUBJECT to maintain product consistency Describe the product prominently in the prompt: "Milkaut Crematto container with blue label and red lid prominently displayed" Position product as hero element: "the product container as co-star product placement" "product container in sharp focus" Include product in scene naturally: "positioned next to", "beside", "prominently arranged"
Hyper-realistic commercial food photography with a [PRODUCT NAME] container prominently displayed next to [FOOD ITEM], [food description], [setting/background], [lighting style], the [product] as hero element, 8k photorealistic detail
Provide the optimized prompt in English, without additional commentary.
uv run ~/.clawdbot/skills/nano-banana-pro/scripts/generate_image.py \ --prompt "Hyper-realistic commercial food photography with a Milkaut Crematto container prominently displayed next to a gourmet double smash burger with perfectly melted cheddar cheese cascading down juicy beef patties, artisan brioche bun, wisps of steam rising, dark moody background with dramatic rim lighting, the cream cheese container as hero product placement, 8k texture detail" \ --filename "2026-01-28-product-burger.png" \ --reference-image "product-photo.jpg" \ --reference-type SUBJECT \ --resolution 2K
uv run ~/.clawdbot/skills/nano-banana-pro/scripts/generate_image.py \ --prompt "Using the warm golden hour aesthetic from the reference, create a serene Japanese garden with cherry blossoms, koi pond reflecting soft pink petals, traditional wooden bridge, ethereal morning mist, 8k photorealistic detail" \ --filename "2026-01-28-japanese-garden.png" \ --reference-image "style-reference.jpg" \ --reference-type STYLE \ --resolution 2K
uv run ~/.clawdbot/skills/nano-banana-pro/scripts/generate_image.py \ --prompt "Change the background to a dramatic sunset over mountains, maintain the subject in sharp focus" \ --filename "2026-01-28-edited-sunset.png" \ --input-image "original.jpg" \ --resolution 2K
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.