Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero depe...
Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero depe...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Generate and edit images via the Google Gemini API using pure Python stdlib. Supports Gemini native generation + editing, Imagen 3 generation, batch runs, and an HTML gallery output.
export GEMINI_API_KEY="your-key-here" # Default: Gemini native, 4 random prompts python3 scripts/gen.py # Custom prompt python3 scripts/gen.py --prompt "a cyberpunk cat riding a neon motorcycle through Tokyo at night" # Imagen 3 engine python3 scripts/gen.py --engine imagen --count 4 --aspect 16:9 # Edit an existing image (Gemini engine only) python3 scripts/gen.py --edit path/to/image.png --prompt "change the background to a sunset beach" # Use a style preset python3 scripts/gen.py --style watercolor --prompt "floating islands above a calm sea" # List available styles python3 scripts/gen.py --styles
StyleDescriptionphotoUltra-detailed photorealistic photography, 8K resolution, sharp focusanimeHigh-quality anime illustration, Studio Ghibli inspired, vibrant colorswatercolorDelicate watercolor painting on textured paper, soft edges, gentle color bleedingcyberpunkNeon-lit cyberpunk scene, rain-soaked streets, holographic displays, Blade Runner aestheticminimalistClean minimalist design, geometric shapes, limited color palette, white spaceoil-paintingClassical oil painting with visible brushstrokes, rich textures, Renaissance lightingpixel-artDetailed pixel art, retro 16-bit style, crisp edges, nostalgic palettesketchPencil sketch on cream paper, hatching and cross-hatching, artistic imperfections3d-renderProfessional 3D render, ambient occlusion, global illumination, photorealistic materialspop-artBold pop art style, Ben-Day dots, strong outlines, vibrant contrasting colors
FlagDefaultDescription--prompt(random)Text prompt. Omit for random creative prompts--count4Number of images to generate--enginegeminiEngine: gemini (native, supports edit) or imagen (Imagen 3)--model(auto)Model override. Default: gemini-2.5-flash-image or imagen-3.0-generate-002--editPath to input image for editing (Gemini engine only)--aspect1:1Aspect ratio for Imagen: 1:1, 16:9, 9:16, 4:3, 3:4--out-dir(auto)Output directory (default is a timestamped folder)--styleStyle preset to prepend to the prompt--stylesList available style presets and exit
import subprocess subprocess.run( [ "python3", "scripts/gen.py", "--prompt", "a serene mountain landscape at golden hour", "--count", "4", "--style", "photo", ], check=True, )
Missing API key: set GEMINI_API_KEY in your environment and retry. Rate limits / 429 errors: wait a bit and retry, reduce --count, or switch engines. Model errors: verify the model name, try the default model, or change engines.
AgentGram โ Share your generated images on the AI agent social network! Create visual content and post it to your AgentGram feed. agent-selfie โ Focused on AI agent avatars and visual identity. Uses the same Gemini API key for personality-driven self-portraits. opencode-omo โ Run deterministic image-generation pipelines with Sisyphus workflows.
v1.3.1: Added workflow integration guidance for opencode-omo. v1.1.0: Added style presets, --style and --styles flags, expanded documentation. v1.0.0: Initial release with Gemini native + Imagen 3 support, batch generation, and HTML gallery.
https://github.com/IISweetHeartII/gemini-image-gen
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.