Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Create AI images with prompt engineering, style control, and provider guides for Midjourney, DALL-E, Stable Diffusion, Flux, and Leonardo.
Create AI images with prompt engineering, style control, and provider guides for Midjourney, DALL-E, Stable Diffusion, Flux, and Leonardo.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
On first use, read setup.md.
User needs AI-generated visuals, edits, or consistent image sets. Use this skill to pick the right model, write stronger prompts, and avoid outdated model choices.
User preferences persist in ~/image-generation/. See memory-template.md for setup. ~/image-generation/ โโโ memory.md # Preferred providers, project context, winning recipes โโโ history.md # Optional generation log
TopicFileInitial setupsetup.mdMemory templatememory-template.mdMigration guidemigration.mdBenchmark snapshotsbenchmarks-2026.mdPrompt techniquesprompting.mdAPI handlingapi-patterns.mdGPT Image (OpenAI)gpt-image.mdGemini and Imagen (Google)gemini.mdFLUX (Black Forest Labs)flux.mdMidjourneymidjourney.mdLeonardoleonardo.mdIdeogramideogram.mdReplicatereplicate.mdStable Diffusionstable-diffusion.md
Community names shift quickly. Before calling an API, map the nickname to the provider model ID. Community labelOfficial model ID to try firstNotesNano Bananagemini-2.5-flash-image-previewCommon nickname, not an official Google model IDNano Banana 2 / ProVerify provider docsUsually a provider preset over Gemini image modelsGPT Image 1.5gpt-image-1.5Current OpenAI high-tier image modelGPT Image mini / iMinigpt-image-1-miniBudget/faster OpenAI variantFLUX 2 Pro / Maxflux-pro / flux-ultraMany platforms rename these SKUs
TaskFirst choiceBackupExact text in imagegpt-image-1.5IdeogramMulti-turn editsgemini-2.5-flash-image-previewflux-kontext-proPhotoreal hero shotsimagen-4.0-ultra-generate-001flux-ultraFast low-cost draftsgpt-image-1-miniimagen-4.0-fast-generate-001Character/product consistencyflux-kontext-maxgpt-image-1.5 with referencesLocal no-API workflowsflux-schnellSDXL
Benchmarks drift weekly. Use benchmarks-2026.md as a starting point, then recheck current rankings when quality is critical.
Start with 1-4 low-cost drafts, pick one, then upscale or rerender only the winner.
If the preferred model is unavailable, fallback by tier: same provider lower tier, 2) cross-provider equivalent, 3) local/open model.
OpenAI lists DALL-E 2/3 as legacy. Do not use them as default for new projects.
Using vendor nicknames as model IDs -> API errors and wasted retries Assuming "Nano Banana Pro" or "FLUX 2" are universal IDs -> provider mismatch Copying old DALL-E prompt habits -> weaker output vs modern GPT/Gemini image models Comparing text-to-image and image-editing scores as if they were the same benchmark Optimizing every draft at max quality -> cost spikes without quality gain
Data that leaves your machine: Prompt text Reference images when editing or style matching Data that stays local: Provider preferences in ~/image-generation/memory.md Optional local history file This skill does NOT: Store API keys Upload files outside chosen provider requests Persist generated images unless user asks to save them
ProviderEndpointData SentPurposeOpenAIapi.openai.comPrompt text, optional input imagesGPT Image generation/editingGoogle Gemini APIgenerativelanguage.googleapis.comPrompt text, optional input imagesGemini image generation/editingGoogle Vertex AIaiplatform.googleapis.comPrompt text, optional input imagesImagen 4 generationBlack Forest Labsapi.bfl.aiPrompt text, optional input imagesFLUX generation/editingReplicateapi.replicate.comPrompt text, optional input imagesHosted third-party image modelsMidjourneydiscord.comPrompt textMidjourney generation via Discord workflowsLeonardocloud.leonardo.aiPrompt text, optional input imagesLeonardo generation/editingIdeogramapi.ideogram.aiPrompt textTypography-focused image generation No other data is sent externally.
If upgrading from a previous version, read migration.md before updating local memory structure.
This skill may send prompts and reference images to third-party AI providers. Only install if you trust those providers with your content.
Install with clawhub install <slug> if user confirms: image-edit - Specialized inpainting, outpainting, and mask workflows video-generation - Convert image concepts into video pipelines colors - Build palettes for visual consistency across assets ffmpeg - Post-process image sequences and exports
If useful: clawhub star image-generation Stay updated: clawhub sync
Writing, remixing, publishing, visual generation, and marketing content production.
Largest current source with strong distribution and engagement signals.