Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
YouTube thumbnail design with specific dimensions, contrast rules, and mobile preview optimization. Covers safe zones, text placement, face expression psycho...
YouTube thumbnail design with specific dimensions, contrast rules, and mobile preview optimization. Covers safe zones, text placement, face expression psycho...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Create high-CTR YouTube thumbnails with AI image generation via inference.sh CLI.
curl -fsSL https://cli.inference.sh | sh && infsh login # Generate a thumbnail infsh app run falai/flux-dev-lora --input '{ "prompt": "YouTube thumbnail style, close-up of a person with surprised excited expression looking at a glowing laptop screen, vibrant blue and orange color scheme, dramatic studio lighting, shallow depth of field, high contrast, cinematic", "width": 1280, "height": 720 }' Install note: The install script only detects your OS/architecture, downloads the matching binary from dist.inference.sh, and verifies its SHA-256 checksum. No elevated permissions or background processes. Manual install & verification available.
SpecValueDimensions1280 x 720 px (minimum)Recommended1920 x 1080 pxAspect ratio16:9Max file size2 MBFormatsJPG, GIF, PNG
Your thumbnail appears at roughly 120px wide on mobile โ that's how most viewers first see it. At 120px, viewers must be able to identify: The mood/emotion (from colors and expression) The general subject (from composition) The text (if any โ only if large enough) Test: view your thumbnail at 120px width. If it's a muddy blur, redesign.
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ โ โ โ SAFE FOR TEXT AND KEY ELEMENTS โ โ โ โ โ โ โ โ โ โ โโโโโ โ โ โ โฑ โ โ โ Timestamp overlay โ โโโโโโโโโโดโโโโ โ (bottom-right) โ โโโโโโ โ DURATION โ โ โ CH โ Chapter marker โโโโโโโโโโโโโโโโ โโโโโดโโโโโดโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ Bottom-left: chapter/progress markers Avoid placing critical elements in: Bottom-right corner (video duration timestamp) Bottom-left corner (chapter markers, progress bar) Extreme edges (cropping varies by device)
CombinationMoodBest ForYellow + BlackUrgency, attentionTech, business, listsRed + WhiteEnergy, excitementEntertainment, reactionsBlue + OrangeProfessional contrastEducation, tutorialsGreen + WhiteGrowth, moneyFinance, success storiesPurple + YellowPremium, creativeDesign, art, creativityWhite + DarkClean, minimalLuxury, minimalist channels
Background and text/subject should be complementary or high-contrast Avoid same-temperature colors touching (red on orange = mud) Use 3 colors maximum per thumbnail Saturate more than real life โ thumbnails compete with bright UI
Lists/numbers: "7 Tips", "Top 10" Strong opinions: "STOP Doing This" Results: "$10K in 30 Days" Comparisons: "vs" between two things
The video title already says it (redundant) The emotion/visual tells the story You can't make it large enough to read at 120px
RuleReasonMax 6 wordsReadability at thumbnail sizeMin 60pt equivalentMust be legible at 120px widthBold sans-serif fontThin fonts disappear at small sizesContrast stroke/shadowEnsures readability on any backgroundNo small textIf it's not readable small, cut it
Thumbnails with faces get higher CTR than faceless thumbnails. Expression matters: ExpressionCTR ImpactBest ForSurprise/shockHighestReaction, reveal, discovery contentCuriosityHighTutorial, how-to, tipsExcitementHighUnboxing, reviews, announcementsConcern/worryMedium-highWarning, mistake, problem contentConfidenceMediumExpert advice, authority contentNeutralLowestAvoid unless your brand is minimalist
Face should fill 30-50% of the thumbnail Eyes looking toward the text or subject (directs viewer attention) Eyes looking at camera = connection. Eyes looking at object = curiosity. Place face on one side (usually left), text or subject on the other # Generate a face-forward thumbnail infsh app run falai/flux-dev-lora --input '{ "prompt": "close-up portrait of a man with genuinely surprised expression, mouth slightly open, raised eyebrows, looking at camera, left side of frame, vibrant teal background, dramatic rim lighting, YouTube thumbnail style, high contrast, cinematic", "width": 1280, "height": 720 }' # Generate a face-looking-at-subject thumbnail infsh app run bytedance/seedream-4-5 --input '{ "prompt": "person looking amazed at a glowing holographic chart showing upward growth, dramatic blue and green lighting, right side profile view, dark background, tech aesthetic, high energy", "size": "2K" }'
infsh app run falai/flux-dev-lora --input '{ "prompt": "overhead flat lay of organized workspace with laptop showing code editor, colorful sticky notes, coffee cup, clean bright background, professional setup, tutorial style composition, warm lighting", "width": 1280, "height": 720 }'
infsh app run falai/flux-dev-lora --input '{ "prompt": "split composition, left side dark and messy disorganized desk, right side bright clean organized minimalist workspace, dramatic contrast between chaos and order, clear dividing line in center, high contrast", "width": 1280, "height": 720 }'
infsh app run falai/flux-dev-lora --input '{ "prompt": "two products facing each other with dramatic lighting and sparks between them, competition battle concept, dark background with colorful rim lighting, versus comparison style, high energy, product photography", "width": 1280, "height": 720 }'
infsh app run falai/flux-dev-lora --input '{ "prompt": "dynamic arrangement of 7 different colorful objects floating in space against dark gradient background, each item distinct and clearly separated, energetic composition, vibrant saturated colors, studio lighting", "width": 1280, "height": 720 }'
Test one variable at a time: VariableTest A vs BFace vs No faceSame composition, with/without personExpressionSurprise vs curiosityColor schemeWarm vs cool paletteText vs No textWith/without text overlayBackgroundBright vs darkCompositionLeft-facing vs right-facing subject # Generate variant A infsh app run falai/flux-dev-lora --input '{ "prompt": "..., bright yellow background, ...", "width": 1280, "height": 720 }' --no-wait # Generate variant B (same prompt, different background) infsh app run falai/flux-dev-lora --input '{ "prompt": "..., dark navy background, ...", "width": 1280, "height": 720 }' --no-wait
1280x720 minimum (1920x1080 preferred) Under 2MB file size Passes the 120px squint test No critical elements in bottom-right (timestamp) or bottom-left (chapter) Max 3 colors, high contrast Text (if any) is max 6 words, bold, with contrast stroke Face expression matches content energy (if applicable) Doesn't duplicate the video title Stands out from surrounding thumbnails (check your niche) Works on both light and dark YouTube backgrounds
MistakeProblemFixToo much textUnreadable at thumbnail sizeMax 6 words or no textLow contrastDisappears in the feedUse complementary colorsCluttered compositionEye doesn't know where to lookOne focal pointGeneric stock photo feelNo personality, gets skippedAuthentic expressions, unique anglesTiny detailsLost at 120pxBold, simple shapesSame style every videoViewer fatigueVary within brand guidelinesMisleading thumbnailKills trust, hurts retentionMatch the actual content
npx skills add inference-sh/skills@ai-image-generation npx skills add inference-sh/skills@image-upscaling npx skills add inference-sh/skills@prompt-engineering Browse all apps: infsh app list
Writing, remixing, publishing, visual generation, and marketing content production.
Largest current source with strong distribution and engagement signals.