Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Graceful rate limit handling with Ollama fallback. Notifies on rate limits, offers local model switch with confirmation for code tasks.
Graceful rate limit handling with Ollama fallback. Notifies on rate limits, offers local model switch with confirmation for code tasks.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Handles rate limits and model fallbacks gracefully.
When I encounter rate limits or overload errors from cloud providers (Anthropic, OpenAI): Tell the user immediately โ Don't silently fail or retry endlessly Offer local fallback โ Ask if they want to switch to Ollama Wait for confirmation โ Never auto-switch for code generation tasks
Before using local models for code generation, ask: "Cloud is rate-limited. Switch to local Ollama (qwen2.5:7b)? Reply 'yes' to confirm." For simple queries (chat, summaries), can switch without confirmation if user previously approved.
Report current state: Which provider is active (cloud/local) Ollama availability and models Recent rate limit events
Manually switch to Ollama for the session.
Switch back to cloud provider.
# Check available models ollama list # Run a query ollama run qwen2.5:7b "your prompt here" # For longer prompts, use stdin echo "your prompt" | ollama run qwen2.5:7b
Check with ollama list. Configured default: qwen2.5:7b
Track in memory during session: currentProvider: "cloud" | "local" lastRateLimitAt: timestamp or null localConfirmedForCode: boolean Reset to cloud at session start.
Agent frameworks, memory systems, reasoning layers, and model-native orchestration.
Largest current source with strong distribution and engagement signals.