# Send Smart Model Routing for Z.AI to your agent
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
## Fast path
- Download the package from Yavira.
- Extract it into a folder your agent can access.
- Paste one of the prompts below and point your agent at the extracted folder.
## Suggested prompts
### New install

```text
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
```
### Upgrade existing

```text
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
```
## Machine-readable fields
```json
{
  "schemaVersion": "1.0",
  "item": {
    "slug": "smart-model-routing-for-zai",
    "name": "Smart Model Routing for Z.AI",
    "source": "tencent",
    "type": "skill",
    "category": "开发工具",
    "sourceUrl": "https://clawhub.ai/PrincNL/smart-model-routing-for-zai",
    "canonicalUrl": "https://clawhub.ai/PrincNL/smart-model-routing-for-zai",
    "targetPlatform": "OpenClaw"
  },
  "install": {
    "downloadUrl": "/downloads/smart-model-routing-for-zai",
    "sourceDownloadUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=smart-model-routing-for-zai",
    "sourcePlatform": "tencent",
    "targetPlatform": "OpenClaw",
    "packageFormat": "ZIP package",
    "primaryDoc": "SKILL.md",
    "includedAssets": [
      "SKILL.md"
    ],
    "downloadMode": "redirect",
    "sourceHealth": {
      "source": "tencent",
      "status": "healthy",
      "reason": "direct_download_ok",
      "recommendedAction": "download",
      "checkedAt": "2026-05-07T17:22:31.273Z",
      "expiresAt": "2026-05-14T17:22:31.273Z",
      "httpStatus": 200,
      "finalUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=afrexai-annual-report",
      "contentType": "application/zip",
      "probeMethod": "head",
      "details": {
        "probeUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=afrexai-annual-report",
        "contentDisposition": "attachment; filename=\"afrexai-annual-report-1.0.0.zip\"",
        "redirectLocation": null,
        "bodySnippet": null
      },
      "scope": "source",
      "summary": "Source download looks usable.",
      "detail": "Yavira can redirect you to the upstream package for this source.",
      "primaryActionLabel": "Download for OpenClaw",
      "primaryActionHref": "/downloads/smart-model-routing-for-zai"
    },
    "validation": {
      "installChecklist": [
        "Use the Yavira download entry.",
        "Review SKILL.md after the package is downloaded.",
        "Confirm the extracted package contains the expected setup assets."
      ],
      "postInstallChecks": [
        "Confirm the extracted package includes the expected docs or setup files.",
        "Validate the skill or prompts are available in your target agent workspace.",
        "Capture any manual follow-up steps the agent could not complete."
      ]
    }
  },
  "links": {
    "detailUrl": "https://openagent3.xyz/skills/smart-model-routing-for-zai",
    "downloadUrl": "https://openagent3.xyz/downloads/smart-model-routing-for-zai",
    "agentUrl": "https://openagent3.xyz/skills/smart-model-routing-for-zai/agent",
    "manifestUrl": "https://openagent3.xyz/skills/smart-model-routing-for-zai/agent.json",
    "briefUrl": "https://openagent3.xyz/skills/smart-model-routing-for-zai/agent.md"
  }
}
```
## Documentation

### Smart Model Switching

Three-tier z.ai (GLM) routing: Flash → Standard → Plus / 32B

Start with the cheapest model. Escalate only when needed. Designed to minimize API cost without sacrificing correctness.

### The Golden Rule

If a human would need more than 30 seconds of focused thinking, escalate from Flash to Standard.
If the task involves architecture, complex tradeoffs, or deep reasoning, escalate to Plus / 32B.

### Model Reality (Relative)

TierExample ModelsPurposeFlashGLM-4.5-Flash, GLM-4.7-FlashFastest & cheapestStandardGLM-4.6, GLM-4.7Strong reasoning & codePlus / 32BGLM-4-Plus, GLM-4-32B-128KHeavy reasoning & architecture

Bottom line: Wrong model selection wastes money OR time. Flash for simple, Standard for normal work, Plus/32B for complex decisions.

### 💚 FLASH — Default for Simple Tasks

Stay on Flash for:

Factual Q&A — “what is X”, “who is Y”, “when did Z”
Quick lookups — definitions, unit conversions, short translations
Status checks — monitoring, file reads, session state
Heartbeats — periodic checks, OK responses
Memory & reminders
Casual conversation — greetings, acknowledgments
Simple file ops — read, list, basic writes
One-liner tasks — anything answerable in 1–2 sentences
Cron jobs (always Flash by default)

### NEVER do these on Flash

❌ Write code longer than 10 lines
❌ Create comparison tables
❌ Write more than 3 paragraphs
❌ Do multi-step analysis
❌ Write reports or proposals

### 💛 STANDARD — Core Workhorse

Escalate to Standard for:

### Code & Technical

Code generation — functions, scripts, features
Debugging — normal bug investigation
Code review — PRs, refactors
Documentation — README, comments, guides

### Analysis & Planning

Comparisons and evaluations
Planning — roadmaps, task breakdowns
Research synthesis
Multi-step reasoning

### Writing & Content

Long-form writing (>3 paragraphs)
Summaries of long documents
Structured output — tables, outlines

Most real user conversations belong here.

### ❤️ PLUS / 32B — Complex Reasoning Only

Escalate to Plus / 32B for:

### Architecture & Design

System and service architecture
Database schema design
Distributed or multi-tenant systems
Major refactors across multiple files

### Deep Analysis

Complex debugging (race conditions, subtle bugs)
Security reviews
Performance optimization strategy
Root cause analysis

### Strategic & Judgment-Based Work

Strategic planning
Nuanced judgment and ambiguity
Deep or multi-source research
Critical production decisions

### For Subagents

// Routine monitoring
sessions_spawn(task="Check backup status", model="GLM-4.5-Flash")

// Standard code work
sessions_spawn(task="Build the REST API endpoint", model="GLM-4.7")

// Architecture decisions
sessions_spawn(task="Design the database schema for multi-tenancy", model="GLM-4-Plus")
For Cron Jobs
json
Copy code
{
  "payload": {
    "kind": "agentTurn",
    "model": "GLM-4.5-Flash"
  }
}
Always use Flash for cron unless the task genuinely needs reasoning.

📊 Quick Decision Tree
pgsql
Copy code
Is it a greeting, lookup, status check, or 1–2 sentence answer?
  YES → FLASH
  NO ↓

Is it code, analysis, planning, writing, or multi-step?
  YES → STANDARD
  NO ↓

Is it architecture, deep reasoning, or a critical decision?
  YES → PLUS / 32B
  NO → Default to STANDARD, escalate if struggling
📋 Quick Reference Card
less
Copy code
┌─────────────────────────────────────────────────────────────┐
│                  SMART MODEL SWITCHING                      │
│              Flash → Standard → Plus / 32B                  │
├─────────────────────────────────────────────────────────────┤
│  💚 FLASH (cheapest)                                        │
│  • Greetings, status checks, quick lookups                  │
│  • Factual Q&A, reminders                                   │
│  • Simple file ops, 1–2 sentence answers                    │
├─────────────────────────────────────────────────────────────┤
│  💛 STANDARD (workhorse)                                    │
│  • Code > 10 lines, debugging                               │
│  • Analysis, comparisons, planning                          │
│  • Reports, long writing                                    │
├─────────────────────────────────────────────────────────────┤
│  ❤️ PLUS / 32B (complex)                                    │
│  • Architecture decisions                                   │
│  • Complex debugging, multi-file refactoring                │
│  • Strategic planning, deep research                        │
├─────────────────────────────────────────────────────────────┤
│  💡 RULE: >30 sec human thinking → escalate                 │
│  💰 START CHEAP → SCALE ONLY WHEN NEEDED                    │
└─────────────────────────────────────────────────────────────┘
Built for z.ai (GLM) setups.
## Trust
- Source: tencent
- Verification: Indexed source record
- Publisher: PrincNL
- Version: 1.0.0
## Source health
- Status: healthy
- Source download looks usable.
- Yavira can redirect you to the upstream package for this source.
- Health scope: source
- Reason: direct_download_ok
- Checked at: 2026-05-07T17:22:31.273Z
- Expires at: 2026-05-14T17:22:31.273Z
- Recommended action: Download for OpenClaw
## Links
- [Detail page](https://openagent3.xyz/skills/smart-model-routing-for-zai)
- [Send to Agent page](https://openagent3.xyz/skills/smart-model-routing-for-zai/agent)
- [JSON manifest](https://openagent3.xyz/skills/smart-model-routing-for-zai/agent.json)
- [Markdown brief](https://openagent3.xyz/skills/smart-model-routing-for-zai/agent.md)
- [Download page](https://openagent3.xyz/downloads/smart-model-routing-for-zai)