Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Token-safe prompt assembly with memory orchestration. Use for any agent that needs to construct LLM prompts with memory retrieval. Guarantees no API failure due to token overflow. Implements two-phase context construction, memory safety valve, and hard limits on memory injection.
Token-safe prompt assembly with memory orchestration. Use for any agent that needs to construct LLM prompts with memory retrieval. Guarantees no API failure due to token overflow. Implements two-phase context construction, memory safety valve, and hard limits on memory injection.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
A standardized, token-safe prompt assembly framework that guarantees API stability. Implements Two-Phase Context Construction and Memory Safety Valve to prevent token overflow while maximizing relevant context. Design Goals: โ Never fail due to memory-related token overflow โ Memory is always discardable enhancement, never rigid dependency โ Token budget decisions centralized at prompt assemble layer
Use this skill when: Building or modifying any agent that constructs prompts Implementing memory retrieval systems Adding new prompt-related logic to existing agents Any scenario where token budget safety is required
User Input โ Need-Memory Decision โ Minimal Context Build โ Memory Retrieval (Optional) โ Memory Summarization โ Token Estimation โ Safety Valve Decision โ Final Prompt โ LLM Call
# Model Context Windows (2026-02-04) # - MiniMax-M2.1: 204,000 tokens (default) # - Claude 3.5 Sonnet: 200,000 tokens # - GPT-4o: 128,000 tokens MAX_TOKENS = 204000 # Set to your model's context limit SAFETY_MARGIN = 0.75 * MAX_TOKENS # Conservative: 75% threshold = 153,000 tokens MEMORY_TOP_K = 3 # Max 3 memories MEMORY_SUMMARY_MAX = 3 lines # Max 3 lines per memory Design Philosophy: Leave 25% buffer for safety (model overhead, estimation errors, spikes) Better to underutilize capacity than to overflow
System prompt Recent N messages (N=3, trimmed) Current user input No memory by default
def need_memory(user_input): triggers = [ "previously", "earlier we discussed", "do you remember", "as I mentioned before", "continuing from", "before we", "last time", "previously mentioned" ] for trigger in triggers: if trigger.lower() in user_input.lower(): return True return False
memories = memory_search(query=user_input, top_k=MEMORY_TOP_K) for mem in memories: summarized_memories.append(summarize(mem, max_lines=MEMORY_SUMMARY_MAX))
Calculate estimated tokens for base_context + summarized_memories.
if estimated_tokens > SAFETY_MARGIN: base_context.append("[System Notice] Relevant memory skipped due to token budget.") return assemble(base_context) Hard Rules: โ Never downgrade system prompt โ Never truncate user input โ No "lucky splicing" โ Only memory layer is expendable
final_prompt = assemble(base_context + summarized_memories) return final_prompt
โ User preferences / identity / long-term goals โ Confirmed important conclusions โ System-level settings and rules
โ Raw conversation logs โ Reasoning traces โ Temporary discussions โ Information recoverable from chat history
Copy scripts/prompt_assemble.py to your agent and use: from prompt_assemble import build_prompt # In your agent's prompt construction: final_prompt = build_prompt(user_input, memory_search_fn, get_recent_dialog_fn)
prompt_assemble.py - Complete implementation with all phases (PromptAssembler class)
memory_standards.md - Detailed memory content guidelines token_estimation.md - Token counting strategies
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.