Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Evaluate Clawdbot skills for quality, reliability, and publish-readiness using a multi-framework rubric (ISO 25010, OpenSSF, Shneiderman, agent-specific heuristics). Use when asked to review, audit, evaluate, score, or assess a skill before publishing, or when checking skill quality. Runs automated structural checks and guides manual assessment across 25 criteria.
Evaluate Clawdbot skills for quality, reliability, and publish-readiness using a multi-framework rubric (ISO 25010, OpenSSF, Shneiderman, agent-specific heuristics). Use when asked to review, audit, evaluate, score, or assess a skill before publishing, or when checking skill quality. Runs automated structural checks and guides manual assessment across 25 criteria.
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Evaluate skills across 25 criteria using a hybrid automated + manual approach.
python3 scripts/eval-skill.py /path/to/skill python3 scripts/eval-skill.py /path/to/skill --json # machine-readable python3 scripts/eval-skill.py /path/to/skill --verbose # show all details Checks: file structure, frontmatter, description quality, script syntax, dependency audit, credential scan, env var documentation.
Use the rubric at references/rubric.md to score 25 criteria across 8 categories (0โ4 each, 100 total). Each criterion has concrete descriptions per score level.
Copy assets/EVAL-TEMPLATE.md to the skill directory as EVAL.md. Fill in automated results + manual scores.
Run eval-skill.py โ get the automated structural score Read the skill's SKILL.md โ understand what it does Read/skim the scripts โ assess code quality, error handling, testability Score each manual criterion using references/rubric.md โ concrete criteria per level Prioritize findings as P0 (blocks publishing) / P1 (should fix) / P2 (nice to have) Write EVAL.md in the skill directory with scores + findings
#CategorySource FrameworkCriteria1Functional SuitabilityISO 25010Completeness, Correctness, Appropriateness2ReliabilityISO 25010Fault Tolerance, Error Reporting, Recoverability3Performance / ContextISO 25010 + AgentToken Cost, Execution Efficiency4Usability โ AI AgentShneiderman, Gerhardt-PowalsLearnability, Consistency, Feedback, Error Prevention5Usability โ HumanTognazzini, NormanDiscoverability, Forgiveness6SecurityISO 25010 + OpenSSFCredentials, Input Validation, Data Safety7MaintainabilityISO 25010Modularity, Modifiability, Testability8Agent-SpecificNovelTrigger Precision, Progressive Disclosure, Composability, Idempotency, Escape Hatches
RangeVerdictAction90โ100ExcellentPublish confidently80โ89GoodPublishable, note known issues70โ79AcceptableFix P0s before publishing60โ69Needs WorkFix P0+P1 before publishing<60Not ReadySignificant rework needed
This evaluator covers security basics (credentials, input validation, data safety) but for thorough security audits of skills under development, consider SkillLens (npx skilllens scan <path>). It checks for exfiltration, code execution, persistence, privilege bypass, and prompt injection โ complementary to the quality focus here.
Python 3.6+ (for eval-skill.py) PyYAML (pip install pyyaml) โ for frontmatter parsing in automated checks
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.