Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
AI/LLM red team testing skill. Point at any LLM API endpoint and run automated security assessments. 160+ attack payloads across prompt injection, jailbreak,...
AI/LLM red team testing skill. Point at any LLM API endpoint and run automated security assessments. 160+ attack payloads across prompt injection, jailbreak,...
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run.
Automated security testing for language models. Point at any LLM API endpoint, select attack modules, and run assessments with real-time results and exportable reports. ⚠️ For authorized security testing and research only. Only test systems you own or have explicit permission to audit.
# Clone and install git clone https://github.com/rustyorb/pincer.git {baseDir}/redpincer cd {baseDir}/redpincer npm ci # Run npm run dev # Dashboard at http://localhost:3000 For production: npm run build npx next start -H 0.0.0.0 -p 3000
CategoryPayloadsDescription💉 Prompt Injection40Instruction override, delimiter confusion, indirect injection, payload smuggling🔓 Jailbreak40Persona splitting, gradual escalation, hypothetical framing, roleplay exploitation🔍 Data Extraction40System prompt theft, training data probing, membership inference, embedding extraction🛡️ Guardrail Bypass40Output filter evasion, multi-language bypass, homoglyph tricks, context overflow Total: 160 base payloads × 20 variant transforms = 3,200 test permutations
OpenAI · Anthropic · OpenRouter · Any OpenAI-compatible endpoint
160+ payloads across 4 categories Model-specific attacks (GPT, Claude, Llama variants) 20 variant transforms (unicode, encoding, case rotation, etc.) Attack chaining with template variables ({{previous_response}}) AI-powered payload generation — uses the target LLM to generate novel attacks against itself Stop/cancel running attacks instantly
Heuristic response classifier with context-aware analysis Reduced false positives — detects "explain then refuse" patterns Vulnerability heatmap — visual category × severity matrix Custom scoring rubrics with weighted grades (A+ to F) Verbose 10-section pen-test reports with appendices Multi-target comparison — side-by-side security profiles Regression testing — save baselines, track fixes over time
ToolWhat It DoesCompareSame payloads against 2-4 targets simultaneouslyAdaptiveAnalyzes weaknesses, generates targeted follow-upsHeatmapVisual matrix of vulnerability rates by category/severityRegressionSave baseline → re-run later → detect fixes or regressionsScoringCustom rubrics with weighted category/severity/classification scoresChainsMulti-step attacks with {{previous_response}} templatesPayload EditorCreate custom payloads with syntax highlighting + AI generation
1. Configure Target → Add LLM endpoint + API key + model 2. Select Categories → Pick attack types to test 3. Run Attack → Stream results in real-time 4. Review Results → Heuristic classification + severity scores 5. Adaptive → Auto-generate follow-up attacks on weaknesses 6. Generate Report → Export comprehensive findings as Markdown
All client-side — no server components, your API keys stay local NDJSON streaming — real-time results during attack runs Heuristic analysis — pattern-matching classifier (no LLM-based grading = no extra cost) Zustand + localStorage — state persists across sessions
For autonomous multi-strategy campaigns (CLI/TUI), see RedClaw — the autonomous red-teaming agent framework. RedPincer = web dashboard, manual + automated testing RedClaw = autonomous CLI agent, adaptive multi-strategy campaigns Together = complete LLM security testing suite Built by @rustyorb — Crack open those guardrails. 🦞
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.