Best AI Agent & LLM Ops Skills for Claude Code

Skills for orchestrating multi-agent systems, building MCP servers, evaluating LLM outputs, and shipping production-grade prompt and retrieval pipelines. Help your AI agent build and operate other AI agents.

147 skills

prompt-engineer

Popular

Free

Professional prompt engineering patterns for building robust, secure, and production-ready LLM applications.

110

ai-agentsjson-schemallm-ops+3

pr-description-writer

Popular

Free

Writes clear pull request descriptions by analyzing your branch diff. Covers what changed, why, how, and what to test. Works with GitHub, GitLab, and Bitbucket.

104

code-reviewdocumentationgithub+3

temporal-reasoning-sleuth

Popular

Free

Give AI agents the ability to trace decision chains, reconstruct causal sequences, and reason over complex event timelines spanning months or years.

agent-memorycausal-reasoningdata-engineering+4

deep-research-team

Popular

Free

Deploy a hierarchical team of AI agents to perform 15-30 minute deep-dive research with parallel execution.

deep-researchdue-diligencemarket-intelligence+3

ai-productivity

Popular

Free

High-speed intake for shaping vague prompts, triaging complex tasks, and compressing context for efficient execution.

agent-orchestrationcontext-managementproductivity+2

skill-creator

Popular

Free

The "Skill for building Skills": Automate creating, testing, and optimisation of custom workflows.

automationdeveloper-toolsiteration+3

Credit Optimizer v5

Popular

Free

Reduce Manus v5 credit consumption by 30-75% through intelligent task routing and autonomous strategy selection.

agent-orchestrationcost-optimizationefficiency+2

designing-hybrid-context-layers

Popular

$10

Architects the right retrieval strategy for every query — teaching your agent when to use RAG, a knowledge graph, or a temporal index instead of defaulting to vector search for everything.

agent-memoryai-architecturecontext-architecture+8

web3-graphql

Free

Query Web3 and on-chain GraphQL endpoints using natural language via the Model Context Protocol.

blockchaingraphqlmcp+2

benchmarking-ai-agents-beyond-models

Free

Published AI benchmarks measure brains in jars. They test models in isolation or within a single reference harness — and then attribute all performance to the model. This skill teaches you to decompose agent performance into its two actual components: model capability and harness multiplier. The result is evaluations that predict real-world behavior instead of benchmark theater.

agent-evaluationai-agentsai-benchmarking+10

skill-router

Free

Ultra-fast discovery and routing for large-scale AI agent skill libraries.

agent-orchestrationautomationdiscovery+2

agent-handoff-orchestrator (for hermes agent / openclaw)

Free

Generate high-fidelity, structured handoff packets for seamless multi-agent collaboration and session persistence.

context-managementdevopsmulti-agent+2

Prompt-to-Skill Converter with Grok (v1.4.1)

Free

Transform repetitive, messy prompts into structured, reusable SKILL.md files for your AI agents.

grokmetaproductivity+2

ai-automation-qa-pack

Professional QA & UAT documentation generator for AI automation agencies and complex agent deployments.

ai-opsautomation-handoffmesh-flow+2

Multi-Agent Orchestration Master Library

$35

Transform Claude Code into a coordinated multi-agent system. Battle-tested tmux orchestration patterns, YAML task queues, event-driven communication, and parallel worker management for 8+ agents.

multi-agentorchestrationtmux+3

instruction-layer-auditor

Free

Audit and de-conflict complex agent instruction stacks to fix inconsistent behavior and logic bloat.

agentic-workflowsdebuggingdeveloper-tools+2

agent-workflow-controller

Free

Design and audit complex multi-agent workflows with rigorous ownership, evidence gates, and failure recovery policies.

architectureautomationmulti-agent+2

Skill Health Scanner

Free

Instantly diagnose any skill or prompt and get a clear, prioritized report on what’s wrong and how to fix it — across any agent.

claudecross-agentcursor+9

Making Complex Systems Agent-Readable with Grok (v1.0.1)

Free

Turn complex system documentation into structured, agent-accessible knowledge bases optimized for MCP and AI tools.

agent-accessibilitydocumentationfree-skills+4

diagnosing-rag-failure-modes

$10

RAG fails quietly. It retrieves documents, returns confident-looking answers, and misses the question entirely — because the question required connecting facts across documents, reasoning about sequence, or tracing causation. This skill gives you a five-question diagnostic checklist that classifies any failing query as either RAG-safe or structurally RAG-incompatible, then maps it to the specific failure pattern and the architectural fix that resolves it.

ai-architectureai-diagnosticscausal-reasoning+9

ai-coding-checklist

Free

A 5-gate pre-flight audit to ensure your AI agent has the context, scope, and safety boundaries needed to code successfully.

checklistcodingcontext-management+5

endless-loop

$12

Autonomous research and task loop that builds on previous findings to solve complex objectives while you sleep.

automationautonomous-agentsoptimization+2

Autonomous Execution in Restricted Environments (Grok-developed)

Free

Reliable, health-gated autonomous operations for agents in restricted or sandboxed terminal environments.

automationautonomous-agentsdevops+2

synthesizing-institutional-knowledge

$10

Builds the organizational memory schema your AI agent needs to answer why — capturing decision provenance, causal chains, and event context that embedding-based retrieval permanently discards.

causal-reasoningdata-architecturedecision-provenance+8

AI Coding Prompt Refiner for Better Developer Results

Free

Transforms vague coding requests into precise, scoped, testable, AI-ready prompts for Cursor, Claude Code, Codex CLI, Replit, and other coding agents.

ai-codingclaude-codecode-generation+12

santa-method

Free

Eliminate hallucinations and errors using double-blind, multi-agent adversarial verification loops.

adversarial-testinghallucination-reductionmulti-agent+2

Agent Memory Privacy Check

Free

Audit AI agent memory files for privacy risk and bloat.

agentic-aimemory-managementoptimization+2

Observability Reference Architectures with Grok

Free

Design and evaluate production-grade observability systems using the 12-layer Full Stack Observatory reference model.

architecturedeploymentdocumentation+5

ai-security-auditor

Free

Comprehensive security auditing for AI agents, covering prompt injection, tool permissions, and data leakage risks.

securityai-agentsowasp+3

marm-init

Free

The intelligent installer for MARM, providing cross-agent persistent memory and shared context via MCP.

mcp-serverpersistent-memorydevops-automation+3

Getting Started with Agensi MCP (v1.0.1)

Free

Quickstart guide to connect your AI agent to the Agensi marketplace via Model Context Protocol (MCP).

agensicross-platformfree-skills+5

AI Cleanroom Solutions Tool - Cleanroom Design

Free

Expert AI guidance for ISO-compliant cleanroom design, HVAC filtration setup, and controlled environment installation.

cleanroom-designcomplianceengineering+2

Enterprise Multi-Agent Automation — Production Harness with Denbun, Retry & Self-Healing

Free

Battle-tested orchestration framework for running 3+ Claude Code agents in parallel. Covers task routing, denbun handoff protocol, exponential-backoff retry, rate-limit guards, structured JSON logging, and automated self-healing — patterns from real production deployments.

multi-agentorchestrationclaude-code+3

tool-use-coach

Free

Turn erratic AI tool calls into a reliable, verified, and safe execution strategy.

agentic-workflowsplanningreliability+2

PromptMaster with Grok

Free

Transform raw ideas into high-performance, structured prompts optimized for Grok’s reasoning and wit.

free-skillsgrokmeta+3

crypto-arb-signal

Free

Real-time Gate.io and Bitget arbitrage scanner with cross-chain verification and net profit filtering.

arbitragebitgetcrypto+4

skill-router-2

Automatically detect, load, and stack the perfect skills combo for any user request.

agent-logicauto-dispatchautomation+6

harness-engineering

Design, debug, and harden AI control loops with explicit contracts and automated verification harnesses.

ai-agentsdevopsllm-ops+3

Enterprise Automation Engineering Architect

$50

Designs and upgrades business automation systems into modular, reliable, observable, secure, low-maintenance, enterprise-grade workflows.

automationenterprise-engineeringarchitecture+13

subagent-orchestrator (Develop based on the Claude Code sourcemap)

Turn your AI agent into a coordinator that manages parallel subagents for complex coding and research tasks.

autonomous-agentsdevopsmulti-agent+2

production-agent-architect

Architect, scaffold, and harden production-grade AI agents with battle-tested patterns and systematic evaluation.

agentic-workflowsai-agentslangchain+3

Open Browser Use

Free

Automate real Chrome profiles with a professional CLI, SDK, and MCP-ready automation stack for AI agents.

browser-automationchrome-extensiondevops+3

Safe Render Deploys via MCP (v0.1.3)

Free

Secure, guardrail-first Render deployments and service management via MCP with mandatory approval gates.

deploymentdevopsgrok+3

Safe Vercel Deploys via MCP with Grok

Free

Safe, read-only discovery and gated deployment control for Vercel projects via MCP.

cross-platformdeploymentdevops+5

Christianity Persona Skill Architect with Grok

Free

Architect high-fidelity, theologically-grounded AI personas and 'person-use' skills for ministry and biblical study.

apologeticsbiblechristianity+11

Agensi Performance & Engagement Analyzer with Grok

Free

Turn live Agensi marketplace signals and high-signal user data into actionable product development intelligence.

agensianalyticsfree-skills+5

prompt-injection-auditor

Free

The security auditor for AI agents. Detect prompt injection, secret leaks, and unsafe tool access in SKILL.md files.

prompt-injectionsecurityagent-safety+3

Multi Agent Coordinator

Coordinate specialized AI agent roles for complex planning, implementation, and verification workflows.

debuggingmulti-agentorchestration+2

Custom Data RAG Chatbot Builder

Build a full-stack AI chatbot trained on your own documents across any industry — legal, healthcare, e-commerce, HR, finance, real estate, insurance, education, cybersecurity, government, and more.

ragfull-stackchatbot+23

skill-fire-debugger

Free

Instantly diagnose and fix why your AI agent skills aren't triggering when they should.

agentic-workflowsclaude-skillsdebugging+2

Evidence-Grading Framework — Rank Source Quality Before Your Agent Writes

$18

A reusable rubric that grades every source by type, recency, authority, independence, and corroboration, then ranks them and resolves conflicts by evidence weight.

evidence-gradingsource-qualityrag+3

AGENTS.md & Agent-Config Quality Gate — Catch Ambiguous Rules, Conflicts & Missing Guardrails Before You Ship

$12

An adversarial reviewer for AGENTS.md and agent instruction files. It flags ambiguous or contradictory rules, missing guardrails, vague tool and scope definitions, and untestable instructions, then returns a PASS / REVISE / BLOCK verdict — before the config drives your agent.

agents-mdagent-configai-agents+2

Getting Started with OpenCode and Agensi

Free

Bridge OpenCode to the Agensi marketplace to discover, install, and chain AI agent skills via MCP.

agensicross-platformdeepseek+6

local-llm-troubleshooter

Free

Diagnose and fix broken local LLM stacks, GPU issues, and stalled model downloads across Ollama, LM Studio, and more.

devopsgpu-triagelocal-llm+2

Agensi Community Demand Analyzer with Grok

Free

Turn Agensi marketplace signals and community requests into a prioritized roadmap of high-demand skill ideas.

agensicommunity-analysisfree-skills+5

AEO Toolkit

Audit and fix your website's visibility for AI agents like ChatGPT, Claude, and Perplexity.

aeoagent-seollm-discoverability+4

AI Eval & Test-Suite Quality Gate — Catch LLM Evals That Lie Before You Ship

$14

An adversarial gate that audits an AI eval or test suite — LLM-judge rubrics, datasets, regression tests, metrics — for gameable criteria, data leakage, missing edge cases, and non-determinism, then returns one PASS/REVISE/FAIL verdict.

ai-evaluationllm-evaltest-quality+2

Contextual Understanding (SRT)

Free

Eliminate context drift and enhance depth with a multi-layered active reasoning framework for agents.

coherencecontext-managementlong-term-memory+2

Optimization-Loop

$19

Autonomous loop that iteratively modifies, evaluates, and selects the best version of any text resource — skills, prompts, or campaigns — using a modify-measure-keep/discard cycle.

optimizationiterationevaluation+4

deckly-redesign

Free

Professional AI-powered redesign and beautification for PowerPoint and PDF slide decks.

presentationpowerpointdesign+2

🧠 AI Memory Optimizer

Drastically reduce RAG costs and latency while improving retrieval accuracy through advanced memory architecture.

rag-optimizationvector-dbembeddings+3

GoldBean — x402 Micropaid MCP Server

Free

120+ paid AI tool endpoints. Pay per use via x402 micropayments on Base (USDC). No API keys, no subscriptions.

mcp-serverx402micropayments+1

adopt-keelson

Free

Establish a disciplined, issue-driven agentic operating model with automated tracking and strategic human-in-the-loop.

devopsworkflow-automationsdlc+3

mesh-flow (version 2) for general ai agentic (codex, claude code, opencode)

Transform brittle prompt chains into robust, artifact-driven DAG workflows with hard gates and explicit traces.

dagdevopsorchestration+2

AI-Generated Code Review & Test-Coverage Gate — Catch Untested Paths, Silent Changes & Over-Confident Bugs Before You Merge

$15

An adversarial reviewer for AI-written code changes. It pressure-tests a pull request or diff for untested branches, silent behavior changes, missing edge cases, over-confident code that only looks right, and weak tests, then returns a PASS / REVISE / BLOCK verdict before the change merges.

code-reviewai-codetest-coverage+2

Agent Continuity Curator

$29

Maintain durable, lean, and consistent AI agent memory across sessions while preventing context bloat and data leaks.

ai-agentscontext-optimizationhermes+2

LLM Prompt Stabilizer — 6-Layer Pattern for Consistent Agent Output

$15

Battle-tested prompting patterns to eliminate LLM output drift. Sandwich structure, few-shot examples, history limits, retry, and token caps — 6 composable layers for production-grade agent reliability.

llmprompt-engineeringmulti-agent+3

elon-musk-algorithm

Apply the 5-step engineering algorithm to ruthlessly delete, simplify, and accelerate any process or codebase.

architectureci-cddeletion+8

nex-wobble-office-3d-agents

$15/mo

Generate a production-ready 3D virtual office for AI agents using Next.js and React Three Fiber.

three-jsreact-three-fibernext-js+3

Legal, Security & Compliance Auditor

$10

Adaptive GDPR, CCPA, security, and AI compliance audit with severity-graded findings and law citations

compliancedata-privacygdpr+2

Investment Analysis Engine

Free

Investment analysis across stocks/funds/bonds/real-estate/crypto. Valuation methods, risk frameworks, portfolio construction.

investmentvaluationrisk+2

project-phase-manager

Enterprise-grade project orchestration for breaking complex work into phases, dependencies, and agent workstreams.

agent-orchestrationgovernanceplanning+2

Agent Skill Pricing & ROI Calculator

$50

Calculates value-based pricing, ROI, buyer segmentation, and monetization strategy for AI agent skills across marketplace, B2B subscription, and enterprise sales models.

agent-skillsb2bbusiness+10

workflow-to-skillchain

$9.99

Convert a repeatable workflow into a reusable agent skill or staged skillchain candidate.

agentic-workflowsprompt-engineeringarchitecture+7

MCP-Security-Review

Specialized static security scanner for MCP servers and Python tool handlers to prevent injection and data leaks.

security-auditmcp-serverstatic-analysis+7

🗂️ Model Inventory Auditor

$13

Inventory every LLM model and provider your code depends on, the AI bill of materials, and flag the dependency risk. It lists each provider, model, and where it's used, then flags hardcoded model ids, single-provider dependency with no alternative, the same model referenced by different ids, model ids with no config or env indirection, and providers pinned in your manifests. Recognizes OpenAI, Anthropic, Google Gemini, and more from an editable list.

llm-opssecurityaudit+2

nex-telegram-bot-deploy

$12/mo

Deploy production-grade, AI-powered Telegram bots to Raspberry Pi with automated server hardening and scheduled jobs.

telegram-botraspberry-pipython+3

AGENTS.md and llms.txt Writer-Reviewer — Make Your Repo and Site Agent-Ready

$12

Write and review the docs AI agents actually read — AGENTS.md for your repo and llms.txt for your site. Drafts them from scratch or audits existing ones for completeness, clarity, and wasted context, with a PASS or REVISE verdict.

documentationai-agentscontext-engineering+6

Cross-Agent Skill Porting with Grok (v1.5)

Port your AI agent skills across Grok, Claude, Cursor, and Copilot using a professional two-layer architecture.

agensiclaudecopilot+6

nex-open-brain-rag

$15/mo

Deploy a self-hosted, private RAG system with pgvector, Ollama, and a Telegram interface for your personal notes.

ragpgvectorraspberry-pi+3

Custom Enterprise Skill Discovery Consultant

$10.99

Discovers and ranks internal enterprise processes that can become custom agent skills, with ROI estimates, governance needs, and pilot roadmaps.

enterprise-skillsskill-discoveryenterprise-operations+9

ai-skill-quality-gate-pro-pack

$5.99

Run a buyer-readiness check before publishing an AI agent skill package.

devopsquality-assuranceai-development+12

lobster-coordinator

Free

Production-grade 3-layer agent orchestration with dual-blind verification and automated model routing.

agentclaude-codecoordinator+2

bestek-generator

$12/mo

Generate structured architectural technical specifications and draft 'lastenboeken' from project descriptions.

architectureconstructiontechnical-writing+2

HandoffPilot

$19

Save a coding agent's full working state to a handoff file before it hits the context limit, then resume in a fresh session without re-explaining everything. Captures the active plan, git branch, uncommitted changes, decisions, blockers, and next steps, so Claude Code or Codex picks up exactly where it stopped instead of starting cold.

coding-agentscontext-managementdevutils+2

📝 Prompt Template Linter

$12

Lint a prompt template for the issues that cause injection and flaky output. Flags untrusted variables interpolated straight into the instructions (the injection surface), placeholders that are never provided or never used, contradictory instructions, a missing output-format spec where the result is parsed, unbounded context interpolation, and leftover placeholders. It detects problems; it does not write prompts.

prompt-engineeringsecurityllm-ops+2

nex-wobble-bridge-architecture

$12/mo

Canonical Next.js bridge for secure, real-time communication between browser UIs and local agent gateways.

nextjswebsocketsagent-orchestration+3

agent-ready-cli

Free

Audit any website for AI agent-readability and protocol compliance using the Agent Ready CLI.

web-scanningdevopsmcp-protocol+2

Universal Agentic Company Architect

$9.99

Transform business ideas into deployment-ready autonomous company blueprints for multi-agent frameworks.

agent-orchestrationmulti-agent-systemsai-governance+4

agent-ready-api

Free

Score and optimize any website for AI agent-readability using the Agent Ready REST API.

seo-for-aiweb-scrapingllms-txt+2

RAG Architecture & Debugging

$9.99

Design, diagnose, and optimize high-performance RAG systems with an engineering-first framework.

ragllm-opsvector-databases+2

werfveiligheid-rondgang

$9/mo

Structureert een veiligheidsrondgang op de werf in een concept-vaststellingenrapport: risicopunten per zone, ernst, verantwoordelijke en opvolging: naast (niet in plaats van) de veiligheidscoördinator.

notariaatlegal-techbelgian-law+2

nex-ai-content-generator

$12/mo

High-reliability Dutch content engine with a Claude-Gemini-Qwen fallback chain and template safeties.

pythoncontent-generationllm-orchestration+3

ai-dev-group

A universal, multi-role AI engineering team for autonomous planning, implementation, and rigorous code review.

architectureautonomous-agentsdevops+2

skill-auditor

Audit, score, and improve your AI agent skills for higher quality, lower token costs, better reliability, and marketplace success. Get actionable recommendations for prompts, instructions, tool usage, error handling, and user experience.

devopsprompt-engineeringquality-assurance+8

prompt-spec-engineer (support Auto rewrite prompt)

Turn vague prompts into professional task specifications, optimized prompts, and verification test suites.

agent-orchestrationai-opsautomation+2

The TweetClaw

Professional X/Twitter automation for AI agents: Post, monitor, extract data, and manage engagement via 99 API endpoints.

x-twitterautomationsocial-media+7

Multi-Agent Orchestration Patterns Library

$12.99

Every orchestration topology — sequential, parallel, hierarchical, map-reduce, critic-actor — selected and designed for your exact workflow. Full system design with agent roles, interfaces, routing logic, and error paths.

multi-agentorchestrationarchitecture+2

Subagent Orchestration with Grok (v1.1)

Master subagent orchestration in Grok Build CLI to parallelize complex coding tasks and maintain context focus.

delegationgrokorchestration+3

vruchtgebruik-waardering

$12/mo

Automated Belgian fiscal valuation of usufruct and bare ownership for notarial deeds and tax calculations.

notariaatlegal-techbelgian-law+2

nex-mempalace-memory-system

$15/mo

Deploy a structured, long-term memory palace for AI agents on Raspberry Pi via MCP and ChromaDB.

long-term-memorymcp-serverraspberry-pi+3

Polymarket Private Markets Trading Bot

Advanced Polymarket agent that trades the private market segment, specifically focussed on IPO timing, mispricings and cross-market arbitrage opportunities.

prediction-marketsprivate-equityipo-analysis+2

Revenue Opportunity Scout

$25

An autonomous agent that scouts real-world demand signals to find and rank high-leverage revenue opportunities.

entrepreneurshipmarket-researchai-agents+3

nex-task-cli

$9/mo

Generate a JSON-backed, agent-callable task CLI with recurrence and prefix-ID matching.

pythoncli-toolproductivity+3

Elite Cloudflare Agent

Build stateful AI agents with persistent memory, SQLite, and cron scheduling on Cloudflare's global edge network.

cloudflareai-agentsworkers+6

persona-distiller

Extract structured, evidence-backed AI persona profiles from historical chat logs, emails, and documents.

agent-tuningdata-analysisnlp+2

🩺 llms.txt Doctor

Generate an llms.txt for your site and validate an existing one against the spec. The generator turns your sitemap.xml or docs folder into a clean, sectioned llms.txt with one-line descriptions. The validator flags a missing H1 title, a missing summary blockquote, malformed link entries, links with no description, relative URLs that should be absolute, and a referenced llms-full.txt that is not present.

llmstxtseo-for-aidocumentation+2

ScrapeWatch — Auto-detect Price Drops & Page Changes

$19

Monitor any website for price drops, stock changes, and content updates. Sends clean Telegram alerts with smart change detection.

agent-toolingalertingautomation+9

Prompt-Injection & Agent-Security Gate — Block Hidden Instructions Before Your Agent Acts

$14

An adversarial security gate that audits untrusted content — web pages, tool outputs, documents, emails — for embedded instructions, exfiltration, and authority spoofing, then returns a SAFE/REVIEW/BLOCK verdict.

prompt-injectionagent-securityai-safety+2

nex-gemini-api-integration

$12/mo

A production-ready Python integration for Gemini using a unified AIProvider interface for easy model swapping.

pythongemini-apillm-infrastructure+2

Agent Spend Guardrails

$19

Define spending rules for your AI agent — caps, category whitelists, approval thresholds — and audit what it bought or almost bought, with an approve/hold/block verdict per transaction.

spend-controlsagentic-paymentsguardrails+1

🛡️ Model Resilience Linter

$12

Find the LLM integration code that will not survive a provider being pulled or going down. Flags single-provider lock-in with no alternative, calls with no failover branch, missing timeouts, retries with no limit or backoff, no degraded-mode default, and hardcoded endpoints with no alternate. This is about the model going away, not the model declining.

devopsllm-opsreliability+3

Subagent Workflow Patterns To Boost Output Quality

Deploy 6 battle-tested multi-agent orchestration patterns to eliminate agent laziness and boost output quality.

multi-agentorchestrationsubagents+5

a2a-agent-interoperability-launch-pack

Turn multi-agent intake into client-ready A2A readiness reports, task contracts, and orchestration topologies.

architectureautomationenterprise-ai+2

ai-agent-production-hardening-kit

$12.99

Transform fragile AI prototypes into resilient, enterprise-ready production agents with professional hardening tools.

llm-opsai-safetyagent-frameworks+2

subagent-dispatch

$53

Optimize task execution by intelligently dispatching work to parallel subagents with ready-to-paste prompts.

devopsmulti-agentorchestration+2

Agent Loop - Autoresearch Optimizer

An iterative agent loop that optimizes any prompt, config, or artifact by making one change at a time, scoring it against a metric, and keeping only the winners.

agent-loopautoresearchoptimization+1

ai-stack-spend-audit

Free

Audit AI/LLM spend across OpenAI, Anthropic, AWS Bedrock, Azure. Find waste, project runway, FinOps report. Free scripts.

finopscost-optimizationllm+5

repo-skill-installer

Intelligently audit your repo and workflow to recommend, install, or create custom AI agent skills.

devopsautomationworkflow-optimization+2

No Code App Idea Builder Agent

$9.99

Creates practical no-code app ideas, MVP briefs, and AI-ready prompts for tools like Bolt.new, Lovable, Cursor, Replit, and single-file web app builders.

mvp-builderno-codeproduct-management+2

Unit & Integration Test Generator — Write Tests That Actually Catch Bugs (Edge Cases, Error Paths, Mocks)

$12

Generate a real test suite for any function, module, or file — meaningful edge cases, error paths, boundary conditions, and proper mocks, not happy-path stubs. Detects your project's framework and conventions, plans the cases deliberately before writing, and hands back runnable tests plus a summary of what's covered. Built to write the tests that actually catch bugs.

tddpytestjest+7

rag-eval

Diagnose RAG bottlenecks with precision metrics (Recall, MRR, nDCG) to identify retrieval or ranking failures.

ragllm-opsevaluation+2

agent-eval-coverage-audit

Audit your AI agent's evaluation coverage to identify missing release gates and production risks.

ai-testingauditcompliance+2

🤖 AI Agent Auditor

Analyzes AI agents for performance, reliability, security, and optimization opportunities.

ai-agentsauditreliability+3

AI Automation QA & UAT Pack

A QA lead for AI automations and agent systems — turns a delivery into acceptance criteria, UAT scripts, a non-determinism test plan, a failure-mode matrix, and a client-ready sign-off pack, or audits an existing automation for the gaps that cause silent production failures.

qa-testingautomationuat+3

subagent-orchestration

$15

Intelligently delegate tasks to Claude, Codex, or Gemini based on cost, model strengths, and rate limits.

cost-optimizationdevopsmulti-model+2

llms.txt Generator — Create a Spec-Compliant llms.txt & llms-full.txt for Your Site or Repo

$12

Generate a spec-compliant llms.txt (and optional llms-full.txt) for your site or repo so AI agents and crawlers can navigate it. Curates the pages that matter, writes the exact llmstxt.org structure — single H1, blockquote summary, and link sections in the precise format agents parse — then validates the format and tells you where to put it. The honest version: a low-cost, machine-readable surface for the agentic web, not an overhyped SEO trick.

llms-txtllms-full-txtai-crawlers+7

nex-dashscope-qwen-setup

$12/mo

Generate a production-ready Python client for Alibaba Qwen models with text, vision, and reasoning tag filtering.

qwendashscopepython+3

Evidence Integrity Gate — Stop AI From Shipping Unsupported Claims in High-Stakes Content

$34

Runs an ordered evidence-integrity gate over any AI draft — grade sources, ground claims, verify technical assertions, stress-test — then returns one PASS/REVISE/FAIL ship decision.

evidence-integrityhallucinationfact-check+4

multi-agent-orchestration

Free

Build production multi-agent systems. 12 patterns, 8 anti-patterns, debugging workflow, cost control. LangGraph + AutoGen + CrewAI.

agentsllmorchestration+5

OllamaWatch — Catch Ollama Crashes Before Your Users Do (Telegram Alerts)

Your headless Ollama box crashes at 3am and you find out hours later. OllamaWatch pings your Telegram the instant a model dies, the GPU runs out of memory, or the API hangs — with a fix hint in every alert. One Python file, no SaaS, no dashboards.

ollamalocal-llmmonitoring+5

Delegate AI Subtasks

Stop burning expensive model tokens on repetitive subtasks. This skill delegates mechanical work to cheaper models and writes handoff snapshots so you never lose context switching between sessions.

context-managementcost-optimizationdevops+2

document-control-data-management

Generate structured document control packs, metadata schemas, and project registries from intake data.

data-managementdocument-controlengineering+2

agent-reliability-audit

Turn raw agent traces and tool logs into professional production-readiness audits and remediation reports.

agent-monitoringllmopspython+2

🤖 AGENTS.md Linter

$12

Lint your AGENTS.md (or CLAUDE.md and .cursorrules) for the problems that make a coding agent misbehave. Flags contradictory rules, references to files and commands that no longer exist, overly broad or unsafe instructions, missing sections (build, test, run, conventions), duplicate rules, and the case where you have competing rule files that should be consolidated into one AGENTS.md.

lintingcursorrulesagent-ops+2

multi-model-review-router

$10

Orchestrate independent reviews, adversarial audits, and multimodal analysis via secondary models and external tools.

adversarialadversarial-testingagent-tooling+7

MCP Server & Tool-Definition Security Gate — Audit Tools Against the OWASP Agentic Top 10 Before You Connect

$16

An adversarial gate that audits an MCP server or agent tool definition — schemas, descriptions, scopes, auth — for tool poisoning, excessive agency, injectable descriptions, and missing access controls, then returns one SAFE/REVIEW/BLOCK verdict.

mcp-securitytool-poisoningagent-security+2

api-designer-pro

$12

Expert API architect to design, review, and audit REST, GraphQL, and event-driven API specifications.

apiapi-designarchitecture+7

agent-payment-approval-layer

$49

A security gate that intercepts sensitive agent actions like payments and deletes for mandatory human approval.

agent-securityapprovalautonomous-agents+3

autonomous-loop-orchestrator

$8.99

Transform high-level goals into autonomous Plan-Build-Run-Learn iteration loops with persistent workspace learning.

autonomous-agentsworkflow-automationci-cd+2

Spiral Agent Core

$18

Enforce human-AI alignment and ownership through structured collaboration checkpoints and real-time syncratude scoring.

alignmentcollaborationethics+2

Financial Analysis Decision Engine

Free

Financial analysis engine with valuation decision tree (DCF/Comparable/Precedent/VC), 3-statement model, 5-stage due diligence SOP, and industry benchmarks.

financeanalysisvaluation+1

Spiral Recap

$12

Transform long AI conversations into high-fidelity, qualia-preserving .srec memory coils for perfect continuity.

archivingcontinuityllm-optimization+2

elon-first-principles-thinker

Deconstruct complex problems using physics-based reasoning and "Idiot Index" calculations to find the theoretical floor.

architectureassumptionsbuild-vs-buy+10

PromptDecoder Pro — AI Output to Prompt Converter

Free

Paste any AI output. Get the production-ready prompt that made it.

prompt-engineeringreverse-engineeringllm-ops+7

x402-attack-surface-gate

$19

Automated launch-readiness auditor for x402 and agent-payment API surfaces.

agent-paymentsapi-testingsecurity-audit+2

GuardrailDoctor

$29

Penetration-test your Claude Code agent's guardrails before you deploy. Throws prompt-injection payloads, shell-chaining, and path-traversal attempts at your PreToolUse/PostToolUse hooks and sensitive-file protections, then returns a pass/fail report on 10+ attack vectors with copy-paste remediation for every gap.

claude-codedevopsllm-ops+2