code-reviewer
Reviews your code for bugs, security vulnerabilities, logic errors, performance issues, and style violations. Organizes findings by severity and suggests fixes with code examples.
Equip your AI coding agent with skills for writing unit tests, integration tests, and end-to-end tests. Improve code coverage, catch regressions early, and automate quality assurance workflows.
📖 Related guide: Best Testing & QA Skills for Claude Code →
108 skills
Reviews your code for bugs, security vulnerabilities, logic errors, performance issues, and style violations. Organizes findings by severity and suggests fixes with code examples.

High-integrity landing page audits that identify ad spend hazards and conversion blockers for Google Ads traffic.
Turn your AI agent into a senior engineer with strict task classification and verification-driven coding protocols.
Identify robotic prose and AI-generated patterns with a 0-5 diagnostic score and structured linguistic analysis.
Professional QA & UAT documentation generator for AI automation agencies and complex agent deployments.
A risk-aware, evidence-based engineering lifecycle protocol for robust agentic task execution and safety.
A systematic 4-phase debugging framework to find root causes, eliminate flaky tests, and prevent regressions.
Expert Lean Six Sigma guidance, statistical formulas, and operational tools for industrial-grade process improvement.
Eliminate hallucinations and errors using double-blind, multi-agent adversarial verification loops.

Design and audit complex multi-agent workflows with rigorous ownership, evidence gates, and failure recovery policies.
Audit your codebase for technical debt and generate a prioritized, actionable remediation report.
A 5-gate pre-flight audit to ensure your AI agent has the context, scope, and safety boundaries needed to code successfully.
Turn OpenAPI specs into exhaustive, framework-ready test suites covering happy paths, edge cases, and security gaps.
A high-performance scraping engine with Playwright stealth, proxy rotation, and anti-bot bypass capabilities.
Transform technical debt into a prioritized roadmap with professional-grade refactoring reports.
Select the smallest honest verification set for a change, including targeted tests, manual checks, missing-test recommendations, a broader fallback, and named remaining risk.

Automatically detect accessibility issues in websites and applications following WCAG and accessibility standards.
Paste your Python code and get a plain-English test report — what works, what doesn't, what's unfinished, and exactly what to do next.

Design rigorous chaos engineering experiments and resilience audits to verify production system reliability.
Enforce senior-level coding standards (Surgical, Simple, Goal-Driven) on every AI-generated code change.
High-precision test gap analysis that prioritizes untested code by risk and identifies missing edge cases.

Master Red-Green-Refactor with an opinionated TDD mentor that guides coding, reviews PRs, and secures legacy systems.
Automatically validate OpenAPI specs, detect breaking changes, and sync API implementation with documentation.
Conduct MBB-level operational excellence audits and maturity assessments with calibrated findings and roadmaps.

Audit, score, and improve your AI agent skills for higher quality, lower token costs, better reliability, and marketplace success. Get actionable recommendations for prompts, instructions, tool usage, error handling, and user experience.

Lint an exported n8n workflow before it ships: catches broken or duplicated nodes, missing error handlers, credential stubs, unhandled retries, unsafe webhooks, brittle expressions, and missing idempotency. A read-only pass over your workflow JSON that ranks production-readiness gaps with evidence and concrete fixes.
A structured CRO audit workflow that identifies conversion killers and generates prioritized fix lists and AB tests.

A pre-publish audit gate to extract claims, verify facts, and flag compliance risks in public-facing content.
Professional security audit skill for web apps and APIs with structured severity-based findings and remediation plans.

Enforce small, verified, and rollback-safe code increments to prevent AI scope creep and broken builds.

You changed the prompt, tried four inputs, it looked better, you shipped — and three days later support tickets say outputs are worse for an entire class of inputs you didn't test
The ultimate pre-commit checklist agent for cleaning code, updating docs, and validating repository state.

An adversarial reviewer for AGENTS.md and agent instruction files. It flags ambiguous or contradictory rules, missing guardrails, vague tool and scope definitions, and untestable instructions, then returns a PASS / REVISE / BLOCK verdict — before the config drives your agent.

Transform chaotic support into structured operations with playbooks, triage rules, and automated QA rubrics.

Reliable UIA-based Windows desktop automation with OCR and image matching fallbacks.
Generate realistic JSON or CSV test data from plain-English schema descriptions with up to 1,000 rows.

Catch typos, homophones, and near-miss misspellings across code, docs, and markdown, in a commit, your staged changes, a file, or a whole directory. Layers dictionary, phonetic (Soundex), and edit-distance checks with context-aware homonym rules to flag the their/there and its/it's a basic spellcheck sails right past.

A reusable rubric that grades every source by type, recency, authority, independence, and corroboration, then ranks them and resolves conflicts by evidence weight.
Design and scaffold professional integration and contract testing suites with Testcontainers, Pact, and WireMock.
Diagnose why your AI skills are underperforming and systematically turn weak SKILL.md files into reliable, high-quality, marketplace-ready assets.

Architect, scaffold, and audit enterprise-grade Playwright test suites with professional CI and auth patterns.
Automated launch-readiness auditor for x402 and agent-payment API surfaces.
Automated test generation with edge case analysis and framework-native syntax for TS, Python, Go, and Rust.
Automated risk classification and regression checking to stop AI agents from breaking your codebase.
Generate high-quality Jest unit tests with automatic dependency mocking for JavaScript and Angular applications.

Generate a deep, evidence-based Software Quality Strategy grounded in your repository's actual code and maturity level.

Convert Robot Framework tests to idiomatic Playwright TypeScript with automated keyword and locator mapping.

Designs robust Windows desktop automation workflows using pywinauto, UI Automation, hotkeys, image matching, OCR, retries, logging, screenshots, and safety controls.

An adversarial reviewer for job descriptions and candidate-facing hiring content. It flags biased or exclusionary language, risky questions, inflated or vague requirements, and missing disclosures, then returns a POST / REVISE / HOLD verdict. A bias-and-clarity aid, not legal advice.

Run structural QA on your translation files across locales. Flags missing keys, placeholder mismatches ({name}, %s, {{var}}), strings left untranslated and identical to the source, length-overflow risk that breaks UI, terminology drift against a glossary, empty targets, and plural-category gaps. Works on JSON, gettext .po/.pot, and .properties. It checks form, not meaning, so you do not need to speak the target language to use it.
Scan multi-language codebases for unused variables, orphaned functions, and unreachable code with severity ranking.

An adversarial reviewer for Dockerfiles and container builds. It flags root users, image bloat, unpinned or cache-busting layers, leaked secrets, and missing hardening, then returns a PASS / FIX / BLOCK verdict — before you build or push the image.
Run a complete pre-publish readiness check on your skills — version consistency, marketplace reference audit, internal path cleanup, and clear 0.0.1 vs 0.1.0 publishing decision.

Drive a browser from your agent without the token bloat. Batches navigate/click/type into one call, stays logged in with persistent sessions, and feeds the model compact DOM snapshots instead of giant HTML, so multi-step flows like logins, form-filling, and scraping behind auth stay fast and cheap. Runs on the uBrowser MCP server.
Design, debug, and harden AI control loops with explicit contracts and automated verification harnesses.
Automated 8-point release audit for project templates, ZIP packages, and developer starter kits.
A Master Black Belt mentor for DMAIC projects, providing phase-by-phase coaching, tollgate prep, and statistical rigor.

Expert WCAG 2.1/2.2 AA accessibility audit and ADA litigation-risk review for any website, web app, design, or page.
Pre-commit security hooks: secret detection, destructive command prevention

Find the unit tests that pass without testing anything. Flags tests with no assertions, trivial existence-only checks (toBeDefined, assertIsNotNone), tests that assert the exact value they just mocked, snapshot-only tests, tautological assertions (expect(true).toBe(true)), empty placeholders, and over-mocked tests with more setup than assertions. Works on Jest/Vitest and pytest/unittest.

Penetration-test your Claude Code agent's guardrails before you deploy. Throws prompt-injection payloads, shell-chaining, and path-traversal attempts at your PreToolUse/PostToolUse hooks and sensitive-file protections, then returns a pass/fail report on 10+ attack vectors with copy-paste remediation for every gap.

An adversarial gate that reviews any drafted support reply — email, chat, help-desk macro, or review response — before it goes out, checking that it answers the question, gets the tone right, avoids overpromising and risk, and gives clear next steps, then returns a structured SEND/REVISE/ESCALATE verdict with exact fixes.
Accelerate your bounty hunting with smart program prioritization and vulnerability report triaging for DeFi protocols.
Generate professional, formula-driven DMAIC/DFSS artifacts, Excel tools, and interactive Lean Six Sigma visualizations.

An adversarial reviewer for social posts, ads, and marketing copy. It flags off-brand tone, unsupported claims, weak or missing calls-to-action, platform mismatches, and compliance risks, then returns a PUBLISH / REVISE / HOLD verdict — before you hit publish.

An evidence-first debugging workflow for agents to identify, reproduce, and surgically fix software defects.
Project-specific Claude Code agent harness setup
Professional prompt engineering, audit, and evaluation system for production-grade AI agents and workflows.
Automate data profiling with type detection, statistical analysis, and quality flags saved to a Markdown report.
Three-pass automated code review that catches error handling gaps, structural issues, and naming problems — then auto-fixes everything before code reaches the user.
Generate meaningful, maintainable tests that actually protect your code — not just inflate coverage numbers.

Protects API endpoints from accidental breaking changes by generating contract maps, validation rules, integration tests, documentation, and safe AI coding prompts.
Audit your AI agent's evaluation coverage to identify missing release gates and production risks.

An adversarial gate that audits meeting notes or a transcript summary before you share them — catching missed decisions, vague or unowned action items, and tasks with no deadline — and returns a structured PASS/REVISE verdict plus a cleaned action list where every task has an owner and a date.

Enforce strict Red-Green-Refactor discipline to build robust, test-driven software with 100% meaningful coverage.
Battle-tested prompting patterns to eliminate LLM output drift. Sandwich structure, few-shot examples, history limits, retry, and token caps — 6 composable layers for production-grade agent reliability.

An adversarial gate that audits a resume or cover letter for overclaims, unverifiable metrics, vague impact, and ATS keyword gaps, then returns one PASS/REVISE/FAIL verdict.
Turn raw agent traces and tool logs into professional production-readiness audits and remediation reports.

Analyzes AI agents for performance, reliability, security, and optimization opportunities.

Safely capture and inspect test emails in a staging environment without sending to real recipients.
Bypass OS-native file upload dialogs in browser automation using JavaScript interception and DataTransfer injection.

Audits AI agent failures and converts recurring mistakes into durable rules, anti-patterns, regression tests, memory candidates, and improved SKILL.md sections.

Runs an ordered evidence-integrity gate over any AI draft — grade sources, ground claims, verify technical assertions, stress-test — then returns one PASS/REVISE/FAIL ship decision.

An adversarial reviewer for invoices, receipts, and expense reports. It re-checks the math, flags missing details, duplicate or already-paid items, policy breaches, and fraud-risk red flags, then returns a PASS / REVISE / HOLD verdict — before you send, pay, or approve.
Audit README files for broken links, missing sections, and formatting issues to ensure professional documentation.

An adversarial pre-publish gate that audits any article or web page draft for on-page SEO — title and meta, heading structure, keyword use without stuffing, internal links, readability, and search intent — returning a structured PASS/REVISE/FAIL verdict with a fix for each issue.
Audit codebases for structural debt, TODOs, and dependency rot to generate prioritized remediation reports.

An adversarial reviewer for landing pages and conversion copy. It flags an unclear value proposition, a weak or buried call to action, missing social proof, funnel friction, and unsupported claims, then returns a PUBLISH / REVISE / HOLD verdict — before the page goes live.
Establish and refine surgical CI quality gates and automated verification loops for any repository.
Turn bounce rates into conversions with prioritized fix lists and high-converting copy variants for any SaaS landing page.
A professional CRO and SEO audit workflow to identify conversion leaks and prioritize website fixes.

An adversarial reviewer for product listings and e-commerce copy. It flags weak or keyword-poor titles, feature-only bullets, missing trust and specifics, unsupported claims, and conversion friction, then returns a PUBLISH / REVISE / HOLD verdict — before the listing goes live.

An adversarial reviewer for cold emails, follow-ups, and outreach sequences. It flags weak subject lines, unclear asks, spam-filter triggers, broken or generic personalization, and pushy tone, then returns a SEND / REVISE / HOLD verdict — before you hit send.
Enforce senior-level coding standards with a focus on verification, minimal diffs, and evidence-based bug fixing.

Deep repository inspection to generate a pragmatic quality risk strategy and interactive HTML dashboard.
Bypass Cloudflare WAF, reCAPTCHA v3, and Vue.js bot detection in one skill.

Audit your dbt project for the test and documentation gaps that let bad data ship. Flags models with no unique or not_null tests, sources missing freshness config or tests, likely keys without a not_null test, models missing descriptions, SELECT * in models, and raw table references that should use ref() or source(). Each finding comes with a suggested tests: YAML snippet to drop into schema.yml.

A senior-level code reviewer that uses Socratic questioning to identify architectural flaws and teach better patterns.
Automated 8-point pre-deployment safety audit to catch breaking migrations, missing env vars, and CVEs.

An adversarial gate that audits a research brief or AI-generated answer for unsupported claims, weak or outdated sources, missing citations, and one-sided framing — returning a structured TRUST/VERIFY/REJECT verdict with the exact passage quoted and what to verify for each.

One-line summary description Stop your agent from claiming "done" before it's proven. A verification gate that classifies each change by risk (payment, auth, database, user-facing), picks the tests that actually cover it, demands evidence, maps regression risk, and outputs an honest pass/fail report. Turns "looks good to me" into "here's what I ran, and here's what's still unverified."

An adversarial self-review gate that hunts your agent's weakest claim, overclaims, and missing limitations before a human sees the output.

Orchestrate independent reviews, adversarial audits, and multimodal analysis via secondary models and external tools.

An adversarial gate that audits any chart, data summary, or statistic for misleading visuals and unsound inference, then returns one PASS/REVISE/FAIL verdict.

Run real Playwright E2E tests on your web app: login, checkout, and form flows across desktop and mobile viewports, with screenshots, traces, and console logs captured on every failure. Catches broken flows and UI regressions before release, and tells you the likely fix, not just that something broke.

Diagnoses unreliable tests, identifies root causes, creates stabilization plans, and generates safe AI coding prompts for fixing flaky unit, integration, E2E, and CI tests.
Cuts the back-and-forth in half. 12 rules that stop your AI from rushing, guessing, and making you repeat yourself.

Migrate Robot Framework tests from SeleniumLibrary to Playwright-based Browser library with architectural integrity.