
evidence-guard
Audit any AI-generated output for unsupported claims, then verify every factual and technical assertion against its real source before it ships.
- Catch unsupported or hallucinated claims in docs and READMEs before merge
- Verify documentation and API references match the actual code
- Flag version mismatches, deprecated references, and uncited performance figures
$14
· or 70 creditsSecure checkout via Stripe
Included in download
- Catch unsupported or hallucinated claims in docs and READMEs before merge
- Verify documentation and API references match the actual code
- Ready for dependencies
- Includes example output and usage patterns
Sample input
Verify the Node version and performance claims in the README and docs/api.md against our package.json and bench results before I ship this release.
Sample output
VERIFICATION NOTE — 5 claims checked. CRITICAL: README:12 — "Supports Node 16" — engines in package.json requires >=18.0.0. WARNING: docs/api.md — "ultra-low latency" — No benchmark data found to support quantitative claim. VERDICT: FAIL — 1 critical version mismatch.
Audit any AI-generated output for unsupported claims, then verify every factual and technical assertion against its real source before it ships.
$14
· or 70 creditsSecure checkout via Stripe
Also available in a bundle
Included in download
- Catch unsupported or hallucinated claims in docs and READMEs before merge
- Verify documentation and API references match the actual code
- Ready for dependencies
- Includes example output and usage patterns
- Instant install
Sample input
Verify the Node version and performance claims in the README and docs/api.md against our package.json and bench results before I ship this release.
Sample output
VERIFICATION NOTE — 5 claims checked. CRITICAL: README:12 — "Supports Node 16" — engines in package.json requires >=18.0.0. WARNING: docs/api.md — "ultra-low latency" — No benchmark data found to support quantitative claim. VERDICT: FAIL — 1 critical version mismatch.
About This Skill
evidence-guard brings regulated-industry evidence standards — the same rigor used to pass medical and scientific MLR and peer review — to any AI-generated output. Before your agent ships documentation, READMEs, PR descriptions, API references, changelogs, or technical claims, evidence-guard forces it to prove every assertion against a real, verifiable source. The problem: AI agents write confident, plausible text that quietly contains unsupported claims — wrong version numbers, deprecated API references, performance figures with no benchmark, and documentation that drifts away from the actual code. A single model reviewing its own work rarely catches these, because the same bias that produced the claim approves it. evidence-guard fixes this at the level of substance, not just process. It runs a structured Claims-QC pass: it extracts every factual, technical, and quantitative claim from the output, classifies each one, verifies it against the repo or a citable source, grades the strength of the evidence, and flags risk patterns like version mismatches and doc-vs-code drift. Nothing passes the verdict gate until every critical claim is traceable. The result is a short, audit-ready Verification Note you can drop into a PR or a docs review. Built by PubsProToolkit, applying the evidence discipline of medical and scientific publishing to everyday agent output.
Use Cases
- Catch unsupported or hallucinated claims in docs and READMEs before merge
- Verify documentation and API references match the actual code
- Flag version mismatches, deprecated references, and uncited performance figures
- Produce an audit-ready Verification Note for PRs and docs reviews
Known Limitations
- Cannot run code for runtime behavior.
- Requires read access to files/sources.
- Opinion-based claims are advisory and cannot be hard-verified.
How to Install
mkdir -p ~/.claude/skills && curl -sL https://www.agensi.io/api/install/evidence-guard -o /tmp/evidence-guard.zip && unzip -o /tmp/evidence-guard.zip -d ~/.claude/skills && rm /tmp/evidence-guard.zipFree skills install directly. Paid skills require purchase - use the download button above after buying.
Reviews
No reviews yet - be the first to share your experience.
Only users who have downloaded or purchased this skill can leave a review.
Early access skill
Be the first to review this skill.
Only users who have downloaded or purchased this skill can leave a review.
Security Scanned
Passed automated security review
Permissions
No special permissions declared or detected
Tags
Pure SKILL.md instruction set — no runtime, dependencies, or install required. Works with Claude Code, Cursor, Codex CLI, GitHub Copilot CLI, and any SKILL.md-compatible agent. Best results when the agent has read access to the referenced code or sources.
Creator
PubsProToolkit builds adversarial "gate" skills for AI agents — they catch problems before your output ships, instead of just generating more. From code, security, and infrastructure to content, hiring, contracts, and finance. Built by a CMPP-certified, PhD medical writer who brings regulated-industry rigor to every domain.
Frequently Asked Questions
Learn More About AI Agent Skills
More Premium Skills

inline-comment
Best way to steer your agents, effortlessly.
designing-hybrid-context-layers
Architects the right retrieval strategy for every query — teaching your agent when to use RAG, a knowledge graph, or a temporal index instead of defaulting to vector search for everything.
ai-automation-qa-pack
Professional QA & UAT documentation generator for AI automation agencies and complex agent deployments.
Bounty Security Pattern Master Library — 399 Vulnerability Patterns
A premium library of 399 vulnerability patterns and DeFi attack vectors for AI-driven bug hunting and security audits.