Works with the AI tools you already use

CCCGGVW+15 more

Evidence Grading Framework

A reusable rubric that grades every source by type, recency, authority, independence, and corroboration, then ranks them and resolves conflicts by evidence weight.

$18

· or 90 credits

30-day refund guarantee

Secure checkout via Stripe

1 installNo reviews yet

About this skill

## What it does Most agents treat every source as equally trustworthy — a dated forum post and a primary specification get the same weight as long as both mention the keyword. Evidence-Grading Framework gives your agent a structured rubric to rank sources by quality BEFORE it writes a single sentence, so the strongest evidence drives the answer and weak sources are demoted or set aside. It is domain-agnostic and built to sit in front of any RAG pipeline or research agent. Feed it retrieved documents, search results, or a reading list, and it returns a graded source ledger plus a clear recommendation for what to actually rely on. ## Evidence-type tiers - PRIMARY — original, first-hand material: the actual data, spec, filing, official record, or direct statement from the entity in question. - SECONDARY — reputable analysis or reporting that interprets primaries and cites them. - TERTIARY — aggregated summaries (encyclopedic entries, roundups, listicles) that compile other people's work. - UNRANKED — undated, anonymous, or self-referential content with no traceable basis. ## The five scoring dimensions Each source is scored on: TYPE (its tier above), RECENCY (how current it is relative to how fast the topic changes), AUTHORITY (credibility for this specific claim — expertise is domain-specific, not transferable), INDEPENDENCE (whether the source has a stake in the conclusion; vendor and marketing pages are flagged), and CORROBORATION (how many independent sources agree — one source repeated across ten sites is still one source). ## Overall grade (A–D) - A — primary or strongly corroborated, current, authoritative, independent. Rely on it. - B — solid secondary or well-supported source with a minor weakness. Usable; corroborate key numbers. - C — tertiary, dated, or non-independent. Use only as a lead; verify before relying. - D — unranked, conflicted, or contradicted by stronger evidence. Do not rely on. ## Conflict resolution When two sources disagree, the agent does NOT default to the most recent or the most confident. It resolves by evidence weight: the higher-graded source wins, ties go to the better-corroborated claim, and unresolved conflicts are reported as open rather than papered over. A confident D never overrides a careful A. ## Sample output SOURCE LEDGER Question: What is the current API rate limit? [A] Official API reference (updated this quarter) Primary · current · authoritative · independent · corroborated -> RELY ON [B] Vendor engineering blog (6 months old) Secondary · slightly dated · non-independent -> CORROBORATE (matches A) [C] Third-party tutorial citing an older limit (2022 aggregator) Tertiary · stale · uncorroborated -> AVOID (superseded by A) CONFLICT: C states 60 req/min; A states 100 req/min. RESOLUTION: A wins (primary, current). No open conflicts. RECOMMENDATION: Base the answer on A; cite the changelog for corroboration. ## Why use this skill RAG and research agents usually fail not because they cannot find sources, but because they cannot tell a strong source from a weak one — and let the loudest or most recent text win. This framework makes source quality explicit and auditable, so the agent's output inherits the credibility of its best evidence rather than the average of everything it retrieved. The rubric is adapted from hierarchy-of-evidence practice in regulated scientific and technical research, where ranking sources before drawing conclusions is standard discipline — and almost entirely absent from general-purpose AI tooling. ## Use cases - Rank retrieved documents in a RAG pipeline before generation, so weak sources are down-weighted automatically. - Triage a research reading list into rely / corroborate / avoid tiers. - Resolve contradictory sources with a defensible, evidence-weighted rule instead of recency bias. - Produce a graded source ledger to attach to any research deliverable for transparency. - Pair with a claim-checking step: grade the sources first, then verify claims against the A and B sources only. ## Known limitations - Grades reflect source quality and provenance, not absolute truth. A well-produced primary source can still be wrong; the framework tells you how much weight a source has earned, not that its content is correct. - Authority and recency judgments depend on correctly identifying the topic and how fast it changes; supply the research question for best results. - It evaluates the sources provided to it and does not search for better ones unless your agent separately provides that capability. - Independence can only be assessed from what a source discloses; hidden conflicts of interest may not be detectable.

Details

How to install

Drop the file into your AI Agent. Works with Claude, Cursor, ChatGPT, and 20+ more.

Reviews

No reviews yet - be the first to share your experience.

Only users who have downloaded or purchased this skill can leave a review.

Be the first to review this skill.

Only users who have downloaded or purchased this skill can leave a review.

Security Scanned

Passed automated security review

Permissions

Read Files

File Scopes

evidence-grading-framework/**

Read-only. The skill reads the sources and any provided documents to grade them. It does not write, execute, or access the network.

Creator

PubsProToolkit

PubsProToolkit builds rigor-first skills for AI agents — they write your docs and content properly, then adversarially review them to catch what's wrong before it ships. The result: cleaner output and a hard quality gate in one toolkit. Built by a CMPP-certified, PhD medical writer who brings regulated-industry standards to developer docs, content, compliance, and research integrity.

Also available in a bundle

Medical & Pharma AI Compliance Pack

3 skills · $35

Save 20%

View bundle

AI Evidence & Fact-Checking Suite

4 skills · $60

Save 23%

View bundle

Data & Statistics Integrity Suite

3 skills · $37

Save 24%

View bundle

Evidence Integrity Suite

4 skills · $39

Save 30%

View bundle

Medical & Scientific Writing Suite

3 skills · $32

Save 27%

View bundle

Frequently Asked Questions

Browse More Skills

designing-hybrid-context-layers

Architects the right retrieval strategy for every query — teaching your agent when to use RAG, a knowledge graph, or a temporal index instead of defaulting to vector search for everything.

$10

165.0(1)

synthesizing-institutional-knowledge

Builds the organizational memory schema your AI agent needs to answer why — capturing decision provenance, causal chains, and event context that embedding-based retrieval permanently discards.

$10

55.0(1)

code-reviewer

Reviews your code for bugs, security vulnerabilities, logic errors, performance issues, and style violations. Organizes findings by severity and suggests fixes with code examples.

Free

7995.0(3)

frontend-motion-wizard

Advanced responsive layout and interactive micro-interaction engine for React, Tailwind CSS, and Framer Motion. Automatically injects fluid element states, mobile-first touch behaviors, adaptive viewports, and non-destructive layout transitions into static codebases

Free

425.0(2)

Evidence Grading Framework

About this skill

Details

Use cases

Known limitations

How to install

Reviews

Permissions

Creator

Also available in a bundle

Medical & Pharma AI Compliance Pack

AI Evidence & Fact-Checking Suite

Data & Statistics Integrity Suite

Evidence Integrity Suite

Medical & Scientific Writing Suite

Frequently Asked Questions

How is this different from a standard RAG search or document retrieval process?

What exactly is included in the skill package?

Can I use this framework for highly specialized fields like medical or legal research?

Which AI agents or platforms are compatible with this framework?

How does the skill handle it when two sources provide completely different information?

Does the skill automatically find better sources if the current ones are graded poorly?

Browse More Skills

designing-hybrid-context-layers

synthesizing-institutional-knowledge

code-reviewer

frontend-motion-wizard