Can I buy the skills individually instead?

Yes — every skill in Agent Optimization & Output-Quality Suite is also sold on its own product page. The bundle just packages them together at a discount (save 23% vs buying each one separately).

Do I get updates to all included skills?

Yes. When you own the bundle you can re-download any included skill at its latest version from your library. Updates the creator publishes after your purchase are included at no extra cost.

How does the discount work?

The bundle price is set 23% below the combined retail price of the 3 paid skills included. You pay once and unlock every skill in your library instantly.

What's the refund policy?

Bundles are covered by the same 30-day refund guarantee as individual skills. If the bundle doesn't work for your workflow, contact support within 30 days for a full refund.

BUNDLE Security scanned3 skills

Agent Optimization & Output-Quality Suite

A tight loop for making AI agent output measurably better and proving it. Agent Loop iteratively optimizes any prompt, config, or artifact - changing one thing at a time, scoring it against your metric, and keeping only the winners. The AI Eval & Test-Suite Quality Gate makes sure the metric you optimize against is trustworthy, catching gameable criteria, data leakage, and missing edge cases before they mislead you. And the Peer-Review Stress Test turns your agent into its own harshest reviewer, hunting weak claims and missing limitations before a human sees the output. Built for prompt engineers, AI builders, and teams shipping agent-powered features. Optimize it, trust the score, then stress-test it.

365 views

Works with every agent that reads SKILL.md — Claude Code, Cursor, Codex CLI, Gemini CLI, GitHub Copilot, Windsurf, OpenClaw, and more.

Created by

PubsProToolkit

$24$31

Save 23% · $7

One payment, lifetime access. 3 skills unlock instantly in your library.

30-day refund guarantee

Instant unlock in your library

Free updates from the creator

What's included

3 skills

1/3

AI Eval & Test Suite Quality Gate

An adversarial gate that audits an AI eval or test suite — LLM-judge rubrics, datasets, regression tests, metrics — for gameable criteria, data leakage, missing edge cases, and non-determinism, then returns one PASS/REVISE/FAIL verdict.

View skill

$14$10.84Save 23%

2/3

Peer Review Stress Test

An adversarial self-review gate that hunts your agent's weakest claim, overclaims, and missing limitations before a human sees the output.

View skill

$12$9.29Save 23%

3/3

Autoresearch Optimizer

An iterative agent loop that optimizes any prompt, config, or artifact by making one change at a time, scoring it against a metric, and keeping only the winners.

View skill

$5$3.87Save 23%

See it in action

You say

Audit this prompt-regression suite for a support agent. Grader prompt: "Rate the answer 1-5 on helpfulness and completeness. A good answer is detailed and covers the topic." 40 test cases, each with its golden answer included in the same prompt block shown to the model. Reported result: average 4.6/5, ship it.

Your agent does

EVAL TRUST: FAIL WEAKNESSES FOUND:

[gameable-rubric] "detailed and covers the topic" rewards length and breadth, not factual correctness -> a verbose wrong answer scores high
[data-leakage] golden answers sit in the same block sent to the model -> the model can copy them; 4.6/5 measures copying, not capability
[determinism] single-run average, no seed, temperature, or variance -> 4.6 is one sample, not a stable measure
[metric-validity] one averaged helpfulness number hides whether any answer was actually wrong BLIND SPOT: A regression that makes the agent confidently incorrect would still average near 4.6 and pass. VERDICT: FAIL - remove golden answers from the model's context, anchor the rubric to correctness, and report seeded multi-run results before trusting any score.

How to install

Drop the file into your AI Agent. Works with Claude, Cursor, ChatGPT, and 20+ more.

Reviews

No reviews yet on the included skills. Be the first to try this bundle.

Frequently asked questions

More bundles from PubsProToolkit

Content Trust Suite

3 skills · $31Save 31%

Reliable Multi-Agent Suite

2 skills · $24Save 29%

Agent Code Quality Suite

3 skills · $35Save 30%

Evidence Integrity Suite

4 skills · $39Save 30%

What's included

See it in action

How to install

Reviews

Frequently asked questions

Can I buy the skills individually instead?

Do I get updates to all included skills?

Which agents are compatible?

How does the discount work?

What's the refund policy?

More bundles from PubsProToolkit