Works with the AI tools you already use

CCCGGVW+15 more

Autoresearch Optimizer

Name: Autoresearch Optimizer
Price: 5 USD
Availability: InStock
Author: Agensi

An iterative agent loop that optimizes any prompt, config, or artifact by making one change at a time, scoring it against a metric, and keeping only the winners.

30-day refund guarantee

Secure checkout via Stripe

0 installsSecurity scanned

See it in action

You say

Optimize my customer-support system prompt at ./target/support_prompt.md. Test inputs are in ./tests/. Score each variant with: did it stay under 120 words, did it include a next step, and did it avoid making promises about refunds. Run 40 rounds and keep the best version.

Your agent does

Baseline score 4/9 checks passing. Ran 40 rounds, 31 edits reverted, 9 kept. Final score 9/9. Top winning changes: added an explicit word limit, moved the next-step instruction earlier, and replaced a vague refund line with a deferral to policy. Best prompt written back to ./target/support_prompt.md; full round-by-round log saved to ./target/loop_log.md.

What you get

Improving a system prompt from good-enough to high-reliabilityTuning a data-extraction pipeline against test casesOptimizing an ML training script or config overnight

About this skill

This skill turns any agent into a relentless self-improving optimizer using the "agent loop" pattern popularized by recent autoresearch work. Instead of hand-tuning a prompt or config once and walking away at "good enough," the agent runs a tight loop: propose one change, test it against a defined metric, keep it if it beats the current best, revert it if it doesn't, and repeat. Over dozens or hundreds of cheap iterations, the artifact climbs steadily toward a much higher quality ceiling than manual iteration ever reaches.

WHAT IT DOES

The skill takes a target you want to improve (a system prompt, an extraction pipeline, a code-review instruction, a model config, or any artifact you can evaluate), a small set of realistic test inputs, and a handful of binary yes/no quality checks. It then drives the optimization loop autonomously: one edit per round, one score per round, winners kept and losers reverted, with a running log of what changed and why.

WHY IT MATTERS

Manual iteration hits diminishing returns fast because humans get tired and stop. An agent doesn't. The bottleneck flips from "can we run this experiment" to "do we even know what question to ask," which means your job becomes curating hypotheses and defining good metrics rather than grinding through trial and error. Any metric you care about that is reasonably cheap to evaluate becomes fair game for automated optimization.

WHO IT'S FOR

Anyone who maintains prompts or configs they rely on repeatedly: customer-support agents, internal workflow automations, data-extraction pipelines, code-review instructions, or ML training scripts. If you've ever written something, gotten it to "good enough," and moved on, this loop picks up exactly where you stopped.

How to install

Drop the file into your AI Agent. Works with Claude, Cursor, ChatGPT, and 20+ more.

Reviews

No reviews yet

Be one of the first to try it. Every listed skill passes our trust checks below.

Security scanned

Passed our 8-point scan before listing

Fresh listing

Recently published to Agensi

30-day refund

Not a fit? Get your money back

Trust & safety

Security scanned

Verified clean 1 month ago

30-day refund guarantee
One-time purchase, yours forever
Secure checkout via Stripe

Listed1 month ago

Creator

PubsProToolkit

PubsProToolkit builds rigor-first skills for AI agents — they write your docs and content properly, then adversarially review them to catch what's wrong before it ships. The result: cleaner output and a hard quality gate in one toolkit. Built by a CMPP-certified, PhD medical writer who brings regulated-industry standards to developer docs, content, compliance, and research integrity.

Also available in a bundle

Agent Optimization & Output-Quality Suite

3 skills · $24

Save 23%

View bundle

Frequently Asked Questions

Popular in AI Agents & LLM Ops

designing-hybrid-context-layers

Architects the right retrieval strategy for every query — teaching your agent when to use RAG, a knowledge graph, or a temporal index instead of defaulting to vector search for everything.

$10

165.0(1)