skill-evaluation-and-iteration-with-gemini

Name: skill-evaluation-and-iteration-with-gemini
Price: 5 USD
Availability: InStock
Author: Agensi

by Markus Isaksson

Diagnose, score, and systematically refactor Gemini CLI skills for high-efficiency performance and reliability.

Updated May 2026

Audit skills for context-window inefficiency and token bloat.
Upgrade basic prompts into robust, marketplace-ready agent protocols.
Fix hallucination issues by enforcing strict tool boundaries and guardrails.

Security scannedOne-time purchaseInstant install

One-time purchase

30-day refund guarantee

Secure checkout via Stripe

Included in download

Audit skills for context-window inefficiency and token bloat.
Upgrade basic prompts into robust, marketplace-ready agent protocols.
file_write, file_read automation included
Includes example output and usage patterns

Markus Isaksson

Sample Output

A real example of what this skill produces.

Tool Precision: 4/10. Notes: Skill uses 'run_shell_command' to read files instead of 'read_file' with line offsets, causing context bloat. Diagnosis: Protocol Weakness. Plan: Update instructions to mandate 'grep_search' for initial discovery. Recommended: Execute surgical rewrite? (Y/N)

skill-evaluation-and-iteration-with-gemini

by Markus Isaksson

Diagnose, score, and systematically refactor Gemini CLI skills for high-efficiency performance and reliability.

Updated May 2026

Security scanned

One-time purchase

30-day refund guarantee

Secure checkout via Stripe

⚡ Also available via Agensi MCP — your AI agent can load this skill on demand via MCP. Learn more →

Included in download

Audit skills for context-window inefficiency and token bloat.
Upgrade basic prompts into robust, marketplace-ready agent protocols.
file_write, file_read automation included
Includes example output and usage patterns
Instant install

Sample Output

A real example of what this skill produces.

Security scanned

About This Skill

Optimize Your AI Agent Performance

Low-quality skills lead to context exhaustion, hallucinations, and agent drift. This skill provides a rigorous diagnostic framework designed specifically for developer-centric agent architectures like the Gemini CLI. It systematically audits your instruction sets to ensure they meet professional distribution standards.

What it does

The evaluator performs a multi-phase audit of any instruction file (SKILL.md). It uses a 9-dimension rubric—including Tool Precision, Orchestration, and Output Contracts—to assign objective scores. After diagnosis, it identifies "context leaks" and protocol weaknesses, providing a prioritized plan to refactor vague advice into hard agent protocols. Finally, it can surgically rewrite your skill to implement these improvements automatically.

Why use this skill

Prompting an AI yourself is often a trial-and-error process. This skill automates the "red-teaming" of your agent logic. It replaces general prompts with high-efficiency tool patterns (like replacing generic shell commands with targeted search tools), drastically reducing token usage and improving the reliability of your local workflows. It ensures your skills are marketplace-ready by enforcing strict guardrails and clear triggering conditions.

Supported Frameworks

Optimized for Gemini CLI and agents utilizing Plan Mode, sub-agent orchestration, and advanced file manipulation tools.

📖 Learn more: Best DevOps & Deployment Skills for Claude Code →

Use Cases

Audit skills for context-window inefficiency and token bloat.
Upgrade basic prompts into robust, marketplace-ready agent protocols.
Fix hallucination issues by enforcing strict tool boundaries and guardrails.
Refactor vague instructions into imperative, phased execution steps.

How to Install

mkdir -p ~/.claude/skills && curl -sL https://www.agensi.io/api/install/skill-evaluation-and-iteration-with-gemini | tar xz -C ~/.claude/skills/

Free skills install directly. Paid skills require purchase - use the download button above after buying.

Reviews

No reviews yet - be the first to share your experience.

Only users who have downloaded or purchased this skill can leave a review.

Early access skill

Security scanned

Built by Markus Isaksson

This skill is optimized for evaluating skills meant to be…

Example output available

Be the first to review this skill.

Only users who have downloaded or purchased this skill can leave a review.

Security Scanned

Passed automated security review

Permissions

Write Files

Read Files

Allowed Hosts

mcp.agensi.io

File Scopes

.gemini/skills/**

**/*.md

**Permission Profile**: Analysis & Refactoring (Read + Write)

Creator

Markus Isaksson

Frequently Asked Questions

Learn More About AI Agent Skills

More Premium Skills

diagnosing-rag-failure-modes

RAG fails quietly. It retrieves documents, returns confident-looking answers, and misses the question entirely — because the question required connecting facts across documents, reasoning about sequence, or tracing causation. This skill gives you a five-question diagnostic checklist that classifies any failing query as either RAG-safe or structurally RAG-incompatible, then maps it to the specific failure pattern and the architectural fix that resolves it.

$105 installs

subagent-orchestrator (Develop based on the Claude Code sourcemap)

Turn your AI agent into a coordinator that manages parallel subagents for complex coding and research tasks.

$52 installs

software-architect

A structured framework for planning, reviewing, and evolving complex software systems with explicit trade-offs.

$52 installs

designing-hybrid-context-layers

Architects the right retrieval strategy for every query — teaching your agent when to use RAG, a knowledge graph, or a temporal index instead of defaulting to vector search for everything.

$1012 installs

skill-evaluation-and-iteration-with-gemini

Included in download

Sample Output

skill-evaluation-and-iteration-with-gemini

Included in download

Sample Output

About This Skill

Optimize Your AI Agent Performance

What it does

Why use this skill

Supported Frameworks

Use Cases

How to Install

How to Install

Reviews

Permissions

Tags

Creator

Frequently Asked Questions

What specific problems does this evaluation skill solve for my AI agents?

Is this skill only compatible with the Gemini CLI?

How does the skill measure the quality of my agent's instructions?

What exactly is included in the purchase of this skill?

Can the skill automatically apply the improvements it suggests to my SKILL.md files?

How does using this evaluator help with context exhaustion and token usage?

Learn More About AI Agent Skills

More Premium Skills

diagnosing-rag-failure-modes

subagent-orchestrator (Develop based on the Claude Code sourcemap)

software-architect

designing-hybrid-context-layers