evaluating ai harness dimensions
by loreto
Evaluates AI coding agent platforms across five structural dimensions that determine real-world performance independently of model quality, so teams select on architectural fit rather than benchmark scores.
Ship agent workflows in 30 seconds. Browse 1,500+ expert-built and security scanned skills. Browse skills
THE AGENSI STORE
314 skills found
by loreto
Evaluates AI coding agent platforms across five structural dimensions that determine real-world performance independently of model quality, so teams select on architectural fit rather than benchmark scores.
by Kevin Cline
Convert source code into OpenAPI 3.0 specs or Markdown docs via static analysis of Express, FastAPI, and more.
by Roy Yuen
Turn raw agent traces and tool logs into professional production-readiness audits and remediation reports.
by Roy Yuen
Automated governance and risk audit for AI agent tool permissions and authentication boundaries.
by Julian
Expert API architect to design, review, and audit REST, GraphQL, and event-driven API specifications.
Scan AI agent skill definitions for malicious instructions, prompt injections, and security risks—locally.
by Roy Yuen
Compress noisy chat logs and logs into durable, high-signal memory reports with built-in duplicate suppression.
by Kevin Cline
Convert natural language to cron expressions and explain complex schedules with run-time projections.
by Shandra
Teaches AI coding agents to make software engineering decisions before coding, including layer placement, complexity control, refactor timing, and framework-change assessment.
by Roy Yuen
Automate the packaging, versioning, and distribution strategy for AI agents, CLIs, and marketplace skills.
Automated 8-point release audit for project templates, ZIP packages, and developer starter kits.
by Roy Yuen
Supercharge your agent with semantic code intelligence for safer refactors, precise navigation, and zero-error edits.
by Shandra
Designs robust Windows desktop automation workflows using pywinauto, UI Automation, hotkeys, image matching, OCR, retries, logging, screenshots, and safety controls.
by Shandra
Finds accessibility problems in UI code and turns them into prioritized fixes, WCAG-aware checklists, test plans, remediation tickets, and safe AI coding prompts.
Automated security auditing and risk assessment for Model Context Protocol (MCP) servers.
by Shandra
Transforms undocumented repositories into professional README files, setup guides, command maps, architecture notes, environment references, testing docs, and AI agent handoff files.
Penetration-test your Claude Code agent's guardrails before you deploy. Throws prompt-injection payloads, shell-chaining, and path-traversal attempts at your PreToolUse/PostToolUse hooks and sensitive-file protections, then returns a pass/fail report on 10+ attack vectors with copy-paste remediation for every gap.
by Julian
A rigorous security auditor that scans code for OWASP Top 10 vulnerabilities with severity ratings and concrete fixes.
by LocoLoboZ
Orchestrate independent reviews, adversarial audits, and multimodal analysis via secondary models and external tools.
by Matthew King
Audit, score, and improve your AI agent skills for higher quality, lower token costs, better reliability, and marketplace success. Get actionable recommendations for prompts, instructions, tool usage, error handling, and user experience.
Audit any AI-generated output for unsupported claims, then verify every factual and technical assertion against its real source before it ships.
by Shandra
Professional DevOps diagnostics for AI agents to solve failed deployments, Docker crashes, and CI/CD pipeline errors.
by LocoLoboZ
Build structured, tool-agnostic ransomware incident response playbooks tailored to your SOC and organizational context.
by LocoLoboZ
A proactive governance layer that validates MCP tool intent and scope to ensure safe, compliant agent behavior.