LLM Eval Framework Builder
You changed the prompt, tried four inputs, it looked better, you shipped — and three days later support tickets say outputs are worse for an entire class of inputs you didn't test
Ship agent workflows in 30 seconds. Browse 1,500+ expert-built and security scanned skills. Browse skills
THE AGENSI STORE
976 skills found
You changed the prompt, tried four inputs, it looked better, you shipped — and three days later support tickets say outputs are worse for an entire class of inputs you didn't test
A professional-grade toolkit for SAST code reviews, PII scanning, and automated compliance gap analysis.
An adversarial security gate to detect and redact PII, secrets, and confidential data before sending prompts.
Hold your bios, footers, and profiles to one brand spec. Flags brand-name spelling and casing that does not match your canonical form, off-spec taglines, links that are not on your official list, leftover placeholders (Lorem, TODO, "your tagline here"), and handles that differ from one surface to the next. You define the spec once and it enforces it everywhere.
Audit a JavaScript or TypeScript frontend for missing translations and hardcoded UI strings before you ship a new locale. Flags hardcoded JSX text and UI props (title, placeholder, aria-label, label, alt) not wrapped in t(), i18n.t(), or <Trans>; keys present in the default locale but missing from other locale files; keys referenced in code but absent from the locales (the raw dotted keys that leak to users); unused locale keys; and unparseable locale JSON.
Systematically refactor large codebases, eliminate circular dependencies, and define clean module boundaries.
Plan, review, and execute safe database migrations with automatic rollback plans, backfill strategies, and zero-downtime sequencing.
Cost-aware execution planning for AI agents — estimate cost-vs-value before expensive steps, propose cheaper paths (cache, summarize once, downshift models), and track spend against a session budget with a PROCEED / OPTIMIZE / DEFER verdict.
by Nex AI
Deploy production-ready real estate websites with property listings, AI-optimized SEO, and a Firebase admin panel.
Define spending rules for your AI agent — caps, category whitelists, approval thresholds — and audit what it bought or almost bought, with an approve/hold/block verdict per transaction.
Reviewer left comments and your PR is stuck? Find the #1 blocking comment and get a finished reply — acknowledge, the fix, what to test — written to move the reviewer to approve.
by Nex AI
Production-ready Resend integration with tracked outbox, open/click analytics, and per-archetype stats.
by Nex AI
Transform political ideas into structured, defensible election manifesto proposals and policy briefs.
An adversarial gate to verify AI-use disclosures and draft compliant provenance statements for any venue.
Find the model-version coupling that breaks when you swap LLMs. Flags hardcoded model names and versions, deprecated or renamed parameters (the max_tokens to max_completion_tokens class of change), hardcoded token and context-window limits, response-format parsing tied to one model's output, tool-schema format coupling between providers, and hardcoded per-token cost constants. The patterns load from an editable model-rules table you update as new models ship.
Catch the dangerous migration before it locks or wrecks your production database. Scans SQL migration files for destructive and risky operations: DROP and TRUNCATE, drops without IF EXISTS, lossy column-type changes, NOT NULL added without a default, DELETE or UPDATE with no WHERE, non-concurrent index builds, dropped constraints, renames, and data backfills mixed into schema changes. Each finding is ranked by severity with a safer rewrite. Postgres, MySQL, and SQLite.
Point it at an unfamiliar or inherited repo and quickly understand it. Maps the architecture, identifies the key modules and entry points, traces the core end-to-end flows, surfaces the conventions and gotchas, and assembles a clean ONBOARDING.md — turning a strange codebase into a clear mental model fast. Built for the moment you join a project, take one over, or have to explain a repo before changing it.
by Nex AI
Professional accessibility auditing for architects, balancing legal regulations with real-world usability.
by Nex AI
Protect your IP by embedding invisible, redundant buyer fingerprints and license terms into your AI skill files.
Production prompts grow by accretion — every failure gets another appended rule until the prompt is two thousand words of contradictions that the model navigates unpredictably
Expert accessibility auditing that prioritizes user impact and provides production-ready code fixes for WCAG compliance.
Transform fragile AI prototypes into resilient, enterprise-ready production agents with professional hardening tools.
by Nex AI
Automate Belgian vzw (non-profit) administration, WVV-compliant bylaws, UBO filings, and GA minutes.
by Nex AI
Generate structured architectural technical specifications and draft 'lastenboeken' from project descriptions.