What specific RAG performance issues does this skill help resolve?

This skill identifies and fixes common production bottlenecks such as low retrieval precision, poor document chunking, inefficient hybrid search configurations, and hallucination issues caused by weak grounding.

Is this skill compatible with my existing AI agent framework?

The RAG Architect is designed to work with any agentic framework that utilizes retrieval pipelines, offering architecture-agnostic advice that applies to various vector databases and embedding models.

What exactly is included with the purchase of this skill?

You receive a specialized architectural framework that includes diagnostic checklists, configuration templates for hybrid search, and a suite of evaluation strategies for your retrieval pipeline.

Do I need to use a specific vector database or model provider for this to work?

No, this skill focuses on the architecture and logic of yours systems; you can apply its strategies to any vector database (like Qdrant or Milvus) and any LLM provider you prefer.

Does the RAG Architect provide support for enterprise-grade features like multi-tenancy?

Yes, the skill includes guidance on implementing secure multi-tenant isolation, citation-based grounding, and cost-efficient scaling strategies for production environments.

rag-architect

Name: rag-architect
Price: 5 USD
Availability: InStock
Author: Agensi

by Roy Yuen

Design, debug, and optimize production RAG systems with expert architecture, hybrid search, and grounding strategies.

Updated May 2026

Construct hybrid search pipelines combining semantic and keyword retrieval
Debug hallucination risks by implementing strict source grounding protocols
Optimize indexing strategies for low-latency document retrieval at scale

Claude Code

GitHub Copilot Extensions

Cursor

and SKILL.md-compatible agents.

Security scannedOne-time purchaseInstant install

One-time purchase · Own forever

Included in download

Construct hybrid search pipelines combining semantic and keyword retrieval
Debug hallucination risks by implementing strict source grounding protocols
Ready for Claude Code
Includes example output and usage patterns

Roy Yuen

See it in action

Diagnosis: Low recall@k.
Hypothesis: Missing BM25/keyword search for technical identifiers.
Evidence: Search 'error 402' returns generic HTTP docs, not specific logs. 
Fix: Implement Hybrid Search with RRF + Metadata filters for log levels.
Expected Impact: +25% precision on technical queries.

rag-architect

by Roy Yuen

Design, debug, and optimize production RAG systems with expert architecture, hybrid search, and grounding strategies.

Updated May 2026

114 views

Security scanned

One-time purchase · Own forever

⚡ Also available via Agensi MCP — your AI agent can load this skill on demand via MCP. Learn more →

Included in download

Construct hybrid search pipelines combining semantic and keyword retrieval
Debug hallucination risks by implementing strict source grounding protocols
Ready for Claude Code
Includes example output and usage patterns
Instant install

See it in action

Diagnosis: Low recall@k.
Hypothesis: Missing BM25/keyword search for technical identifiers.
Evidence: Search 'error 402' returns generic HTTP docs, not specific logs. 
Fix: Implement Hybrid Search with RRF + Metadata filters for log levels.
Expected Impact: +25% precision on technical queries.

114 views

Security scanned

About This Skill

Advanced RAG System Architecture & Debugging

Designing a production-ready Retrieval-Augmented Generation (RAG) system requires more than just a vector database and a prompt. The RAG Architect skill provides a developer-centric framework for building, hardening, and troubleshooting complex retrieval stacks, moving beyond generic implementations to high-performance architecture.

What it does

This skill acts as a senior systems architect for your AI pipeline. It analyzes ingestion workflows, document parsing, chunking strategies, embedding selection, and vector store performance. Whether you are building from scratch or fixing a broken implementation, it applies a rigorous, evidence-based methodology to ensure your agent stays grounded and accurate.

Supported Capabilities

Architecture Design: Decisions for hybrid search, reranking, and context packing tailored to your specific corpus (Legal, Code, Product Docs, etc.).
Truth-First Debugging: Systematic isolation of failures across the pipeline—from bad parsing to stale indexes and tenant leakage.
Infrastructure Selection: Unbiased tradeoff analysis for vector databases (pgvector, Qdrant, Milvus), embedding models, and rerankers.
Production Hardening: Implementing multi-tenant isolation, citation grounding, and incremental re-indexing strategies.
Evaluation Frameworks: Establishing metrics for recall@k, precision, and faithfulness to ensure changes are data-driven rather than anecdotal.

Why use this skill?

Standard LLM prompts often treat "bad answers" as model hallucinations. This skill identifies when the problem is actually a metadata filter mismatch, poor chunking semantics, or an inefficient reranker. It helps you reduce latency and cost by optimizing the weakest stage of your pipeline rather than over-relying on expensive long-context windows.

Use Cases

Construct hybrid search pipelines combining semantic and keyword retrieval
Debug hallucination risks by implementing strict source grounding protocols
Optimize indexing strategies for low-latency document retrieval at scale
Architect multi-stage re-ranking workflows to improve answer precision

Known Limitations

- Cannot perform the actual vector DB migration or infrastructure provisioning. - Effectiveness is limited without access to specific log samples or retrieval metrics. - Does not generate frontend UI.

How to Install

unzip rag-architect.zip -d ~/.claude/skills/

Reviews

No reviews yet — be the first to share your experience.

Only users who have downloaded or purchased this skill can leave a review.

Early access skill

Security scanned

Built by Roy Yuen

Claude Code, GitHub Copilot Extensions, Cursor, and SKILL…

Example output available

Be the first to review this skill.

Only users who have downloaded or purchased this skill can leave a review.

Security Scanned

Passed automated security review

Permissions

No special permissions declared or detected

Creator

Roy Yuen

Frequently Asked Questions

Learn More About AI Agent Skills

Similar Skills

prompt-engineer

Professional prompt engineering patterns for building robust, secure, and production-ready LLM applications.

Free42 installs

env-doctor

Diagnoses why your project will not start. Checks runtime versions, dependencies, environment variables, databases, ports, and build artifacts systematically.

Free50 installs

code-reviewer

Reviews your code for bugs, security vulnerabilities, logic errors, performance issues, and style violations. Organizes findings by severity and suggests fixes with code examples.

Free181 installs

git-commit-writer

Writes conventional commit messages by analyzing your staged git changes. Detects commit type, scope, and breaking changes automatically.

Free93 installs

rag-architect

Included in download

See it in action

rag-architect

Included in download

See it in action

About This Skill

Advanced RAG System Architecture & Debugging

What it does

Supported Capabilities

Why use this skill?

Use Cases

Known Limitations

Known Limitations

How to Install

How to Install

Reviews

Permissions

Tags

Creator

Frequently Asked Questions

What specific RAG performance issues does this skill help resolve?

Is this skill compatible with my existing AI agent framework?

What exactly is included with the purchase of this skill?

Do I need to use a specific vector database or model provider for this to work?

Does the RAG Architect provide support for enterprise-grade features like multi-tenancy?

Learn More About AI Agent Skills

Similar Skills

prompt-engineer

env-doctor

code-reviewer

git-commit-writer