nvidia-ocr
by Kevin Cline
High-precision OCR for images, tables, and handwriting using NVIDIA NeMo Retriever.
- Extract tabular data from screenshots or PDFs into structured text.
- Digitize handwritten notes and save them as searchable markdown.
- Batch process a folder of images to extract and aggregate text data.
Secure checkout via Stripe
Included in download
- Extract tabular data from screenshots or PDFs into structured text.
- Digitize handwritten notes and save them as searchable markdown.
- terminal, network, env_vars automation included
- Includes example output and usage patterns
See it in action
A real example of what this skill takes in and produces.
Sample output
[99.2%] INVOICE #1024 [98.5%] Date: 2023-11-15 [95.1%] Total: $1,250.00 [88.4%] Item: NVIDIA H100 GPU (Qty: 1) Full text saved to: ~/.claude-ocr/ocr_1700000000.txt Total text blocks: 4
nvidia-ocr
by Kevin Cline
High-precision OCR for images, tables, and handwriting using NVIDIA NeMo Retriever.
Secure checkout via Stripe
Included in download
- Extract tabular data from screenshots or PDFs into structured text.
- Digitize handwritten notes and save them as searchable markdown.
- terminal, network, env_vars automation included
- Includes example output and usage patterns
- Instant install
See it in action
A real example of what this skill takes in and produces.
Sample output
[99.2%] INVOICE #1024 [98.5%] Date: 2023-11-15 [95.1%] Total: $1,250.00 [88.4%] Item: NVIDIA H100 GPU (Qty: 1) Full text saved to: ~/.claude-ocr/ocr_1700000000.txt Total text blocks: 4
About This Skill
What it does
This skill provides high-performance Optical Character Recognition (OCR) by leveraging the NVIDIA NeMo Retriever API. It allows your AI agent to "see" and extract text from images and documents with professional-grade accuracy. It handles complex structures like tables, charts, receipts, and even handwriting, returning structured text along with confidence scores and bounding box data.
Why use this skill
Standard LLM vision capabilities can sometimes hallucinate text or struggle with small, dense data like tables or low-quality screenshots. This skill uses a specialized OCR model optimized for precision. It supports batch processing of entire directories, provides confidence metrics to ensure data reliability, and automatically saves output to structured files for further analysis. It is significantly faster and more accurate for data extraction tasks than generic vision prompting.
Supported tools
- NVIDIA NeMo Retriever: State-of-the-art OCR foundation model.
- Python Integration: Built-in handling for Base64 encoding and batch file processing.
- Exporting: Saves results locally in .txt or .md formats for easy developer access.
Use Cases
- Extract tabular data from screenshots or PDFs into structured text.
- Digitize handwritten notes and save them as searchable markdown.
- Batch process a folder of images to extract and aggregate text data.
- Verify automated test results by extracting text from UI screenshots.
How to Install
mkdir -p ~/.claude/skills && curl -sL https://www.agensi.io/api/install/nvidia-ocr | tar xz -C ~/.claude/skills/Free skills install directly. Paid skills require purchase - use the download button above after buying.
Reviews
No reviews yet - be the first to share your experience.
Only users who have downloaded or purchased this skill can leave a review.
Early access skill
Be the first to review this skill.
Only users who have downloaded or purchased this skill can leave a review.
Security Scanned
Passed automated security review
Permissions
Allowed Hosts
Creator
ClawdWorks
Builder of autonomous AI agents and Claude Code skills. ClawdWorks creates tools that make AI work harder and longer — from research loops to code optimization to lead gen. Powered by Claude Opus 4.6 + Codex 5.4.
Frequently Asked Questions
Learn More About AI Agent Skills
More Premium Skills
designing-hybrid-context-layers
Architects the right retrieval strategy for every query — teaching your agent when to use RAG, a knowledge graph, or a temporal index instead of defaulting to vector search for everything.
consumer-motivation-analyzer
Go beyond surface-level feedback to uncover the psychological drivers and hidden motivations behind buyer behavior.
keyword-research
Transform URLs or product lists into SEO keyword research packs with Google Ads data and intent-based clustering.
Bounty Security Pattern Master Library — 399 Vulnerability Patterns
A premium library of 399 vulnerability patterns and DeFi attack vectors for AI-driven bug hunting and security audits.