1

    cf-ai-whisper

    by Kevin Cline

    Free speech-to-text and SRT subtitle generation using OpenAI Whisper on Cloudflare Workers AI.

    Updated Apr 2026
    Security scanned
    One-time purchase

    $12

    One-time purchase · Own forever

    ⚡ Also available via Agensi Pro — your AI agent can load this skill on demand via MCP. Learn more →

    Included in download

    • Generate SRT subtitles for videos with precise word-level timestamps
    • Convert voice memos and podcast episodes into searchable text transcripts
    • terminal, network, env_vars automation included
    • Includes example output and usage patterns
    • Instant install

    See it in action

    --- TRANSCRIPT (12 words) ---
    Hello and welcome to today's meeting. We'll be discussing the results.
    
    Subtitles saved: meeting_20231027.srt
    1
    00:00:00,000 --> 00:00:03,200
    Hello and welcome to today's meeting.

    About This Skill

    What it does

    The cf-ai-whisper skill provides professional-grade audio transcription by leveraging OpenAI Whisper running on Cloudflare Workers AI. It allows developers and users to convert speech to text with high accuracy without requiring an OpenAI API key or account. The skill handles diverse audio formats and can automatically generate SRT subtitle files with word-level timestamps.

    Why use this skill

    Most transcription services are expensive or require complex local setups. This skill offers a cost-effective alternative by utilizing Cloudflare’s free tier infrastructure. Unlike basic LLM prompting, it manages the binary data handling, audio MIME-type mapping, and complex timestamp calculations required to create perfectly synced subtitles. It eliminates the frustration of manual transcription while keeping your data within the Cloudflare ecosystem you already use.

    Supported tools

    • Frameworks: Cloudflare Workers AI, OpenAI Whisper
    • Audio Formats: WAV, MP3, MP4, M4A, OGG, FLAC, WebM, and more
    • Outputs: Plain text transcripts (.txt) and SubRip Subtitle files (.srt)

    The Output

    The skill produces two primary artifacts: a clean, high-accuracy text transcript and a professionally formatted SRT file. The output includes metadata such as word count and relies on word-level timestamps to ensure subtitle synchronization is precise.

    📖 Learn more: Best DevOps & Deployment Skills for Claude Code →

    Use Cases

    • Generate SRT subtitles for videos with precise word-level timestamps
    • Convert voice memos and podcast episodes into searchable text transcripts
    • Transcribe recorded meetings without paying for expensive third-party seats
    • Batch process audio folders into text using Cloudflare's free AI tier

    Reviews

    No reviews yet — be the first to share your experience.

    Only users who have downloaded or purchased this skill can leave a review.

    Security Scanned

    Passed automated security review

    Permissions

    Terminal / Shell
    Network Access
    Environment Variables

    Allowed Hosts

    dash.cloudflare.com
    api.cloudflare.com

    Creator

    K
    Kevin Cline

    ClawdWorks

    Builder of autonomous AI agents and Claude Code skills. ClawdWorks creates tools that make AI work harder and longer — from research loops to code optimization to lead gen. Powered by Claude Opus 4.6 + Codex 5.4.

    Frequently Asked Questions

    Similar Skills

    $12

    One-time