1

    fal-whisper

    by Kevin Cline

    Fast, asynchronous Whisper transcription for audio and video files with SRT subtitle export.

    Updated Apr 2026
    Security scanned
    One-time purchase

    $12

    One-time purchase · Own forever

    ⚡ Also available via Agensi Pro — your AI agent can load this skill on demand via MCP. Learn more →

    Included in download

    • Convert podcast recordings or interviews into searchable text documents.
    • Generate professional SRT captions for YouTube or social media videos.
    • terminal, network, env_vars automation included
    • Includes example output and usage patterns
    • Instant install

    See it in action

    Transcript saved to: ~/.fal-whisper/interview_1715832000.srt
    Preview:
    1
    00:00:00,000 --> 00:00:04,500
    Thanks for joining us today. Start by telling us about your vision.
    
    2
    00:00:04,500 --> 00:00:08,200
    In our vision, AI agents will handle 90% of routine CRUD operations autonomously.

    About This Skill

    High-Speed Whisper Transcription

    Integrate professional-grade audio transcription into your agent's workflow using the fal.ai Fast Whisper model. This skill provides a robust Python-based interface to transcribe audio or video files into accurate, searchable text with optional word-level timestamps.

    What it does

    • Async Processing: Handles transcription through an asynchronous queue (submit, poll, retrieve) designed for stability with large files.
    • Local & Remote Support: Transcribe files directly from your local machine using base64 encoding or provide a public URL for cloud-hosted files.
    • SRT Generation: Automatically generates industry-standard SubRip (.srt) subtitle files with precision timestamps.
    • Broad Format Support: Works with MP3, MP4, M4A, WAV, FLAC, and more.

    Why use this skill?

    Transcribing audio is computationally expensive and difficult to get right with vanilla prompting. This skill offloads the heavy lifting to fal.ai's optimized hardware, returning structured data including text chunks and segment timestamps. It specifically solves the problem of file size limitations and provides persistent local storage for your transcripts in a dedicated directory (~/.fal-whisper/).

    Output Format

    The skill produces two primary outputs: a clean .txt file containing the full transcript and an optional .srt file ready for use in video editors like Premiere Pro or DaVinci Resolve.

    Use Cases

    • Convert podcast recordings or interviews into searchable text documents.
    • Generate professional SRT captions for YouTube or social media videos.
    • Extract minutes and actionable notes from recorded team meetings.
    • Create accessibility-compliant transcripts for educational video content.

    Reviews

    No reviews yet — be the first to share your experience.

    Only users who have downloaded or purchased this skill can leave a review.

    Security Scanned

    Passed automated security review

    Permissions

    Terminal / Shell
    Network Access
    Environment Variables

    Allowed Hosts

    fal.ai
    example.com
    queue.fal.run

    Creator

    K
    Kevin Cline

    ClawdWorks

    Builder of autonomous AI agents and Claude Code skills. ClawdWorks creates tools that make AI work harder and longer — from research loops to code optimization to lead gen. Powered by Claude Opus 4.6 + Codex 5.4.

    Frequently Asked Questions

    Similar Skills

    $12

    One-time