1

    nvidia-voice-clone

    by Kevin Cline

    Clone any voice or generate professional text-to-speech using NVIDIA's zero-shot Magpie NIM technology.

    Updated Apr 2026
    Security scanned
    One-time purchase

    $12

    One-time purchase · Own forever

    ⚡ Also available via Agensi Pro — your AI agent can load this skill on demand via MCP. Learn more →

    Included in download

    • Create personalized voiceovers for demos using a short audio reference
    • Generate multilingual narration for documentation and tutorials
    • terminal, network, env_vars automation included
    • Includes example output and usage patterns
    • Instant install

    See it in action

    Cloning voice and generating speech...
    Text: Welcome to the new platform interface.
    Voice sample: ./samples/founder_voice.wav
    VOICE: /home/user/.claude-voice-clone/voice_1715234892.wav (452 KB)
    Done! Play the output file to hear the cloned voice.

    About This Skill

    What it does

    This skill enables high-fidelity voice cloning and text-to-speech (TTS) generation directly through your AI agent. By leveraging the NVIDIA Magpie TTS NIM, it can replicate any voice from a brief 10-30 second audio sample or generate professional narration using high-quality preset voices.

    Why use this skill

    Integrating professional-grade voice synthesis into a developer workflow usually requires complex SDKs or expensive subscriptions. This skill streamlines the process by using NVIDIA's zero-shot cloning technology, allowing your agent to produce localized audio assets, narration, or personalized voice feedback without leaving the terminal. It is significantly faster than manual audio processing and utilizes a powerful cloud infrastructure for low-latency synthesis.

    Supported tools & features

    • NVIDIA Magpie TTS Zeroshot: Clone voices from WAV/MP3 files with minimal data.
    • NVIDIA Magpie Multilingual: Support for diverse accents and languages including Spanish, French, and German.
    • Local File Management: Automatically manages audio output and storage in a dedicated local directory.
    • Bypass Setup: Works with a simple API key, removing the need for local GPU-heavy TTS models.

    Output format

    The skill produces high-fidelity WAV audio files stored locally, providing clear, natural-sounding speech that is ready for use in applications, videos, or testing.

    Use Cases

    • Create personalized voiceovers for demos using a short audio reference
    • Generate multilingual narration for documentation and tutorials
    • Prototype voice-enabled applications without local GPU resources
    • Automate the production of audio assets for developer presentations

    Reviews

    No reviews yet — be the first to share your experience.

    Only users who have downloaded or purchased this skill can leave a review.

    Security Scanned

    Passed automated security review

    Permissions

    Terminal / Shell
    Network Access
    Environment Variables

    Allowed Hosts

    build.nvidia.com
    integrate.api.nvidia.com

    Creator

    K
    Kevin Cline

    ClawdWorks

    Builder of autonomous AI agents and Claude Code skills. ClawdWorks creates tools that make AI work harder and longer — from research loops to code optimization to lead gen. Powered by Claude Opus 4.6 + Codex 5.4.

    Frequently Asked Questions

    Similar Skills

    $12

    One-time