0

    describe-rename-sound-files

    by Fredrik Akerstrom

    SoundTag AI: Automatically describe and batch-rename audio files based on their actual sound using local ML or Gemini AI.

    0 installs
    1 views

    About This Skill

    SoundTag AI -Listens & Renames Your Sound Files

    What it does

    This skill solves the problem of messy, auto-generated audio filenames like audio_track_v2_final_99.wav. It analyzes the actual content of sound files and renames them with human-readable, descriptive titles such as ElevenLabs_2024-07-21T15_43_56_George_pre_s50_sb75_se0_b_m2.mp3 → Elevenlabs_George_Voice_Speech.mp3 or Bright_Trumpet_Fanfare.wav or Large_Crowd_Cheering.mp3.

    Supported tools

    • Local ML (AST): Uses the MIT Audio Spectrogram Transformer to classify sounds into 527 categories (Speech, Music, Explosion, etc.) entirely offline.
    • Google Gemini API: Leverages advanced multimodal AI for nuanced descriptions of cinematic SFX, moods, and complex textures.
    • Batch Processing: Supports .wav, .mp3, .ogg, .flac, .aac, .m4a, and more.

    Why use this skill

    Unlike simple prompting, this skill implements a sophisticated two-step workflow. It first attempts a high-speed local classification to save on API costs and privacy. For ambiguous sounds, it provides a structured "improvement pass" using Gemini. It intelligently combines ML labels with hidden hints from the original filename to ensure context is never lost. It handles environment constraints automatically, including specific dependency versions (Torch/Transformers) to fit within sandboxed resource limits.

    Output

    The result is a clean, organized directory where every sound file follows a consistent Title_Case_With_Underscores naming convention, making your sample libraries and field recordings instantly searchable.

    Known Limitations

    Requires Python 3.8+ installed. Model download is ~350MB on first run. Works best on clearly identifiable sounds — abstract/cinematic SFX may need the optional Gemini enhancement step. Processes first 10 seconds of each file.

    Example Outputs

    AST Model (local, instant):
    ElevenLabs_2024-10-14T02_58_09_Eric_pre_s23_sb19_se69_b_m2.mp3 → Elevenlabs_Eric_Voice_Speech.mp3
    Epic_trumpet_birthda_#1-1772723461703.wav → Bright_Trumpet_Horn_Fanfare.wav
    AST + Gemini two-step (best quality):
    Lots_Snow_From_Knock.mp3 → Heavy_Wooden_Knocking.mp3
    Rover_Car_Wheele_Vehicle_Aircraft.mp3 → Jet_Turbine_Spin_Up.mp3
    Electronic_Glitch_Owl.mp3 → Digital_Radio_Static.mp3

    How to Install

    unzip describe-rename-sound-files.zip -d ~/.claude/skills/

    $10

    One-time purchase • Own forever

    Security Scanned

    Passed automated security review

    Permissions

    Terminal / Shell
    Network Access
    Environment Variables
    Read Files
    Browser
    Write Files

    Allowed Hosts

    download.pytorch.org
    generativelanguage.googleapis.com
    huggingface.co

    File Scopes

    describe-rename-sound-files/**

    Requirements: Python 3 (free from python.org - skill downloads it automatically if needed), 500 MB disk space, internet for first-time setup only, or if Gemini overkill solution is selected.

    Tags

    audio-processing
    machine-learning
    file-management
    productivity
    python
    Sound-file-renamer
    SFX-classification-tool
    audio-sample-organizer

    Works on Mac and Windows. Supports MP3, WAV, OGG, FLAC, AAC, M4A, OPUS, and WMA. The AI model downloads automatically on first run (~350 MB, one time), then everything runs 100% offline on your computer. No subscriptions, no API keys (except if you want to improve further with the overkill Gemini solution), no cloud — buy once, use forever. Perfect for music producers, sound designers, podcasters, creators, youtubers, game developers, and anyone with a folder full of unreadable audio filenames from Elevenlabs etc.

    Creator

    F
    Fredrik Akerstrom

    Claude Skills

    Freelancer, internet entrepreneur and music producer from Sweden.

    Frequently Asked Questions

    Similar Skills