Describe Rename Sound Files
SoundTag AI: Automatically describe and batch-rename audio files based on their actual sound using local ML or Gemini AI.
- Identify specific instruments to rename generic music project tracks
- Convert cryptic field recording names into descriptive environmental labels
- Organize voiceover exports by speaker name and performance style
$10
· or 50 creditsSecure checkout via Stripe
Included in download
- Identify specific instruments to rename generic music project tracks
- Convert cryptic field recording names into descriptive environmental labels
- terminal, network, env_vars automation included
- Ready for Works on Mac and Windows. Supports MP3
Freelancer, internet entrepreneur and music producer from Sweden.
See it in action
You say
I have a folder of audio files with messy names like 'ElevenLabs' timestamps and 'Epic_trumpet' IDs. Can you analyze what they actually sound like and rename them to be descriptive?
Your agent does
AST Model (local, instant): ElevenLabs_2024-10-14T02_58_09_Eric_pre_s23_sb19_se69_b_m2.mp3 → Elevenlabs_Eric_Voice_Speech.mp3 Epic_trumpet_birthda_#1-1772723461703.wav → Bright_Trumpet_Horn_Fanfare.wav AST + Gemini two-step (best quality): Lots_Snow_From_Knock.mp3 → Heavy_Wooden_Knocking.mp3 Rover_Car_Wheele_Vehicle_Aircraft.mp3 → Jet_Turbine_Spin_Up.mp3 Electronic_Glitch_Owl.mp3 → Digital_Radio_Static.mp3
Describe Rename Sound Files
SoundTag AI: Automatically describe and batch-rename audio files based on their actual sound using local ML or Gemini AI.
$10
· or 50 creditsSecure checkout via Stripe
Included in download
- Identify specific instruments to rename generic music project tracks
- Convert cryptic field recording names into descriptive environmental labels
- terminal, network, env_vars automation included
- Ready for Works on Mac and Windows. Supports MP3
- Instant install
See it in action
You say
I have a folder of audio files with messy names like 'ElevenLabs' timestamps and 'Epic_trumpet' IDs. Can you analyze what they actually sound like and rename them to be descriptive?
Your agent does
AST Model (local, instant): ElevenLabs_2024-10-14T02_58_09_Eric_pre_s23_sb19_se69_b_m2.mp3 → Elevenlabs_Eric_Voice_Speech.mp3 Epic_trumpet_birthda_#1-1772723461703.wav → Bright_Trumpet_Horn_Fanfare.wav AST + Gemini two-step (best quality): Lots_Snow_From_Knock.mp3 → Heavy_Wooden_Knocking.mp3 Rover_Car_Wheele_Vehicle_Aircraft.mp3 → Jet_Turbine_Spin_Up.mp3 Electronic_Glitch_Owl.mp3 → Digital_Radio_Static.mp3
About This Skill
SoundTag AI -Listens & Renames Your Sound Files
What it does
This skill solves the problem of messy, auto-generated audio filenames like audio_track_v2_final_99.wav. It analyzes the actual content of sound files and renames them with human-readable, descriptive titles such as
ElevenLabs_2024-07-21T15_43_56_George_pre_s50_sb75_se0_b_m2.mp3 → Elevenlabs_George_Voice_Speech.mp3 or
Bright_Trumpet_Fanfare.wav or Large_Crowd_Cheering.mp3.
Supported tools
- Local ML (AST): Uses the MIT Audio Spectrogram Transformer to classify sounds into 527 categories (Speech, Music, Explosion, etc.) entirely offline.
- Google Gemini API: Leverages advanced multimodal AI for nuanced descriptions of cinematic SFX, moods, and complex textures.
- Batch Processing: Supports .wav, .mp3, .ogg, .flac, .aac, .m4a, and more.
Why use this skill
Unlike simple prompting, this skill implements a sophisticated two-step workflow. It first attempts a high-speed local classification to save on API costs and privacy. For ambiguous sounds, it provides a structured "improvement pass" using Gemini. It intelligently combines ML labels with hidden hints from the original filename to ensure context is never lost. It handles environment constraints automatically, including specific dependency versions (Torch/Transformers) to fit within sandboxed resource limits.
Output
The result is a clean, organized directory where every sound file follows a consistent Title_Case_With_Underscores naming convention, making your sample libraries and field recordings instantly searchable.
Use Cases
- Identify specific instruments to rename generic music project tracks
- Convert cryptic field recording names into descriptive environmental labels
- Organize voiceover exports by speaker name and performance style
- Batch-process sound effect libraries using AI-generated content tags
Known Limitations
Requires Python 3.8+ installed. Model download is ~350MB on first run. Works best on clearly identifiable sounds — abstract/cinematic SFX may need the optional Gemini enhancement step. Processes first 10 seconds of each file.
How to Install
mkdir -p ~/.claude/skills && curl -sL https://www.agensi.io/api/install/describe-rename-sound-files -o /tmp/describe-rename-sound-files.zip && unzip -o /tmp/describe-rename-sound-files.zip -d ~/.claude/skills && rm /tmp/describe-rename-sound-files.zipFree skills install directly. Paid skills require purchase - use the download button above after buying.
Reviews
No reviews yet - be the first to share your experience.
Only users who have downloaded or purchased this skill can leave a review.
Early access skill
Be the first to review this skill.
Only users who have downloaded or purchased this skill can leave a review.
Security Scanned
Passed automated security review
Permissions
Allowed Hosts
File Scopes
Requirements: Python 3 (free from python.org - skill downloads it automatically if needed), 500 MB disk space, internet for first-time setup only, or if Gemini overkill solution is selected.
Tags
Works on Mac and Windows. Supports MP3, WAV, OGG, FLAC, AAC, M4A, OPUS, and WMA. The AI model downloads automatically on first run (~350 MB, one time), then everything runs 100% offline on your computer. No subscriptions, no API keys (except if you want to improve further with the overkill Gemini solution), no cloud — buy once, use forever. Perfect for music producers, sound designers, podcasters, creators, youtubers, game developers, and anyone with a folder full of unreadable audio filenames from Elevenlabs etc.
Creator
Claude Skills
Freelancer, internet entrepreneur and music producer from Sweden.
Frequently Asked Questions
Learn More About AI Agent Skills
More Premium Skills

inline-comment
Best way to steer your agents, effortlessly.
designing-hybrid-context-layers
Architects the right retrieval strategy for every query — teaching your agent when to use RAG, a knowledge graph, or a temporal index instead of defaulting to vector search for everything.
ai-automation-qa-pack
Professional QA & UAT documentation generator for AI automation agencies and complex agent deployments.
World-Class Site & App Design
Every AI-built UI looks generic and templated. This skill teaches your agent to actually match the design to the project, the industry, and the audience.