I have a folder of audio files with messy names like 'ElevenLabs' timestamps and 'Epic_trumpet' IDs. Can you analyze what they actually sound like and rename them to be descriptive?

Your agent does

AST Model (local, instant): ElevenLabs_2024-10-14T02_58_09_Eric_pre_s23_sb19_se69_b_m2.mp3 → Elevenlabs_Eric_Voice_Speech.mp3 Epic_trumpet_birthda_#1-1772723461703.wav → Bright_Trumpet_Horn_Fanfare.wav AST + Gemini two-step (best quality): Lots_Snow_From_Knock.mp3 → Heavy_Wooden_Knocking.mp3 Rover_Car_Wheele_Vehicle_Aircraft.mp3 → Jet_Turbine_Spin_Up.mp3 Electronic_Glitch_Owl.mp3 → Digital_Radio_Static.mp3

Describe Rename Sound Files

Name: Describe Rename Sound Files
Price: 10 USD
Availability: InStock
Author: Agensi

by Fredrik Akerstrom

SoundTag AI: Automatically describe and batch-rename audio files based on their actual sound using local ML or Gemini AI.

Updated Jun 2026

194 views

Security scanned

$10

· or 50 credits

30-day refund guarantee

Secure checkout via Stripe

⚡ Also available via Agensi MCP - your AI agent can load this skill on demand via MCP. Learn more →

Included in download

Identify specific instruments to rename generic music project tracks
Convert cryptic field recording names into descriptive environmental labels
terminal, network, env_vars automation included
Ready for Works on Mac and Windows. Supports MP3
Instant install

See it in action

You say

I have a folder of audio files with messy names like 'ElevenLabs' timestamps and 'Epic_trumpet' IDs. Can you analyze what they actually sound like and rename them to be descriptive?

Your agent does

194 views

Security scanned

About This Skill

SoundTag AI -Listens & Renames Your Sound Files

What it does

This skill solves the problem of messy, auto-generated audio filenames like audio_track_v2_final_99.wav. It analyzes the actual content of sound files and renames them with human-readable, descriptive titles such as ElevenLabs_2024-07-21T15_43_56_George_pre_s50_sb75_se0_b_m2.mp3 → Elevenlabs_George_Voice_Speech.mp3 or Bright_Trumpet_Fanfare.wav or Large_Crowd_Cheering.mp3.

Supported tools

Local ML (AST): Uses the MIT Audio Spectrogram Transformer to classify sounds into 527 categories (Speech, Music, Explosion, etc.) entirely offline.
Google Gemini API: Leverages advanced multimodal AI for nuanced descriptions of cinematic SFX, moods, and complex textures.
Batch Processing: Supports .wav, .mp3, .ogg, .flac, .aac, .m4a, and more.

Why use this skill

Unlike simple prompting, this skill implements a sophisticated two-step workflow. It first attempts a high-speed local classification to save on API costs and privacy. For ambiguous sounds, it provides a structured "improvement pass" using Gemini. It intelligently combines ML labels with hidden hints from the original filename to ensure context is never lost. It handles environment constraints automatically, including specific dependency versions (Torch/Transformers) to fit within sandboxed resource limits.

Output

The result is a clean, organized directory where every sound file follows a consistent Title_Case_With_Underscores naming convention, making your sample libraries and field recordings instantly searchable.

Use Cases

Identify specific instruments to rename generic music project tracks
Convert cryptic field recording names into descriptive environmental labels
Organize voiceover exports by speaker name and performance style
Batch-process sound effect libraries using AI-generated content tags

Known Limitations

Requires Python 3.8+ installed. Model download is ~350MB on first run. Works best on clearly identifiable sounds — abstract/cinematic SFX may need the optional Gemini enhancement step. Processes first 10 seconds of each file.

How to Install

mkdir -p ~/.claude/skills && curl -sL https://www.agensi.io/api/install/describe-rename-sound-files -o /tmp/describe-rename-sound-files.zip && unzip -o /tmp/describe-rename-sound-files.zip -d ~/.claude/skills && rm /tmp/describe-rename-sound-files.zip

Free skills install directly. Paid skills require purchase - use the download button above after buying.

Reviews

No reviews yet - be the first to share your experience.

Only users who have downloaded or purchased this skill can leave a review.

Early access skill

Security scanned

Built by Fredrik Akerstrom

Works on Mac and Windows. Supports MP3, WAV, OGG, FLAC, A…

Example output available

Be the first to review this skill.

Only users who have downloaded or purchased this skill can leave a review.

Security Scanned

Passed automated security review

Permissions

Terminal / Shell

Network Access

Environment Variables

Read Files

Browser

Write Files

Allowed Hosts

download.pytorch.org

generativelanguage.googleapis.com

huggingface.co

File Scopes

describe-rename-sound-files/**

Requirements: Python 3 (free from python.org - skill downloads it automatically if needed), 500 MB disk space, internet for first-time setup only, or if Gemini overkill solution is selected.

Creator

Fredrik Akerstrom

Claude Skills

Freelancer, internet entrepreneur and music producer from Sweden.

Frequently Asked Questions

Learn More About AI Agent Skills

More Premium Skills

inline-comment

Best way to steer your agents, effortlessly.

$9.994 installs

designing-hybrid-context-layers

Architects the right retrieval strategy for every query — teaching your agent when to use RAG, a knowledge graph, or a temporal index instead of defaulting to vector search for everything.

$1016 installs

ai-automation-qa-pack

Professional QA & UAT documentation generator for AI automation agencies and complex agent deployments.

$510 installs

World-Class Site & App Design

Every AI-built UI looks generic and templated. This skill teaches your agent to actually match the design to the project, the industry, and the audience.

$510 installs