Audio Editing
Compare the latest audio editing offerings side by side. Every listing links to pricing, free plan information, and direct competitors.
Fugatto
https://blogs.nvidia.com/blog/fugatto-gen-ai-sound-model/
Fugatto by NVIDIA is a generative audio model that lets you produce and transform music, voices, and soundscapes from text or audio.
MMAudio
https://hkchengrex.com/MMAudio/
MMAudio is an AI tool that automatically generates context-appropriate audio (sound effects, ambience, etc.) from videos or text prompts, syncing sound to visuals to make content more immersive.
Covers AI
https://covers.ai/
Covers.ai is an AI music tool that lets creators generate song covers, swap genres or lyrics, use custom voices, and produce viral TikTok-style music content in minutes.
CloneDub
https://www.clonedub.com/
CloneDub is a tool for high-quality automated video dubbing in 20-plus languages, preserving the original voice and audio context. Upload content, choose target language, and get dubbed video/audio.
Emote Portrait Alive (EMO)
https://humanaigc.github.io/emote-portrait-alive/
EMO (Emote Portrait Alive) is an AI innovation that animates static portraits to talk or sing — using a single image and audio input, it generates expressive video-avatars with realistic expression.
Voicemy.ai
https://voicemy.ai/
Clone voices or train your own custom model, compose melodies, and create AI voiceovers with Voicemy.ai. Free tier available; paid plans from ~$9.99/month. Coming soon: full text-to-speech support.
Epidemic Sound
https://www.epidemicsound.com/voices/use-your-voice/
Epidemic Sound is a subscription-based royalty-free music & sound effects service offering thousands of tracks and tools with global licensing so creators can use premium audio.
voice.ai
https://voice.ai/
Voice.ai is an AI-powered voice changer & text-to-speech tool with thousands of realistic voices, voice cloning, and real-time voice transformations — great for streaming, content creators, gamers, an
Video Candy
https://videocandy.com/
Video Candy is an online video editor with 70+ tools (trim, merge, compress, add text/music etc.), freemium pricing, and no software install needed — ideal for quick video edits, social content, and m
Vidiofy
https://www.vidiofy.ai/
Vidiofy converts your articles or blog posts into engaging vertical videos for Instagram Reels, TikTok, Shorts etc., using AI — complete with branded templates, voice narration, and multilingual suppo
Aidaptive
https://aidaptive.com/
Aidaptive is an AI personalization platform for e-commerce & hospitality, automating product recommendations, search, audience segmentation & marketing content to deliver personalized shopping experie
Tad AI
https://tad.ai/
An AI-powered song generator that transforms text prompts or brief lyrics into royalty-free music—ideal for creators needing fast, customized background tracks.
T2A-Feedback
https://t2a.org.uk/2011/09/29/new-t2a-website/
A large dataset and AI feedback system for evaluating and improving text-to-audio generation models—scoring outputs on event occurrence, order, and acoustic/harmonic quality, and enabling better model
Audio Flamingo 3
https://research.nvidia.com/labs/adlr/AF3/
An open-source audio-language model by NVIDIA for in-depth audio understanding and reasoning (speech, sound, music), supporting long audio, multi-turn chat, and voice-to-voice interaction.
CosyAudio
https://www.researchgate.net/publication/388459609_CosyAudio_Improving_Audio_Generation_with_Confidence_Scores_and_Synthetic_Captions
A research framework for text-to-audio generation that uses synthetic captions + confidence scoring to filter noisy data and improve the quality and faithfulness of generated audio.
WaveLLDM
https://www.aimodels.fyi/papers/arxiv/wavelldm-design-development-lightweight-latent-diffusion-model
A research model for efficient speech denoising and restoration using a compressed latent space via a diffusion approach—faster and less resource-intensive than many waveform-based methods.
AudioGen-Omni
http://ciyou2.github.io/AudioGen-Omni
An advanced AI model that generates speech, song, or general audio synchronized with video or text input; supports multimodal inputs and efficient generation with strong lip-sync and alignment.
Voice Restore
https://huggingface.co/jadechoghari/VoiceRestore
An AI model that restores speech recordings by using advanced transformer techniques to clean up noise, reverb, distortion and other defects — even in severely degraded audio.
iZotope RX Elements
https://www.izotope.com/en/shop/rx-elements/
A basic audio repair toolkit with AI help — clean up clicks, hum, reverb, clipping, and background noise quickly with six essential plugins.
Acon Digital Acoustica
https://acondigital.com/products/acoustica
Acoustica is a professional audio editor designed for editing, mastering, restoration, post-production, and sound design. It is developed by Acon Digital and available for both Windows and macOS. Ther
ElevenLabs Voice
https://elevenlabs.io/docs/capabilities/voices
A high-fidelity AI voice platform that lets you generate, clone, design, and dub voices with realism and expressiveness—powered by text-to-speech, voice cloning, and conversational agents.
End Boost
https://alexaudiobutler.com/
Automatic AI-powered audio mixing tool that balances voice, music, and effects with presets, ducking, mastering & denoising — without needing deep audio skills.
GoldWave
https://goldwave.com/
A long-standing, full-featured audio editor for recording, editing, restoration, and conversion now also usable in browsers via “Infinity” for cross-platform flexibility.
WAVS
https://wavs.com/
An AI-powered sample discovery tool that finds sonically similar sounds from WAVS’ extensive library—integrated directly into your DAW for instant, key/tempo-matched results.