Creating and transforming audio has always been challenging. NVIDIA’s latest AI tool, Fugatto, allows you to generate and edit music, voices and sounds using simple text prompts or audio files. Whether describing a sound or envisioning something entirely new, this innovative tool turns your ideas into reality.
How Fugatto works
With Fugatto, you can compose music snippets based on written prompts, alter existing audio by adding or removing instruments, or modify the tone and emotion of a voice. The tool even pushes the boundaries of creativity, enabling you to produce unique sounds like a trumpet that barks or a saxophone that meows. “Whatever users can describe, the model can create,” NVIDIA stated in a blog post.
Rafael Valle, manager of applied audio research at NVIDIA, and an experienced orchestral conductor, explained the vision behind Fugatto:
“We wanted to create a model that understands and generates sound like humans do. Fugatto is our first step toward a future where unsupervised multitask learning in audio synthesis, and transformation emerges from data and model scale.”
Transforming audio possibilities
Fugatto’s capabilities make it a powerful tool for musicians, sound designers, and creators in various fields. For example, you can quickly prototype a song idea, experiment with different musical styles, or enhance the audio quality of an existing track. Need to adjust the mood of a composition? Fugatto allows you to effortlessly alter emotions or accents.
Learning a new language becomes even more engaging with Fugatto. Imagine being taught by a voice that sounds like a loved one—personalising the learning experience in an entirely new way.
Fugatto also introduces temporal interpolation, which creates dynamic sounds that change over time. A striking example is crafting the audio of a rainstorm moving across an area, complete with the crackle of thunder fading into the distance.
Endless creative opportunities
Fugatto’s applications are vast and exciting. For composers and audio engineers, it’s a time-saving tool that enhances creativity and flexibility. For educators, it offers innovative ways to teach through personalised soundscapes. It even opens doors for filmmakers and game developers looking to design immersive audio experiences.
As AI tools like Fugatto advance, how we create, transform, and understand audio will change dramatically. NVIDIA’s innovation is paving the way for a future where sound is limited only by your imagination.