Nvidia Debuts Fugatto AI Model That Can Generate Music, Voices and Sound Effects

Nvidia’s Fugatto is short for Foundational Generative Audio Transformer Opus 1.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 26 November 2024 14:26 IST
Highlights
  • The AI audio model accepts a combination of text and audio as input
  • Nvidia’s Fugatto can remove or add instruments from an existing song
  • Nvidia says Fugatto has the ability to combine free-form instructions
Nvidia Debuts Fugatto AI Model That Can Generate Music, Voices and Sound Effects

The full version of Fugatto uses 2.5 billion parameters and was trained on Nvidia DGX systems’ datasets

Photo Credit: Reuters

Nvidia introduced a new artificial intelligence (AI) model on Monday that can generate a variety of audio and mix different types of sounds. The tech giant calls the foundation model Fugatto, which is short for Foundational Generative Audio Transformer Opus 1. While audio-focused AI platforms such as Beatoven and Suno exist, the company highlighted that Fugatto offers users granular control over the desired output. The AI model can generate or transform any mix of music, voices and sound based on specific prompts.

Nvidia Introduces AI Audio Model Fugatto

In a blog post, the tech giant detailed its new large language model (LLM). Nvidia said Fugatto can generate music snippets, remove or add instruments from an existing song, change accent or emotion in a voice, and “even let people produce sounds never heard before.”

The AI model accepts both text and audio files as input, and users can combine both to fine-tune their requests. Under the hood, the foundation model's architecture is based on the company's previous work in speech modelling, audio vocoding, and audio understanding. Its full version uses 2.5 billion parameters and was trained on the datasets of Nvidia DGX systems.

Nvidia highlighted that the team that built Fugatto collaborated from different countries globally including Brazil, China, India, Jordan, and South Korea. The collaboration of people from different ethnicities has also contributed to developing the AI model's multi-accent and multilingual capabilities, the company said.

Advertisement

Coming to the AI audio model's capabilities, the tech giant highlighted that it has the capability to generate audio output types that it was not pre-trained on. Highlighting an example, Nvidia said, “Fugatto can make a trumpet bark or a saxophone meow. Whatever users can describe, the model can create.”

Additionally, Fugatto can combine specific audio capabilities using a technique called ComposableART. With this, users can ask the AI model to generate an audio of a person speaking French with a sad feeling. Users can also control the degree of sorrow and the heaviness of the accent with specific instructions.

Advertisement

Further, the foundation model can also generate audio with temporal interpolation, or sounds that change over time. For instance, users can generate the sound of a rainstorm with crescendos of thunder that fade into the distance. These soundscapes can also be experimented with, and even if it is a sound that the model has never processed before, it can create them.

At present, the company has not shared any plans to make the AI model available to users or enterprises.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Oppo Reno 14 5G, Reno 14 Pro 5G India Launch Timeline Leaked
  2. Apple Back to School Offer Brings Discounts on iPad Air, Other Products
  3. Infinix Note 50s 5G+ Gets a New RAM and Storage Option in India: See Price
  4. iQOO Z10 Lite 5G With 6,000mAh Battery Launched in India: Price, Features
  5. Redmi Pad 2 With 11-Inch 2.5K Display, 9,000mAh Battery Launched in India
  6. Nothing Phone 3 to Offer Longer Software Support Than Its Predecessor
  7. Vivo X200 FE Launch Date, Colours, and Design Revealed Ahead of Launch
  8. Nothing Headphone 1 Price, Colour Options Leaked Ahead of Launch
  1. SpaceX Launches 26 Starlink Satellites from California to Expand Low Earth Orbit Internet Network
  2. NASA and DoD Simulate Critical Abort Scenarios to Secure Artemis II Moon Mission
  3. Brain’s Built-In Signal Threshold Helps Differentiate Imagination from Reality
  4. Feather-Legged Lace Weaver Spider Uses Toxic Silk Instead of Fangs to Kill Its Prey
  5. New Habitability Model Helps Identify Which Alien Planets Might Be Able to Host Life
  6. Warner Bros. Games Restructures to Focus on Harry Potter, Game of Thrones, Mortal Kombat and DC Franchises
  7. Google Pixel 10, Pixel 10 Pro Alleged Case Suggests Minor Design Changes From Predecessors
  8. Oppo Reno 14 5G, Reno 14 Pro 5G India Launch Timeline Leaked
  9. Nothing Phone 3 to Offer Longer Android and Security Update Support Than Its Predecessor
  10. Boat Wave Fortune Smartwatch With NFC Tap & Pay Feature, Bluetooth Calling Launched in India
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.