In the following, we present samples for MAGNeT MusicGen, MusicLM, using the public AI Test Kitchen demo, AudioLDM2, and Mousai, which we retrained on the same dataset as MAGNeT.
Description | MAGNeT | MusicGen | MusicLM | AudioLDM2 | Mousai |
Earthy tones, environmentally conscious, ukulele-infused, harmonic, breezy, easygoing, organic instrumentation, gentle grooves | |||||
80s electronic track with melodic synthesizers, catchy beat and groovy bass | |||||
Smooth jazz, with a saxophone solo, piano chords, and snare full drums | |||||
A grand orchestral arrangement with thunderous percussion, epic brass fanfares, and soaring strings, creating a cinematic atmosphere fit for a heroic battle | |||||
Rock with saturated guitars, a heavy bass line and crazy drum break and fills |
In the following, we present samples for MAGNeT AudioGen, and AudioLDM2.
Description | MAGNeT | AudioGen | AudioLDM2 |
Whistling with wind blowing | |||
A toilet flushing as music is playing and a man is singing in the distance | |||
Pigeons are making grunting sounds and snapping beaks | |||
Seagulls squawking as ocean waves crash while wind blows heavily into a microphone |
We present samples of Hybrid-MAGNeT where the first 5-seconds were generated using an autoregressive mode, while the rest were generated in a non-autoregressive manner.
Description | Hybrid-MAGNeT |
Hypnotic and bouncy, with hip hop trap elements featuring trippy synthesizer and synth drums to create a content and chill mood | |
Funky and confident, featuring groovy electric guitar, keyboards that create a chill, laid-back mood | |
Heavy, hard and driving, in the style of Pop Punk, featuring edgy electric guitar that creates a bold, rebellious mood | |
Contemporary Jazz Waltz featuring a fabulous guitar solo | |
Bright and groovy, featuring a Tropical House feel and warm synth textures that create an enthusiastic mood. |
We present 10-second samples from MAGNeT trained with and without the temporal context restriction as defined in our paper.
Description | MAGNeT w.o. restricted context | MAGNeT |
House track with pads and synths creating a tripping harmony | ||
House track with pads and synths creating a tripping harmony | ||
House track with pads and synths creating a tripping harmony | ||
Funky groove with electric piano playing blue chords rhythmically | ||
Funky groove with electric piano playing blue chords rhythmically | ||
Funky groove with electric piano playing blue chords rhythmically |
@misc{ziv2024masked, title={Masked Audio Generation using a Single Non-Autoregressive Transformer}, author={Alon Ziv and Itai Gat and Gael Le Lan and Tal Remez and Felix Kreuk and Alexandre Défossez and Jade Copet and Gabriel Synnaeve and Yossi Adi}, year={2024}, eprint={2401.04577}, archivePrefix={arXiv}, primaryClass={cs.SD} }