
| Input Audio | Text 1 | Text 2 |
|---|---|---|
| Balero - Ravel | An 80s driving pop song electronic drums and synth pads in the background | Folk song with accordion and acoustic guitar |
| Flight of the Bumblebee - Rimsky-Korsakov | Fast tempo country song with dominating banjo and acoustic guitars | Psychodelic trance music with deep synth bass |
| text prompt → drum prompt ↓ | Input Audio | 90s rock with electric guitar and heavy drums | Reggae with ukelele and percussions | An 80s driving pop song with heavy drums and synth pads in the background |
|---|---|---|---|---|
| separated drums | ||||
| separated drums | ||||
| separated drums | ||||
| beatboxing | ||||
| beatboxing | ||||
| beatboxing |
| Chord Progression | 90s rock with electric guitar and heavy drums | Reggae with ukelele and percussions | An 80s driving pop song with heavy drums and synth pads in the background |
|---|---|---|---|
| (C, 0.0), (F, 1.0), (G, 1.75), (C, 4.0), (F, 5.0), (G, 5.75), (C, 8.0), (F, 9.0), (A7, 9.75) | |||
| (Em, 0.0), (G, 1.5), (D, 3.0), (A, 4.5), (Em, 6.0), (G, 7.5), (D, 9.0) | |||
| (E7, 0.0), (A7, 1.0), (E7, 2.0), (A7, 4.0), (E7, 6.0), (B7, 8.0), (A7, 8.5), (E7, 9.0) | |||
| (D, 0.0), (F#m, 2.5), (G, 5.0) , (D, 7.5) | |||
| (E, 0.0), (D, 1.25), (A, 2.5) , (E, 5.0), (D, 6.25), (A, 7.5) |
| Input Audio | 90s rock with electric guitar and heavy drums | Reggae with ukelele and percussions | An 80s driving pop song with heavy drums and synth pads in the background |
|---|---|---|---|
| Input Sample 1 | Input Sample 2 | Input Sample 3 |
|---|---|---|
| condition ↓ text → | 90s rock with electric guitar and heavy drums | lofi slow bpm electro chill with organic samples | Groovy and bright, with funk elements featuring lively horns, bass, and drums to create a positive and confident mood. |
|---|---|---|---|
| Melody | |||
| Drums | |||
| Chords | |||
| Text-Only |
| text → condition ↓ | A modern Bossa Nova using traditional Brazilian instruments. With nylon stringed guitar, piano, bass, flugel horn and percussion. bpm 112 | Groovy and bright, with funk elements featuring lively horns, bass, and drums to create a positive and confident mood. | Bright and grooving, featuring vocal chops, synthesizers, bass, and beats that create a proud, soaring mood. | Soaring and hopeful, featuring atmospheric electric guitar, floating chops, bouncy choir and light synth drums that create a dreamy, inspirational mood. bpm 105 |
|---|---|---|---|---|
| Source Audio | ||||
| Chords + Drums | ||||
| Chords + Melody | ||||
| Drums + Melody | ||||
| All Controls |
@misc{tal2024joint,
title={Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation},
author={Or Tal and Alon Ziv and Itai Gat and Felix Kreuk and Yossi Adi},
year={2024},
eprint={2406.10970},
archivePrefix={arXiv},
primaryClass={cs.SD}
}