-
Long-form music generation with latent diffusion
Paper • 2404.10301 • Published • 27 -
Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Paper • 2409.08628 • Published -
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis
Paper • 2402.01753 • Published -
Apollo: Band-sequence Modeling for High-Quality Audio Restoration
Paper • 2409.08514 • Published • 11
Matteo Lorito
mattebass
AI & ML interests
None yet
Organizations
None yet
doc-doc
-
Long-form music generation with latent diffusion
Paper • 2404.10301 • Published • 27 -
Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Paper • 2409.08628 • Published -
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis
Paper • 2402.01753 • Published -
Apollo: Band-sequence Modeling for High-Quality Audio Restoration
Paper • 2409.08514 • Published • 11
models
0
None public yet
datasets
0
None public yet