Noise estimation for generative diffusion models R San-Roman, E Nachmani, L Wolf arXiv preprint arXiv:2104.02600, 2021 | 113 | 2021 |
Seamless: Multilingual Expressive and Streaming Speech Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ... arXiv preprint arXiv:2312.05187, 2023 | 100 | 2023 |
Non Gaussian Denoising Diffusion Models E Nachmani, RS Roman, L Wolf arXiv preprint arXiv:2106.07582, 2021 | 66 | 2021 |
Proactive detection of voice cloning with localized watermarking R San Roman, P Fernandez, H Elsahar, A Défossez, T Furon, T Tran International Conference on Machine Learning 235, 2024 | 28* | 2024 |
Denoising diffusion gamma models E Nachmani, RS Roman, L Wolf arXiv preprint arXiv:2110.05948, 2021 | 25 | 2021 |
From discrete tokens to high-fidelity audio using multi-band diffusion R San Roman, Y Adi, A Deleforge, R Serizel, G Synnaeve, A Défossez Advances in neural information processing systems 36, 1526-1538, 2023 | 20 | 2023 |
Latent Watermarking of Audio Generative Models R San Roman, P Fernandez, A Deleforge, Y Adi, R Serizel | 2 | 2024 |
MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling S Rouard, RS Roman, Y Adi, A Roebel arXiv preprint arXiv:2501.01757, 2025 | | 2025 |
Large Concept Models: Language Modeling in a Sentence Representation Space LCM The, L Barrault, PA Duquenne, M Elbayad, A Kozhevnikov, ... arXiv preprint arXiv:2412.08821, 2024 | | 2024 |