Wei-Ning Hsu

Cited by

	All	Since 2019
Citations	8710	8455
h-index	39	34
i10-index	67	66

3100

1550

775

2325

2017201820192020202120222023202454 182 328 495 726 1674 3023 2177

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

James GlassMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
Abdelrahman MohamedResearch scientist, Facebook AI ResearchVerified email at fb.com
Alexei BaevskiFacebook AI ResearchVerified email at fb.com
Michael AuliMeta, FAIRVerified email at meta.com
Bowen ShiFacebook AI ResearchVerified email at meta.com
Yu ZhangOpenAIVerified email at csail.mit.edu
Emmanuel DupouxProfessor of Cognitive Psychology, Ecole des Hautes Etudes en Sciences Sociales, ParisVerified email at ehess.fr
Apoorv VyasFAIR Labs MetaVerified email at meta.com
Yu-An ChungFacebook AI Research (FAIR)Verified email at fb.com
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Andros TjandraFacebook AI (research scientist)Verified email at fb.com
Gabriel SynnaeveResearch scientist at Facebook AI ResearchVerified email at fb.com
Matthew LeFacebook AI ResearchVerified email at fb.com
David HarwathThe University of Texas at AustinVerified email at utexas.edu
Ron J WeissGoogleVerified email at google.com
Awni HannunMachine Learning Research, AppleVerified email at apple.com
Hsuan-Tien LinProfessor of Computer Science and Information Engineering, National Taiwan UniversityVerified email at csie.ntu.edu.tw
Jonathan Le RouxMERLVerified email at merl.com
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)Verified email at google.com
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu

Wei-Ning Hsu

Facebook AI Research (FAIR)

Verified email at csail.mit.edu - Homepage

Speech Processing Machine Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Hubert: Self-supervised speech representation learning by masked prediction of hidden units WN Hsu, B Bolte, YHH Tsai, K Lakhotia, R Salakhutdinov, A Mohamed IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3451-3460, 2021	2227	2021
Data2vec: A general framework for self-supervised learning in speech, vision and language A Baevski, WN Hsu, Q Xu, A Babu, J Gu, M Auli International Conference on Machine Learning, 1298-1312, 2022	736	2022
An unsupervised autoregressive model for speech representation learning YA Chung, WN Hsu, H Tang, J Glass INTERSPEECH, 2019	443	2019
Unsupervised learning of disentangled and interpretable representations from sequential data WN Hsu, Y Zhang, J Glass Thirty-first Conference on Neural Information Processing Systems (NeurIPS), 2017	406	2017
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... Seventh International Conference on Learning Representations (ICLR), 2019	297*	2019
Unsupervised speech recognition A Baevski, WN Hsu, A Conneau, M Auli Advances in Neural Information Processing Systems 34, 27826-27839, 2021	281	2021
On generative spoken language modeling from raw audio K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ... Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021	274	2021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ... INTERSPEECH, 2021	244	2021
Learning audio-visual speech representation by masked multimodal cluster prediction B Shi, WN Hsu, K Lakhotia, A Mohamed arXiv preprint arXiv:2201.02184, 2022	237	2022
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training WN Hsu, A Sriram, A Baevski, T Likhomanenko, Q Xu, V Pratap, J Kahn, ... INTERSPEECH, 2021	228	2021
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	203	2019
Active learning by learning WN Hsu, HT Lin Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015	194	2015
Learning Latent Representations for Speech Generation and Transformation WN Hsu, Y Zhang, J Glass INTERSPEECH, 1273-1277, 2017	182	2017
Scaling speech technology to 1,000+ languages V Pratap, A Tjandra, B Shi, P Tomasello, A Babu, S Kundu, A Elkahky, ... Journal of Machine Learning Research 25 (97), 1-52, 2024	167	2024
Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation WN Hsu, Y Zhang, J Glass 2017 IEEE automatic speech recognition and understanding workshop (ASRU), 16-23, 2017	164	2017
Semi-supervised training for improving data efficiency in end-to-end speech synthesis YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	139	2019
Direct speech-to-speech translation with discrete units A Lee, PJ Chen, C Wang, J Gu, S Popuri, X Ma, A Polyak, Y Adi, Q He, ... arXiv preprint arXiv:2107.05604, 2021	137	2021
Disentangling correlated speaker and noise for speech synthesis via data augmentation and adversarial factorization WN Hsu, Y Zhang, RJ Weiss, YA Chung, Y Wang, Y Wu, J Glass ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	124	2019
Voicebox: Text-guided multilingual universal speech generation at scale M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ... Advances in neural information processing systems 36, 2024	116	2024
Textless speech-to-speech translation on real data A Lee, H Gong, PA Duquenne, H Schwenk, PJ Chen, C Wang, S Popuri, ... arXiv preprint arXiv:2112.08352, 2021	115	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors