Yu-An Chung

Cited by

	All	Since 2019
Citations	3878	3757
h-index	27	27
i10-index	33	33

1300

650

325

975

2017201820192020202120222023202432 84 231 366 517 934 1266 416

Co-authors

James GlassMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
Wei-Ning HsuFacebook AI Research (FAIR)Verified email at csail.mit.edu
Yuan GongResearch Scientist, MIT CSAILVerified email at mit.edu
Yu ZhangOpenAIVerified email at csail.mit.edu
Wei-Hung WengGoogle ResearchVerified email at mit.edu
Hung-yi LeeNational Taiwan UniversityVerified email at ntu.edu.tw
Hsuan-Tien LinProfessor of Computer Science and Information Engineering, National Taiwan UniversityVerified email at csie.ntu.edu.tw
Schrasing TongMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Shao-Wen YangSr. Applied Scientist at AmazonVerified email at amazon.com
Chenguang ZhuHead of Zoom GenAI ScienceVerified email at zoom.us
RJ Skerry-RyanGoogle, Inc.Verified email at alum.mit.edu
Sravya PopuriResearch Engineer, Facebook AI ResearchVerified email at fb.com
Alexander H. LiuMassachusetts Institute of TechnologyVerified email at mit.edu
Peng-Jen ChenFacebookVerified email at fb.com
Juan PinoMetaVerified email at fb.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Ann LeeMeta AIVerified email at csail.mit.edu
Anmol GulatiResearcher, Google DeepmindVerified email at google.com
Alexis ConneauOpenAIVerified email at openai.com

Yu-An Chung

Facebook AI Research (FAIR)

Verified email at fb.com - Homepage

Machine Learning Speech Processing Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Ast: Audio spectrogram transformer Y Gong, YA Chung, J Glass arXiv preprint arXiv:2104.01778, 2021	675	2021
An unsupervised autoregressive model for speech representation learning YA Chung, WN Hsu, H Tang, J Glass arXiv preprint arXiv:1904.03240, 2019	433	2019
W2v-bert: Combining contrastive learning and masked language modeling for self-supervised speech pre-training YA Chung, Y Zhang, W Han, CC Chiu, J Qin, R Pang, Y Wu 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	297	2021
Audio word2vec: Unsupervised learning of audio segment representations using sequence-to-sequence autoencoder YA Chung, CC Wu, CH Shen, HY Lee, LS Lee arXiv preprint arXiv:1603.00982, 2016	210	2016
Ssast: Self-supervised audio spectrogram transformer Y Gong, CI Lai, YA Chung, J Glass Proceedings of the AAAI Conference on Artificial Intelligence 36 (10), 10699 …, 2022	207	2022
Speech2vec: A sequence-to-sequence framework for learning word embeddings from speech YA Chung, J Glass arXiv preprint arXiv:1803.08976, 2018	205	2018
Generative pre-training for speech with autoregressive predictive coding YA Chung, J Glass ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	195	2020
Psla: Improving audio tagging with pretraining, sampling, labeling, and aggregation Y Gong, YA Chung, J Glass IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3292-3306, 2021	141	2021
Semi-supervised training for improving data efficiency in end-to-end speech synthesis YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	136	2019
Disentangling correlated speaker and noise for speech synthesis via data augmentation and adversarial factorization WN Hsu, Y Zhang, RJ Weiss, YA Chung, Y Wang, Y Wu, J Glass ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	123	2019
Vector-quantized autoregressive predictive coding YA Chung, H Tang, J Glass arXiv preprint arXiv:2005.08392, 2020	109	2020
Unsupervised cross-modal alignment of speech and text embedding spaces YA Chung, WH Weng, S Tong, J Glass Advances in neural information processing systems 31, 2018	108	2018
Supervised and unsupervised transfer learning for question answering YA Chung, HY Lee, J Glass arXiv preprint arXiv:1711.05345, 2017	104	2017
Cost-aware pre-training for multiclass cost-sensitive deep learning YA Chung, HT Lin, SW Yang arXiv preprint arXiv:1511.09337, 2015	104	2015
Learning deep representations of medical images using siamese cnns with application to content-based image retrieval YA Chung, WH Weng arXiv preprint arXiv:1711.08490, 2017	89	2017
Non-autoregressive predictive coding for learning speech representations from local dependencies AH Liu, YA Chung, J Glass arXiv preprint arXiv:2011.00406, 2020	86	2020
SLAM: A unified encoder for speech and language modeling via speech-text joint pre-training A Bapna, Y Chung, N Wu, A Gulati, Y Jia, JH Clark, M Johnson, J Riesa, ... arXiv preprint arXiv:2110.10329, 2021	71	2021
Splat: Speech-language joint pre-training for spoken language understanding YA Chung, C Zhu, M Zeng arXiv preprint arXiv:2010.02295, 2020	66	2020
libact: Pool-based active learning in python YY Yang, SC Lee, YA Chung, TE Wu, SA Chen, HT Lin arXiv preprint arXiv:1710.00379, 2017	60	2017
Improved speech representations with multi-target autoregressive predictive coding YA Chung, J Glass arXiv preprint arXiv:2004.05274, 2020	56	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors