Follow
Jiatong Shi (史嘉彤)
Jiatong Shi (史嘉彤)
Verified email at andrew.cmu.edu - Homepage
Title
Cited by
Cited by
Year
SUPERB: Speech processing Universal PERformance Benchmark
S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ...
Proceedings of the Interspeech, 1194--1198, 2021
6482021
Recent developments on ESPnet toolkit boosted by Conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2542021
Findings of the IWSLT 2022 Evaluation Campaign.
A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, ...
Proceedings of the 19th International Conference on Spoken Language …, 2022
862022
Audiogpt: Understanding and generating speech, music, sound, and talking head
R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024
852024
SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative capabilities
HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ...
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
702022
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task Learning
W Hou, Y Dong, B Zhuang, L Yang, J Shi, T Shinozaki
Proceedings of the Interspeech, 1037-1041, 2020
692020
Context-aware Goodness of Pronunciation for Computer-Assisted Pronunciation Training
J Shi, N Huo, Q Jin
Proceedings of the Interspeech, 3057-3061, 2020
552020
ESPnet2-TTS: Extending the edge of TTS research
T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ...
arXiv preprint arXiv:2110.07840, 2021
422021
Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yolox\'ochitl Mixtec
J Shi, JD Amith, RC García, EG Sierra, K Duh, S Watanabe
Proceedings of the 16th Conference of the European Chapter of the …, 2021
302021
SUPERB@ SLT 2022: Challenge on generalization and efficiency of self-supervised speech representation learning
T Feng, A Dong, CF Yeh, S Yang, TQ Lin, J Shi, KW Chang, Z Huang, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 1096-1103, 2023
232023
Findings of the IWSLT 2023 evaluation campaign
M Agarwal, S Agarwal, A Anastasopoulos, L Bentivogli, O Bojar, C Borg, ...
Association for Computational Linguistics, 2023
232023
Improving massively multilingual ASR with auxiliary CTC objectives
W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
202023
The singing voice conversion challenge 2023
WC Huang, LP Violeta, S Liu, J Shi, T Toda
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
192023
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
J Shi, D Berrebbi, W Chen, HL Chung, EP Hu, WP Huang, X Chang, ...
Proceedings of the Interspeech, 884--888, 2023
192023
Sequence-to-sequence singing voice synthesis with perceptual entropy loss
J Shi, S Guo, N Huo, Y Zhang, Q Jin
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
192021
ESPnet-ST IWSLT 2021 Offline Speech Translation System
H Inaguma, B Yan, S Dalmia, P Gu, J Shi, K Duh, S Watanabe
Proceedings of the 18th International Conference on Spoken Language …, 2021
192021
Leveraging deep learning with audio analytics to predict the success of crowdfunding projects
J Shi, K Yang, W Xu, M Wang
The Journal of Supercomputing 77, 7833-7853, 2021
182021
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation
D Berrebbi, J Shi, B Yan, O Lopez-Francisco, JD Amith, S Watanabe
Proceedings of the Interspeech, 3533--3537, 2022
172022
Uniaudio: An audio foundation model toward universal audio generation
D Yang, J Tian, X Tan, R Huang, S Liu, X Chang, J Shi, S Zhao, J Bian, ...
arXiv preprint arXiv:2310.00704, 2023
142023
On compressing sequences for self-supervised speech models
Y Meng, HJ Chen, J Shi, S Watanabe, P Garcia, H Lee, H Tang
2022 IEEE Spoken Language Technology Workshop (SLT), 1128-1135, 2023
122023
The system can't perform the operation now. Try again later.
Articles 1–20