Shaohuai Shi

Cited by

	All	Since 2019
Citations	2580	2368
h-index	24	23
i10-index	31	30

600

300

150

450

2016201720182019202020212022202320248 50 121 210 353 422 428 599 346

Public access

View all

20 articles

3 articles

available

not available

Based on funding mandates

Co-authors

Xiaowen ChuProfessor, Data Science and Analytics, HKUST(GZ)Verified email at ust.hk
Qiang WangSchool of Computer Science and Technology, Harbin Institute of Technology, ShenzhenVerified email at hit.edu.cn
Zhenheng TangHong Kong Baptist UniversityVerified email at comp.hkbu.edu.hk
Bo LiChair Professor at hong kong university of science and technologyVerified email at cse.ust.hk
Kaiyong ZhaoXGRIDSVerified email at xgrids.com
Pengfei XUPhD Student, The University of Hong KongVerified email at connect.hku.hk
Yuxin WangHong Kong Baptist UniversityVerified email at comp.hkbu.edu.hk
Chengjian LiuCollege of Big Data and Internet, Shenzhen Technology UniversityVerified email at sztu.edu.cn
Yangzihao WangSea AI LabVerified email at sea.com
Xianyan JiaAlibabaVerified email at alibaba-inc.com
Wei WangThe Hong Kong University of Science and TechnologyVerified email at cse.ust.hk
Ka Chun CheungNVIDIAVerified email at nvidia.com
Simon SeenvidiaVerified email at nvidia.com

Shaohuai Shi

Professor, Harbin Institute of Technology, Shenzhen

Verified email at hit.edu.cn - Homepage

GPU Computing Parallel and Distributed Computing Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Benchmarking state-of-the-art deep learning software tools S Shi, Q Wang, P Xu, X Chu 2016 7th International Conference on Cloud Computing and Big Data (CCBD), 99-104, 2016	452	2016
Highly scalable deep learning training system with mixed-precision: Training imagenet in four minutes X Jia, S Song, W He, Y Wang, H Rong, F Zhou, L Xie, Z Guo, Y Yang, L Yu, ... NeurIPS Workshop on Systems for ML and Open Source Software, 2018	433	2018
A Distributed Synchronous SGD Algorithm with Global Top- Sparsification for Low Bandwidth Networks S Shi, Q Wang, K Zhao, Z Tang, Y Wang, X Huang, X Chu IEEE ICDCS 2019, 2019	147	2019
Communication-efficient distributed deep learning: A comprehensive survey Z Tang, S Shi, W Wang, B Li, X Chu arXiv preprint arXiv:2003.06307, 2020	129	2020
Performance modeling and evaluation of distributed deep learning frameworks on gpus S Shi, Q Wang, X Chu 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th …, 2018	122	2018
MG-WFBP: Efficient data communication for distributed synchronous SGD algorithms S Shi, X Chu, B Li IEEE INFOCOM 2019-IEEE International Conference on Computer Communications …, 2019	99	2019
Understanding top-k sparsification in distributed deep learning S Shi, X Chu, KC Cheung, S See arXiv preprint arXiv:1911.08772, 2019	85	2019
A Convergence Analysis of Distributed SGD with Communication-Efficient Gradient Sparsification S Shi, K Zhao, Q Wang, Z Tang, X Chu IJCAI, 3411-3417, 2019	83	2019
FADNet: A Fast and Accurate Network for Disparity Estimation Q Wang, S Shi, S Zheng, K Zhao, X Chu International Conference on Robotics and Automation (ICRA) 2020, 2020	82	2020
Benchmarking the performance and energy efficiency of AI accelerators for AI training Y Wang, Q Wang, S Shi, X He, Z Tang, K Zhao, X Chu 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet …, 2020	77*	2020
Performance evaluation of deep learning tools in docker containers P Xu, S Shi, X Chu 2017 3rd International Conference on Big Data Computing and Communications …, 2017	67	2017
Virtual Homogeneity Learning: Defending against Data Heterogeneity in Federated Learning Z Tang, Y Zhang, S Shi, X He, B Han, X Chu ICML 2022, 2022	64	2022
Communication-efficient distributed deep learning with merged gradient sparsification on gpus S Shi, Q Wang, X Chu, B Li, Y Qin, R Liu, X Zhao IEEE INFOCOM 2020-IEEE International Conference on Computer Communications, 2020	64	2020
Benchmarking deep learning models and automated model design for COVID-19 detection with chest CT scans X He, S Wang, S Shi, X Chu, J Tang, X Liu, C Yan, J Zhang, G Ding MedRxiv, 2020.06. 08.20125963, 2020	58	2020
Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters S Shi, X Zhou, S Song, X Wang, Z Zhu, X Huang, X Jiang, F Zhou, Z Guo, ... Fourth Conference on Machine Learning and Systems (MLSys 2021), 2021	55	2021
Communication-efficient decentralized learning with sparsification and adaptive peer selection Z Tang, S Shi, X Chu 2020 IEEE 40th International Conference on Distributed Computing Systems …, 2020	55	2020
Speeding up convolutional neural networks by exploiting the sparsity of rectifier units S Shi, X Chu arXiv preprint arXiv:1704.07724, 2017	54	2017
GossipFL: A Decentralized Federated Learning Framework with Sparsified and Adaptive Communication Z Tang, S Shi, B Li, X Chu IEEE Transactions on Parallel and Distributed Systems 34 (3), 909 - 922, 2023	53	2023
Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans X He, S Wang, X Chu, S Shi, J Tang, X Liu, C Yan, J Zhang, G Ding AAAI 2021, 2021	43	2021
A Quantitative Survey of Communication Optimizations in Distributed Deep Learning S Shi, Z Tang, X Chu, C Liu, W Wang, B Li IEEE Network 35 (3), 230 - 237, 2020	42	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors