Follow
Xu Liu
Title
Cited by
Cited by
Year
Evaluating modern gpu interconnect: Pcie, nvlink, nv-sli, nvswitch and gpudirect
A Li, SL Song, J Chen, J Li, X Liu, NR Tallent, KJ Barker
IEEE Transactions on Parallel and Distributed Systems 31 (1), 94-110, 2019
2782019
OMPT: An OpenMP tools application programming interface for performance analysis
AE Eichenberger, J Mellor-Crummey, M Schulz, M Wong, N Copty, ...
OpenMP in the Era of Low Power Devices and Accelerators: 9th International …, 2013
1412013
Locality-aware CTA clustering for modern GPUs
A Li, SL Song, W Liu, X Liu, A Kumar, H Corporaal
ACM SIGARCH Computer Architecture News 45 (1), 297-311, 2017
942017
Cvr: Efficient vectorization of spmv on x86 processors
B Xie, J Zhan, X Liu, W Gao, Z Jia, X He, L Zhang
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
912018
A tool to analyze the performance of multithreaded programs on NUMA architectures
X Liu, J Mellor-Crummey
ACM Sigplan Notices 49 (8), 259-272, 2014
912014
Flep: Enabling flexible and efficient preemption on gpus
B Wu, X Liu, X Zhou, C Jiang
ACM SIGPLAN Notices 52 (4), 483-496, 2017
902017
memif Towards Programming Heterogeneous Memory Asynchronously
FX Lin, X Liu
ACM SIGPLAN Notices 51 (4), 369-383, 2016
772016
A data-centric profiler for parallel programs
X Liu, J Mellor-Crummey
Proceedings of the International Conference on High Performance Computing …, 2013
712013
Tartan: evaluating modern GPU interconnect via a multi-GPU benchmark suite
A Li, SL Song, J Chen, X Liu, N Tallent, K Barker
2018 IEEE International Symposium on Workload Characterization (IISWC), 191-202, 2018
682018
Scaanalyzer: A tool to identify memory scalability bottlenecks in parallel programs
X Liu, B Wu
Proceedings of the International Conference for High Performance Computing …, 2015
582015
Towards efficient spmv on sunway manycore architectures
C Liu, B Xie, X Liu, W Xue, H Yang, X Liu
Proceedings of the 2018 International Conference on Supercomputing, 363-373, 2018
572018
Pinpointing data locality problems using data-centric analysis
X Liu, J Mellor-Crummey
International Symposium on Code Generation and Optimization (CGO 2011), 171-180, 2011
572011
Cudaadvisor: Llvm-based runtime profiling for modern gpus
D Shen, SL Song, A Li, X Liu
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
552018
Redspy: Exploring value locality in software
S Wen, M Chabbi, X Liu
Proceedings of the Twenty-Second International Conference on Architectural …, 2017
452017
Watching for software inefficiencies with witch
S Wen, X Liu, J Byrne, M Chabbi
Proceedings of the Twenty-Third International Conference on Architectural …, 2018
392018
DR-BW: identifying bandwidth contention in NUMA architectures with supervised learning
H Xu, S Wen, A Gimenez, T Gamblin, X Liu
2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017
392017
Pinpointing data locality bottlenecks with low overhead
X Liu, J Mellor-Crummey
2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013
382013
Atmem: Adaptive data placement in graph applications on heterogeneous memories
Y Chen, IB Peng, Z Peng, X Liu, B Ren
Proceedings of the 18th ACM/IEEE International Symposium on Code Generation …, 2020
372020
Redundant loads: A software inefficiency indicator
P Su, S Wen, H Yang, M Chabbi, X Liu
2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE …, 2019
372019
Characterizing emerging heterogeneous memory
D Shen, X Liu, FX Lin
ACM SIGPLAN Notices 51 (11), 13-23, 2016
372016
The system can't perform the operation now. Try again later.
Articles 1–20