Numa-caffe: Numa-aware deep learning neural networks P Roy, SL Song, S Krishnamoorthy, A Vishnu, D Sengupta, X Liu ACM Transactions on Architecture and Code Optimization (TACO) 15 (2), 1-26, 2018 | 23 | 2018 |
StructSlim: A lightweight profiler to guide structure splitting P Roy, X Liu Proceedings of the 2016 International Symposium on Code Generation and …, 2016 | 21 | 2016 |
LWPTool: A lightweight profiler to guide data layout optimization C Yu, P Roy, Y Bai, H Yang, X Liu IEEE Transactions on Parallel and Distributed Systems 29 (11), 2489-2502, 2018 | 16 | 2018 |
Lightweight detection of cache conflicts P Roy, SL Song, S Krishnamoorthy, X Liu Proceedings of the 2018 International Symposium on Code Generation and …, 2018 | 16 | 2018 |
An Empirical Study of High Performance Computing (HPC) Performance Bugs MAK Azad, N Iqbal, F Hassan, P Roy International Conference on Mining Software Repositories, 2023 | 10 | 2023 |
MicroProf: Code-level Attribution of Unnecessary Data Transfer in Microservice Applications SSM Tariq, L Menard, P Su, P Roy ACM Transactions on Architecture and Code Optimization 20 (4), 1-26, 2023 | 2 | 2023 |
Smt-aware instantaneous footprint optimization P Roy, X Liu, SL Song Proceedings of the 25th ACM International Symposium on High-Performance …, 2016 | 2 | 2016 |
LibProf: A Python Profiler for Improving Cold Start Performance in Serverless Applications SSM Tariq, AA Zein, SS Vaidya, A Khanolkar, P Roy arXiv preprint arXiv:2406.11734, 2024 | 1 | 2024 |
PerfCurator: Curating a large-scale dataset of performance bug-related commits from public repositories MAK Azad, M Alexender, M Alexender, SSM Tariq, F Hassan, P Roy arXiv preprint arXiv:2406.11731, 2024 | | 2024 |
Designing Secure Performance Metrics for Last Level Cache P Roy, B Eshete, P Su 2023 IEEE International Parallel and Distributed Processing Symposium …, 2023 | | 2023 |
PcMINER: Mining Performance Related Commits at Scale MAK Azad, M Alexerder, M Alexender, SSM Tariq, F Hassan, P Roy | | |