NWChem: Past, present, and future E Apra, EJ Bylaska, WA De Jong, N Govind, K Kowalski, TP Straatsma, ... The Journal of chemical physics 152 (18), 2020 | 603 | 2020 |
Addressing failures in exascale computing M Snir, RW Wisniewski, JA Abraham, SV Adve, S Bagchi, P Balaji, J Belak, ... The International Journal of High Performance Computing Applications 28 (2 …, 2014 | 528 | 2014 |
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model U Bondhugula, M Baskaran, S Krishnamoorthy, J Ramanujam, A Rountev, ... Compiler Construction: 17th International Conference, CC 2008, Held as Part …, 2008 | 455 | 2008 |
Scalable work stealing J Dinan, DB Larkins, P Sadayappan, S Krishnamoorthy, J Nieplocha Proceedings of the Conference on High Performance Computing Networking …, 2009 | 412 | 2009 |
Effective automatic parallelization of stencil computations S Krishnamoorthy, M Baskaran, U Bondhugula, J Ramanujam, A Rountev, ... ACM sigplan notices 42 (6), 235-244, 2007 | 320 | 2007 |
A compiler framework for optimization of affine loop nests for GPGPUs MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ... Proceedings of the 22nd annual international conference on Supercomputing …, 2008 | 300 | 2008 |
Synthesis of high-performance parallel programs for a class of ab initio quantum chemistry models G Baumgartner, A Auer, DE Bernholdt, A Bibireata, V Choppella, ... Proceedings of the IEEE 93 (2), 276-292, 2005 | 256 | 2005 |
Dynamic load balancing on single-and multi-GPU systems L Chen, O Villa, S Krishnamoorthy, GR Gao 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 224 | 2010 |
NWChem E Apra, EJ Bylaska, WA de Jong, N Govind, K Kowalski, TP Straatsma, ... American Institute of Physics, 2020 | 215 | 2020 |
Automatic code generation for many-body electronic structure methods: the tensor contraction engine AA Auer, G Baumgartner, DE Bernholdt, A Bibireata, V Choppella, ... Molecular Physics 104 (2), 211-228, 2006 | 174 | 2006 |
Qasmbench: A low-level qasm benchmark suite for nisq evaluation and simulation A Li, S Stein, S Krishnamoorthy, J Ang arXiv preprint arXiv:2005.13018, 2020 | 172* | 2020 |
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ... Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008 | 160 | 2008 |
Argobots: A lightweight low-level threading and tasking framework S Seo, A Amer, P Balaji, C Bordage, G Bosilca, A Brooks, P Carns, ... IEEE Transactions on Parallel and Distributed Systems 29 (3), 512-526, 2017 | 153 | 2017 |
Lifeline-based global load balancing VA Saraswat, P Kambadur, S Kodali, D Grove, S Krishnamoorthy ACM SIGPLAN Notices 46 (8), 201-212, 2011 | 152 | 2011 |
Solving large, irregular graph problems using adaptive work-stealing G Cong, S Kodali, S Krishnamoorthy, D Lea, V Saraswat, T Wen Parallel Processing, 2008. ICPP'08. 37th International Conference on, 536-545, 2008 | 132 | 2008 |
Parametric multi-level tiling of imperfectly nested loops A Hartono, MM Baskaran, C Bastoul, A Cohen, S Krishnamoorthy, ... Proceedings of the 23rd international conference on Supercomputing, 147-157, 2009 | 117 | 2009 |
Data layout transformation for enhancing data locality on nuca chip multiprocessors Q Lu, C Alias, U Bondhugula, T Henretty, S Krishnamoorthy, ... 2009 18th International Conference on Parallel Architectures and Compilation …, 2009 | 108 | 2009 |
Scioto: A framework for global-view task parallelism J Dinan, S Krishnamoorthy, DB Larkins, J Nieplocha, P Sadayappan 2008 37th International Conference on Parallel Processing, 586-593, 2008 | 103 | 2008 |
GPU-based implementations of the noniterative regularized-CCSD (T) corrections: applications to strongly correlated systems W Ma, S Krishnamoorthy, O Villa, K Kowalski Journal of chemical theory and computation 7 (5), 1316-1327, 2011 | 92 | 2011 |
Work stealing and persistence-based load balancers for iterative overdecomposed applications J Lifflander, S Krishnamoorthy, LV Kale Proceedings of the 21st international symposium on High-Performance Parallel …, 2012 | 88 | 2012 |