The scalable heterogeneous computing (SHOC) benchmark suite A Danalis, G Marin, C McCurdy, JS Meredith, PC Roth, K Spafford, ... Proceedings of the 3rd workshop on general-purpose computation on graphics …, 2010 | 849 | 2010 |
DAGuE: A generic distributed DAG engine for high performance computing G Bosilca, A Bouteiller, A Danalis, T Herault, P Lemarinier, J Dongarra Parallel Computing 38 (1-2), 37-51, 2012 | 501 | 2012 |
Parsec: Exploiting heterogeneity to enhance scalability G Bosilca, A Bouteiller, A Danalis, M Faverge, T Hérault, JJ Dongarra Computing in Science & Engineering 15 (6), 36-45, 2013 | 433 | 2013 |
Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ... 2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011 | 202 | 2011 |
Transformations to parallel codes for communication-computation overlap A Danalis, KY Kim, L Pollock, M Swany SC'05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing, 58-58, 2005 | 110 | 2005 |
MPI-aware compiler optimizations for improving communication-computation overlap A Danalis, L Pollock, M Swany, J Cavazos Proceedings of the 23rd international conference on Supercomputing, 316-325, 2009 | 74 | 2009 |
PTG: an abstraction for unhindered parallelism A Danalis, G Bosilca, A Bouteiller, T Herault, J Dongarra 2014 Fourth International Workshop on Domain-Specific Languages and High …, 2014 | 73 | 2014 |
Online impact analysis via dynamic compilation technology B Breech, A Danalis, S Shindo, L Pollock 20th IEEE International Conference on Software Maintenance, 2004 …, 2004 | 72 | 2004 |
Distibuted dense numerical linear algebra algorithms on massively parallel architectures: DPLASMA G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ... | 56 | 2010 |
Power monitoring with PAPI for extreme scale architectures and dataflow-based programming models H McCraw, J Ralph, A Danalis, J Dongarra 2014 IEEE International Conference on Cluster Computing (CLUSTER), 385-391, 2014 | 44 | 2014 |
Dense linear algebra on distributed heterogeneous hardware with a symbolic dag approach G Bosilca | 37 | 2012 |
Efficient quality threshold clustering for parallel architectures A Danalis, C McCurdy, JS Vetter 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 28 | 2012 |
Distributed-memory task execution and dependence tracking within DAGuE and the DPLASMA project G Bosilca, A Bouteiller, A Danalis, M Faverge, H Haidar, T Herault, ... Innovative Computing Laboratory, University of Tennessee, Technical Report …, 2010 | 28 | 2010 |
PAPI software-defined events for in-depth performance analysis H Jagode, A Danalis, H Anzt, J Dongarra The International Journal of High Performance Computing Applications 33 (6 …, 2019 | 27 | 2019 |
Automatic MPI application transformation with ASPhALT A Danalis, L Pollock, M Swany 2007 IEEE International Parallel and Distributed Processing Symposium, 1-8, 2007 | 27 | 2007 |
PaRSEC in practice: Optimizing a legacy chemistry application through distributed task-based execution A Danalis, H Jagode, G Bosilca, J Dongarra 2015 IEEE International Conference on Cluster Computing, 304-313, 2015 | 25 | 2015 |
Power management and event verification in papi H Jagode, A YarKhan, A Danalis, J Dongarra Tools for High Performance Computing 2015: Proceedings of the 9th …, 2016 | 22 | 2016 |
An efficient distributed randomized algorithm for solving large dense symmetric indefinite linear systems M Baboulin, D Becker, G Bosilca, A Danalis, J Dongarra Parallel Computing 40 (7), 213-223, 2014 | 22 | 2014 |
Gravel: A communication library to fast path MPI A Danalis, A Brown, L Pollock, M Swany, J Cavazos Recent Advances in Parallel Virtual Machine and Message Passing Interface …, 2008 | 21 | 2008 |
Search space generation and pruning system for autotuners P Luszczek, M Gates, J Kurzak, A Danalis, J Dongarra 2016 IEEE International Parallel and Distributed Processing Symposium …, 2016 | 15 | 2016 |