Follow
Hari Subramoni
Hari Subramoni
Verified email at cse.ohio-state.edu - Homepage
Title
Cited by
Cited by
Year
Memcached design on high performance RDMA capable interconnects
J Jose, H Subramoni, M Luo, M Zhang, J Huang, M Wasi-ur-Rahman, ...
2011 International Conference on Parallel Processing, 743-752, 2011
2642011
High performance RDMA-based design of HDFS over InfiniBand
NS Islam, MW Rahman, J Jose, R Rajachandrasekar, H Wang, ...
SC'12: Proceedings of the International Conference on High Performance …, 2012
2232012
High-performance design of hadoop rpc with rdma over infiniband
X Lu, NS Islam, M Wasi-Ur-Rahman, J Jose, H Subramoni, H Wang, ...
2013 42nd International Conference on Parallel Processing, 641-650, 2013
1612013
High-performance design of hbase with rdma over infiniband
J Huang, X Ouyang, J Jose, M Wasi-ur-Rahman, H Wang, M Luo, ...
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
1102012
Designing topology-aware collective communication algorithms for large scale infiniband clusters: Case studies with scatter and gather
K Kandalla, H Subramoni, A Vishnu, DK Panda
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
1062010
High-performance RDMA-based design of Hadoop MapReduce over InfiniBand
M Wasi-ur-Rahman, NS Islam, X Lu, J Jose, H Subramoni, H Wang, ...
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
862013
Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes
H Subramoni, S Potluri, K Kandalla, B Barth, J Vienne, J Keasler, ...
SC'12: Proceedings of the International Conference on High Performance …, 2012
842012
Performance analysis and evaluation of infiniband fdr and 40gige roce on hpc and cloud computing systems
J Vienne, J Chen, M Wasi-Ur-Rahman, NS Islam, H Subramoni, ...
2012 IEEE 20th Annual Symposium on High-Performance Interconnects, 48-55, 2012
832012
An in-depth performance characterization of CPU-and GPU-based DNN training on modern architectures
AA Awan, H Subramoni, DK Panda
Proceedings of the Machine Learning on HPC Environments, 1-8, 2017
822017
Scalable memcached design for infiniband clusters using hybrid transports
J Jose, H Subramoni, K Kandalla, M Wasi-ur-Rahman, H Wang, ...
2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2012
792012
High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT
K Kandalla, H Subramoni, K Tomko, D Pekurovsky, S Sur, DK Panda
Computer Science-Research and Development 26 (3), 237-246, 2011
782011
Designing multi-leader-based allgather algorithms for multi-core clusters
K Kandalla, H Subramoni, G Santhanaraman, M Koop, DK Panda
2009 IEEE International Symposium on Parallel & Distributed Processing, 1-8, 2009
672009
The MVAPICH project: Transforming research into high-performance MPI library for HPC community
DK Panda, H Subramoni, CH Chu, M Bayatpour
Journal of Computational Science 52, 101208, 2021
652021
Rdma over ethernet—a preliminary study
H Subramoni, P Lai, M Luo, DK Panda
2009 IEEE International Conference on Cluster Computing and Workshops, 1-9, 2009
652009
Design and evaluation of benchmarks for financial applications using Advanced Message Queuing Protocol (AMQP) over InfiniBand
H Subramoni, G Marsh, S Narravula, P Lai, DK Panda
2008 workshop on high performance computational finance, 1-8, 2008
592008
Scalable distributed dnn training using tensorflow and cuda-aware mpi: Characterization, designs, and performance evaluation
AA Awan, J Bédorf, CH Chu, H Subramoni, DK Panda
2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2019
572019
Design and evaluation of network topology-/speed-aware broadcast algorithms for infiniband clusters
H Subramoni, K Kandalla, J Vienne, S Sur, B Barth, K Tomko, R Mclay, ...
2011 IEEE International Conference on Cluster Computing, 317-325, 2011
562011
Optimized broadcast for deep learning workloads on dense-GPU InfiniBand clusters: MPI or NCCL?
AA Awan, CH Chu, H Subramoni, DK Panda
Proceedings of the 25th European MPI Users' Group Meeting, 1-9, 2018
532018
MVAPICH-PRISM: A proxy-based communication framework using InfiniBand and SCIF for Intel MIC clusters
S Potluri, D Bureddy, K Hamidouche, A Venkatesh, K Kandalla, ...
Proceedings of the International Conference on High Performance Computing …, 2013
522013
Gems: Gpu-enabled memory-aware model-parallelism system for distributed dnn training
A Jain, AA Awan, AM Aljuhani, JM Hashmi, QG Anthony, H Subramoni, ...
SC20: International Conference for High Performance Computing, Networking …, 2020
492020
The system can't perform the operation now. Try again later.
Articles 1–20