Follow
Saeed Maleki
Saeed Maleki
xAI
Verified email at x.ai
Title
Cited by
Cited by
Year
An evaluation of vectorizing compilers
S Maleki, Y Gao, MJ Garzar, T Wong, DA Padua
2011 International Conference on Parallel Architectures and Compilation …, 2011
3152011
CHET: an optimizing compiler for fully-homomorphic neural-network inferencing
R Dathathri, O Saarikivi, H Chen, K Laine, K Lauter, S Maleki, ...
Proceedings of the 40th ACM SIGPLAN conference on programming language …, 2019
2512019
Performance portability with the chapel language
A Sidelnik, S Maleki, BL Chamberlain, MJ Garzar'n, D Padua
2012 IEEE 26th international parallel and distributed processing symposium …, 2012
662012
Is Moore's Party Over?
MY Vardi
Commun. ACM 54 (11), 5, 2011
64*2011
DSMR: A parallel algorithm for single-source shortest path problem
S Maleki, D Nguyen, A Lenharth, M Garzarán, D Padua, K Pingali
Proceedings of the 2016 International Conference on Supercomputing, 1-14, 2016
532016
Parallelizing dynamic programming through rank convergence
S Maleki, M Musuvathi, T Mytkowicz
ACM SIGPLAN Notices 49 (8), 219-232, 2014
492014
An empirical study of the effect of source-level loop transformations on compiler stability
Z Gong, Z Chen, J Szaday, D Wong, Z Sura, N Watkinson, S Maleki, ...
Proceedings of the ACM on Programming Languages 2 (OOPSLA), 1-29, 2018
402018
Synthesizing optimal collective algorithms
Z Cai, Z Liu, S Maleki, M Musuvathi, T Mytkowicz, J Nelson, O Saarikivi
Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of …, 2021
382021
Splitwise: Efficient generative llm inference using phase splitting
P Patel, E Choukse, C Zhang, A Shah, Í Goiri, S Maleki, R Bianchini
Power 400 (700W), 1.75, 2023
342023
Breaking the computation and communication abstraction barrier in distributed machine learning workloads
A Jangda, J Huang, G Liu, AHN Sabet, S Maleki, Y Miao, M Musuvathi, ...
Proceedings of the 27th ACM International Conference on Architectural …, 2022
332022
Inter-disciplinary research challenges in computer systems for the 2020s
A Cohen, X Shen, J Torrellas, J Tuck, Y Zhou, S Adve, I Akturk, S Bagchi, ...
National Science Foundation, 2018
292018
Parallel dynamic programming through rank convergence
TD Mytkowicz, M Musuvathi, S Maleki
US Patent 9,195,436, 2015
282015
Implementing network security measures in response to a detected cyber attack
MS Musuvathi, TD Mytkowicz, S Maleki, Y Ding
US Patent 10,805,317, 2020
272020
{TACCL}: Guiding Collective Algorithm Synthesis using Communication Sketches
A Shah, V Chidambaram, M Cowan, S Maleki, M Musuvathi, T Mytkowicz, ...
20th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2023
222023
Homomorphic evaluation of tensor programs
MS Musuvathi, K Laine, KE Lauter, H Chen, OI Saarikivi, S Maleki, ...
US Patent 11,177,935, 2021
192021
Determining a likelihood of a user interaction with a content element
MS Musuvathi, TD Mytkowicz, S Maleki, Y Ding
US Patent 11,062,226, 2021
192021
Lore: A loop repository for the evaluation of compilers
Z Chen, Z Gong, JJ Szaday, DC Wong, D Padua, A Nicolau, ...
2017 IEEE International Symposium on Workload Characterization (IISWC), 219-228, 2017
192017
Parallelizing wfst speech decoders
C Mendis, J Droppo, S Maleki, M Musuvathi, T Mytkowicz, G Zweig
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
182016
CHET: compiler and runtime for homomorphic evaluation of tensor programs
R Dathathri, O Saarikivi, H Chen, K Laine, K Lauter, S Maleki, ...
arXiv preprint arXiv:1810.00845, 2018
162018
Efficient parallelization using rank convergence in dynamic programming algorithms
S Maleki, M Musuvathi, T Mytkowicz
Communications of the ACM 59 (10), 85-92, 2016
152016
The system can't perform the operation now. Try again later.
Articles 1–20