Google's neural machine translation system: Bridging the gap between human and machine translation Y Wu, M Schuster, Z Chen, QV Le, M Norouzi, W Macherey, M Krikun, ... arXiv preprint arXiv:1609.08144, 2016 | 8469 | 2016 |
In-datacenter performance analysis of a tensor processing unit NP Jouppi, C Young, N Patil, D Patterson, G Agrawal, R Bajwa, S Bates, ... Proceedings of the 44th annual international symposium on computer …, 2017 | 5212 | 2017 |
Anton, a special-purpose machine for molecular dynamics simulation DE Shaw, MM Deneroff, RO Dror, JS Kuskin, RH Larson, JK Salmon, ... Communications of the ACM 51 (7), 91-97, 2008 | 970 | 2008 |
Anton 2: raising the bar for performance and programmability in a special-purpose molecular dynamics supercomputer DE Shaw, JP Grossman, JA Bank, B Batson, JA Butts, JC Chao, ... SC'14: Proceedings of the International Conference for High Performance …, 2014 | 701 | 2014 |
Millisecond-scale molecular dynamics simulations on Anton DE Shaw, RO Dror, JK Salmon, JP Grossman, KM Mackenzie, JA Bank, ... Proceedings of the conference on high performance computing networking …, 2009 | 684 | 2009 |
Embedded computing: a VLIW approach to architecture, compilers and tools JA Fisher, P Faraboschi, C Young Elsevier, 2005 | 519 | 2005 |
Anton, a special-purpose machine for molecular dynamics simulation DE Shaw, MM Deneroff, RO Dror, JS Kuskin, RH Larson, JK Salmon, ... ACM SIGARCH Computer Architecture News 35 (2), 1-12, 2007 | 366 | 2007 |
Mesh-tensorflow: Deep learning for supercomputers N Shazeer, Y Cheng, N Parmar, D Tran, A Vaswani, P Koanantakool, ... Advances in neural information processing systems 31, 2018 | 365 | 2018 |
Mlperf training benchmark P Mattson, C Cheng, G Diamos, C Coleman, P Micikevicius, D Patterson, ... Proceedings of Machine Learning and Systems 2, 336-349, 2020 | 309 | 2020 |
Ten lessons from three generations shaped google’s tpuv4i: Industrial product NP Jouppi, DH Yoon, M Ashcraft, M Gottscho, TB Jablin, G Kurian, ... 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021 | 299 | 2021 |
Motivation for and evaluation of the first tensor processing unit N Jouppi, C Young, N Patil, D Patterson ieee Micro 38 (3), 10-19, 2018 | 269 | 2018 |
A domain-specific supercomputer for training deep neural networks NP Jouppi, DH Yoon, G Kurian, S Li, N Patil, J Laudon, C Young, ... Communications of the ACM 63 (7), 67-78, 2020 | 263 | 2020 |
A new golden age in computer architecture: Empowering the machine-learning revolution J Dean, D Patterson, C Young IEEE Micro 38 (2), 21-29, 2018 | 222 | 2018 |
A comparative analysis of schemes for correlated branch prediction C Young, N Gloy, MD Smith ACM SIGARCH Computer Architecture News 23 (2), 276-286, 1995 | 219 | 1995 |
Sparse gpu kernels for deep learning T Gale, M Zaharia, C Young, E Elsen SC20: International Conference for High Performance Computing, Networking …, 2020 | 212 | 2020 |
A domain-specific architecture for deep neural networks NP Jouppi, C Young, N Patil, D Patterson Communications of the ACM 61 (9), 50-59, 2018 | 193 | 2018 |
Search for a heavy charged boson in events with a charged lepton and missing transverse momentum from collisions at with the ATLAS detector G Aad, B Abbott, DC Abbott, AA Abud, K Abeling, DK Abhayasinghe, ... Physical review D 100 (5), 052013, 2019 | 172 | 2019 |
Improving the accuracy of static branch prediction using branch correlation C Young, MD Smith ACM SIGOPS Operating Systems Review 28 (5), 232-241, 1994 | 168 | 1994 |
Measurements of differential cross-sections of highly boosted top quarks decaying to all-hadronic final states in collisions at using the ATLAS … M Aaboud, G Aad, B Abbott, B Abeloos, SH Abidi, OS Abouzeid, ... Physical Review D 98 (1), 012003, 2018 | 154 | 2018 |
Search for long-lived neutral particles in pp collisions at that decay into displaced hadronic jets in the ATLAS calorimeter M Aaboud, G Aad, B Abbott, DC Abbott, DK Abhayasinghe, SH Abidi, ... The European Physical Journal C 79 (6), 1-31, 2019 | 145 | 2019 |