An image is worth 16x16 words: Transformers for image recognition at scale A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ... arXiv preprint arXiv:2010.11929, 2020 | 16666 | 2020 |
Vivit: A video vision transformer A Arnab, M Dehghani, G Heigold, C Sun, M Lučić, C Schmid Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 952 | 2021 |
An image is worth 16x16 words: Transformers for image recognition at scale. arXiv 2020 A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ... arXiv preprint arXiv:2010.11929, 2010 | 881 | 2010 |
End-to-end text-dependent speaker verification G Heigold, I Moreno, S Bengio, N Shazeer 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 695 | 2016 |
Small-footprint keyword spotting using deep neural networks G Chen, C Parada, G Heigold 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 598 | 2014 |
Object-centric learning with slot attention F Locatello, D Weissenborn, T Unterthiner, A Mahendran, G Heigold, ... Advances in Neural Information Processing Systems 33, 11525-11538, 2020 | 408 | 2020 |
Multilingual acoustic models using distributed deep neural networks G Heigold, V Vanhoucke, A Senior, P Nguyen, MA Ranzato, M Devin, ... 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 365 | 2013 |
An empirical study of learning rates in deep neural networks for speech recognition A Senior, G Heigold, MA Ranzato, K Yang 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 185 | 2013 |
Word embeddings for speech recognition S Bengio, G Heigold | 175 | 2014 |
Sequence discriminative distributed training of long short-term memory recurrent neural networks H Sak, O Vinyals, G Heigold, A Senior, E McDermott, R Monga, M Mao | 160 | 2014 |
The RWTH Aachen University open source speech recognition system D Rybach, C Gollan, G Heigold, B Hoffmeister, J Lööf, R Schlüter, H Ney Tenth Annual Conference of the International Speech Communication Association, 2009 | 143 | 2009 |
Asynchronous optimization for sequence training of neural networks G Heigold, E McDermott, VO Vanhoucke, AW Senior, MAU Bacchiani US Patent 10,019,985, 2018 | 94 | 2018 |
A linguistic evaluation of rule-based, phrase-based, and neural MT engines A Burchardt, V Macketanz, J Dehdari, G Heigold, P Jan-Thorsten, ... The Prague Bulletin of Mathematical Linguistics 108 (1), 159, 2017 | 92 | 2017 |
Conditional object-centric learning from video T Kipf, GF Elsayed, A Mahendran, A Stone, S Sabour, G Heigold, ... arXiv preprint arXiv:2111.12594, 2021 | 86 | 2021 |
The RWTH 2007 TC-STAR evaluation system for european English and Spanish. J Lööf, C Gollan, S Hahn, G Heigold, B Hoffmeister, C Plahl, D Rybach, ... Interspeech, 2145-2148, 2007 | 77 | 2007 |
Asynchronous stochastic optimization for sequence training of deep neural networks G Heigold, E McDermott, V Vanhoucke, A Senior, M Bacchiani 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 76 | 2014 |
Cross-lingual, character-level neural morphological tagging R Cotterell, G Heigold arXiv preprint arXiv:1708.09157, 2017 | 69 | 2017 |
Speech recognition process G Heigold, PAP Nguyen, M Weintraub, VO Vanhoucke US Patent 8,775,177, 2014 | 68 | 2014 |
Multiframe deep neural networks for acoustic modeling V Vanhoucke, M Devin, G Heigold 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 66 | 2013 |
Modified MMI/MPE: A direct evaluation of the margin in speech recognition G Heigold, T Deselaers, R Schlüter, H Ney Proceedings of the 25th international conference on Machine learning, 384-391, 2008 | 66 | 2008 |