Palm: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311, 2022 | 3187 | 2022 |
Efficiently Scaling Transformer Inference R Pope, S Douglas, A Chowdhery, J Devlin, J Bradbury, A Levskaya, ... arXiv preprint arXiv:2211.05102, 2022 | 123 | 2022 |