Follow
Tristan Thrush
Tristan Thrush
Research Engineer, Hugging Face
Verified email at huggingface.co - Homepage
Title
Cited by
Cited by
Year
Dynabench: Rethinking benchmarking in NLP
D Kiela, M Bartolo, Y Nie, D Kaushik, A Geiger, Z Wu, B Vidgen, G Prasad, ...
NAACL, 2021
1282021
Learning from the worst: Dynamically generated datasets to improve online hate detection
B Vidgen, T Thrush, Z Waseem, D Kiela
ACL, 2021
532021
Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
M Bartolo, T Thrush, R Jia, S Riedel, P Stenetorp, D Kiela
EMNLP, 2021
282021
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
T Thrush*, R Jiang, M Bartolo, A Singh, A Williams, D Kiela, C Ross*
CVPR, 2022
242022
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Z Ma*, K Ethayarajh*, T Thrush*, S Jain, L Wu, R Jia, C Potts, A Williams, ...
NeurIPS, 2021
232021
Anlizing the adversarial natural language inference dataset
A Williams, T Thrush, D Kiela
SCiL, 2022
202022
Human-adversarial visual question answering
S Sheng, A Singh, V Goswami, JAL Magana, T Thrush, W Galuba, ...
NeurIPS, 2021
182021
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate
HR Kirk, B Vidgen, P Röttger, T Thrush, SA Hale
NAACL, 2022
102022
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
M Bartolo, T Thrush, S Riedel, P Stenetorp, R Jia, D Kiela
NAACL, 2022
92022
Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation
G Wenzek, V Chaudhary, A Fan, S Gomez, N Goyal, S Jain, D Kiela, ...
WMT at EMNLP, 2021
62021
Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization
T Thrush, E Wilcox, R Levy
BlackboxNLP at EMNLP, 2020
52020
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
arXiv preprint arXiv:2211.05100, 2022
32022
DataPerf: Benchmarks for Data-Centric AI Development
M Mazumder, C Banbury, X Yao, B Karlaš, WG Rojas, S Diamos, ...
White Paper, 2022
32022
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
T Thrush, K Tirumala, A Gupta, M Bartolo, P Rodriguez, T Kane, ...
ACL System Demos, 2022
32022
Rover Relocalization for Mars Sample Return by Virtual Template Synthesis and Matching
TH Pham, W Seto, S Daftry, B Ridge, J Hansen, T Thrush, ...
IEEE Robotics and Automation Letters 6 (2), 4009-4016, 2021
32021
The partial mental state inducer: Learning intuition with few training examples and K-line theory
T Thrush, P Winston
Advances in Cognitive Systems, 2018
32018
The BigScience ROOTS Corpus: A 1.6 TB Composite Multilingual Dataset
H Laurençon, L Saulnier, T Wang, C Akiki, AV del Moral, T Le Scao, ...
NeurIPS Datasets and Benchmarks, 2022
22022
Compositional neural machine translation by removing the lexicon from syntax
T Thrush
CogSci (Abstract), 2020
12020
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurement
L von Werra, L Tunstall, A Thakur, AS Luccioni, T Thrush, A Piktus, ...
EMNLP System Demos, 2022
2022
Proceedings of the First Workshop on Dynamic Adversarial Data Collection
M Bartolo, H Kirk, P Rodriguez, K Margatina, T Thrush, R Jia, P Stenetorp, ...
Proceedings of the First Workshop on Dynamic Adversarial Data Collection, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20