All Works



.

RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?
Adrian de Wynter, Ishaan Watts, et al.
(Ann. 2024); AAAI 2025
If Eleanor Rigby Had Met ChatGPT: A Study on Loneliness in a Post-LLM World
Adrian de Wynter
Preprint
Awes, Laws and Flaws of Today's LLM Research
Adrian de Wynter
Preprint
Will GPT-4 Run DOOM?
Adrian de Wynter
IEEE Transactions on Games (2024)
LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models
Yadong Zhang, Shaoguang Mao, Tao Ge, Xun Wang, Adrian de Wynter, Yan Xia, Wenshan Wu, Ting Song, Man Lan and Furu Wei
COLM 2024
One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
Fangru Lin, Shaoguang Mao, Emanuele La Malfa, Valentin Hofmann, Adrian de Wynter, Jing Yao, Si-Qing Chen, Michael Wooldridge, Furu Wei
Preprint
"I'd Like to Have an Argument, Please": Argumentative Reasoning in Large Language Models
Adrian de Wynter and Tangming Yuan
(Ann. 2023); COMMA 2024
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?
Rishav Hada, Varun Gumma, Adrian de Wynter, Harshita Diddee, Mohamed Ahmed, Monojit Choudhury, Kalika Bali, and Sunayana Sitaram
(Ann. 2023); EACL 2024
An Algorithm for Learning Smaller Representations of Models With Scarce Data
Adrian de Wynter
(Ann. 2020); Information Geometry (2024)
CELA Open Data Award
Adrian de Wynter
For the RTP-LX corpus, a dataset for toxic multilingual prompt evaluation.
On Meta-Prompting
Adrian de Wynter, Xun Wang, Qilong Gu, and Si-Qing Chen
Preprint
An Evaluation of LLM Outputs: Discourse and Memorization
Adrian de Wynter, Xun Wang, Alex Sokolov, Qilong Gu, and Si-Qing Chen
The Natural Language Processing Journal
On the Opportunities and Dangers of LLM-Based Evaluation
Chris Quirk and Adrian de Wynter
Invited talk at the 2023 MLADS Conference
The Curse of the Biased Researcher: Common Pitfalls in LLM-based Evaluation
Adrian de Wynter
Invited talk at the 2023 MLADS Conference
A User-Centered Evaluation of Spanish Text Simplification
Adrian de Wynter, Anthony Hevia, and Si-Qing Chen
Preprint
CELA Open Data Award
Adrian de Wynter
For the CLANDESTINO corpus, a dataset for localized Spanish toxic-language detection.
Turing Completeness and Sid Meier's Civilization
Adrian de Wynter
IEEE Transactions on Games (2022)
Bort: Algorithms and Applications
Adrian de Wynter
Invited talk at the 2021 Alexa Prize Summit
Optimal Subarchitecture Extraction for BERT
Adrian de Wynter and Daniel J. Perry
Preprint (2020)
An Approximation Algorithm for Optimal Subarchitecture Extraction
Adrian de Wynter
Preprint (2020)
Mischief: A Simple Black-Box Attack Against Transformer Architectures
Adrian de Wynter
Preprint (2020)
Harder Performance Measures for Language Models
Adrian de Wynter
Invited talk at the 2020 Alexa Prize Summit
On the Bounds of Function Approximations
Adrian de Wynter
ICANN 2019 (oral presentation)