Older Papers

2025

The Thin Line Between Comprehension and Persuasion in LLMs

[pdf] [BibTex] <-- [Code] -->

Adrian de Wynter and Tangming Yuan

Preprint

Labelling Data with Unknown References

[pdf] [BibTex]

Adrian de Wynter

Preprint

A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications

[pdf] [BibTex]

Sunayana Sitaram, Adrian de Wynter, Isobel McCrum, Qilong Gu, and Si-Qing Chen

Preprint

RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?

[pdf] [BibTex] [Code]

Adrian de Wynter et al.

(Ann. 2024); AAAI 2025

2024

If Eleanor Rigby Had Met ChatGPT: A Study on Loneliness in a Post-LLM World

[pdf] [BibTex] [Code]

Adrian de Wynter

Preprint

Awes, Laws and Flaws of Today's LLM Research

[pdf] [BibTex] [Code]

Adrian de Wynter

Preprint

Will GPT-4 Run DOOM?

[pdf] [BibTex] [Code] [Post]

Adrian de Wynter

IEEE Transactions on Games (2024)

LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models

[pdf] [BibTex]

Yadong Zhang, Shaoguang Mao, Tao Ge, Xun Wang, Adrian de Wynter, Yan Xia, Wenshan Wu, Ting Song, Man Lan and Furu Wei

COLM 2024

One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks

[pdf] [BibTex]

Fangru Lin, Shaoguang Mao, Emanuele La Malfa, Valentin Hofmann, Adrian de Wynter, Jing Yao, Si-Qing Chen, Michael Wooldridge, Furu Wei

(Ann. 2024); ICLR 2025 Workshop on LLM Planning and Reasoning

"I'd Like to Have an Argument, Please": Argumentative Reasoning in Large Language Models

[pdf] [BibTex] [Code]

Adrian de Wynter and Tangming Yuan

(Ann. 2023); COMMA 2024

Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?

[pdf] [BibTex]

Rishav Hada, Varun Gumma, Adrian de Wynter, Harshita Diddee, Mohamed Ahmed, Monojit Choudhury, Kalika Bali, and Sunayana Sitaram

(Ann. 2023); EACL 2024

An Algorithm for Learning Smaller Representations of Models With Scarce Data

[pdf] [BibTex] [Code]

Adrian de Wynter

(Ann. 2020); Information Geometry (2024)

CELA Open Data Award

Adrian de Wynter

For the RTP-LX corpus, a dataset for toxic multilingual prompt evaluation.

2023

On Meta-Prompting

[pdf] [BibTex] [Code]

Adrian de Wynter, Xun Wang, Qilong Gu, and Si-Qing Chen

Preprint

An Evaluation of LLM Outputs: Discourse and Memorization

[pdf] [BibTex]

Adrian de Wynter, Xun Wang, Alex Sokolov, Qilong Gu, and Si-Qing Chen

The Natural Language Processing Journal

On the Opportunities and Dangers of LLM-Based Evaluation

Chris Quirk and Adrian de Wynter

Invited talk at the 2023 MLADS Conference

The Curse of the Biased Researcher: Common Pitfalls in LLM-based Evaluation

Adrian de Wynter

Invited talk at the 2023 MLADS Conference

A User-Centered Evaluation of Spanish Text Simplification

[pdf] [BibTex] [Data]

Adrian de Wynter, Anthony Hevia, and Si-Qing Chen

Preprint

CELA Open Data Award

Adrian de Wynter

For the CLANDESTINO corpus, a dataset for localized Spanish toxic-language detection.

Older

Turing Completeness and Sid Meier's Civilization

[pdf] [BibTex] [The Turing Machine in Action]

Adrian de Wynter

IEEE Transactions on Games (2022)

Bort: Algorithms and Applications

Adrian de Wynter

Invited talk at the 2021 Alexa Prize Summit

Optimal Subarchitecture Extraction for BERT

[pdf] [BibTex] [Code]

Adrian de Wynter and Daniel J. Perry

Preprint (2020)

An Approximation Algorithm for Optimal Subarchitecture Extraction

[pdf] [BibTex]

Adrian de Wynter

Preprint (2020)

Mischief: A Simple Black-Box Attack Against Transformer Architectures

[pdf] [BibTex]

Adrian de Wynter

Preprint (2020)

Harder Performance Measures for Language Models

Adrian de Wynter

Invited talk at the 2020 Alexa Prize Summit

On the Bounds of Function Approximations

[pdf] [BibTex]

Adrian de Wynter

ICANN 2019 (oral presentation)

Back