Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2210.03050
Cited By
v1
v2
v3
v4 (latest)
State-of-the-art generalisation research in NLP: A taxonomy and review
Nature Machine Intelligence (Nat. Mach. Intell.), 2022
6 October 2022
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
Tiago Pimentel
Christos Christodoulopoulos
Karim Lasri
Naomi Saphra
Arabella J. Sinclair
Dennis Ulmer
Florian Schottmann
Khuyagbaatar Batsuren
Kaiser Sun
Koustuv Sinha
Leila Khalatbari
Maria Ryskina
Rita Frieske
Robert Bamler
Zhijing Jin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"State-of-the-art generalisation research in NLP: A taxonomy and review"
50 / 77 papers shown
Title
Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation
Qingsong He
Jing Nan
Jiayu Jiao
Liangjie Tang
Xiaodong Xu
Mengmeng Sun
Qingyao Wang
Minghui Yan
LLMAG
162
0
0
23 Nov 2025
On the Measure of a Model: From Intelligence to Generality
Ruchira Dhar
Ninell Oldenburg
Anders Soegaard
ELM
125
0
0
14 Nov 2025
Lightweight CNN Model Hashing with Higher-Order Statistics and Chaotic Mapping for Piracy Detection and Tamper Localization
Kunming Yang
Ling Chen
AAML
52
0
0
31 Oct 2025
MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference
Mădălina Zgreabăn
Tejaswini Deoskar
Lasha Abzianidze
102
0
0
28 Oct 2025
Resource-sensitive but language-blind: Community size and not grammatical complexity better predicts the accuracy of Large Language Models in a novel Wug Test
Nikoleta Pantelidou
Evelina Leivada
Paolo Morosi
52
1
0
14 Oct 2025
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
Sabrina McCallum
Amit Parekh
Alessandro Suglia
LM&Ro
116
0
0
13 Oct 2025
Steering Embedding Models with Geometric Rotation: Mapping Semantic Relationships Across Languages and Models
Michael Freenor
Lauren Alvarez
LLMSV
177
0
0
10 Oct 2025
Hybrid Models for Natural Language Reasoning: The Case of Syllogistic Logic
Manuel Vargas Guzmán
Jakub Szymanik
Maciej Malicki
NAI
LRM
ELM
54
0
0
10 Oct 2025
MoVa: Towards Generalizable Classification of Human Morals and Values
Ziyu Chen
Junfei Sun
Chenxi Li
Tuan Dung Nguyen
Jing Yao
Xiaoyuan Yi
Xing Xie
Chenhao Tan
Lexing Xie
100
1
0
29 Sep 2025
AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios
Lisa Alazraki
Lihu Chen
Ana Brassard
Joe Stacey
Hossein A. Rahmani
Marek Rei
CoGe
LRM
126
0
0
27 Aug 2025
Numerical models outperform AI weather forecasts of record-breaking extremes
Zhongwei Zhang
Erich Fischer
Jakob Zscheischler
Sebastian Engelke
AI4Cl
ELM
172
9
0
21 Aug 2025
How Causal Abstraction Underpins Computational Explanation
Atticus Geiger
Jacqueline Harding
Thomas Icard
105
2
0
15 Aug 2025
Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?
Xi Ai
Mahardika Krisna Ihsani
Min-Yen Kan
HILM
161
1
0
17 Jul 2025
Assessing Intersectional Bias in Representations of Pre-Trained Image Recognition Models
Valerie Krug
Sebastian Stober
241
0
0
04 Jun 2025
Systematic Generalization in Language Models Scales with Information Entropy
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Sondre Wold
Lucas Georges Gabriel Charpentier
Étienne Simon
406
0
0
19 May 2025
Domain Regeneration: How well do LLMs match syntactic properties of text domains?
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Da Ju
Hagen Blix
Adina Williams
DeLMO
312
2
0
12 May 2025
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Yulia Otmakhova
Hung Thinh Truong
Rahmad Mahendra
Zenan Zhai
Rongxin Zhu
Daniel Beck
Jey Han Lau
ELM
445
0
0
24 Apr 2025
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Jabez Magomere
Elena Kochkina
Samuel Mensah
Simerjot Kaur
Charese Smiley
312
3
0
22 Apr 2025
MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers
Lili Zhao
Qi Liu
Wei-neng Chen
Xiaoou Liu
R.-H. Sun
Min Hou
Yang Wang
Shijin Wang
411
2
0
14 Apr 2025
MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages
Dieuwke Hupkes
Nikolay Bogoychev
922
11
0
14 Apr 2025
TRA: Better Length Generalisation with Threshold Relative Attention
Mattia Opper
Roland Fernandez
P. Smolensky
Jianfeng Gao
480
1
0
29 Mar 2025
Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Florian Eichin
Wenshu Fan
Yun Xue
Michael A. Hedderich
376
6
0
13 Mar 2025
Structural Deep Encoding for Table Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Raphael Mouravieff
Benjamin Piwowarski
Sylvain Lamprier
LMTD
253
2
0
03 Mar 2025
Gradient-Guided Annealing for Domain Generalization
Computer Vision and Pattern Recognition (CVPR), 2025
Aristotelis Ballas
Christos Diou
OOD
1.3K
4
0
27 Feb 2025
Can LLMs Help Uncover Insights about LLMs? A Large-Scale, Evolving Literature Analysis of Frontier LLMs
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jungsoo Park
Junmo Kang
Gabriel Stanovsky
Alan Ritter
375
0
0
26 Feb 2025
Learning Latent Spaces for Domain Generalization in Time Series Forecasting
Songgaojun Deng
Maarten de Rijke
CML
AI4TS
OOD
BDL
286
1
0
15 Dec 2024
Quantifying artificial intelligence through algorithmic generalization
Nature Machine Intelligence (Nat. Mach. Intell.), 2024
Takuya Ito
Murray Campbell
L. Horesh
Tim Klinger
Parikshit Ram
ELM
400
0
0
08 Nov 2024
Beyond the Numbers: Transparency in Relation Extraction Benchmark Creation and Leaderboards
Varvara Arzt
Allan Hanbury
236
2
0
07 Nov 2024
Frequency matters: Modeling irregular morphological patterns in Spanish with Transformers
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Akhilesh Kakolu Ramarao
Kevin Tang
Dinah Baer-Henney
332
1
0
28 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
261
9
0
15 Oct 2024
The Mystery of Compositional Generalization in Graph-based Generative Commonsense Reasoning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xiyan Fu
Anette Frank
LRM
414
1
0
08 Oct 2024
Data Contamination Report from the 2024 CONDA Shared Task
Oscar Sainz
Iker García-Ferrero
Alon Jacovi
Jonas Hanselle
Yanai Elazar
...
Yu-Min Tseng
Vishaal Udandarao
Zengzhi Wang
Ruijie Xu
Jinglin Yang
259
13
0
31 Jul 2024
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Amit Parekh
Nikolas Vitsakis
Alessandro Suglia
Ioannis Konstas
AAML
251
8
0
04 Jul 2024
Black Big Boxes: Do Language Models Hide a Theory of Adjective Order?
Jaap Jumelet
Lisa Bylinina
Willem H. Zuidema
Jakub Szymanik
247
5
0
02 Jul 2024
Detection and Measurement of Syntactic Templates in Generated Text
Chantal Shaib
Yanai Elazar
Junyi Jessy Li
Byron C. Wallace
248
37
0
28 Jun 2024
Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
Abhinav Patil
Jaap Jumelet
Yu Ying Chiu
Andy Lapastora
Peter Shen
Lexie Wang
Clevis Willrich
Shane Steinert-Threlkeld
218
17
0
24 May 2024
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
Jacob Russin
Sam Whitman McGrath
Danielle J. Williams
AI4CE
481
6
0
24 May 2024
Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge
Khuyagbaatar Batsuren
Ekaterina Vylomova
Verna Dankers
Tsetsuukhei Delgerbaatar
Omri Uzan
Yuval Pinter
Gábor Bella
160
16
0
20 Apr 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
269
9
0
18 Apr 2024
Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
Changjiang Gao
Hongda Hu
Peng Hu
Jiajun Chen
Jixing Li
Shujian Huang
278
30
0
06 Apr 2024
From Robustness to Improved Generalization and Calibration in Pre-trained Language Models
Josip Jukić
Jan Snajder
337
2
0
31 Mar 2024
THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation
Wilbert Pumacay
Ishika Singh
Jiafei Duan
Ranjay Krishna
Jesse Thomason
Dieter Fox
295
92
0
13 Feb 2024
Efficient and Scalable Fine-Tune of Language Models for Genome Understanding
Huixin Zhan
Ying Nian Wu
Zijun Zhang
ALM
87
2
0
12 Feb 2024
A Philosophical Introduction to Language Models -- Part I: Continuity With Classic Debates
Raphael Milliere
Cameron Buckner
LRM
ELM
167
37
0
08 Jan 2024
The ICL Consistency Test
Lucas Weber
Elia Bruni
Dieuwke Hupkes
ALM
216
7
0
08 Dec 2023
Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Chia-Chien Hung
Wiem Ben-Rim
Lindsay Frost
Lars Bruckner
Carolin (Haas) Lawrence
AILaw
ALM
ELM
235
11
0
25 Nov 2023
GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language Understanding
Andor Diera
Abdelhalim Hafedh Dahou
Lukas Galke
Fabian Karl
Florian Sihler
A. Scherp
ELM
136
7
0
16 Nov 2023
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Ashim Gupta
Rishanth Rajendhran
Nathan Stringham
Vivek Srikumar
Ana Marasović
AAML
258
8
0
16 Nov 2023
On Using Distribution-Based Compositionality Assessment to Evaluate Compositional Generalisation in Machine Translation
Anssi Moisio
Mathias Creutz
M. Kurimo
CoGe
173
1
0
14 Nov 2023
Robust Generalization Strategies for Morpheme Glossing in an Endangered Language Documentation Context
Michael Ginn
Alexis Palmer
171
5
0
05 Nov 2023
1
2
Next