ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.03050
  4. Cited By
State-of-the-art generalisation research in NLP: A taxonomy and review
v1v2v3v4 (latest)

State-of-the-art generalisation research in NLP: A taxonomy and review

Nature Machine Intelligence (Nat. Mach. Intell.), 2022
6 October 2022
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
Tiago Pimentel
Christos Christodoulopoulos
Karim Lasri
Naomi Saphra
Arabella J. Sinclair
Dennis Ulmer
Florian Schottmann
Khuyagbaatar Batsuren
Kaiser Sun
Koustuv Sinha
Leila Khalatbari
Maria Ryskina
Rita Frieske
Robert Bamler
Zhijing Jin
ArXiv (abs)PDFHTMLGithub

Papers citing "State-of-the-art generalisation research in NLP: A taxonomy and review"

50 / 77 papers shown
Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation
Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation
Qingsong He
Jing Nan
Jiayu Jiao
Liangjie Tang
Xiaodong Xu
Mengmeng Sun
Qingyao Wang
Minghui Yan
LLMAG
261
0
0
23 Nov 2025
On the Measure of a Model: From Intelligence to Generality
On the Measure of a Model: From Intelligence to Generality
Ruchira Dhar
Ninell Oldenburg
Anders Soegaard
ELM
158
0
0
14 Nov 2025
Lightweight CNN Model Hashing with Higher-Order Statistics and Chaotic Mapping for Piracy Detection and Tamper Localization
Lightweight CNN Model Hashing with Higher-Order Statistics and Chaotic Mapping for Piracy Detection and Tamper Localization
Kunming Yang
Ling Chen
AAML
107
0
0
31 Oct 2025
MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference
MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference
Mădălina Zgreabăn
Tejaswini Deoskar
Lasha Abzianidze
184
0
0
28 Oct 2025
Community size rather than grammatical complexity better predicts Large Language Model accuracy in a novel Wug Test
Community size rather than grammatical complexity better predicts Large Language Model accuracy in a novel Wug Test
Nikoleta Pantelidou
Evelina Leivada
Paolo Morosi
Paolo Morosi
ELM
164
1
0
14 Oct 2025
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
Sabrina McCallum
Amit Parekh
Alessandro Suglia
LM&Ro
176
0
0
13 Oct 2025
Mapping Semantic & Syntactic Relationships with Geometric Rotation
Mapping Semantic & Syntactic Relationships with Geometric Rotation
Michael Freenor
Lauren Alvarez
LLMSV
247
1
0
10 Oct 2025
Hybrid Models for Natural Language Reasoning: The Case of Syllogistic Logic
Hybrid Models for Natural Language Reasoning: The Case of Syllogistic Logic
Manuel Vargas Guzmán
Jakub Szymanik
Maciej Malicki
NAILRMELM
89
0
0
10 Oct 2025
MoVa: Towards Generalizable Classification of Human Morals and Values
MoVa: Towards Generalizable Classification of Human Morals and Values
Ziyu Chen
Junfei Sun
Chenxi Li
Tuan Dung Nguyen
Jing Yao
Xiaoyuan Yi
Xing Xie
Chenhao Tan
Lexing Xie
146
7
0
29 Sep 2025
AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios
AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios
Lisa Alazraki
Lihu Chen
Ana Brassard
Joe Stacey
Hossein A. Rahmani
Marek Rei
CoGeLRM
204
1
0
27 Aug 2025
Numerical models outperform AI weather forecasts of record-breaking extremes
Numerical models outperform AI weather forecasts of record-breaking extremes
Zhongwei Zhang
Erich Fischer
Jakob Zscheischler
Sebastian Engelke
AI4ClELM
303
16
0
21 Aug 2025
How Causal Abstraction Underpins Computational Explanation
How Causal Abstraction Underpins Computational Explanation
Atticus Geiger
Jacqueline Harding
Thomas Icard
180
3
0
15 Aug 2025
Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?
Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?
Xi Ai
Mahardika Krisna Ihsani
Min-Yen Kan
HILM
275
2
0
17 Jul 2025
Assessing Intersectional Bias in Representations of Pre-Trained Image Recognition Models
Assessing Intersectional Bias in Representations of Pre-Trained Image Recognition Models
Valerie Krug
Sebastian Stober
340
0
0
04 Jun 2025
Systematic Generalization in Language Models Scales with Information Entropy
Systematic Generalization in Language Models Scales with Information EntropyAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Sondre Wold
Lucas Georges Gabriel Charpentier
Étienne Simon
485
2
0
19 May 2025
Domain Regeneration: How well do LLMs match syntactic properties of text domains?
Domain Regeneration: How well do LLMs match syntactic properties of text domains?Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Da Ju
Hagen Blix
Adina Williams
DeLMO
423
3
0
12 May 2025
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Yulia Otmakhova
Hung Thinh Truong
Rahmad Mahendra
Zenan Zhai
Rongxin Zhu
Daniel Beck
Jey Han Lau
ELM
577
1
0
24 Apr 2025
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference BenchmarkingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Jabez Magomere
Elena Kochkina
Samuel Mensah
Simerjot Kaur
Charese Smiley
444
4
0
22 Apr 2025
MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers
MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers
Lili Zhao
Qi Liu
Wei-neng Chen
Xiaoou Liu
R.-H. Sun
Min Hou
Yang Wang
Shijin Wang
496
2
0
14 Apr 2025
MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages
MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages
Dieuwke Hupkes
Nikolay Bogoychev
1.1K
15
0
14 Apr 2025
TRA: Better Length Generalisation with Threshold Relative Attention
TRA: Better Length Generalisation with Threshold Relative Attention
Mattia Opper
Roland Fernandez
P. Smolensky
Jianfeng Gao
641
1
0
29 Mar 2025
Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set
Probing LLMs for Multilingual Discourse Generalization Through a Unified Label SetAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Florian Eichin
Wenshu Fan
Yun Xue
Michael A. Hedderich
496
6
0
13 Mar 2025
Structural Deep Encoding for Table Question Answering
Structural Deep Encoding for Table Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Raphael Mouravieff
Benjamin Piwowarski
Sylvain Lamprier
LMTD
332
2
0
03 Mar 2025
Gradient-Guided Annealing for Domain Generalization
Gradient-Guided Annealing for Domain GeneralizationComputer Vision and Pattern Recognition (CVPR), 2025
Aristotelis Ballas
Christos Diou
OOD
1.5K
10
0
27 Feb 2025
Can LLMs Help Uncover Insights about LLMs? A Large-Scale, Evolving Literature Analysis of Frontier LLMs
Can LLMs Help Uncover Insights about LLMs? A Large-Scale, Evolving Literature Analysis of Frontier LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jungsoo Park
Junmo Kang
Gabriel Stanovsky
Alan Ritter
468
0
0
26 Feb 2025
Learning Latent Spaces for Domain Generalization in Time Series
  Forecasting
Learning Latent Spaces for Domain Generalization in Time Series Forecasting
Songgaojun Deng
Maarten de Rijke
CMLAI4TSOODBDL
407
1
0
15 Dec 2024
Quantifying artificial intelligence through algorithmic generalization
Quantifying artificial intelligence through algorithmic generalizationNature Machine Intelligence (Nat. Mach. Intell.), 2024
Takuya Ito
Murray Campbell
L. Horesh
Tim Klinger
Parikshit Ram
ELM
504
0
0
08 Nov 2024
Beyond the Numbers: Transparency in Relation Extraction Benchmark
  Creation and Leaderboards
Beyond the Numbers: Transparency in Relation Extraction Benchmark Creation and Leaderboards
Varvara Arzt
Allan Hanbury
375
2
0
07 Nov 2024
Frequency matters: Modeling irregular morphological patterns in Spanish with Transformers
Frequency matters: Modeling irregular morphological patterns in Spanish with TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Akhilesh Kakolu Ramarao
Kevin Tang
Dinah Baer-Henney
421
2
0
28 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A
  Comparative Analysis of mT5 and ByT5
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
430
13
0
15 Oct 2024
The Mystery of Compositional Generalization in Graph-based Generative
  Commonsense Reasoning
The Mystery of Compositional Generalization in Graph-based Generative Commonsense ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xiyan Fu
Anette Frank
LRM
518
1
0
08 Oct 2024
Data Contamination Report from the 2024 CONDA Shared Task
Data Contamination Report from the 2024 CONDA Shared Task
Oscar Sainz
Iker García-Ferrero
Alon Jacovi
Jonas Hanselle
Yanai Elazar
...
Yu-Min Tseng
Vishaal Udandarao
Zengzhi Wang
Ruijie Xu
Jinglin Yang
338
19
0
31 Jul 2024
Investigating the Role of Instruction Variety and Task Difficulty in
  Robotic Manipulation Tasks
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Amit Parekh
Nikolas Vitsakis
Alessandro Suglia
Ioannis Konstas
AAML
368
9
0
04 Jul 2024
Black Big Boxes: Tracing Adjective Order Preferences in Large Language Models
Black Big Boxes: Tracing Adjective Order Preferences in Large Language Models
Jaap Jumelet
Lisa Bylinina
Willem H. Zuidema
Jakub Szymanik
357
5
0
02 Jul 2024
Detection and Measurement of Syntactic Templates in Generated Text
Detection and Measurement of Syntactic Templates in Generated Text
Chantal Shaib
Yanai Elazar
Junyi Jessy Li
Byron C. Wallace
313
40
0
28 Jun 2024
Filtered Corpus Training (FiCT) Shows that Language Models can
  Generalize from Indirect Evidence
Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
Abhinav Patil
Jaap Jumelet
Yu Ying Chiu
Andy Lapastora
Peter Shen
Lexie Wang
Clevis Willrich
Shane Steinert-Threlkeld
268
26
0
24 May 2024
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
Jacob Russin
Sam Whitman McGrath
Danielle J. Williams
AI4CE
675
9
0
24 May 2024
Evaluating Subword Tokenization: Alien Subword Composition and OOV
  Generalization Challenge
Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge
Khuyagbaatar Batsuren
Ekaterina Vylomova
Verna Dankers
Tsetsuukhei Delgerbaatar
Omri Uzan
Yuval Pinter
Gábor Bella
210
16
0
20 Apr 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models
  Using Multisense Consistency
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
340
11
0
18 Apr 2024
Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual
  Knowledge Alignment, But Only Shallowly
Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
Changjiang Gao
Hongda Hu
Peng Hu
Jiajun Chen
Jixing Li
Shujian Huang
352
38
0
06 Apr 2024
From Robustness to Improved Generalization and Calibration in
  Pre-trained Language Models
From Robustness to Improved Generalization and Calibration in Pre-trained Language Models
Josip Jukić
Jan Snajder
441
2
0
31 Mar 2024
THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic
  Manipulation
THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation
Wilbert Pumacay
Ishika Singh
Jiafei Duan
Ranjay Krishna
Jesse Thomason
Dieter Fox
452
119
0
13 Feb 2024
Efficient and Scalable Fine-Tune of Language Models for Genome
  Understanding
Efficient and Scalable Fine-Tune of Language Models for Genome Understanding
Huixin Zhan
Ying Nian Wu
Zijun Zhang
ALM
144
2
0
12 Feb 2024
A Philosophical Introduction to Language Models -- Part I: Continuity
  With Classic Debates
A Philosophical Introduction to Language Models -- Part I: Continuity With Classic Debates
Raphael Milliere
Cameron Buckner
LRMELM
261
43
0
08 Jan 2024
The ICL Consistency Test
The ICL Consistency Test
Lucas Weber
Elia Bruni
Dieuwke Hupkes
ALM
370
7
0
08 Dec 2023
Walking a Tightrope -- Evaluating Large Language Models in High-Risk
  Domains
Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Chia-Chien Hung
Wiem Ben-Rim
Lindsay Frost
Lars Bruckner
Carolin (Haas) Lawrence
AILawALMELM
349
15
0
25 Nov 2023
GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization
  in Programming Language Understanding
GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language Understanding
Andor Diera
Abdelhalim Hafedh Dahou
Lukas Galke
Fabian Karl
Florian Sihler
A. Scherp
ELM
196
10
0
16 Nov 2023
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Ashim Gupta
Rishanth Rajendhran
Nathan Stringham
Vivek Srikumar
Ana Marasović
AAML
325
8
0
16 Nov 2023
On Using Distribution-Based Compositionality Assessment to Evaluate
  Compositional Generalisation in Machine Translation
On Using Distribution-Based Compositionality Assessment to Evaluate Compositional Generalisation in Machine Translation
Anssi Moisio
Mathias Creutz
M. Kurimo
CoGe
291
1
0
14 Nov 2023
Robust Generalization Strategies for Morpheme Glossing in an Endangered
  Language Documentation Context
Robust Generalization Strategies for Morpheme Glossing in an Endangered Language Documentation Context
Michael Ginn
Alexis Palmer
285
5
0
05 Nov 2023
12
Next
Page 1 of 2