v1v2 (latest)

Shortcut Learning of Large Language Models in Natural Language Understanding

Communications of the ACM (CACM), 2022

25 August 2022

Papers citing "Shortcut Learning of Large Language Models in Natural Language Understanding"

50 / 62 papers shown

Do AI Models Perform Human-like Abstract Reasoning Across Modalities?

Sivasankaran Rajamanickam

Melanie Mitchell

ReLM ELM LRM

247

02 Oct 2025

Detecting Regional Spurious Correlations in Vision Transformers via Token Discarding

158

04 Sep 2025

STREAM (ChemBio): A Standard for Transparently Reporting Evaluations in AI Model Reports

266

13 Aug 2025

SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models

Wonjun Jeong

Dongseok Kim

Taegkeun Whangbo

219

24 Jul 2025

Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility

284

23 Jul 2025

LoRA Users Beware: A Few Spurious Tokens Can Manipulate Your Finetuned Model

340

13 Jun 2025

Physics-informed Temporal Alignment for Auto-regressive PDE Foundation Models

436

16 May 2025

Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety

332

11 May 2025

MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers

427

14 Apr 2025

Gradient Extrapolation for Debiased Representation Learning

Ihab Asaad

M. Shadaydeh

Joachim Denzler

310

17 Mar 2025

DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding ModelsSIGKDD Explorations (SIGKDD Explor.), 2025

361

25 Feb 2025

Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic ScoringTechnology, Knowledge and Learning (TKL), 2024

325

24 Feb 2025

Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-CheckingInternational Conference on Human Factors in Computing Systems (CHI), 2025

599

13 Feb 2025

Should Code Models Learn Pedagogically? A Preliminary Evaluation of Curriculum Learning for Real-World Software Engineering TasksIEEE Working Conference on Mining Software Repositories (MSR), 2025

Kyi Shin Khant

Hong Yi Lin

Patanamon Thongtanunam

ELM

322

06 Feb 2025

On Adversarial Robustness of Language Models in Transfer Learning

366

29 Dec 2024

Boosting LLM-based Relevance Modeling with Distribution-Aware Robust LearningInternational Conference on Information and Knowledge Management (CIKM), 2024

385

17 Dec 2024

On the Shortcut Learning in Multilingual Neural Machine Translation

913

15 Nov 2024

Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers

1.2K

30 Oct 2024

Large Language Model Benchmarks in Medical Tasks

...

695

28 Oct 2024

Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers

Lorenzo Pacchiardi

Marko Tesic

Lucy G. Cheke

José Hernández-Orallo

243

15 Oct 2024

ELF-Gym: Evaluating Large Language Models Generated Features for Tabular PredictionInternational Conference on Information and Knowledge Management (CIKM), 2024

135

13 Oct 2024

Co-occurrence is not Factual Association in Language ModelsNeural Information Processing Systems (NeurIPS), 2024

410

21 Sep 2024

Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges

Qian Niu

Junyu Liu

Ziqian Bi

Pohsun Feng

Benji Peng

...

Ming Li

Lawrence KQ Yan

Yichao Zhang

Caitlyn Heqi Yin

Cheng Fei

404

04 Sep 2024

Logistic Regression makes small LLMs strong and explainable "tens-of-shot" classifiers

Marcus Buckmann

Edward Hill

223

06 Aug 2024

Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for Improved Quality and Efficiency in RAG Systems

Xing Zi

Qiang Wu

296

15 Jul 2024

Source Code Summarization in the Era of Large Language Models

395

09 Jul 2024

ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization

Quanjun Zhang

Bin Luo

Yang Liu

Zhenyu Chen

AI4TS

301

01 Jul 2024

ImageNet3D: Towards General-Purpose Object-Level 3D Understanding

Yaoyao Liu

Alan Yuille

VLM 3DV

270

13 Jun 2024

Conditional Language Learning with Context

X. Zhang

Chenyi Guo

Ji Wu

247

04 Jun 2024

Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory

367

26 May 2024

ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token IdentificationNeural Information Processing Systems (NeurIPS), 2024

Jing Liu

Bohan Zhuang

295

23 May 2024

From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency

293

18 Apr 2024

Defending Against Unforeseen Failure Modes with Latent Adversarial Training

Stephen Casper

Lennart Schulze

Oam Patel

Dylan Hadfield-Menell

AAML

702

08 Mar 2024

On the Challenges and Opportunities in Generative AI

...

756

28 Feb 2024

The Clever Hans Mirage: A Comprehensive Survey on Spurious Correlations in Machine Learning

...

529

20 Feb 2024

Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation

Dennis Hoftijzer

Gertjan J. Burghouts

Luuk J. Spreeuwers

233

07 Feb 2024

The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models

242

02 Feb 2024

Rethinking Interpretability in the Era of Large Language Models

296

106

30 Jan 2024

Black-Box Access is Insufficient for Rigorous AI AuditsConference on Fairness, Accountability and Transparency (FAccT), 2024

...

Dylan Hadfield-Menell

AAML

552

128

25 Jan 2024

Learning Shortcuts: On the Misleading Promise of NLU in Language Models

Geetanjali Bihani

Julia Taylor Rayz

262

17 Jan 2024

Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous DequantizationInternational Conference on Computer Aided Design (ICCAD), 2023

228

28 Nov 2023

Large Language Models in Law: A SurveyAI Open (AO), 2023

Wensheng Gan

Philip S. Yu

303

166

26 Nov 2023

Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Patrick Charles Emerton

Genevieve Grant

LRM AILaw ELM

357

23 Oct 2023

Fool Your (Vision and) Language Model With Embarrassingly Simple PermutationsInternational Conference on Machine Learning (ICML), 2023

Timothy M. Hospedales

MLLM AAML LRM

283

02 Oct 2023

Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context LearningInternational Conference on Learning Representations (ICLR), 2023

428

01 Oct 2023

Mitigating Shortcuts in Language Models with Soft Label EncodingInternational Conference on Language Resources and Evaluation (LREC), 2023

Ninghao Liu

176

17 Sep 2023

Explainability for Large Language Models: A SurveyACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023

Haiyan Zhao

Hanjie Chen

Fan Yang

Ninghao Liu

456

706

02 Sep 2023

ExpeL: LLM Agents Are Experiential LearnersAAAI Conference on Artificial Intelligence (AAAI), 2023

Gao Huang

463

337

20 Aug 2023

Large Language Models and Knowledge Graphs: Opportunities and Challenges

...

285

115

11 Aug 2023

Large Language Models Can be Lazy Learners: Analyze Shortcuts in In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

300

26 May 2023