Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2208.11857
Cited By
v1
v2 (latest)
Shortcut Learning of Large Language Models in Natural Language Understanding
Communications of the ACM (CACM), 2022
25 August 2022
Mengnan Du
Fengxiang He
Na Zou
Dacheng Tao
Helen Zhou
KELM
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Shortcut Learning of Large Language Models in Natural Language Understanding"
50 / 62 papers shown
Title
Do AI Models Perform Human-like Abstract Reasoning Across Modalities?
Claas Beger
Ryan Yi
Shuhao Fu
A. Moskvichev
Sarah W. Tsai
Sivasankaran Rajamanickam
Melanie Mitchell
ReLM
ELM
LRM
219
0
1
02 Oct 2025
Detecting Regional Spurious Correlations in Vision Transformers via Token Discarding
Solha Kang
Esla Timothy Anzaku
W. D. Neve
Arnout Van Messem
J. Vankerschaver
Francois Rameau
Utku Ozbulak
ViT
122
0
0
04 Sep 2025
STREAM (ChemBio): A Standard for Transparently Reporting Evaluations in AI Model Reports
Tegan McCaslin
Jide Alaga
Samira Nedungadi
Seth Donoughe
Tom Reed
Rishi Bommasani
Chris Painter
Luca Righetti
186
2
0
13 Aug 2025
SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models
Wonjun Jeong
Dongseok Kim
Taegkeun Whangbo
183
1
0
24 Jul 2025
Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility
Melih Barsbey
Lucas Prieto
Stefanos Zafeiriou
Tolga Birdal
208
0
0
23 Jul 2025
LoRA Users Beware: A Few Spurious Tokens Can Manipulate Your Finetuned Model
Pradyut Sekhsaria
Marcel Mateos Salles
Hai Huang
Randall Balestriero
Randall Balestriero
268
1
0
13 Jun 2025
Physics-informed Temporal Alignment for Auto-regressive PDE Foundation Models
Congcong Zhu
Xiaoyan Xu
Jiayue Han
Jingrun Chen
OOD
AI4CE
364
0
0
16 May 2025
Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety
Zihan Guan
Mengxuan Hu
Ronghang Zhu
Sheng Li
Anil Vullikanti
AAML
267
10
0
11 May 2025
MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers
Lili Zhao
Qi Liu
Wei-neng Chen
Xiaoou Liu
R.-H. Sun
Min Hou
Yang Wang
Shijin Wang
363
2
0
14 Apr 2025
Gradient Extrapolation for Debiased Representation Learning
Ihab Asaad
M. Shadaydeh
Joachim Denzler
274
1
0
17 Mar 2025
DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
SIGKDD Explorations (SIGKDD Explor.), 2025
Zihao Li
Ruixiang Tang
Lu Cheng
Shuaiqiang Wang
D. Yin
Jundong Li
307
0
0
25 Feb 2025
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring
Technology, Knowledge and Learning (TKL), 2024
Xuansheng Wu
Padmaja Pravin Saraf
Gyeong-Geon Lee
Ehsan Latif
Ninghao Liu
Xiaoming Zhai
289
23
0
24 Feb 2025
Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-Checking
International Conference on Human Factors in Computing Systems (CHI), 2025
Greta Warren
Irina Shklovski
Isabelle Augenstein
OffRL
507
25
0
13 Feb 2025
Should Code Models Learn Pedagogically? A Preliminary Evaluation of Curriculum Learning for Real-World Software Engineering Tasks
IEEE Working Conference on Mining Software Repositories (MSR), 2025
Kyi Shin Khant
Hong Yi Lin
Patanamon Thongtanunam
ELM
306
0
0
06 Feb 2025
On Adversarial Robustness of Language Models in Transfer Learning
Bohdan Turbal
Anastasiia Mazur
Jiaxu Zhao
Mykola Pechenizkiy
AAML
306
0
0
29 Dec 2024
Boosting LLM-based Relevance Modeling with Distribution-Aware Robust Learning
International Conference on Information and Knowledge Management (CIKM), 2024
Hong Liu
Saisai Gong
Yixin Ji
Hai Ye
Jia Xu
Jinjie Gu
317
5
0
17 Dec 2024
On the Shortcut Learning in Multilingual Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Shu Yang
Zhaopeng Tu
Michael R. Lyu
849
2
0
15 Nov 2024
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
Lam Nguyen Tung
Steven Cho
Xiaoning Du
Neelofar Neelofar
Valerio Terragni
Stefano Ruberto
Aldeida Aleti
1.0K
2
0
30 Oct 2024
Large Language Model Benchmarks in Medical Tasks
Lawrence K. Q. Yan
Ming Li
Yujiao Shi
Cheng Fei
Cheng Fei
...
Junyu Liu
Xinyuan Song
Riyang Bao
Zekun Jiang
Ziyuan Qin
LM&MA
AI4MH
531
16
0
28 Oct 2024
Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers
Lorenzo Pacchiardi
Marko Tesic
Lucy G. Cheke
José Hernández-Orallo
198
3
0
15 Oct 2024
ELF-Gym: Evaluating Large Language Models Generated Features for Tabular Prediction
International Conference on Information and Knowledge Management (CIKM), 2024
Yanlin Zhang
Ning Li
Quan Gan
Weinan Zhang
David Wipf
Minjie Wang
123
4
0
13 Oct 2024
Co-occurrence is not Factual Association in Language Models
Neural Information Processing Systems (NeurIPS), 2024
Xiao Zhang
Chenyi Guo
Ji Wu
KELM
346
8
0
21 Sep 2024
Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges
Qian Niu
Junyu Liu
Ziqian Bi
Pohsun Feng
Benji Peng
...
Ming Li
Lawrence KQ Yan
Yichao Zhang
Caitlyn Heqi Yin
Cheng Fei
300
42
0
04 Sep 2024
Logistic Regression makes small LLMs strong and explainable "tens-of-shot" classifiers
Marcus Buckmann
Edward Hill
175
4
0
06 Aug 2024
Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for Improved Quality and Efficiency in RAG Systems
Yunxiao Shi
Xing Zi
Zijing Shi
Haimin Zhang
Qiang Wu
Min Xu
228
20
0
15 Jul 2024
Source Code Summarization in the Era of Large Language Models
Weisong Sun
Yun Miao
Yuekang Li
Hongyu Zhang
Chunrong Fang
Yi Liu
Gelei Deng
Yang Liu
Zhenyu Chen
ELM
337
42
0
09 Jul 2024
ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization
Chunrong Fang
Weisong Sun
Yuchen Chen
Xiao Chen
Zhao Wei
Quanjun Zhang
Yudu You
Bin Luo
Yang Liu
Zhenyu Chen
AI4TS
249
20
0
01 Jul 2024
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Wufei Ma
Guanning Zeng
Guofeng Zhang
Qihao Liu
Letian Zhang
Adam Kortylewski
Yaoyao Liu
Alan Yuille
VLM
3DV
202
15
0
13 Jun 2024
Conditional Language Learning with Context
X. Zhang
Chenyi Guo
Ji Wu
175
5
0
04 Jun 2024
Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory
Nikola Zubić
Federico Soldá
Aurelio Sulser
Davide Scaramuzza
LRM
BDL
311
15
0
26 May 2024
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
Neural Information Processing Systems (NeurIPS), 2024
Yefei He
Luoming Zhang
Weijia Wu
Jing Liu
Hong Zhou
Bohan Zhuang
MQ
255
47
0
23 May 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
241
9
0
18 Apr 2024
Defending Against Unforeseen Failure Modes with Latent Adversarial Training
Stephen Casper
Lennart Schulze
Oam Patel
Dylan Hadfield-Menell
AAML
588
56
0
08 Mar 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Kushagra Pandey
Robert Bamler
Sina Daubener
...
Yixin Wang
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
662
40
0
28 Feb 2024
The Clever Hans Mirage: A Comprehensive Survey on Spurious Correlations in Machine Learning
Wenqian Ye
Luyang Jiang
Xu Cao
Guangtao Zheng
Yunsheng Ma
...
Huang
Ziran Wang
James M. Rehg
Henry Kautz
Andrew Gordon Wilson
OOD
AAML
CML
465
58
0
20 Feb 2024
Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation
Dennis Hoftijzer
Gertjan J. Burghouts
Luuk J. Spreeuwers
201
3
0
07 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
182
23
0
02 Feb 2024
Rethinking Interpretability in the Era of Large Language Models
Chandan Singh
J. Inala
Michel Galley
Rich Caruana
Jianfeng Gao
LRM
AI4CE
236
101
0
30 Jan 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Conference on Fairness, Accountability and Transparency (FAccT), 2024
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
440
124
0
25 Jan 2024
Learning Shortcuts: On the Misleading Promise of NLU in Language Models
Geetanjali Bihani
Julia Taylor Rayz
198
4
0
17 Jan 2024
Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization
International Conference on Computer Aided Design (ICCAD), 2023
Jinhao Li
Jiaming Xu
Shiyao Li
Shan Huang
Jun Liu
Yaoxiu Lian
Guohao Dai
MQ
138
10
0
28 Nov 2023
Large Language Models in Law: A Survey
AI Open (AO), 2023
Jinqi Lai
Wensheng Gan
Jiayang Wu
Zhenlian Qi
Philip S. Yu
ELM
AILaw
259
152
0
26 Nov 2023
Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiaoxi Kang
Zhuang Li
Lay-Ki Soon
Adnan Trakic
Terry Yue Zhuo
Patrick Charles Emerton
Genevieve Grant
LRM
AILaw
ELM
309
16
0
23 Oct 2023
Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations
International Conference on Machine Learning (ICML), 2023
Yongshuo Zong
Tingyang Yu
Ruchika Chavhan
Bingchen Zhao
Timothy M. Hospedales
MLLM
AAML
LRM
211
25
0
02 Oct 2023
Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context Learning
International Conference on Learning Representations (ICLR), 2023
Mustafa Shukor
Alexandre Ramé
Corentin Dancette
Matthieu Cord
LRM
MLLM
324
26
0
01 Oct 2023
Mitigating Shortcuts in Language Models with Soft Label Encoding
International Conference on Language Resources and Evaluation (LREC), 2023
Zirui He
Huiqi Deng
Haiyan Zhao
Ninghao Liu
Jundong Li
124
2
0
17 Sep 2023
Explainability for Large Language Models: A Survey
ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jundong Li
LRM
361
675
0
02 Sep 2023
ExpeL: LLM Agents Are Experiential Learners
AAAI Conference on Artificial Intelligence (AAAI), 2023
Andrew Zhao
Daniel Huang
Quentin Xu
Matthieu Lin
Wenshu Fan
Gao Huang
LLMAG
391
325
0
20 Aug 2023
Large Language Models and Knowledge Graphs: Opportunities and Challenges
Jeff Z. Pan
Simon Razniewski
Jan-Christoph Kalo
Sneha Singhania
Jiaoyan Chen
...
Gerard de Melo
A. Bonifati
Edlira Vakaj
M. Dragoni
D. Graux
KELM
229
110
0
11 Aug 2023
Large Language Models Can be Lazy Learners: Analyze Shortcuts in In-Context Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ruixiang Tang
Dehan Kong
Lo-li Huang
Hui Xue
244
73
0
26 May 2023
1
2
Next