Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.11857
Cited By
Shortcut Learning of Large Language Models in Natural Language Understanding
25 August 2022
Mengnan Du
Fengxiang He
Na Zou
Dacheng Tao
Xia Hu
KELM
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Shortcut Learning of Large Language Models in Natural Language Understanding"
50 / 62 papers shown
Title
Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety
Zihan Guan
Mengxuan Hu
Ronghang Zhu
Sheng R. Li
Anil Vullikanti
AAML
13
0
0
11 May 2025
Gradient Extrapolation for Debiased Representation Learning
Ihab Asaad
M. Shadaydeh
Joachim Denzler
31
0
0
17 Mar 2025
DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
Zihao Li
Ruixiang Tang
Lu Cheng
S. Wang
Dawei Yin
Mengnan Du
66
0
0
25 Feb 2025
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring
Xuansheng Wu
Padmaja Pravin Saraf
Gyeong-Geon Lee
Ehsan Latif
Ninghao Liu
Xiaoming Zhai
53
4
0
24 Feb 2025
Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-Checking
Greta Warren
Irina Shklovski
Isabelle Augenstein
OffRL
62
4
0
13 Feb 2025
On Adversarial Robustness of Language Models in Transfer Learning
Bohdan Turbal
Anastasiia Mazur
Jiaxu Zhao
Mykola Pechenizkiy
AAML
38
0
0
03 Jan 2025
On the Shortcut Learning in Multilingual Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Jen-tse Huang
Zhaopeng Tu
Michael R. Lyu
31
0
0
15 Nov 2024
Large Language Model Benchmarks in Medical Tasks
Lawrence K. Q. Yan
Ming Li
Y. Zhang
Caitlyn Heqi Yin
Cheng Fei
...
Ziqian Bi
Pohsun Feng
Keyu Chen
Junyu Liu
Qian Niu
LM&MA
AI4MH
51
4
0
28 Oct 2024
Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers
Lorenzo Pacchiardi
Marko Tesic
Lucy G. Cheke
José Hernández Orallo
31
3
0
15 Oct 2024
ELF-Gym: Evaluating Large Language Models Generated Features for Tabular Prediction
Yanlin Zhang
Ning Li
Quan Gan
W. Zhang
David Wipf
Minjie Wang
11
0
0
13 Oct 2024
Co-occurrence is not Factual Association in Language Models
Xiao Zhang
Miao Li
Ji Wu
KELM
59
2
0
21 Sep 2024
Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges
Qian Niu
Junyu Liu
Ziqian Bi
Pohsun Feng
Benji Peng
...
Ming Li
Lawrence KQ Yan
Yichao Zhang
Caitlyn Heqi Yin
Cheng Fei
32
13
0
04 Sep 2024
Logistic Regression makes small LLMs strong and explainable "tens-of-shot" classifiers
Marcus Buckmann
Edward Hill
16
1
0
06 Aug 2024
Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for Improved Quality and Efficiency in RAG Systems
Yunxiao Shi
Xing Zi
Zijing Shi
Haimin Zhang
Qiang Wu
Min Xu
21
7
0
15 Jul 2024
Source Code Summarization in the Era of Large Language Models
Weisong Sun
Yun Miao
Yuekang Li
Hongyu Zhang
Chunrong Fang
Yi Liu
Gelei Deng
Yang Liu
Zhenyu Chen
ELM
34
11
0
09 Jul 2024
ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization
Chunrong Fang
Weisong Sun
Yuchen Chen
Xiao Chen
Zhao Wei
Quanjun Zhang
Yudu You
Bin Luo
Yang Liu
Zhenyu Chen
AI4TS
27
3
0
01 Jul 2024
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Wufei Ma
Guanning Zeng
Guofeng Zhang
Qihao Liu
Letian Zhang
Adam Kortylewski
Yaoyao Liu
Alan Yuille
VLM
3DV
26
7
0
13 Jun 2024
Conditional Language Learning with Context
X. Zhang
Miao Li
Ji Wu
36
1
0
04 Jun 2024
Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory
Nikola Zubić
Federico Soldá
Aurelio Sulser
Davide Scaramuzza
LRM
BDL
37
5
0
26 May 2024
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
Yefei He
Luoming Zhang
Weijia Wu
Jing Liu
Hong Zhou
Bohan Zhuang
MQ
35
24
0
23 May 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
23
6
0
18 Apr 2024
Defending Against Unforeseen Failure Modes with Latent Adversarial Training
Stephen Casper
Lennart Schulze
Oam Patel
Dylan Hadfield-Menell
AAML
49
27
0
08 Mar 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
42
17
0
28 Feb 2024
Spurious Correlations in Machine Learning: A Survey
Wenqian Ye
Guangtao Zheng
Xu Cao
Yunsheng Ma
Aidong Zhang
OOD
AAML
CML
23
33
0
20 Feb 2024
Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation
Dennis Hoftijzer
Gertjan J. Burghouts
Luuk J. Spreeuwers
8
1
0
07 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
6
6
0
02 Feb 2024
Rethinking Interpretability in the Era of Large Language Models
Chandan Singh
J. Inala
Michel Galley
Rich Caruana
Jianfeng Gao
LRM
AI4CE
75
59
0
30 Jan 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
13
75
0
25 Jan 2024
Learning Shortcuts: On the Misleading Promise of NLU in Language Models
Geetanjali Bihani
Julia Taylor Rayz
12
3
0
17 Jan 2024
Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization
Jinhao Li
Jiaming Xu
Shiyao Li
Shan Huang
Jun Liu
Yaoxiu Lian
Guohao Dai
MQ
13
1
0
28 Nov 2023
Large Language Models in Law: A Survey
Jinqi Lai
Wensheng Gan
Jiayang Wu
Zhenlian Qi
Philip S. Yu
ELM
AILaw
10
69
0
26 Nov 2023
Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?
Xiaoxi Kang
Lizhen Qu
Lay-Ki Soon
Adnan Trakic
Terry Yue Zhuo
Patrick Charles Emerton
Genevieve Grant
LRM
AILaw
ELM
101
13
0
23 Oct 2023
Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations
Yongshuo Zong
Tingyang Yu
Ruchika Chavhan
Bingchen Zhao
Timothy M. Hospedales
MLLM
AAML
LRM
16
17
0
02 Oct 2023
Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context Learning
Mustafa Shukor
Alexandre Ramé
Corentin Dancette
Matthieu Cord
LRM
MLLM
27
20
0
01 Oct 2023
Mitigating Shortcuts in Language Models with Soft Label Encoding
Zirui He
Huiqi Deng
Haiyan Zhao
Ninghao Liu
Mengnan Du
8
2
0
17 Sep 2023
Explainability for Large Language Models: A Survey
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Mengnan Du
LRM
19
324
0
02 Sep 2023
ExpeL: LLM Agents Are Experiential Learners
Andrew Zhao
Daniel Huang
Quentin Xu
Matthieu Lin
Y. Liu
Gao Huang
LLMAG
17
192
0
20 Aug 2023
Large Language Models and Knowledge Graphs: Opportunities and Challenges
Jeff Z. Pan
Simon Razniewski
Jan-Christoph Kalo
Sneha Singhania
Jiaoyan Chen
...
Gerard de Melo
A. Bonifati
Edlira Vakaj
M. Dragoni
D. Graux
KELM
25
71
0
11 Aug 2023
Large Language Models Can be Lazy Learners: Analyze Shortcuts in In-Context Learning
Ruixiang Tang
Dehan Kong
Lo-li Huang
Hui Xue
17
33
0
26 May 2023
Controlling Learned Effects to Reduce Spurious Correlations in Text Classifiers
Parikshit Bansal
Amit Sharma
CML
11
5
0
26 May 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Xia Hu
LM&MA
123
593
0
26 Apr 2023
Language Model Behavior: A Comprehensive Survey
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
16
100
0
20 Mar 2023
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning
Vittoria Dentella
Fritz Guenther
Elliot Murphy
G. Marcus
Evelina Leivada
ELM
17
24
0
23 Feb 2023
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Shengnan An
Zeqi Lin
B. Chen
Qiang Fu
Nanning Zheng
Jian-Guang Lou
21
4
0
23 Feb 2023
DISCO: Distilling Counterfactuals with Large Language Models
Zeming Chen
Qiyue Gao
Antoine Bosselut
Ashish Sabharwal
Kyle Richardson
14
25
0
20 Dec 2022
Feature-Level Debiased Natural Language Understanding
Yougang Lyu
Piji Li
Yechang Yang
Maarten de Rijke
Pengjie Ren
Yukun Zhao
Dawei Yin
Z. Ren
17
10
0
11 Dec 2022
Can Language Representation Models Think in Bets?
Zhi–Bin Tang
M. Kejriwal
8
6
0
14 Oct 2022
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
Seonghyeon Ye
Doyoung Kim
Joel Jang
Joongbo Shin
Minjoon Seo
FedML
VLM
UQCV
LRM
11
25
0
06 Oct 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
22
3
0
05 Sep 2022
Rectify ViT Shortcut Learning by Visual Saliency
Chong Ma
Lin Zhao
Yuzhong Chen
David Liu
Xi Jiang
Tuo Zhang
Xintao Hu
Dinggang Shen
Dajiang Zhu
Tianming Liu
ViT
14
20
0
17 Jun 2022
1
2
Next