v1v2 (latest)

Shortcut Learning of Large Language Models in Natural Language Understanding

Communications of the ACM (CACM), 2022

25 August 2022

Papers citing "Shortcut Learning of Large Language Models in Natural Language Understanding"

50 / 62 papers shown

Title
Do AI Models Perform Human-like Abstract Reasoning Across Modalities? Claas Beger Ryan Yi Shuhao Fu A. Moskvichev Sarah W. Tsai Sivasankaran Rajamanickam Melanie Mitchell ReLM ELM LRM 219 0 1 02 Oct 2025
Detecting Regional Spurious Correlations in Vision Transformers via Token Discarding Solha Kang Esla Timothy Anzaku W. D. Neve Arnout Van Messem J. Vankerschaver Francois Rameau Utku Ozbulak ViT 122 0 0 04 Sep 2025
STREAM (ChemBio): A Standard for Transparently Reporting Evaluations in AI Model Reports Tegan McCaslin Jide Alaga Samira Nedungadi Seth Donoughe Tom Reed Rishi Bommasani Chris Painter Luca Righetti 186 2 0 13 Aug 2025
SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models Wonjun Jeong Dongseok Kim Taegkeun Whangbo 183 1 0 24 Jul 2025
Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility Melih Barsbey Lucas Prieto Stefanos Zafeiriou Tolga Birdal 208 0 0 23 Jul 2025
LoRA Users Beware: A Few Spurious Tokens Can Manipulate Your Finetuned Model Pradyut Sekhsaria Marcel Mateos Salles Hai Huang Randall Balestriero Randall Balestriero 268 1 0 13 Jun 2025
Physics-informed Temporal Alignment for Auto-regressive PDE Foundation Models Congcong Zhu Xiaoyan Xu Jiayue Han Jingrun Chen OOD AI4CE 364 0 0 16 May 2025
Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety Zihan Guan Mengxuan Hu Ronghang Zhu Sheng Li Anil Vullikanti AAML 267 10 0 11 May 2025
MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers Lili Zhao Qi Liu Wei-neng Chen Xiaoou Liu R.-H. Sun Min Hou Yang Wang Shijin Wang 363 2 0 14 Apr 2025
Gradient Extrapolation for Debiased Representation Learning Ihab Asaad M. Shadaydeh Joachim Denzler 274 1 0 17 Mar 2025
DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding ModelsSIGKDD Explorations (SIGKDD Explor.), 2025 Zihao Li Ruixiang Tang Lu Cheng Shuaiqiang Wang D. Yin Jundong Li 307 0 0 25 Feb 2025
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic ScoringTechnology, Knowledge and Learning (TKL), 2024 Xuansheng Wu Padmaja Pravin Saraf Gyeong-Geon Lee Ehsan Latif Ninghao Liu Xiaoming Zhai 289 23 0 24 Feb 2025
Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-CheckingInternational Conference on Human Factors in Computing Systems (CHI), 2025 Greta Warren Irina Shklovski Isabelle Augenstein OffRL 507 25 0 13 Feb 2025
Should Code Models Learn Pedagogically? A Preliminary Evaluation of Curriculum Learning for Real-World Software Engineering TasksIEEE Working Conference on Mining Software Repositories (MSR), 2025 Kyi Shin Khant Hong Yi Lin Patanamon Thongtanunam ELM 306 0 0 06 Feb 2025
On Adversarial Robustness of Language Models in Transfer Learning Bohdan Turbal Anastasiia Mazur Jiaxu Zhao Mykola Pechenizkiy AAML 306 0 0 29 Dec 2024
Boosting LLM-based Relevance Modeling with Distribution-Aware Robust LearningInternational Conference on Information and Knowledge Management (CIKM), 2024 Hong Liu Saisai Gong Yixin Ji Hai Ye Jia Xu Jinjie Gu 317 5 0 17 Dec 2024
On the Shortcut Learning in Multilingual Neural Machine Translation Wenxuan Wang Wenxiang Jiao Shu Yang Zhaopeng Tu Michael R. Lyu 849 2 0 15 Nov 2024
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers Lam Nguyen Tung Steven Cho Xiaoning Du Neelofar Neelofar Valerio Terragni Stefano Ruberto Aldeida Aleti 1.0K 2 0 30 Oct 2024
Large Language Model Benchmarks in Medical Tasks Lawrence K. Q. Yan Ming Li Yujiao Shi Cheng Fei Cheng Fei ... Junyu Liu Xinyuan Song Riyang Bao Zekun Jiang Ziyuan Qin LM&MA AI4MH 531 16 0 28 Oct 2024
Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers Lorenzo Pacchiardi Marko Tesic Lucy G. Cheke José Hernández-Orallo 198 3 0 15 Oct 2024
ELF-Gym: Evaluating Large Language Models Generated Features for Tabular PredictionInternational Conference on Information and Knowledge Management (CIKM), 2024 Yanlin Zhang Ning Li Quan Gan Weinan Zhang David Wipf Minjie Wang 123 4 0 13 Oct 2024
Co-occurrence is not Factual Association in Language ModelsNeural Information Processing Systems (NeurIPS), 2024 Xiao Zhang Chenyi Guo Ji Wu KELM 346 8 0 21 Sep 2024
Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges Qian Niu Junyu Liu Ziqian Bi Pohsun Feng Benji Peng ... Ming Li Lawrence KQ Yan Yichao Zhang Caitlyn Heqi Yin Cheng Fei 300 42 0 04 Sep 2024
Logistic Regression makes small LLMs strong and explainable "tens-of-shot" classifiers Marcus Buckmann Edward Hill 175 4 0 06 Aug 2024
Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for Improved Quality and Efficiency in RAG Systems Yunxiao Shi Xing Zi Zijing Shi Haimin Zhang Qiang Wu Min Xu 228 20 0 15 Jul 2024
Source Code Summarization in the Era of Large Language Models Weisong Sun Yun Miao Yuekang Li Hongyu Zhang Chunrong Fang Yi Liu Gelei Deng Yang Liu Zhenyu Chen ELM 337 42 0 09 Jul 2024
ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization Chunrong Fang Weisong Sun Yuchen Chen Xiao Chen Zhao Wei Quanjun Zhang Yudu You Bin Luo Yang Liu Zhenyu Chen AI4TS 249 20 0 01 Jul 2024
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding Wufei Ma Guanning Zeng Guofeng Zhang Qihao Liu Letian Zhang Adam Kortylewski Yaoyao Liu Alan Yuille VLM 3DV 202 15 0 13 Jun 2024
Conditional Language Learning with Context X. Zhang Chenyi Guo Ji Wu 175 5 0 04 Jun 2024
Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory Nikola Zubić Federico Soldá Aurelio Sulser Davide Scaramuzza LRM BDL 311 15 0 26 May 2024
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token IdentificationNeural Information Processing Systems (NeurIPS), 2024 Yefei He Luoming Zhang Weijia Wu Jing Liu Hong Zhou Bohan Zhuang MQ 255 47 0 23 May 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency Xenia Ohmer Elia Bruni Dieuwke Hupkes AI4CE 241 9 0 18 Apr 2024
Defending Against Unforeseen Failure Modes with Latent Adversarial Training Stephen Casper Lennart Schulze Oam Patel Dylan Hadfield-Menell AAML 588 56 0 08 Mar 2024
On the Challenges and Opportunities in Generative AI Laura Manduchi Kushagra Pandey Kushagra Pandey Robert Bamler Sina Daubener ... Yixin Wang F. Wenzel Frank Wood Stephan Mandt Vincent Fortuin 662 40 0 28 Feb 2024
The Clever Hans Mirage: A Comprehensive Survey on Spurious Correlations in Machine Learning Wenqian Ye Luyang Jiang Xu Cao Guangtao Zheng Yunsheng Ma ... Huang Ziran Wang James M. Rehg Henry Kautz Andrew Gordon Wilson OOD AAML CML 465 58 0 20 Feb 2024
Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation Dennis Hoftijzer Gertjan J. Burghouts Luuk J. Spreeuwers 201 3 0 07 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models M. Pternea Prerna Singh Abir Chakraborty Y. Oruganti M. Milletarí Sayli Bapat Kebei Jiang OffRL 182 23 0 02 Feb 2024
Rethinking Interpretability in the Era of Large Language Models Chandan Singh J. Inala Michel Galley Rich Caruana Jianfeng Gao LRM AI4CE 236 101 0 30 Jan 2024
Black-Box Access is Insufficient for Rigorous AI AuditsConference on Fairness, Accountability and Transparency (FAccT), 2024 Stephen Casper Carson Ezell Charlotte Siegmann Noam Kolt Taylor Lynn Curtis ... Michael Gerovitch David Bau Max Tegmark David M. Krueger Dylan Hadfield-Menell AAML 440 124 0 25 Jan 2024
Learning Shortcuts: On the Misleading Promise of NLU in Language Models Geetanjali Bihani Julia Taylor Rayz 198 4 0 17 Jan 2024
Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous DequantizationInternational Conference on Computer Aided Design (ICCAD), 2023 Jinhao Li Jiaming Xu Shiyao Li Shan Huang Jun Liu Yaoxiu Lian Guohao Dai MQ 138 10 0 28 Nov 2023
Large Language Models in Law: A SurveyAI Open (AO), 2023 Jinqi Lai Wensheng Gan Jiayang Wu Zhenlian Qi Philip S. Yu ELM AILaw 259 152 0 26 Nov 2023
Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Xiaoxi Kang Zhuang Li Lay-Ki Soon Adnan Trakic Terry Yue Zhuo Patrick Charles Emerton Genevieve Grant LRM AILaw ELM 309 16 0 23 Oct 2023
Fool Your (Vision and) Language Model With Embarrassingly Simple PermutationsInternational Conference on Machine Learning (ICML), 2023 Yongshuo Zong Tingyang Yu Ruchika Chavhan Bingchen Zhao Timothy M. Hospedales MLLM AAML LRM 211 25 0 02 Oct 2023
Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context LearningInternational Conference on Learning Representations (ICLR), 2023 Mustafa Shukor Alexandre Ramé Corentin Dancette Matthieu Cord LRM MLLM 324 26 0 01 Oct 2023
Mitigating Shortcuts in Language Models with Soft Label EncodingInternational Conference on Language Resources and Evaluation (LREC), 2023 Zirui He Huiqi Deng Haiyan Zhao Ninghao Liu Jundong Li 124 2 0 17 Sep 2023
Explainability for Large Language Models: A SurveyACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023 Haiyan Zhao Hanjie Chen Fan Yang Ninghao Liu Huiqi Deng Hengyi Cai Shuaiqiang Wang D. Yin Jundong Li LRM 361 675 0 02 Sep 2023
ExpeL: LLM Agents Are Experiential LearnersAAAI Conference on Artificial Intelligence (AAAI), 2023 Andrew Zhao Daniel Huang Quentin Xu Matthieu Lin Wenshu Fan Gao Huang LLMAG 391 325 0 20 Aug 2023
Large Language Models and Knowledge Graphs: Opportunities and Challenges Jeff Z. Pan Simon Razniewski Jan-Christoph Kalo Sneha Singhania Jiaoyan Chen ... Gerard de Melo A. Bonifati Edlira Vakaj M. Dragoni D. Graux KELM 229 110 0 11 Aug 2023
Large Language Models Can be Lazy Learners: Analyze Shortcuts in In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Ruixiang Tang Dehan Kong Lo-li Huang Hui Xue 244 73 0 26 May 2023

All Papers

Shortcut Learning of Large Language Models in Natural Language Understanding

Papers citing "Shortcut Learning of Large Language Models in Natural Language Understanding"