Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization

8 October 2024

Wei Liu

Jun Wang

Haozhao Wang

YuanKai Zhang

Ruixuan Li

Papers citing "Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization"

2 / 2 papers shown

Title
Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets W. Liu Zhongyu Niu Lang Gao Zhiying Deng Jun Wang H. Wang Ruixuan Li 55 1 0 04 May 2025
When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated Explanations Huaizhi Ge Yiming Li Qifan Wang Yongfeng Zhang Ruixiang Tang AAML SILM 72 0 0 19 Nov 2024