Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.06003
Cited By
Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization
8 October 2024
Wei Liu
Zhiying Deng
Zhongyu Niu
Jun Wang
Haozhao Wang
YuanKai Zhang
Ruixuan Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization"
2 / 2 papers shown
Title
Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets
W. Liu
Zhongyu Niu
Lang Gao
Zhiying Deng
Jun Wang
H. Wang
Ruixuan Li
55
1
0
04 May 2025
When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated Explanations
Huaizhi Ge
Yiming Li
Qifan Wang
Yongfeng Zhang
Ruixiang Tang
AAML
SILM
72
0
0
19 Nov 2024
1