Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.04103
Cited By
Enhancing the Rationale-Input Alignment for Self-explaining Rationalization
7 December 2023
Wei Liu
Haozhao Wang
Jun Wang
Zhiying Deng
Yuankai Zhang
Chengwei Wang
Ruixuan Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enhancing the Rationale-Input Alignment for Self-explaining Rationalization"
11 / 11 papers shown
Title
Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets
W. Liu
Zhongyu Niu
Lang Gao
Zhiying Deng
Jun Wang
H. Wang
Ruixuan Li
46
1
0
04 May 2025
Breaking Free from MMI: A New Frontier in Rationalization by Probing Input Utilization
W. Liu
Zhiying Deng
Zhongyu Niu
Jun Wang
Haozhao Wang
Zhigang Zeng
Ruixuan Li
31
2
0
08 Mar 2025
When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated Explanations
Huaizhi Ge
Yiming Li
Qifan Wang
Yongfeng Zhang
Ruixiang Tang
AAML
SILM
72
0
0
19 Nov 2024
Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization
Wei Liu
Zhiying Deng
Zhongyu Niu
Jun Wang
Haozhao Wang
YuanKai Zhang
Ruixuan Li
20
4
0
08 Oct 2024
MARE: Multi-Aspect Rationale Extractor on Unsupervised Rationale Extraction
Han Jiang
Junwen Duan
Zhe Qu
Jianxin Wang
19
1
0
04 Oct 2024
Adversarial Attack for Explanation Robustness of Rationalization Models
Yuankai Zhang
Lingxiao Kong
Haozhao Wang
Ruixuan Li
Jun Wang
Yuhua Li
Wei Liu
AAML
22
1
0
20 Aug 2024
Give Me More Details: Improving Fact-Checking with Latent Retrieval
Xuming Hu
Guan-Huei Wu
Zhijiang Guo
Philip S. Yu
HILM
17
4
0
25 May 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Invariant Rationalization
Shiyu Chang
Yang Zhang
Mo Yu
Tommi Jaakkola
179
197
0
22 Mar 2020
Bi-level Actor-Critic for Multi-agent Coordination
Haifeng Zhang
Weizhe Chen
Zeren Huang
Minne Li
Yaodong Yang
Weinan Zhang
Jun Wang
94
90
0
08 Sep 2019
Learning Attitudes and Attributes from Multi-Aspect Reviews
Julian McAuley
J. Leskovec
Dan Jurafsky
183
292
0
15 Oct 2012
1