ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.08220
  4. Cited By
Understanding Neural Networks through Representation Erasure

Understanding Neural Networks through Representation Erasure

24 December 2016
Jiwei Li
Will Monroe
Dan Jurafsky
    AAML
    MILM
ArXivPDFHTML

Papers citing "Understanding Neural Networks through Representation Erasure"

50 / 149 papers shown
Title
Automatic Sleep Staging of EEG Signals: Recent Development, Challenges,
  and Future Directions
Automatic Sleep Staging of EEG Signals: Recent Development, Challenges, and Future Directions
Huy P Phan
Kaare B. Mikkelsen
19
94
0
03 Nov 2021
Adversarial Attacks and Defenses for Social Network Text Processing
  Applications: Techniques, Challenges and Future Research Directions
Adversarial Attacks and Defenses for Social Network Text Processing Applications: Techniques, Challenges and Future Research Directions
I. Alsmadi
Kashif Ahmad
Mahmoud Nazzal
Firoj Alam
Ala I. Al-Fuqaha
Abdallah Khreishah
A. Algosaibi
AAML
37
16
0
26 Oct 2021
Understanding Interlocking Dynamics of Cooperative Rationalization
Understanding Interlocking Dynamics of Cooperative Rationalization
Mo Yu
Yang Zhang
Shiyu Chang
Tommi Jaakkola
29
41
0
26 Oct 2021
Double Trouble: How to not explain a text classifier's decisions using
  counterfactuals synthesized by masked language models?
Double Trouble: How to not explain a text classifier's decisions using counterfactuals synthesized by masked language models?
Thang M. Pham
Trung H. Bui
Long Mai
Anh Totti Nguyen
23
7
0
22 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A
  Review
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
24
45
0
20 Oct 2021
A Framework for Rationale Extraction for Deep QA models
A Framework for Rationale Extraction for Deep QA models
Sahana Ramnath
Preksha Nema
Deep Sahni
Mitesh M. Khapra
AAML
FAtt
27
0
0
09 Oct 2021
CheerBots: Chatbots toward Empathy and Emotionusing Reinforcement
  Learning
CheerBots: Chatbots toward Empathy and Emotionusing Reinforcement Learning
Jiun-hao Jhan
Chao-Peng Liu
Shyh-Kang Jeng
Hung-yi Lee
72
9
0
08 Oct 2021
D-REX: Dialogue Relation Extraction with Explanations
D-REX: Dialogue Relation Extraction with Explanations
Alon Albalak
Varun R. Embar
Yi-Lin Tuan
Lise Getoor
Wenjie Wang
67
9
0
10 Sep 2021
AutoTriggER: Label-Efficient and Robust Named Entity Recognition with
  Auxiliary Trigger Extraction
AutoTriggER: Label-Efficient and Robust Named Entity Recognition with Auxiliary Trigger Extraction
Dong-Ho Lee
Ravi Kiran Selvam
Sheikh Muhammad Sarwar
Bill Yuchen Lin
Fred Morstatter
Jay Pujara
Elizabeth Boschee
James Allan
Xiang Ren
44
2
0
10 Sep 2021
Enjoy the Salience: Towards Better Transformer-based Faithful
  Explanations with Word Salience
Enjoy the Salience: Towards Better Transformer-based Faithful Explanations with Word Salience
G. Chrysostomou
Nikolaos Aletras
37
16
0
31 Aug 2021
Discretized Integrated Gradients for Explaining Language Models
Discretized Integrated Gradients for Explaining Language Models
Soumya Sanyal
Xiang Ren
FAtt
17
53
0
31 Aug 2021
T3-Vis: a visual analytic framework for Training and fine-Tuning
  Transformers in NLP
T3-Vis: a visual analytic framework for Training and fine-Tuning Transformers in NLP
Raymond Li
Wen Xiao
Lanjun Wang
Hyeju Jang
Giuseppe Carenini
ViT
31
23
0
31 Aug 2021
Neuron-level Interpretation of Deep NLP Models: A Survey
Neuron-level Interpretation of Deep NLP Models: A Survey
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
MILM
AI4CE
40
82
0
30 Aug 2021
Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal
  Reasoning Models
Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models
Mingyue Han
Yinglin Wang
LRM
21
11
0
05 Jul 2021
What Context Features Can Transformer Language Models Use?
What Context Features Can Transformer Language Models Use?
J. O'Connor
Jacob Andreas
KELM
29
75
0
15 Jun 2021
On Sample Based Explanation Methods for NLP:Efficiency, Faithfulness,
  and Semantic Evaluation
On Sample Based Explanation Methods for NLP:Efficiency, Faithfulness, and Semantic Evaluation
Wei Zhang
Ziming Huang
Yada Zhu
Guangnan Ye
Xiaodong Cui
Fan Zhang
33
17
0
09 Jun 2021
The Out-of-Distribution Problem in Explainability and Search Methods for
  Feature Importance Explanations
The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations
Peter Hase
Harry Xie
Joey Tianyi Zhou
OODD
LRM
FAtt
31
91
0
01 Jun 2021
Improving the Faithfulness of Attention-based Explanations with
  Task-specific Information for Text Classification
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification
G. Chrysostomou
Nikolaos Aletras
32
37
0
06 May 2021
Making Attention Mechanisms More Robust and Interpretable with Virtual
  Adversarial Training
Making Attention Mechanisms More Robust and Interpretable with Virtual Adversarial Training
Shunsuke Kitada
Hitoshi Iyatomi
AAML
30
8
0
18 Apr 2021
BERT: A Review of Applications in Natural Language Processing and
  Understanding
BERT: A Review of Applications in Natural Language Processing and Understanding
M. V. Koroteev
VLM
25
197
0
22 Mar 2021
Contrastive Explanations for Model Interpretability
Contrastive Explanations for Model Interpretability
Alon Jacovi
Swabha Swayamdipta
Shauli Ravfogel
Yanai Elazar
Yejin Choi
Yoav Goldberg
46
95
0
02 Mar 2021
On the Post-hoc Explainability of Deep Echo State Networks for Time
  Series Forecasting, Image and Video Classification
On the Post-hoc Explainability of Deep Echo State Networks for Time Series Forecasting, Image and Video Classification
Alejandro Barredo Arrieta
S. Gil-Lopez
I. Laña
Miren Nekane Bilbao
Javier Del Ser
AI4TS
41
13
0
17 Feb 2021
Explain and Predict, and then Predict Again
Explain and Predict, and then Predict Again
Zijian Zhang
Koustav Rudra
Avishek Anand
FAtt
30
51
0
11 Jan 2021
FastIF: Scalable Influence Functions for Efficient Model Interpretation
  and Debugging
FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging
Han Guo
Nazneen Rajani
Peter Hase
Joey Tianyi Zhou
Caiming Xiong
TDI
43
102
0
31 Dec 2020
Self-Explaining Structures Improve NLP Models
Self-Explaining Structures Improve NLP Models
Zijun Sun
Chun Fan
Qinghong Han
Xiaofei Sun
Yuxian Meng
Fei Wu
Jiwei Li
MILM
XAI
LRM
FAtt
46
38
0
03 Dec 2020
TimeSHAP: Explaining Recurrent Models through Sequence Perturbations
TimeSHAP: Explaining Recurrent Models through Sequence Perturbations
João Bento
Pedro Saleiro
André F. Cruz
Mário A. T. Figueiredo
P. Bizarro
FAtt
AI4TS
24
88
0
30 Nov 2020
Parameterized Explainer for Graph Neural Network
Parameterized Explainer for Graph Neural Network
Dongsheng Luo
Wei Cheng
Dongkuan Xu
Wenchao Yu
Bo Zong
Haifeng Chen
Xiang Zhang
53
542
0
09 Nov 2020
Interpretation of NLP models through input marginalization
Interpretation of NLP models through input marginalization
Siwon Kim
Jihun Yi
Eunji Kim
Sungroh Yoon
MILM
FAtt
30
58
0
27 Oct 2020
Geometry matters: Exploring language examples at the decision boundary
Geometry matters: Exploring language examples at the decision boundary
Debajyoti Datta
Shashwat Kumar
Laura E. Barnes
Tom Fletcher
AAML
20
3
0
14 Oct 2020
Neural Databases
Neural Databases
James Thorne
Majid Yazdani
Marzieh Saeidi
Fabrizio Silvestri
Sebastian Riedel
A. Halevy
NAI
34
9
0
14 Oct 2020
The elephant in the interpretability room: Why use attention as
  explanation when we have saliency methods?
The elephant in the interpretability room: Why use attention as explanation when we have saliency methods?
Jasmijn Bastings
Katja Filippova
XAI
LRM
64
175
0
12 Oct 2020
A Geometry-Inspired Attack for Generating Natural Language Adversarial
  Examples
A Geometry-Inspired Attack for Generating Natural Language Adversarial Examples
Zhao Meng
Roger Wattenhofer
GAN
AAML
35
32
0
03 Oct 2020
Interpreting Graph Neural Networks for NLP With Differentiable Edge
  Masking
Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking
M. Schlichtkrull
Nicola De Cao
Ivan Titov
AI4CE
36
214
0
01 Oct 2020
Distillation of Weighted Automata from Recurrent Neural Networks using a
  Spectral Approach
Distillation of Weighted Automata from Recurrent Neural Networks using a Spectral Approach
Rémi Eyraud
Stéphane Ayache
24
16
0
28 Sep 2020
Evaluation of Local Explanation Methods for Multivariate Time Series
  Forecasting
Evaluation of Local Explanation Methods for Multivariate Time Series Forecasting
Ozan Ozyegen
Igor Ilic
Mucahit Cevik
FAtt
AI4TS
24
2
0
18 Sep 2020
Tasty Burgers, Soggy Fries: Probing Aspect Robustness in Aspect-Based
  Sentiment Analysis
Tasty Burgers, Soggy Fries: Probing Aspect Robustness in Aspect-Based Sentiment Analysis
Xiaoyu Xing
Zhijing Jin
Di Jin
Bingning Wang
Qi Zhang
Xuanjing Huang
CoGe
21
44
0
16 Sep 2020
Can We Trust Your Explanations? Sanity Checks for Interpreters in
  Android Malware Analysis
Can We Trust Your Explanations? Sanity Checks for Interpreters in Android Malware Analysis
Ming Fan
Wenying Wei
Xiaofei Xie
Yang Liu
X. Guan
Ting Liu
FAtt
AAML
24
36
0
13 Aug 2020
On the Generalizability of Neural Program Models with respect to
  Semantic-Preserving Program Transformations
On the Generalizability of Neural Program Models with respect to Semantic-Preserving Program Transformations
Md Rafiqul Islam Rabin
Nghi D. Q. Bui
Ke Wang
Yijun Yu
Lingxiao Jiang
Mohammad Amin Alipour
30
90
0
31 Jul 2020
Machine Learning Explanations to Prevent Overtrust in Fake News
  Detection
Machine Learning Explanations to Prevent Overtrust in Fake News Detection
Sina Mohseni
Fan Yang
Shiva K. Pentyala
Mengnan Du
Yi Liu
Nic Lupfer
Xia Hu
Shuiwang Ji
Eric D. Ragan
21
41
0
24 Jul 2020
Rationalizing Text Matching: Learning Sparse Alignments via Optimal
  Transport
Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport
Kyle Swanson
L. Yu
Tao Lei
OT
29
37
0
27 May 2020
Explaining Black Box Predictions and Unveiling Data Artifacts through
  Influence Functions
Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions
Xiaochuang Han
Byron C. Wallace
Yulia Tsvetkov
MILM
FAtt
AAML
TDI
28
165
0
14 May 2020
DQI: Measuring Data Quality in NLP
DQI: Measuring Data Quality in NLP
Swaroop Mishra
Anjana Arunkumar
Bhavdeep Singh Sachdeva
Chris Bryan
Chitta Baral
36
30
0
02 May 2020
Explainable Deep Learning: A Field Guide for the Uninitiated
Explainable Deep Learning: A Field Guide for the Uninitiated
Gabrielle Ras
Ning Xie
Marcel van Gerven
Derek Doran
AAML
XAI
49
371
0
30 Apr 2020
Sequential Interpretability: Methods, Applications, and Future Direction
  for Understanding Deep Learning Models in the Context of Sequential Data
Sequential Interpretability: Methods, Applications, and Future Direction for Understanding Deep Learning Models in the Context of Sequential Data
B. Shickel
Parisa Rashidi
AI4TS
33
17
0
27 Apr 2020
Adversarial Attacks and Defenses: An Interpretation Perspective
Adversarial Attacks and Defenses: An Interpretation Perspective
Ninghao Liu
Mengnan Du
Ruocheng Guo
Huan Liu
Xia Hu
AAML
31
8
0
23 Apr 2020
Frequency-Guided Word Substitutions for Detecting Textual Adversarial
  Examples
Frequency-Guided Word Substitutions for Detecting Textual Adversarial Examples
Maximilian Mozes
Pontus Stenetorp
Bennett Kleinberg
Lewis D. Griffin
AAML
30
99
0
13 Apr 2020
Generating Hierarchical Explanations on Text Classification via Feature
  Interaction Detection
Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection
Hanjie Chen
Guangtao Zheng
Yangfeng Ji
FAtt
38
92
0
04 Apr 2020
BAE: BERT-based Adversarial Examples for Text Classification
BAE: BERT-based Adversarial Examples for Text Classification
Siddhant Garg
Goutham Ramakrishnan
AAML
SILM
39
542
0
04 Apr 2020
HotelRec: a Novel Very Large-Scale Hotel Recommendation Dataset
HotelRec: a Novel Very Large-Scale Hotel Recommendation Dataset
Diego Antognini
Boi Faltings
DML
25
29
0
17 Feb 2020
Description Based Text Classification with Reinforcement Learning
Description Based Text Classification with Reinforcement Learning
Duo Chai
Wei Wu
Qinghong Han
Fei Wu
Jiwei Li
VLM
121
66
0
08 Feb 2020
Previous
123
Next