ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.08220
  4. Cited By
Understanding Neural Networks through Representation Erasure

Understanding Neural Networks through Representation Erasure

24 December 2016
Jiwei Li
Will Monroe
Dan Jurafsky
    AAML
    MILM
ArXivPDFHTML

Papers citing "Understanding Neural Networks through Representation Erasure"

49 / 149 papers shown
Title
On Interpretability of Artificial Neural Networks: A Survey
On Interpretability of Artificial Neural Networks: A Survey
Fenglei Fan
Jinjun Xiong
Mengzhou Li
Ge Wang
AAML
AI4CE
43
301
0
08 Jan 2020
Generalizing Natural Language Analysis through Span-relation
  Representations
Generalizing Natural Language Analysis through Span-relation Representations
Zhengbao Jiang
Wenyuan Xu
Jun Araki
Graham Neubig
33
60
0
10 Nov 2019
Rethinking Cooperative Rationalization: Introspective Extraction and
  Complement Control
Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
Mo Yu
Shiyu Chang
Yang Zhang
Tommi Jaakkola
21
140
0
29 Oct 2019
A Game Theoretic Approach to Class-wise Selective Rationalization
A Game Theoretic Approach to Class-wise Selective Rationalization
Shiyu Chang
Yang Zhang
Mo Yu
Tommi Jaakkola
22
60
0
28 Oct 2019
CXPlain: Causal Explanations for Model Interpretation under Uncertainty
CXPlain: Causal Explanations for Model Interpretation under Uncertainty
Patrick Schwab
W. Karlen
FAtt
CML
40
206
0
27 Oct 2019
Perturbation Sensitivity Analysis to Detect Unintended Model Biases
Perturbation Sensitivity Analysis to Detect Unintended Model Biases
Vinodkumar Prabhakaran
Ben Hutchinson
Margaret Mitchell
22
117
0
09 Oct 2019
Towards Explainable Artificial Intelligence
Towards Explainable Artificial Intelligence
Wojciech Samek
K. Müller
XAI
32
437
0
26 Sep 2019
Explaining and Interpreting LSTMs
Explaining and Interpreting LSTMs
L. Arras
Jose A. Arjona-Medina
Michael Widrich
G. Montavon
Michael Gillhofer
K. Müller
Sepp Hochreiter
Wojciech Samek
FAtt
AI4TS
21
79
0
25 Sep 2019
Learning to Deceive with Attention-Based Explanations
Learning to Deceive with Attention-Based Explanations
Danish Pruthi
Mansi Gupta
Bhuwan Dhingra
Graham Neubig
Zachary Chase Lipton
19
193
0
17 Sep 2019
Towards Understanding Neural Machine Translation with Word Importance
Towards Understanding Neural Machine Translation with Word Importance
Shilin He
Zhaopeng Tu
Xing Wang
Longyue Wang
Michael R. Lyu
Shuming Shi
AAML
26
39
0
01 Sep 2019
Universal Adversarial Triggers for Attacking and Analyzing NLP
Universal Adversarial Triggers for Attacking and Analyzing NLP
Eric Wallace
Shi Feng
Nikhil Kandpal
Matt Gardner
Sameer Singh
AAML
SILM
60
842
0
20 Aug 2019
Understanding Memory Modules on Learning Simple Algorithms
Understanding Memory Modules on Learning Simple Algorithms
Kexin Wang
Yu Zhou
Shaonan Wang
Jiajun Zhang
Chengqing Zong
34
0
0
01 Jul 2019
ML-LOO: Detecting Adversarial Examples with Feature Attribution
ML-LOO: Detecting Adversarial Examples with Feature Attribution
Puyudi Yang
Jianbo Chen
Cho-Jui Hsieh
Jane-ling Wang
Michael I. Jordan
AAML
22
101
0
08 Jun 2019
Combating Adversarial Misspellings with Robust Word Recognition
Combating Adversarial Misspellings with Robust Word Recognition
Danish Pruthi
Bhuwan Dhingra
Zachary Chase Lipton
25
300
0
27 May 2019
Software and application patterns for explanation methods
Software and application patterns for explanation methods
Maximilian Alber
38
11
0
09 Apr 2019
Semantics Preserving Adversarial Learning
Semantics Preserving Adversarial Learning
Ousmane Amadou Dia
Elnaz Barshan
Reza Babanezhad
AAML
GAN
36
2
0
10 Mar 2019
Reliable Deep Grade Prediction with Uncertainty Estimation
Reliable Deep Grade Prediction with Uncertainty Estimation
Qian Hu
Huzefa Rangwala
26
39
0
26 Feb 2019
Attention is not Explanation
Attention is not Explanation
Sarthak Jain
Byron C. Wallace
FAtt
31
1,301
0
26 Feb 2019
Re-evaluating ADEM: A Deeper Look at Scoring Dialogue Responses
Re-evaluating ADEM: A Deeper Look at Scoring Dialogue Responses
Ananya B. Sai
Mithun Das Gupta
Mitesh M. Khapra
Mukundhan Srinivasan
27
48
0
23 Feb 2019
Saliency Learning: Teaching the Model Where to Pay Attention
Saliency Learning: Teaching the Model Where to Pay Attention
Reza Ghaeini
Xiaoli Z. Fern
Hamed Shahbazi
Prasad Tadepalli
FAtt
XAI
32
30
0
22 Feb 2019
Understanding Impacts of High-Order Loss Approximations and Features in
  Deep Learning Interpretation
Understanding Impacts of High-Order Loss Approximations and Features in Deep Learning Interpretation
Sahil Singla
Eric Wallace
Shi Feng
S. Feizi
FAtt
34
59
0
01 Feb 2019
Code Failure Prediction and Pattern Extraction using LSTM Networks
Code Failure Prediction and Pattern Extraction using LSTM Networks
Mahdi Hajiaghayi
E. Vahedi
35
24
0
13 Dec 2018
Discrete Adversarial Attacks and Submodular Optimization with
  Applications to Text Classification
Discrete Adversarial Attacks and Submodular Optimization with Applications to Text Classification
Qi Lei
Lingfei Wu
Pin-Yu Chen
A. Dimakis
Inderjit S. Dhillon
Michael Witbrock
AAML
21
92
0
01 Dec 2018
Explaining Deep Learning Models - A Bayesian Non-parametric Approach
Explaining Deep Learning Models - A Bayesian Non-parametric Approach
Wenbo Guo
Sui Huang
Yunzhe Tao
Masashi Sugiyama
Lin Lin
BDL
16
47
0
07 Nov 2018
Identifying and Controlling Important Neurons in Neural Machine
  Translation
Identifying and Controlling Important Neurons in Neural Machine Translation
A. Bau
Yonatan Belinkov
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
James R. Glass
MILM
21
180
0
03 Nov 2018
Attack Graph Convolutional Networks by Adding Fake Nodes
Attack Graph Convolutional Networks by Adding Fake Nodes
Xiaoyun Wang
Minhao Cheng
Joe Eaton
Cho-Jui Hsieh
S. F. Wu
AAML
GNN
33
78
0
25 Oct 2018
What made you do this? Understanding black-box decisions with sufficient
  input subsets
What made you do this? Understanding black-box decisions with sufficient input subsets
Brandon Carter
Jonas W. Mueller
Siddhartha Jain
David K Gifford
FAtt
42
77
0
09 Oct 2018
Training Machine Learning Models by Regularizing their Explanations
Training Machine Learning Models by Regularizing their Explanations
A. Ross
FaML
26
0
0
29 Sep 2018
Assessing Composition in Sentence Vector Representations
Assessing Composition in Sentence Vector Representations
Allyson Ettinger
Ahmed Elgohary
C. Phillips
Philip Resnik
CoGe
20
78
0
11 Sep 2018
Interpreting Neural Networks With Nearest Neighbors
Interpreting Neural Networks With Nearest Neighbors
Eric Wallace
Shi Feng
Jordan L. Boyd-Graber
AAML
FAtt
MILM
23
53
0
08 Sep 2018
Extractive Adversarial Networks: High-Recall Explanations for
  Identifying Personal Attacks in Social Media Posts
Extractive Adversarial Networks: High-Recall Explanations for Identifying Personal Attacks in Social Media Posts
Samuel Carton
Qiaozhu Mei
Paul Resnick
FAtt
AAML
21
34
0
01 Sep 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
Dissecting Contextual Word Embeddings: Architecture and Representation
Matthew E. Peters
Mark Neumann
Luke Zettlemoyer
Wen-tau Yih
35
428
0
27 Aug 2018
Interpreting Recurrent and Attention-Based Neural Models: a Case Study
  on Natural Language Inference
Interpreting Recurrent and Attention-Based Neural Models: a Case Study on Natural Language Inference
Reza Ghaeini
Xiaoli Z. Fern
Prasad Tadepalli
MILM
20
97
0
12 Aug 2018
Hierarchical interpretations for neural network predictions
Hierarchical interpretations for neural network predictions
Chandan Singh
W. James Murdoch
Bin Yu
31
145
0
14 Jun 2018
SoPa: Bridging CNNs, RNNs, and Weighted Finite-State Machines
SoPa: Bridging CNNs, RNNs, and Weighted Finite-State Machines
Roy Schwartz
Sam Thomson
Noah A. Smith
33
24
0
15 May 2018
State Gradients for RNN Memory Analysis
State Gradients for RNN Memory Analysis
Lyan Verwimp
Hugo Van hamme
Vincent Renkens
P. Wambacq
13
6
0
11 May 2018
Behavior Analysis of NLI Models: Uncovering the Influence of Three
  Factors on Robustness
Behavior Analysis of NLI Models: Uncovering the Influence of Three Factors on Robustness
V. Carmona
Jeff Mitchell
Sebastian Riedel
35
44
0
11 May 2018
Seq2Seq-Vis: A Visual Debugging Tool for Sequence-to-Sequence Models
Seq2Seq-Vis: A Visual Debugging Tool for Sequence-to-Sequence Models
Hendrik Strobelt
Sebastian Gehrmann
M. Behrisch
Adam Perer
Hanspeter Pfister
Alexander M. Rush
VLM
HAI
31
239
0
25 Apr 2018
Explanation Methods in Deep Learning: Users, Values, Concerns and
  Challenges
Explanation Methods in Deep Learning: Users, Values, Concerns and Challenges
Gabrielle Ras
Marcel van Gerven
W. Haselager
XAI
17
217
0
20 Mar 2018
IcoRating: A Deep-Learning System for Scam ICO Identification
IcoRating: A Deep-Learning System for Scam ICO Identification
Shuqing Bian
Zhenpeng Deng
F. Li
Will Monroe
Peng Shi
...
Sikuang Wang
William Yang Wang
Arianna Yuan
Tianwei Zhang
Jiwei Li
38
37
0
08 Mar 2018
Seq2Sick: Evaluating the Robustness of Sequence-to-Sequence Models with
  Adversarial Examples
Seq2Sick: Evaluating the Robustness of Sequence-to-Sequence Models with Adversarial Examples
Minhao Cheng
Jinfeng Yi
Pin-Yu Chen
Huan Zhang
Cho-Jui Hsieh
SILM
AAML
54
242
0
03 Mar 2018
Before Name-calling: Dynamics and Triggers of Ad Hominem Fallacies in
  Web Argumentation
Before Name-calling: Dynamics and Triggers of Ad Hominem Fallacies in Web Argumentation
Ivan Habernal
Henning Wachsmuth
Iryna Gurevych
Benno Stein
44
83
0
19 Feb 2018
Generating Natural Adversarial Examples
Generating Natural Adversarial Examples
Zhengli Zhao
Dheeru Dua
Sameer Singh
GAN
AAML
40
596
0
31 Oct 2017
Learning to Generate Reviews and Discovering Sentiment
Learning to Generate Reviews and Discovering Sentiment
Alec Radford
Rafal Jozefowicz
Ilya Sutskever
44
506
0
05 Apr 2017
Right for the Right Reasons: Training Differentiable Models by
  Constraining their Explanations
Right for the Right Reasons: Training Differentiable Models by Constraining their Explanations
A. Ross
M. C. Hughes
Finale Doshi-Velez
FAtt
51
583
0
10 Mar 2017
Representations of language in a model of visually grounded speech
  signal
Representations of language in a model of visually grounded speech signal
Grzegorz Chrupała
Lieke Gelderloos
Afra Alishahi
41
131
0
07 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,505
0
25 Jan 2017
Representation of linguistic form and function in recurrent neural
  networks
Representation of linguistic form and function in recurrent neural networks
Ákos Kádár
Grzegorz Chrupała
Afra Alishahi
27
162
0
29 Feb 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
220
7,930
0
17 Aug 2015
Previous
123