Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1610.01644
Cited By
Understanding intermediate layers using linear classifier probes
5 October 2016
Guillaume Alain
Yoshua Bengio
FAtt
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding intermediate layers using linear classifier probes"
50 / 158 papers shown
Title
Visualizing Deep Neural Networks with Topographic Activation Maps
A. Krug
Raihan Kabir Ratul
Christopher Olson
Sebastian Stober
FAtt
AI4CE
25
3
0
07 Apr 2022
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Jaan Aru
Aqeel Labash
Oriol Corcoll
Raul Vicente
20
26
0
30 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
40
11
0
01 Mar 2022
Screening Gender Transfer in Neural Machine Translation
Guillaume Wisniewski
Lichao Zhu
Nicolas Bailler
François Yvon
6
4
0
25 Feb 2022
Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning
Hao He
Kaiwen Zha
Dina Katabi
AAML
26
32
0
22 Feb 2022
Probing Pretrained Models of Source Code
Sergey Troshin
Nadezhda Chirkova
ELM
25
38
0
16 Feb 2022
Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence
Frederik Pahde
Maximilian Dreyer
Leander Weber
Moritz Weckbecker
Christopher J. Anders
Thomas Wiegand
Wojciech Samek
Sebastian Lapuschkin
57
7
0
07 Feb 2022
What Has Been Enhanced in my Knowledge-Enhanced Language Model?
Yifan Hou
Guoji Fu
Mrinmaya Sachan
KELM
33
1
0
02 Feb 2022
Deconfounded Representation Similarity for Comparison of Neural Networks
Tianyu Cui
Yogesh Kumar
Pekka Marttinen
Samuel Kaski
CML
24
13
0
31 Jan 2022
Ensembling Off-the-shelf Models for GAN Training
Nupur Kumari
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
19
86
0
16 Dec 2021
Sparse Interventions in Language Models with Differentiable Masking
Nicola De Cao
Leon Schmid
Dieuwke Hupkes
Ivan Titov
30
27
0
13 Dec 2021
Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
Konpat Preechakul
Nattanat Chatthee
Suttisak Wizadwongsa
Supasorn Suwajanakorn
SyDa
DiffM
27
415
0
30 Nov 2021
Neural Networks as Kernel Learners: The Silent Alignment Effect
Alexander B. Atanasov
Blake Bordelon
C. Pehlevan
MLT
13
74
0
29 Oct 2021
Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents
Shivansh Patel
Saim Wani
Unnat Jain
A. Schwing
Svetlana Lazebnik
Manolis Savva
Angel X. Chang
LM&Ro
24
25
0
12 Oct 2021
Conditional probing: measuring usable information beyond a baseline
John Hewitt
Kawin Ethayarajh
Percy Liang
Christopher D. Manning
31
55
0
19 Sep 2021
What do pre-trained code models know about code?
Anjan Karmakar
Romain Robbes
ELM
24
87
0
25 Aug 2021
Do Vision Transformers See Like Convolutional Neural Networks?
M. Raghu
Thomas Unterthiner
Simon Kornblith
Chiyuan Zhang
Alexey Dosovitskiy
ViT
41
922
0
19 Aug 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
26
181
0
27 Jul 2021
Codified audio language modeling learns useful representations for music information retrieval
Rodrigo Castellon
Chris Donahue
Percy Liang
78
86
0
12 Jul 2021
A Closer Look at How Fine-tuning Changes BERT
Yichu Zhou
Vivek Srikumar
24
63
0
27 Jun 2021
Poisoning and Backdooring Contrastive Learning
Nicholas Carlini
Andreas Terzis
10
156
0
17 Jun 2021
Deep Learning Through the Lens of Example Difficulty
R. Baldock
Hartmut Maennel
Behnam Neyshabur
19
155
0
17 Jun 2021
Revisiting Model Stitching to Compare Neural Representations
Yamini Bansal
Preetum Nakkiran
Boaz Barak
FedML
22
104
0
14 Jun 2021
Do Syntactic Probes Probe Syntax? Experiments with Jabberwocky Probing
Rowan Hall Maudslay
Ryan Cotterell
26
33
0
04 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
18
3
0
01 Jun 2021
A multilabel approach to morphosyntactic probing
Naomi Tachikawa Shapiro
Amandalynne Paullada
Shane Steinert-Threlkeld
27
10
0
17 Apr 2021
Does BERT Pretrained on Clinical Notes Reveal Sensitive Data?
Eric P. Lehman
Sarthak Jain
Karl Pichotta
Yoav Goldberg
Byron C. Wallace
OOD
MIACV
22
117
0
15 Apr 2021
Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Koustuv Sinha
Robin Jia
Dieuwke Hupkes
J. Pineau
Adina Williams
Douwe Kiela
43
243
0
14 Apr 2021
DirectProbe: Studying Representations without Classifiers
Yichu Zhou
Vivek Srikumar
27
27
0
13 Apr 2021
First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT
Benjamin Muller
Yanai Elazar
Benoît Sagot
Djamé Seddah
LRM
21
71
0
26 Jan 2021
Categorical Perception: A Groundwork for Deep Learning
L. Bonnasse-Gahot
Jean-Pierre Nadal
14
7
0
10 Dec 2020
Examining the rhetorical capacities of neural language models
Zining Zhu
Chuer Pan
Mohamed Abdalla
Frank Rudzicz
28
10
0
01 Oct 2020
Gaussian Gated Linear Networks
David Budden
Adam H. Marblestone
Eren Sezener
Tor Lattimore
Greg Wayne
J. Veness
BDL
AI4CE
14
12
0
10 Jun 2020
ReLU Code Space: A Basis for Rating Network Quality Besides Accuracy
Natalia Shepeleva
Werner Zellinger
Michal Lewandowski
Bernhard A. Moser
9
3
0
20 May 2020
Explainable Deep Learning: A Field Guide for the Uninitiated
Gabrielle Ras
Ning Xie
Marcel van Gerven
Derek Doran
AAML
XAI
29
371
0
30 Apr 2020
Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models
A. Krug
Sebastian Stober
14
0
0
19 Feb 2020
Contrastive Representation Distillation
Yonglong Tian
Dilip Krishnan
Phillip Isola
39
1,029
0
23 Oct 2019
Unsupervised State Representation Learning in Atari
Ankesh Anand
Evan Racah
Sherjil Ozair
Yoshua Bengio
Marc-Alexandre Côté
R. Devon Hjelm
SSL
27
253
0
19 Jun 2019
Interpretable Neural Network Decoupling
Yuchao Li
R. Ji
Shaohui Lin
Baochang Zhang
Chenqian Yan
Yongjian Wu
Feiyue Huang
Ling Shao
28
2
0
04 Jun 2019
Wasserstein Dependency Measure for Representation Learning
Sherjil Ozair
Corey Lynch
Yoshua Bengio
Aaron van den Oord
Sergey Levine
P. Sermanet
SSL
DRL
19
115
0
28 Mar 2019
On the Pitfalls of Measuring Emergent Communication
Ryan J. Lowe
Jakob N. Foerster
Y-Lan Boureau
Joelle Pineau
Yann N. Dauphin
20
131
0
12 Mar 2019
Human-Centered Tools for Coping with Imperfect Algorithms during Medical Decision-Making
Carrie J. Cai
Emily Reif
Narayan Hegde
J. Hipp
Been Kim
...
Martin Wattenberg
F. Viégas
G. Corrado
Martin C. Stumpe
Michael Terry
30
395
0
08 Feb 2019
Ablation of a Robot's Brain: Neural Networks Under a Knife
Peter Lillian
Richard Meyes
Tobias Meisen
17
10
0
13 Dec 2018
Local Explanation Methods for Deep Neural Networks Lack Sensitivity to Parameter Values
Julius Adebayo
Justin Gilmer
Ian Goodfellow
Been Kim
FAtt
AAML
11
128
0
08 Oct 2018
Sanity Checks for Saliency Maps
Julius Adebayo
Justin Gilmer
M. Muelly
Ian Goodfellow
Moritz Hardt
Been Kim
FAtt
AAML
XAI
26
1,926
0
08 Oct 2018
Interpreting Layered Neural Networks via Hierarchical Modular Representation
C. Watanabe
8
19
0
03 Oct 2018
Visualizing and Understanding Deep Neural Networks in CTR Prediction
Lin Guo
Hui Ye
Wenbo Su
He Liu
Kai Sun
Hang Xiang
FAtt
HAI
8
7
0
22 Jun 2018
How Important Is a Neuron?
Kedar Dhamdhere
Mukund Sundararajan
Qiqi Yan
FAtt
GNN
14
128
0
30 May 2018
Fast Dynamic Routing Based on Weighted Kernel Density Estimation
Suofei Zhang
Wei-Ye Zhao
Xiaofu Wu
Quan Zhou
9
61
0
28 May 2018
Confidence Scoring Using Whitebox Meta-models with Linear Classifier Probes
Tongfei Chen
Jirí Navrátil
Vijay Iyengar
Karthikeyan Shanmugam
6
42
0
14 May 2018
Previous
1
2
3
4
Next