ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.05796
  4. Cited By
Network Dissection: Quantifying Interpretability of Deep Visual
  Representations

Network Dissection: Quantifying Interpretability of Deep Visual Representations

19 April 2017
David Bau
Bolei Zhou
A. Khosla
A. Oliva
Antonio Torralba
    MILMFAtt
ArXiv (abs)PDFHTML

Papers citing "Network Dissection: Quantifying Interpretability of Deep Visual Representations"

50 / 842 papers shown
Mechanistic Finetuning of Vision-Language-Action Models via Few-Shot Demonstrations
Mechanistic Finetuning of Vision-Language-Action Models via Few-Shot Demonstrations
Chancharik Mitra
Yusen Luo
Raj Saravanan
Dantong Niu
Anirudh Pai
Jesse Thomason
Trevor Darrell
Abrar Anwar
Deva Ramanan
Roei Herzig
52
0
0
27 Nov 2025
Auxiliary Metrics Help Decoding Skill Neurons in the Wild
Auxiliary Metrics Help Decoding Skill Neurons in the Wild
Yixiu Zhao
Xiaozhi Wang
Zijun Yao
Lei Hou
Juanzi Li
345
0
0
26 Nov 2025
Guaranteed Optimal Compositional Explanations for Neurons
Guaranteed Optimal Compositional Explanations for Neurons
Biagio La Rosa
Leilani H. Gilpin
76
0
0
25 Nov 2025
Open Vocabulary Compositional Explanations for Neuron Alignment
Open Vocabulary Compositional Explanations for Neuron Alignment
Biagio La Rosa
Leilani H. Gilpin
OCL
336
0
0
25 Nov 2025
Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry
Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry
Amirtha Varshini A S
Duminda S. Ranasinghe
Hok Hei Tam
70
0
0
24 Nov 2025
LAYA: Layer-wise Attention Aggregation for Interpretable Depth-Aware Neural Networks
LAYA: Layer-wise Attention Aggregation for Interpretable Depth-Aware Neural Networks
Gennaro Vessio
FAtt
183
0
0
16 Nov 2025
Probing the Probes: Methods and Metrics for Concept Alignment
Probing the Probes: Methods and Metrics for Concept Alignment
Jacob Lysnæs-Larsen
Marte Eggen
Inga Strümke
LLMSV
174
0
0
06 Nov 2025
LLEXICORP: End-user Explainability of Convolutional Neural Networks
LLEXICORP: End-user Explainability of Convolutional Neural Networks
Vojtěch Kůr
Adam Bajger
Adam Kukučka
Marek Hradil
Vít Musil
Tomáš Brázdil
89
0
0
04 Nov 2025
Atlas-Alignment: Making Interpretability Transferable Across Language Models
Atlas-Alignment: Making Interpretability Transferable Across Language Models
Bruno Puri
J. Berend
Sebastian Lapuschkin
Wojciech Samek
LLMSV
417
0
0
31 Oct 2025
ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts
ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts
Jinho Choi
Hyesu Lim
Steffen Schneider
Jaegul Choo
144
0
0
30 Oct 2025
Finding Culture-Sensitive Neurons in Vision-Language Models
Finding Culture-Sensitive Neurons in Vision-Language Models
Xiutian Zhao
Rochelle Choenni
Rohit Saxena
Ivan Titov
VLM
248
0
0
28 Oct 2025
Enhancing Pre-trained Representation Classifiability can Boost its Interpretability
Enhancing Pre-trained Representation Classifiability can Boost its InterpretabilityInternational Conference on Learning Representations (ICLR), 2025
Shufan Shen
Zhaobo Qi
Junshu Sun
Qingming Huang
Qi Tian
Shuhui Wang
FAtt
417
4
0
28 Oct 2025
A Video Is Not Worth a Thousand Words
A Video Is Not Worth a Thousand Words
Sam Pollard
Michael Wray
108
0
0
27 Oct 2025
Scaling Non-Parametric Sampling with Representation
Scaling Non-Parametric Sampling with Representation
Vincent Lu
Aaron Truong
Zeyu Yun
Yubei Chen
DiffM
128
0
0
25 Oct 2025
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Christy Li
Josep Lopez Camunas
Jake Thomas Touchet
Jacob Andreas
Àgata Lapedriza
Antonio Torralba
Tamar Rott Shaham
195
0
0
24 Oct 2025
EdgeSync: Accelerating Edge-Model Updates for Data Drift through Adaptive Continuous Learning
EdgeSync: Accelerating Edge-Model Updates for Data Drift through Adaptive Continuous Learning
Runchu Donga
Peng Zhao
Guiqin Wang
Nan Qi
Jie Lin
109
0
0
18 Oct 2025
Neologism Learning for Controllability and Self-Verbalization
Neologism Learning for Controllability and Self-Verbalization
John Hewitt
Oyvind Tafjord
Robert Geirhos
Been Kim
NAI
87
1
0
09 Oct 2025
Encode, Think, Decode: Scaling test-time reasoning with recursive latent thoughts
Encode, Think, Decode: Scaling test-time reasoning with recursive latent thoughts
Yeskendir Koishekenov
Aldo Lipani
Nicola Cancedda
LRM
150
1
0
08 Oct 2025
Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection
Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection
I. M. De la Jara
C. Rodriguez-Opazo
D. Teney
D. Ranasinghe
E. Abbasnejad
OODD
351
0
0
07 Oct 2025
Semantic Regexes: Auto-Interpreting LLM Features with a Structured Language
Semantic Regexes: Auto-Interpreting LLM Features with a Structured Language
Angie Boggust
Donghao Ren
Yannick Assogba
Dominik Moritz
Arvind Satyanarayan
Fred Hohman
144
0
0
07 Oct 2025
Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization
Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization
Antoine Maier
Aude Maier
Tom David
96
0
0
03 Oct 2025
Attack logics, not outputs: Towards efficient robustification of deep neural networks by falsifying concept-based properties
Attack logics, not outputs: Towards efficient robustification of deep neural networks by falsifying concept-based properties
Raik Dankworth
Gesina Schwalbe
AAML
120
0
0
01 Oct 2025
Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG
Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG
Maxime Méloux
François Portet
Maxime Peyrard
166
1
0
01 Oct 2025
TextCAM: Explaining Class Activation Map with Text
TextCAM: Explaining Class Activation Map with Text
Qiming Zhao
Xingjian Li
Xiaoyu Cao
Xiaolong Wu
Min Xu
VLM
121
0
0
01 Oct 2025
Object-Centric Case-Based Reasoning via Argumentation
Object-Centric Case-Based Reasoning via Argumentation
Gabriel de Olim Gaul
Adam Gould
Avinash Kori
Francesca Toni
93
0
0
30 Sep 2025
Nonparametric Identification of Latent Concepts
Nonparametric Identification of Latent Concepts
Yujia Zheng
Shaoan Xie
Kun Zhang
224
1
0
30 Sep 2025
Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document
Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document
Adnan Ben Mansour
Ayoub Karine
D. Naccache
130
0
0
30 Sep 2025
CE-FAM: Concept-Based Explanation via Fusion of Activation Maps
CE-FAM: Concept-Based Explanation via Fusion of Activation Maps
Michihiro Kuroki
T. Yamasaki
152
0
0
28 Sep 2025
On The Variability of Concept Activation Vectors
On The Variability of Concept Activation Vectors
Julia Wenkmann
Damien Garreau
AAML
126
0
0
28 Sep 2025
REMA: A Unified Reasoning Manifold Framework for Interpreting Large Language Model
REMA: A Unified Reasoning Manifold Framework for Interpreting Large Language Model
Bo Li
Guanzhi Deng
Ronghao Chen
Junrong Yue
Shuo Zhang
Qinghua Zhao
Linqi Song
Lijie Wen
LRM
109
0
0
26 Sep 2025
Interpreting ResNet-based CLIP via Neuron-Attention Decomposition
Interpreting ResNet-based CLIP via Neuron-Attention Decomposition
Edmund Bu
Yossi Gandelsman
221
0
0
24 Sep 2025
Redefining Experts: Interpretable Decomposition of Language Models for Toxicity Mitigation
Redefining Experts: Interpretable Decomposition of Language Models for Toxicity Mitigation
Zuhair Hasan Shaik
Abdullah Mazhar
Aseem Srivastava
Md. Shad Akhtar
114
0
0
20 Sep 2025
V-CECE: Visual Counterfactual Explanations via Conceptual Edits
V-CECE: Visual Counterfactual Explanations via Conceptual Edits
Nikolaos Spanos
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Athanasios Voulodimos
Giorgos Stamou
254
0
0
20 Sep 2025
Which Direction to Choose? An Analysis on the Representation Power of Self-Supervised ViTs in Downstream Tasks
Which Direction to Choose? An Analysis on the Representation Power of Self-Supervised ViTs in Downstream Tasks
Yannis Kaltampanidis
Alexandros Doumanoglou
D. Zarpalas
144
0
0
18 Sep 2025
NeuroStrike: Neuron-Level Attacks on Aligned LLMs
NeuroStrike: Neuron-Level Attacks on Aligned LLMs
Lichao Wu
Sasha Behrouzi
Mohamadreza Rostami
Maximilian Thang
S. Picek
A. Sadeghi
AAML
239
1
0
15 Sep 2025
Discovering Divergent Representations between Text-to-Image Models
Discovering Divergent Representations between Text-to-Image Models
Lisa Dunlap
Joseph E. Gonzalez
Trevor Darrell
Fabian Caba Heilbron
Josef Sivic
Bryan C. Russell
EGVM
126
0
0
10 Sep 2025
Superposition in Graph Neural Networks
Superposition in Graph Neural Networks
Lukas Pertl
Han Xuanyuan
Pietro Lio
GNN
152
0
0
31 Aug 2025
GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability
GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability
Zhenghao He
Sanchit Sinha
Guangzhi Xiong
Aidong Zhang
163
0
0
28 Aug 2025
NM-Hebb: Coupling Local Hebbian Plasticity with Metric Learning for More Accurate and Interpretable CNNs
NM-Hebb: Coupling Local Hebbian Plasticity with Metric Learning for More Accurate and Interpretable CNNs
Davorin Miličević
Ratko Grbić
100
0
0
27 Aug 2025
Disentangling Polysemantic Neurons with a Null-Calibrated Polysemanticity Index and Causal Patch Interventions
Disentangling Polysemantic Neurons with a Null-Calibrated Polysemanticity Index and Causal Patch Interventions
Manan Gupta
Dhruv Kumar
MILM
93
0
0
23 Aug 2025
Evaluating Sparse Autoencoders for Monosemantic Representation
Evaluating Sparse Autoencoders for Monosemantic Representation
Moghis Fereidouni
Muhammad Umair Haider
Peizhong Ju
A.B. Siddique
136
0
0
20 Aug 2025
Integrating attention into explanation frameworks for language and vision transformers
Integrating attention into explanation frameworks for language and vision transformers
Marte Eggen
Jacob Lysnæs-Larsen
Inga Strümke
83
0
0
12 Aug 2025
Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations
Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations
Dahee Kwon
Sehyun Lee
Jaesik Choi
168
1
0
03 Aug 2025
Eigen Neural Network: Unlocking Generalizable Vision with Eigenbasis
Eigen Neural Network: Unlocking Generalizable Vision with Eigenbasis
Anzhe Cheng
Chenzhong Yin
Mingxi Cheng
Shukai Duan
Shahin Nazarian
Paul Bogdan
223
0
0
02 Aug 2025
Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations
Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations
Nils Hütten
Florian Hölken
Hasan Tercan
Tobias Meisen
MedIm
172
0
0
29 Jul 2025
Compositional Function Networks: A High-Performance Alternative to Deep Neural Networks with Built-in Interpretability
Compositional Function Networks: A High-Performance Alternative to Deep Neural Networks with Built-in Interpretability
Fang Li
216
0
0
28 Jul 2025
Emergence of Quantised Representations Isolated to Anisotropic Functions
Emergence of Quantised Representations Isolated to Anisotropic Functions
George Bird
154
1
0
16 Jul 2025
Escaping Plato's Cave: JAM for Aligning Independently Trained Vision and Language Models
Escaping Plato's Cave: JAM for Aligning Independently Trained Vision and Language Models
Lauren Hyoseo Yoon
Yisong Yue
Been Kim
379
0
0
01 Jul 2025
When concept-based XAI is imprecise: Do people distinguish between generalisations and misrepresentations?
When concept-based XAI is imprecise: Do people distinguish between generalisations and misrepresentations?
Romy Müller
172
1
0
22 Jun 2025
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
Jingtong Su
Julia Kempe
Karen Ullrich
270
3
0
20 Jun 2025
1234...151617
Next