Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2406.08074
Cited By

A Concept-Based Explainability Framework for Large Multimodal Models

A Concept-Based Explainability Framework for Large Multimodal Models

12 June 2024

ArXiv (abs)PDF HTML Github

Papers citing "A Concept-Based Explainability Framework for Large Multimodal Models"

21 / 21 papers shown

Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Valentino Maiorca

Francesco Locatello

Alberto Cazzaniga

165

8

0

24 Oct 2025

VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Set

VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Set

206

2

0

24 Oct 2025

Learning to Steer: Input-dependent Steering for Multimodal LLMs

Learning to Steer: Input-dependent Steering for Multimodal LLMs

438

5

0

18 Aug 2025

Probing the Representational Power of Sparse Autoencoders in Vision Models

Probing the Representational Power of Sparse Autoencoders in Vision Models

Matthew Lyle Olson

239

1

0

15 Aug 2025

TARS: MinMax Token-Adaptive Preference Strategy for MLLM Hallucination Reduction

TARS: MinMax Token-Adaptive Preference Strategy for MLLM Hallucination Reduction

331

0

0

29 Jul 2025

Architecting Clinical Collaboration: Multi-Agent Reasoning Systems for Multimodal Medical VQA

Architecting Clinical Collaboration: Multi-Agent Reasoning Systems for Multimodal Medical VQA

Karishma Thakrar

Shreyas Basavatia

Akshay Daftardar

326

1

0

07 Jul 2025

From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit

From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit

Ekdeep Singh Lubana

Bahareh Tolooshams

439

19

0

03 Jun 2025

Interpreting the linear structure of vision-language model embedding spaces

Interpreting the linear structure of vision-language model embedding spaces

Isabel Papadimitriou

577

16

0

16 Apr 2025

Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models

Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models

Shyamgopal Karthik

Quentin Bouniot

710

25

0

03 Apr 2025

Conceptualizing Uncertainty: A Concept-based Approach to Explaining Uncertainty

Conceptualizing Uncertainty: A Concept-based Approach to Explaining Uncertainty

Alexander Schulz

Sarah Schroeder

474

0

0

05 Mar 2025

Re-Imagining Multimodal Instruction Tuning: A Representation View

Re-Imagining Multimodal Instruction Tuning: A Representation ViewInternational Conference on Learning Representations (ICLR), 2025

...

Raghuveer M. Rao

1.2K

13

0

02 Mar 2025

Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models

Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models

Mohammad Havaei

Bernhard Schölkopf

1.4K

6

0

28 Feb 2025

Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models

Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models

Ekdeep Singh Lubana

Jacob S. Prince

Isabel Papadimitriou

Martin Wattenberg

381

38

0

18 Feb 2025

Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment

Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment

Harrish Thasarathan

Konstantinos G. Derpanis

385

29

0

06 Feb 2025

Visual Large Language Models for Generalized and Specialized Applications

Visual Large Language Models for Generalized and Specialized Applications

504

37

0

06 Jan 2025

Analyzing Finetuning Representation Shift for Multimodal LLMs Steering

Analyzing Finetuning Representation Shift for Multimodal LLMs Steering

472

8

0

06 Jan 2025

Explainable and Interpretable Multimodal Large Language Models: A
Comprehensive Survey

Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey

...

463

60

0

03 Dec 2024

Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens

Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention LensComputer Vision and Pattern Recognition (CVPR), 2024

629

79

0

23 Nov 2024

MINER: Mining the Underlying Pattern of Modality-Specific Neurons in
Multimodal Large Language Models

MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models

Kun Wang

Xuming Hu

357

4

0

07 Oct 2024

Concept-Based Explanations in Computer Vision: Where Are We and Where
Could We Go?

Concept-Based Explanations in Computer Vision: Where Are We and Where Could We Go?

Georgii Mikriukov

Gesina Schwalbe

Stefan Wermter

363

12

0

20 Sep 2024

Referential communication in heterogeneous communities of pre-trained
visual deep networks

Referential communication in heterogeneous communities of pre-trained visual deep networksAdaptive Agents and Multi-Agent Systems (AAMAS), 2023

Francesca Franzon

502

10

0

04 Feb 2023

Page 1 of 1