v1v2v3 (latest)

Explainability for Large Language Models: A Survey

ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023

2 September 2023

Haiyan Zhao

Hanjie Chen

Fan Yang

Ninghao Liu

Papers citing "Explainability for Large Language Models: A Survey"

50 / 288 papers shown

Using LLMs for Automated Privacy Policy Analysis: Prompt Engineering, Fine-Tuning and Explainability

172

16 Mar 2025

Reasoning-Grounded Natural Language Explanations for Language Models

279

14 Mar 2025

Hoi2Threat: An Interpretable Threat Detection Method for Human Violence Scenarios Guided by Human-Object Interaction

375

13 Mar 2025

Advanced Tool Learning and Selection System (ATLASS): A Closed-Loop Framework Using LLMInternational Symposium on Service Oriented Software Engineering (ISSOSE), 2025

263

13 Mar 2025

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

990

13 Mar 2025

I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

1.1K

12 Mar 2025

Statistical Deficiency for Task Inclusion EstimationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

1.2K

07 Mar 2025

A Zero-shot Explainable Doctor Ranking Framework with Large Language Models

340

04 Mar 2025

Can LLMs Explain Themselves Counterfactually?

Zahra Dehghanighobadi

Asja Fischer

Muhammad Bilal Zafar

LRM

422

25 Feb 2025

VeriPlan: Integrating Formal Verification and LLMs into End-User PlanningInternational Conference on Human Factors in Computing Systems (CHI), 2025

526

25 Feb 2025

Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic ScoringTechnology, Knowledge and Learning (TKL), 2024

340

24 Feb 2025

Representation Engineering for Large-Language Models: Survey and Research Challenges

410

24 Feb 2025

What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis

265

20 Feb 2025

Adaptive Tool Use in Large Language Models with Meta-Cognition TriggerAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

212

18 Feb 2025

Exploring the Translation Mechanism of Large Language Models

410

17 Feb 2025

Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models

...

Ning Qiang

Bao Ge

Tianming Liu

Junwei Han

Xintao Hu

164

13 Feb 2025

Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and InconsistenciesInternational Conference on Human Factors in Computing Systems (CHI), 2025

582

12 Feb 2025

Finding Words Associated with DIF: Predicting Differential Item Functioning using LLMs and Explainable AI

Hotaka Maeda

Yikai Lu

117

10 Feb 2025

Survey on AI-Generated Media Detection: From Non-MLLM to MLLM

700

07 Feb 2025

CueTip: An Interactive and Explainable Physics-aware Pool Assistant

310

30 Jan 2025

Clinical Insights: A Comprehensive Review of Language Models in MedicinePLOS Digital Health (PDH), 2024

567

08 Jan 2025

Putnam's Critical and Explanatory Tendencies Interpreted from a Machine Learning Perspective

Sheldon Z. Soudin

FAtt

110

06 Jan 2025

Embedding Style Beyond Topics: Analyzing Dispersion Effects Across Different Language ModelsInternational Conference on Computational Linguistics (COLING), 2025

Jean-Gabriel Ganascia

199

03 Jan 2025

Citations and Trust in LLM Generated Responses

Sanmitra Bhattacharya

Tim Weninger

HILM

319

03 Jan 2025

Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

461

02 Jan 2025

How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons

Michael Veillet-Guillem

276

26 Dec 2024

A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future

387

18 Dec 2024

The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human MotionComputer Vision and Pattern Recognition (CVPR), 2024

356

13 Dec 2024

A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future DirectionsACM Computing Surveys (ACM CSUR), 2024

432

07 Dec 2024

Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey

...

426

03 Dec 2024

Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?

Aryan Sajith

Krishna Chaitanya Rao Kathala

261

24 Nov 2024

When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated ExplanationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

534

19 Nov 2024

ToxiLab: How Well Do Open-Source LLMs Generate Synthetic Toxicity Data?

432

18 Nov 2024

Education in the Era of Neurosymbolic AIJournal of Web Semantics (JWS), 2024

260

16 Nov 2024

The Systems Engineering Approach in Times of Large Language ModelsHawaii International Conference on System Sciences (HICSS), 2024

226

13 Nov 2024

Concept Bottleneck Language Models For protein design

Aya Abdelsalam Ismail

...

342

09 Nov 2024

AI Should Challenge, Not ObeyCommunications of the ACM (CACM), 2024

Advait Sarkar

362

04 Nov 2024

Attention Tracker: Detecting Prompt Injection Attacks in LLMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

411

01 Nov 2024

Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers

1.2K

30 Oct 2024

Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality

Junlong Chen

Jens Grubert

Per Ola Kristensson

133

28 Oct 2024

Brain-like Functional Organization within Large Language Models

228

25 Oct 2024

CogSteer: Cognition-Inspired Selective Layer Intervention for Efficiently Steering Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

265

23 Oct 2024

Enhancing Answer Attribution for Faithful Text Generation with Large Language ModelsInternational Conference on Knowledge Discovery and Information Retrieval (KDIR), 2024

Juraj Vladika

Luca Mülln

Florian Matthes

229

22 Oct 2024

On the Role of Attention Heads in Large Language Model SafetyInternational Conference on Learning Representations (ICLR), 2024

Kun Wang

Yang Liu

Cunchun Li

Yongbin Li

489

17 Oct 2024

PromptExp: Multi-granularity Prompt Explanation of Large Language Models

Ximing Dong

Shaowei Wang

Dayi Lin

Gopi Krishnan Rajbahadur

396

16 Oct 2024

Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

239

15 Oct 2024

Investigating Human-Computer Interaction and Visual Comprehension in Text Generation Process of Natural Language Generation Models

152

11 Oct 2024

Neuropsychology of AI: Relationship Between Activation Proximity and Categorical Proximity Within Neural Categories of Synthetic Cognition

Michael Veillet-Guillem

178

08 Oct 2024

Stereotype or Personalization? User Identity Biases Chatbot RecommendationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

322

08 Oct 2024

MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models

Kun Wang

Xuming Hu

249

07 Oct 2024