ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.01029
  4. Cited By
Explainability for Large Language Models: A Survey
v1v2v3 (latest)

Explainability for Large Language Models: A Survey

ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
2 September 2023
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jundong Li
    LRM
ArXiv (abs)PDFHTML

Papers citing "Explainability for Large Language Models: A Survey"

50 / 288 papers shown
Using LLMs for Automated Privacy Policy Analysis: Prompt Engineering, Fine-Tuning and Explainability
Using LLMs for Automated Privacy Policy Analysis: Prompt Engineering, Fine-Tuning and Explainability
Yuxin Chen
Peng Tang
Weidong Qiu
Shujun Li
172
2
0
16 Mar 2025
Reasoning-Grounded Natural Language Explanations for Language Models
Vojtech Cahlik
Rodrigo Alves
Pavel Kordík
LRM
279
3
0
14 Mar 2025
Hoi2Threat: An Interpretable Threat Detection Method for Human Violence Scenarios Guided by Human-Object Interaction
Hoi2Threat: An Interpretable Threat Detection Method for Human Violence Scenarios Guided by Human-Object Interaction
Yuhan Wang
Cheng Liu
Daou Zhang
Zihan Zhao
Jinyang Chen
Purui Dong
Zuyuan Yu
Ziru Wang
Weichao Wu
375
0
0
13 Mar 2025
Advanced Tool Learning and Selection System (ATLASS): A Closed-Loop Framework Using LLMInternational Symposium on Service Oriented Software Engineering (ISSOSE), 2025
Mohd Ariful Haque
Justin Williams
Sunzida Siddique
Md. Hujaifa Islam
Hasmot Ali
Kishor Datta Gupta
Roy George
263
5
0
13 Mar 2025
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention
Jinhao Duan
Fei Kong
Hao-Ran Cheng
James Diffenderfer
B. Kailkhura
Lichao Sun
Xiaofeng Zhu
Xiaoshuang Shi
Kaidi Xu
990
7
0
13 Mar 2025
I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?
I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?
Yuhang Liu
Dong Gong
Erdun Gao
Zhen Zhang
Zhen Zhang
Biwei Huang
Anton van den Hengel
Javen Qinfeng Shi
Javen Qinfeng Shi
1.1K
5
0
12 Mar 2025
Statistical Deficiency for Task Inclusion Estimation
Statistical Deficiency for Task Inclusion EstimationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Loïc Fosse
Frédéric Béchet
Benoit Favre
Géraldine Damnati
Gwénolé Lecorvé
Maxime Darrin
Philippe Formont
Pablo Piantanida
1.2K
1
0
07 Mar 2025
A Zero-shot Explainable Doctor Ranking Framework with Large Language Models
A Zero-shot Explainable Doctor Ranking Framework with Large Language Models
Ziyang Zeng
Dongyuan Li
Yuqing Yang
LM&MAAI4TS
340
0
0
04 Mar 2025
Can LLMs Explain Themselves Counterfactually?
Can LLMs Explain Themselves Counterfactually?
Zahra Dehghanighobadi
Asja Fischer
Muhammad Bilal Zafar
LRM
422
2
0
25 Feb 2025
VeriPlan: Integrating Formal Verification and LLMs into End-User Planning
VeriPlan: Integrating Formal Verification and LLMs into End-User PlanningInternational Conference on Human Factors in Computing Systems (CHI), 2025
Christine P. Lee
David J. Porfirio
Xinyu Jessica Wang
Kevin Zhao
Bilge Mutlu
526
21
0
25 Feb 2025
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic ScoringTechnology, Knowledge and Learning (TKL), 2024
Xuansheng Wu
Padmaja Pravin Saraf
Gyeong-Geon Lee
Ehsan Latif
Ninghao Liu
Xiaoming Zhai
340
26
0
24 Feb 2025
Representation Engineering for Large-Language Models: Survey and Research Challenges
Representation Engineering for Large-Language Models: Survey and Research Challenges
Lukasz Bartoszcze
Sarthak Munshi
Bryan Sukidi
Jennifer Yen
Zejia Yang
David Williams-King
Linh Le
Kosi Asuzu
Carsten Maple
410
5
0
24 Feb 2025
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis
Peiran Wang
Yang Liu
Yunfei Lu
Jue Hong
Ye Wu
HILMLRM
265
1
0
20 Feb 2025
Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger
Adaptive Tool Use in Large Language Models with Meta-Cognition TriggerAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Wenjun Li
Dexun Li
Kuicai Dong
Cong Zhang
Hao Zhang
Weiwen Liu
Yasheng Wang
Ruiming Tang
Yong Liu
LLMAGKELM
212
12
0
18 Feb 2025
Exploring the Translation Mechanism of Large Language Models
Exploring the Translation Mechanism of Large Language Models
Hongbin Zhang
Kehai Chen
Xuefeng Bai
Xiucheng Li
Yang Xiang
Min Zhang
410
2
0
17 Feb 2025
Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models
Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models
Yiheng Liu
Xiaohui Gao
Haiyang Sun
Bao Ge
Tianming Liu
...
Ning Qiang
Bao Ge
Tianming Liu
Junwei Han
Xintao Hu
164
2
0
13 Feb 2025
Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies
Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and InconsistenciesInternational Conference on Human Factors in Computing Systems (CHI), 2025
Sunnie S. Y. Kim
J. Vaughan
Q. V. Liao
Tania Lombrozo
Olga Russakovsky
582
26
0
12 Feb 2025
Finding Words Associated with DIF: Predicting Differential Item Functioning using LLMs and Explainable AI
Finding Words Associated with DIF: Predicting Differential Item Functioning using LLMs and Explainable AI
Hotaka Maeda
Yikai Lu
117
0
0
10 Feb 2025
Survey on AI-Generated Media Detection: From Non-MLLM to MLLM
Survey on AI-Generated Media Detection: From Non-MLLM to MLLM
Yueying Zou
Peipei Li
Zekun Li
Huaibo Huang
Xing Cui
Xuannan Liu
Chenghanyu Zhang
Ran He
DeLMO
700
11
0
07 Feb 2025
CueTip: An Interactive and Explainable Physics-aware Pool Assistant
CueTip: An Interactive and Explainable Physics-aware Pool Assistant
Sean Memery
Kevin Denamganai
Jiaxin Zhang
Zehai Tu
Yiwen Guo
Kartic Subr
LRM
310
1
0
30 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Clinical Insights: A Comprehensive Review of Language Models in MedicinePLOS Digital Health (PDH), 2024
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
567
17
0
08 Jan 2025
Putnam's Critical and Explanatory Tendencies Interpreted from a Machine Learning Perspective
Putnam's Critical and Explanatory Tendencies Interpreted from a Machine Learning Perspective
Sheldon Z. Soudin
FAtt
110
0
0
06 Jan 2025
Embedding Style Beyond Topics: Analyzing Dispersion Effects Across Different Language ModelsInternational Conference on Computational Linguistics (COLING), 2025
Benjamin Icard
Evangelia Zve
Lila Sainero
Alice Breton
Jean-Gabriel Ganascia
199
2
0
03 Jan 2025
Citations and Trust in LLM Generated Responses
Yifan Ding
Matthew Facciani
Amrit Poudel
Ellen Joyce
Salvador Aguiñaga
Balaji Veeramani
Sanmitra Bhattacharya
Tim Weninger
HILM
319
11
0
03 Jan 2025
Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models
Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yanwen Huang
Yong Zhang
Ning Cheng
Zhitao Li
Shaojun Wang
Jing Xiao
461
4
0
02 Jan 2025
How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons
How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons
Michael Pichat
William Pogrund
Armanush Gasparian
Paloma Pichat
Samuel Demarchi
Michael Veillet-Guillem
276
3
0
26 Dec 2024
A Review of Multimodal Explainable Artificial Intelligence: Past,
  Present and Future
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Shilin Sun
Wenbin An
Feng Tian
Fang Nan
Qidong Liu
Jing Liu
N. Shah
Ping Chen
387
20
0
18 Dec 2024
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D
  Human Motion
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human MotionComputer Vision and Pattern Recognition (CVPR), 2024
Changan Chen
Juze Zhang
S. K. Lakshmikanth
Yusu Fang
Ruizhi Shao
Gordon Wetzstein
L. Fei-Fei
Ehsan Adeli
VGen
356
16
0
13 Dec 2024
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future DirectionsACM Computing Surveys (ACM CSUR), 2024
Ola Shorinwa
Zhiting Mei
Justin Lidard
Allen Z. Ren
Anirudha Majumdar
HILMLRM
432
19
0
07 Dec 2024
Explainable and Interpretable Multimodal Large Language Models: A
  Comprehensive Survey
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Yunkai Dang
Kaichen Huang
Jiahao Huo
Yibo Yan
Shijie Huang
...
Kun Wang
Yong Liu
Jing Shao
Hui Xiong
Xuming Hu
LRM
426
51
0
03 Dec 2024
Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?
Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?
Aryan Sajith
Krishna Chaitanya Rao Kathala
261
4
0
24 Nov 2024
When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated Explanations
When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated ExplanationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Huaizhi Ge
Yiming Li
Qifan Wang
Yongfeng Zhang
Ruixiang Tang
AAMLSILM
534
11
0
19 Nov 2024
ToxiLab: How Well Do Open-Source LLMs Generate Synthetic Toxicity Data?
ToxiLab: How Well Do Open-Source LLMs Generate Synthetic Toxicity Data?
Zheng Hui
Zhaoxiao Guo
Hang Zhao
Juanyong Duan
Lin Ai
Yinheng Li
Julia Hirschberg
Congrui Huang
432
1
0
18 Nov 2024
Education in the Era of Neurosymbolic AI
Education in the Era of Neurosymbolic AIJournal of Web Semantics (JWS), 2024
Chris Davis Jaldi
Eleni Ilkou
Noah Schroeder
Cogan Shimizu
260
16
0
16 Nov 2024
The Systems Engineering Approach in Times of Large Language Models
The Systems Engineering Approach in Times of Large Language ModelsHawaii International Conference on System Sciences (HICSS), 2024
Christian Cabrera
Viviana Bastidas
Jennifer Schooling
Neil D. Lawrence
226
2
0
13 Nov 2024
Concept Bottleneck Language Models For protein design
Concept Bottleneck Language Models For protein design
Aya Abdelsalam Ismail
Tuomas Oikarinen
Amy Wang
Julius Adebayo
Samuel Stanton
...
J. Kleinhenz
Allen Goodman
H. C. Bravo
Dong Wang
Nathan C. Frey
342
13
0
09 Nov 2024
AI Should Challenge, Not Obey
AI Should Challenge, Not ObeyCommunications of the ACM (CACM), 2024
Advait Sarkar
362
29
0
04 Nov 2024
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Attention Tracker: Detecting Prompt Injection Attacks in LLMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kuo-Han Hung
Ching-Yun Ko
Ambrish Rawat
I-Hsin Chung
Winston H. Hsu
Pin-Yu Chen
411
54
0
01 Nov 2024
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
Lam Nguyen Tung
Steven Cho
Xiaoning Du
Neelofar Neelofar
Valerio Terragni
Stefano Ruberto
Aldeida Aleti
1.2K
3
0
30 Oct 2024
Large Language Model-assisted Speech and Pointing Benefits Multiple 3D
  Object Selection in Virtual Reality
Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality
Junlong Chen
Jens Grubert
Per Ola Kristensson
133
1
0
28 Oct 2024
Brain-like Functional Organization within Large Language Models
Brain-like Functional Organization within Large Language Models
Haiyang Sun
Lin Zhao
Zihao Wu
Xiaohui Gao
Yutao Hu
Mengfei Zuo
Weinan Zhang
Junwei Han
Tianming Liu
X. Hu
228
2
0
25 Oct 2024
CogSteer: Cognition-Inspired Selective Layer Intervention for Efficiently Steering Large Language Models
CogSteer: Cognition-Inspired Selective Layer Intervention for Efficiently Steering Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Xintong Wang
Jingheng Pan
Longqin Jiang
Liang Ding
Longqin Jiang
Xingshan Li
Chris Biemann
LLMSV
265
0
0
23 Oct 2024
Enhancing Answer Attribution for Faithful Text Generation with Large
  Language Models
Enhancing Answer Attribution for Faithful Text Generation with Large Language ModelsInternational Conference on Knowledge Discovery and Information Retrieval (KDIR), 2024
Juraj Vladika
Luca Mülln
Florian Matthes
229
0
0
22 Oct 2024
On the Role of Attention Heads in Large Language Model Safety
On the Role of Attention Heads in Large Language Model SafetyInternational Conference on Learning Representations (ICLR), 2024
Zhenhong Zhou
Haiyang Yu
Xinghua Zhang
Rongwu Xu
Fei Huang
Kun Wang
Yang Liu
Cunchun Li
Yongbin Li
489
37
0
17 Oct 2024
PromptExp: Multi-granularity Prompt Explanation of Large Language Models
PromptExp: Multi-granularity Prompt Explanation of Large Language Models
Ximing Dong
Shaowei Wang
Dayi Lin
Gopi Krishnan Rajbahadur
Boquan Zhou
Shichao Liu
Ahmed E. Hassan
AAMLLRM
396
4
0
16 Oct 2024
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based
  Language Models
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kushal Tatariya
Vladimir Araujo
Thomas Bauwens
Miryam de Lhoneux
VLM
239
1
0
15 Oct 2024
Investigating Human-Computer Interaction and Visual Comprehension in
  Text Generation Process of Natural Language Generation Models
Investigating Human-Computer Interaction and Visual Comprehension in Text Generation Process of Natural Language Generation Models
Yunchao Wang
Zihang Fu
Chaoqing Xu
Guodao Sun
Ronghua Liang
152
0
0
11 Oct 2024
Neuropsychology of AI: Relationship Between Activation Proximity and
  Categorical Proximity Within Neural Categories of Synthetic Cognition
Neuropsychology of AI: Relationship Between Activation Proximity and Categorical Proximity Within Neural Categories of Synthetic Cognition
Michael Pichat
Enola Campoli
William Pogrund
Jourdan Wilson
Michael Veillet-Guillem
Anton Melkozerov
Paloma Pichat
Armanouche Gasparian
Samuel Demarchi
Judicael Poumay
NAI
178
3
0
08 Oct 2024
Stereotype or Personalization? User Identity Biases Chatbot Recommendations
Stereotype or Personalization? User Identity Biases Chatbot RecommendationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Anjali Kantharuban
Jeremiah Milbauer
Emma Strubell
Emma Strubell
Graham Neubig
322
26
0
08 Oct 2024
MINER: Mining the Underlying Pattern of Modality-Specific Neurons in
  Multimodal Large Language Models
MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models
Kaichen Huang
Jiahao Huo
Yibo Yan
Kun Wang
Yutao Yue
Xuming Hu
249
2
0
07 Oct 2024
Previous
123456
Next