ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.01029
  4. Cited By
Explainability for Large Language Models: A Survey
v1v2v3 (latest)

Explainability for Large Language Models: A Survey

ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
2 September 2023
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jundong Li
    LRM
ArXiv (abs)PDFHTML

Papers citing "Explainability for Large Language Models: A Survey"

50 / 287 papers shown
Title
AI-based Traffic Modeling for Network Security and Privacy: Challenges Ahead
AI-based Traffic Modeling for Network Security and Privacy: Challenges Ahead
Dinil Mon Divakaran
AAML
244
2
0
24 Dec 2025
Neural Architecture Search for Quantum Autoencoders
Neural Architecture Search for Quantum AutoencodersInternational Conference on Quantum Computing and Engineering (QCE), 2025
Hibah Agha
Samuel Yen-Chi Chen
Huan-Hsin Tseng
Shinjae Yoo
AI4CE
232
0
0
24 Nov 2025
Alignment Faking - the Train -> Deploy Asymmetry: Through a Game-Theoretic Lens with Bayesian-Stackelberg Equilibria
Alignment Faking - the Train -> Deploy Asymmetry: Through a Game-Theoretic Lens with Bayesian-Stackelberg Equilibria
Kartik Garg
Shourya Mishra
Kartikeya Sinha
Ojaswi Pratap Singh
Ayush Chopra
...
Ammar Sheikh
Raghav Maheshwari
Aman Chadha
Vinija Jain
Amitava Das
OffRL
145
0
0
22 Nov 2025
Patient-level Information Extraction by Consistent Integration of Textual and Tabular Evidence with Bayesian Networks
Patient-level Information Extraction by Consistent Integration of Textual and Tabular Evidence with Bayesian Networks
Paloma Rabaey
Adrick Tench
Stefan Heytens
Thomas Demeester
60
0
0
21 Nov 2025
Robustness of LLM-enabled vehicle trajectory prediction under data security threats
Robustness of LLM-enabled vehicle trajectory prediction under data security threats
Feilong Wang
Fuqiang Liu
AAML
85
0
0
14 Nov 2025
Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder
Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder
Zhen Xu
Zhen Tan
Song Wang
Kaidi Xu
Tianlong Chen
MoE
246
0
0
07 Nov 2025
Ask WhAI:Probing Belief Formation in Role-Primed LLM Agents
Ask WhAI:Probing Belief Formation in Role-Primed LLM Agents
Keith Moore
Jun W. Kim
David Lyu
Jeffrey Heo
Ehsan Adeli
65
0
0
06 Nov 2025
KnowThyself: An Agentic Assistant for LLM Interpretability
KnowThyself: An Agentic Assistant for LLM Interpretability
Suraj Prasai
Mengnan Du
Y. Zhang
Fan Yang
77
1
0
05 Nov 2025
Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations
Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations
Shahin Atakishiyev
H. Babiker
Jiayi Dai
Nawshad Farruque
Teruaki Hayashi
...
Md Abed Rahman
Iain Smith
Mi-Young Kim
Osmar R. Zaïane
Randy Goebel
LRM
133
0
0
20 Oct 2025
QLENS: Towards A Quantum Perspective of Language Transformers
QLENS: Towards A Quantum Perspective of Language Transformers
Aditya Gupta
Kirandeep Kaur
Manya Chadha
Chirag Shah
AI4CE
96
0
0
13 Oct 2025
From Explainability to Action: A Generative Operational Framework for Integrating XAI in Clinical Mental Health Screening
From Explainability to Action: A Generative Operational Framework for Integrating XAI in Clinical Mental Health Screening
Ratna Kandala
Akshata Kishore Moharir
Divya Arvinda Nayak
89
0
0
10 Oct 2025
FaithCoT-Bench: Benchmarking Instance-Level Faithfulness of Chain-of-Thought Reasoning
FaithCoT-Bench: Benchmarking Instance-Level Faithfulness of Chain-of-Thought Reasoning
Xu Shen
Song Wang
Zhen Tan
Laura Yao
Xinyu Zhao
Kaidi Xu
X. Wang
Tianlong Chen
LRM
172
0
0
05 Oct 2025
LLM Chemistry Estimation for Multi-LLM Recommendation
LLM Chemistry Estimation for Multi-LLM Recommendation
H. Sánchez
Briland Hitaj
84
1
0
04 Oct 2025
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Jiaxi Li
Yucheng Shi
Jin Lu
Ninghao Liu
LRM
120
0
0
04 Oct 2025
A Qualitative Comparative Evaluation of Cognitive and Generative Theories
A Qualitative Comparative Evaluation of Cognitive and Generative Theories
Paul S. Rosenbloom
ELM
44
0
0
03 Oct 2025
Understanding the Dilemma of Unlearning for Large Language Models
Understanding the Dilemma of Unlearning for Large Language Models
Qingjie Zhang
Haoting Qian
Zhicong Huang
Cheng Hong
Shiyu Huang
Ke Xu
Chao Zhang
Han Qiu
MU
232
1
0
29 Sep 2025
Explaining Fine Tuned LLMs via Counterfactuals A Knowledge Graph Driven Framework
Explaining Fine Tuned LLMs via Counterfactuals A Knowledge Graph Driven Framework
Y Samuel Wang
Ziyang Chen
Md Faisal Kabir
OffRL
92
0
0
25 Sep 2025
Beyond Stars: Bridging the Gap Between Ratings and Review Sentiment with LLM
Beyond Stars: Bridging the Gap Between Ratings and Review Sentiment with LLM
Najla Zuhir
Amna Mohammad Salim
Parvathy Premkumar
Moshiur Farazi
92
0
0
25 Sep 2025
Towards Transparent AI: A Survey on Explainable Language Models
Towards Transparent AI: A Survey on Explainable Language Models
Avash Palikhe
Sribala Vidyadhari Chinta
Zhipeng Yin
Rui Guo
Qiang Duan
Jie Yang
Wenbin Zhang
148
1
0
25 Sep 2025
Uncovering Graph Reasoning in Decoder-only Transformers with Circuit Tracing
Uncovering Graph Reasoning in Decoder-only Transformers with Circuit Tracing
Xinnan Dai
Chung-Hsiang Lo
Kai Guo
Shenglai Zeng
Dongsheng Luo
Shucheng Zhou
105
1
0
24 Sep 2025
Revealing Adversarial Smart Contracts through Semantic Interpretation and Uncertainty Estimation
Revealing Adversarial Smart Contracts through Semantic Interpretation and Uncertainty Estimation
Yating Liu
Xing Su
Hao Wu
Sijin Li
Y. Cheng
Fengyuan Xu
Sheng Zhong
AAML
143
0
0
23 Sep 2025
Leveraging NTPs for Efficient Hallucination Detection in VLMs
Leveraging NTPs for Efficient Hallucination Detection in VLMs
Ofir Azachi
Kfir Eliyahu
Eyal El Ani
Rom Himelstein
Roi Reichart
Yuval Pinter
Nitay Calderon
VLM
109
0
0
20 Sep 2025
Defining and Monitoring Complex Robot Activities via LLMs and Symbolic Reasoning
Defining and Monitoring Complex Robot Activities via LLMs and Symbolic Reasoning
F. Argenziano
Elena Umili
Francesco Leotta
Daniele Nardi
LLMAG
128
0
0
19 Sep 2025
REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
Nannan Huang
Haytham M. Fayek
Xiuzhen Zhang
92
0
0
19 Sep 2025
V-SEAM: Visual Semantic Editing and Attention Modulating for Causal Interpretability of Vision-Language Models
V-SEAM: Visual Semantic Editing and Attention Modulating for Causal Interpretability of Vision-Language Models
Qidong Wang
Junjie Hu
Ming Jiang
72
0
0
18 Sep 2025
From Embeddings to Equations: Genetic-Programming Surrogates for Interpretable Transformer Classification
From Embeddings to Equations: Genetic-Programming Surrogates for Interpretable Transformer Classification
M. S. Khorshidi
Navid Yazdanjue
Hassan Gharoun
M. Nikoo
Fang Chen
Amir H. Gandomi
120
1
0
16 Sep 2025
SME-TEAM: Leveraging Trust and Ethics for Secure and Responsible Use of AI and LLMs in SMEs
SME-TEAM: Leveraging Trust and Ethics for Secure and Responsible Use of AI and LLMs in SMEs
Iqbal H. Sarker
Helge Janicke
Ahmad Mohsin
Leandros A. Maglaras
59
0
0
12 Sep 2025
Can AI Make Energy Retrofit Decisions? An Evaluation of Large Language Models
Can AI Make Energy Retrofit Decisions? An Evaluation of Large Language Models
Lei Shu
Dong Zhao
83
1
0
08 Sep 2025
Triadic Fusion of Cognitive, Functional, and Causal Dimensions for Explainable LLMs: The TAXAL Framework
Triadic Fusion of Cognitive, Functional, and Causal Dimensions for Explainable LLMs: The TAXAL Framework
David Herrera-Poyatos
Carlos Peláez-González
Cristina Zuheros
Virilo Tejedor
Rosana Montes
F. Herrera
72
0
0
05 Sep 2025
NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models
NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models
Chuhan Zhang
Ye Zhang
Bowen Shi
Yuyou Gan
Xuhong Zhang
S. Ji
Dazhan Deng
Yingcai Wu
AAML
108
0
0
04 Sep 2025
Improving Narrative Classification and Explanation via Fine Tuned Language Models
Improving Narrative Classification and Explanation via Fine Tuned Language Models
Rishit Tyagi
Rahul Bouri
Mohit Gupta
70
1
0
04 Sep 2025
Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
Seyedali Mohammadi
Bhaskara Hanuma Vedula
Hemank Lamba
Edward Raff
Ponnurangam Kumaraguru
Francis Ferraro
Manas Gaur
155
0
0
02 Sep 2025
AHAMask: Reliable Task Specification for Large Audio Language Models without Instructions
AHAMask: Reliable Task Specification for Large Audio Language Models without Instructions
Yiwei Guo
Bohan Li
Hankun Wang
Zhihan Li
Shuai Wang
Xie Chen
K. Yu
AuLLM
411
1
0
01 Sep 2025
Safety Alignment Should Be Made More Than Just A Few Attention Heads
Safety Alignment Should Be Made More Than Just A Few Attention Heads
Chao Huang
Zefeng Zhang
Juewei Yue
Quangang Li
Chuang Zhang
Tingwen Liu
AAML
97
0
0
27 Aug 2025
A Case Study on the Effectiveness of LLMs in Verification with Proof Assistants
A Case Study on the Effectiveness of LLMs in Verification with Proof Assistants
Barış Bayazıt
Yao Li
Xujie Si
68
1
0
26 Aug 2025
AdaptiveK Sparse Autoencoders: Dynamic Sparsity Allocation for Interpretable LLM Representations
AdaptiveK Sparse Autoencoders: Dynamic Sparsity Allocation for Interpretable LLM Representations
Yifei Yao
Mengnan Du
115
0
0
24 Aug 2025
Foundational Design Principles and Patterns for Building Robust and Adaptive GenAI-Native Systems
Foundational Design Principles and Patterns for Building Robust and Adaptive GenAI-Native Systems
Frederik Vandeputte
AI4TS
112
2
0
21 Aug 2025
Overcoming Knowledge Discrepancies: Structuring Reasoning Threads through Knowledge Balancing in Interactive Scenarios
Overcoming Knowledge Discrepancies: Structuring Reasoning Threads through Knowledge Balancing in Interactive Scenarios
Daniel Burkhardt
Xiangwei Cheng
LRM
81
0
0
16 Aug 2025
Learning Marked Temporal Point Process Explanations based on Counterfactual and Factual Reasoning
Learning Marked Temporal Point Process Explanations based on Counterfactual and Factual Reasoning
Sishun Liu
Ke Deng
Xiuzhen Zhang
Yan Wang
AI4TSLRM
89
0
0
16 Aug 2025
When Explainability Meets Privacy: An Investigation at the Intersection of Post-hoc Explainability and Differential Privacy in the Context of Natural Language Processing
When Explainability Meets Privacy: An Investigation at the Intersection of Post-hoc Explainability and Differential Privacy in the Context of Natural Language Processing
Mahdi Dhaini
Stephen Meisenbacher
Ege Erdogan
Florian Matthes
Gjergji Kasneci
SILM
181
0
0
14 Aug 2025
Adoption of Explainable Natural Language Processing: Perspectives from Industry and Academia on Practices and Challenges
Adoption of Explainable Natural Language Processing: Perspectives from Industry and Academia on Practices and Challenges
Mahdi Dhaini
Tobias Müller
Roksoliana Rabets
Gjergji Kasneci
48
0
0
13 Aug 2025
SMA: Who Said That? Auditing Membership Leakage in Semi-Black-box RAG Controlling
SMA: Who Said That? Auditing Membership Leakage in Semi-Black-box RAG Controlling
Shixuan Sun
Yaning Tan
Ruoyu Chen
Jianjie Huang
Jingzhi Li
Xiaochun Cao
226
0
0
12 Aug 2025
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
Huiqi Deng
Hongbin Pei
Quanshi Zhang
Mengnan Du
FAtt
154
1
0
11 Aug 2025
A Multi-Stage Large Language Model Framework for Extracting Suicide-Related Social Determinants of Health
A Multi-Stage Large Language Model Framework for Extracting Suicide-Related Social Determinants of HealthCommunications Medicine (Commun. Med.), 2025
Song Wang
Yishu Wei
Haotian Ma
Max Lovitt
Kelly Deng
...
Yunyu Xiao
Ying Ding
Xuhai Xu
Joydeep Ghosh
Yifan Peng
49
0
0
07 Aug 2025
Word Overuse and Alignment in Large Language Models: The Influence of Learning from Human Feedback
Word Overuse and Alignment in Large Language Models: The Influence of Learning from Human Feedback
Tom S. Juzek
Zina B. Ward
110
0
0
03 Aug 2025
Comparison of Large Language Models for Deployment Requirements
Comparison of Large Language Models for Deployment Requirements
Alper Yaman
Jannik Schwab
C. Nitsche
Abhirup Sinha
Marco F. Huber
ELM
84
0
0
31 Jul 2025
Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning
Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based ReasoningInternational Conference on Multimodal Interaction (ICMI), 2025
Dongyang Guo
Yasmeen Abdrabou
Enkeleda Thaqi
Enkelejda Kasneci
105
0
0
24 Jul 2025
Speaker Disentanglement of Speech Pre-trained Model Based on Interpretability
Speaker Disentanglement of Speech Pre-trained Model Based on Interpretability
Xiaoxu Zhu
Junhua Li
Aaron J. Li
Yiming Ren
Baoxiang Li
136
0
0
19 Jul 2025
Assessing the Reliability of LLMs Annotations in the Context of Demographic Bias and Model Explanation
Assessing the Reliability of LLMs Annotations in the Context of Demographic Bias and Model Explanation
Hadi Mohammadi
Tina Shahedi
Pablo Mosteiro
Massimo Poesio
Ayoub Bagheri
Anastasia Giachanou
185
1
0
17 Jul 2025
Let's Measure the Elephant in the Room: Facilitating Personalized Automated Analysis of Privacy Policies at Scale
Let's Measure the Elephant in the Room: Facilitating Personalized Automated Analysis of Privacy Policies at Scale
Rui Zhao
Vladyslav Melnychuk
Jun Zhao
Jesse Wright
N. Shadbolt
88
1
0
15 Jul 2025
123456
Next