Title
AI-based Traffic Modeling for Network Security and Privacy: Challenges Ahead Dinil Mon Divakaran AAML 244 2 0 24 Dec 2025
Neural Architecture Search for Quantum AutoencodersInternational Conference on Quantum Computing and Engineering (QCE), 2025 Hibah Agha Samuel Yen-Chi Chen Huan-Hsin Tseng Shinjae Yoo AI4CE 232 0 0 24 Nov 2025
Alignment Faking - the Train -> Deploy Asymmetry: Through a Game-Theoretic Lens with Bayesian-Stackelberg Equilibria Kartik Garg Shourya Mishra Kartikeya Sinha Ojaswi Pratap Singh Ayush Chopra ... Ammar Sheikh Raghav Maheshwari Aman Chadha Vinija Jain Amitava Das OffRL 145 0 0 22 Nov 2025
Patient-level Information Extraction by Consistent Integration of Textual and Tabular Evidence with Bayesian Networks Paloma Rabaey Adrick Tench Stefan Heytens Thomas Demeester 60 0 0 21 Nov 2025
Robustness of LLM-enabled vehicle trajectory prediction under data security threats Feilong Wang Fuqiang Liu AAML 85 0 0 14 Nov 2025
Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder Zhen Xu Zhen Tan Song Wang Kaidi Xu Tianlong Chen MoE 246 0 0 07 Nov 2025
Ask WhAI:Probing Belief Formation in Role-Primed LLM Agents Keith Moore Jun W. Kim David Lyu Jeffrey Heo Ehsan Adeli 65 0 0 06 Nov 2025
KnowThyself: An Agentic Assistant for LLM Interpretability Suraj Prasai Mengnan Du Y. Zhang Fan Yang 77 1 0 05 Nov 2025
Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations Shahin Atakishiyev H. Babiker Jiayi Dai Nawshad Farruque Teruaki Hayashi ... Md Abed Rahman Iain Smith Mi-Young Kim Osmar R. Zaïane Randy Goebel LRM 133 0 0 20 Oct 2025
QLENS: Towards A Quantum Perspective of Language Transformers Aditya Gupta Kirandeep Kaur Manya Chadha Chirag Shah AI4CE 96 0 0 13 Oct 2025
From Explainability to Action: A Generative Operational Framework for Integrating XAI in Clinical Mental Health Screening Ratna Kandala Akshata Kishore Moharir Divya Arvinda Nayak 89 0 0 10 Oct 2025
FaithCoT-Bench: Benchmarking Instance-Level Faithfulness of Chain-of-Thought Reasoning Xu Shen Song Wang Zhen Tan Laura Yao Xinyu Zhao Kaidi Xu X. Wang Tianlong Chen LRM 172 0 0 05 Oct 2025
LLM Chemistry Estimation for Multi-LLM Recommendation H. Sánchez Briland Hitaj 84 1 0 04 Oct 2025
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information Jiaxi Li Yucheng Shi Jin Lu Ninghao Liu LRM 120 0 0 04 Oct 2025
A Qualitative Comparative Evaluation of Cognitive and Generative Theories Paul S. Rosenbloom ELM 44 0 0 03 Oct 2025
Understanding the Dilemma of Unlearning for Large Language Models Qingjie Zhang Haoting Qian Zhicong Huang Cheng Hong Shiyu Huang Ke Xu Chao Zhang Han Qiu MU 232 1 0 29 Sep 2025
Explaining Fine Tuned LLMs via Counterfactuals A Knowledge Graph Driven Framework Y Samuel Wang Ziyang Chen Md Faisal Kabir OffRL 92 0 0 25 Sep 2025
Beyond Stars: Bridging the Gap Between Ratings and Review Sentiment with LLM Najla Zuhir Amna Mohammad Salim Parvathy Premkumar Moshiur Farazi 92 0 0 25 Sep 2025
Towards Transparent AI: A Survey on Explainable Language Models Avash Palikhe Sribala Vidyadhari Chinta Zhipeng Yin Rui Guo Qiang Duan Jie Yang Wenbin Zhang 148 1 0 25 Sep 2025
Uncovering Graph Reasoning in Decoder-only Transformers with Circuit Tracing Xinnan Dai Chung-Hsiang Lo Kai Guo Shenglai Zeng Dongsheng Luo Shucheng Zhou 105 1 0 24 Sep 2025
Revealing Adversarial Smart Contracts through Semantic Interpretation and Uncertainty Estimation Yating Liu Xing Su Hao Wu Sijin Li Y. Cheng Fengyuan Xu Sheng Zhong AAML 143 0 0 23 Sep 2025
Leveraging NTPs for Efficient Hallucination Detection in VLMs Ofir Azachi Kfir Eliyahu Eyal El Ani Rom Himelstein Roi Reichart Yuval Pinter Nitay Calderon VLM 109 0 0 20 Sep 2025
Defining and Monitoring Complex Robot Activities via LLMs and Symbolic Reasoning F. Argenziano Elena Umili Francesco Leotta Daniele Nardi LLMAG 128 0 0 19 Sep 2025
REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting Nannan Huang Haytham M. Fayek Xiuzhen Zhang 92 0 0 19 Sep 2025
V-SEAM: Visual Semantic Editing and Attention Modulating for Causal Interpretability of Vision-Language Models Qidong Wang Junjie Hu Ming Jiang 72 0 0 18 Sep 2025
From Embeddings to Equations: Genetic-Programming Surrogates for Interpretable Transformer Classification M. S. Khorshidi Navid Yazdanjue Hassan Gharoun M. Nikoo Fang Chen Amir H. Gandomi 120 1 0 16 Sep 2025
SME-TEAM: Leveraging Trust and Ethics for Secure and Responsible Use of AI and LLMs in SMEs Iqbal H. Sarker Helge Janicke Ahmad Mohsin Leandros A. Maglaras 59 0 0 12 Sep 2025
Can AI Make Energy Retrofit Decisions? An Evaluation of Large Language Models Lei Shu Dong Zhao 83 1 0 08 Sep 2025
Triadic Fusion of Cognitive, Functional, and Causal Dimensions for Explainable LLMs: The TAXAL Framework David Herrera-Poyatos Carlos Peláez-González Cristina Zuheros Virilo Tejedor Rosana Montes F. Herrera 72 0 0 05 Sep 2025
NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models Chuhan Zhang Ye Zhang Bowen Shi Yuyou Gan Xuhong Zhang S. Ji Dazhan Deng Yingcai Wu AAML 108 0 0 04 Sep 2025
Improving Narrative Classification and Explanation via Fine Tuned Language Models Rishit Tyagi Rahul Bouri Mohit Gupta 70 1 0 04 Sep 2025
Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions Seyedali Mohammadi Bhaskara Hanuma Vedula Hemank Lamba Edward Raff Ponnurangam Kumaraguru Francis Ferraro Manas Gaur 155 0 0 02 Sep 2025
AHAMask: Reliable Task Specification for Large Audio Language Models without Instructions Yiwei Guo Bohan Li Hankun Wang Zhihan Li Shuai Wang Xie Chen K. Yu AuLLM 411 1 0 01 Sep 2025
Safety Alignment Should Be Made More Than Just A Few Attention Heads Chao Huang Zefeng Zhang Juewei Yue Quangang Li Chuang Zhang Tingwen Liu AAML 97 0 0 27 Aug 2025
A Case Study on the Effectiveness of LLMs in Verification with Proof Assistants Barış Bayazıt Yao Li Xujie Si 68 1 0 26 Aug 2025
AdaptiveK Sparse Autoencoders: Dynamic Sparsity Allocation for Interpretable LLM Representations Yifei Yao Mengnan Du 115 0 0 24 Aug 2025
Foundational Design Principles and Patterns for Building Robust and Adaptive GenAI-Native Systems Frederik Vandeputte AI4TS 112 2 0 21 Aug 2025
Overcoming Knowledge Discrepancies: Structuring Reasoning Threads through Knowledge Balancing in Interactive Scenarios Daniel Burkhardt Xiangwei Cheng LRM 81 0 0 16 Aug 2025
Learning Marked Temporal Point Process Explanations based on Counterfactual and Factual Reasoning Sishun Liu Ke Deng Xiuzhen Zhang Yan Wang AI4TS LRM 89 0 0 16 Aug 2025
When Explainability Meets Privacy: An Investigation at the Intersection of Post-hoc Explainability and Differential Privacy in the Context of Natural Language Processing Mahdi Dhaini Stephen Meisenbacher Ege Erdogan Florian Matthes Gjergji Kasneci SILM 181 0 0 14 Aug 2025
Adoption of Explainable Natural Language Processing: Perspectives from Industry and Academia on Practices and Challenges Mahdi Dhaini Tobias Müller Roksoliana Rabets Gjergji Kasneci 48 0 0 13 Aug 2025
SMA: Who Said That? Auditing Membership Leakage in Semi-Black-box RAG Controlling Shixuan Sun Yaning Tan Ruoyu Chen Jianjie Huang Jingzhi Li Xiaochun Cao 226 0 0 12 Aug 2025
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective Huiqi Deng Hongbin Pei Quanshi Zhang Mengnan Du FAtt 154 1 0 11 Aug 2025
A Multi-Stage Large Language Model Framework for Extracting Suicide-Related Social Determinants of HealthCommunications Medicine (Commun. Med.), 2025 Song Wang Yishu Wei Haotian Ma Max Lovitt Kelly Deng ... Yunyu Xiao Ying Ding Xuhai Xu Joydeep Ghosh Yifan Peng 49 0 0 07 Aug 2025
Word Overuse and Alignment in Large Language Models: The Influence of Learning from Human Feedback Tom S. Juzek Zina B. Ward 110 0 0 03 Aug 2025
Comparison of Large Language Models for Deployment Requirements Alper Yaman Jannik Schwab C. Nitsche Abhirup Sinha Marco F. Huber ELM 84 0 0 31 Jul 2025
Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based ReasoningInternational Conference on Multimodal Interaction (ICMI), 2025 Dongyang Guo Yasmeen Abdrabou Enkeleda Thaqi Enkelejda Kasneci 105 0 0 24 Jul 2025
Speaker Disentanglement of Speech Pre-trained Model Based on Interpretability Xiaoxu Zhu Junhua Li Aaron J. Li Yiming Ren Baoxiang Li 136 0 0 19 Jul 2025
Assessing the Reliability of LLMs Annotations in the Context of Demographic Bias and Model Explanation Hadi Mohammadi Tina Shahedi Pablo Mosteiro Massimo Poesio Ayoub Bagheri Anastasia Giachanou 185 1 0 17 Jul 2025
Let's Measure the Elephant in the Room: Facilitating Personalized Automated Analysis of Privacy Policies at Scale Rui Zhao Vladyslav Melnychuk Jun Zhao Jesse Wright N. Shadbolt 88 1 0 15 Jul 2025