ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.13417
  4. Cited By
VISIT: Visualizing and Interpreting the Semantic Information Flow of
  Transformers
v1v2 (latest)

VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
22 May 2023
Shahar Katz
Yonatan Belinkov
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers"

26 / 26 papers shown
Understanding Post-Training Structural Changes in Large Language Models
Understanding Post-Training Structural Changes in Large Language Models
Xinyu He
Xianghui Cao
158
0
0
22 Sep 2025
Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts
Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts
Zineddine Tighidet
Andrea Mogini
Hedi Ben-younes
Jiali Mei
Patrick Gallinari
Benjamin Piwowarski
226
2
0
12 Sep 2025
HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs
HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs
Zhaolin Cai
Fan Li
Ziwei Zheng
Yanjun Qin
154
3
0
23 Jul 2025
AbbIE: Autoregressive Block-Based Iterative Encoder for Efficient Sequence Modeling
AbbIE: Autoregressive Block-Based Iterative Encoder for Efficient Sequence Modeling
Preslav Aleksandrov
Meghdad Kurmanji
Fernando Garcia Redondo
David O'Shea
William F. Shen
Alex Iacob
Lorenzo Sani
Xinchi Qiu
Nicola Cancedda
Nicholas D. Lane
186
4
0
11 Jul 2025
Know-MRI: A Knowledge Mechanisms Revealer&Interpreter for Large Language Models
Jiaxiang Liu
Boxuan Xing
Chenhao Yuan
Chenxiang Zhang
Di Wu
...
Haida Yu
Chuhan Lang
Pengfei Cao
Jun Zhao
Kang Liu
180
0
0
10 Jun 2025
Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety
Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety
Seongmin Lee
Aeree Cho
Grace C. Kim
ShengYun Peng
Mansi Phute
Duen Horng Chau
LM&MAAI4CE
273
3
0
05 Jun 2025
Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models
Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models
Liyi Zhang
Veniamin Veselovsky
R. Thomas McCoy
Thomas Griffiths
189
1
0
17 Apr 2025
How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence
How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and ConfidenceSIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), 2025
Hongzhe Du
Weikai Li
Min Cai
Jiaqi W. Ma
Zimin Zhang
Himabindu Lakkaraju
Tony Nowatzki
Shichang Zhang
KELM
302
4
0
03 Apr 2025
Reverse-Engineering the Retrieval Process in GenIR Models
Reverse-Engineering the Retrieval Process in GenIR ModelsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Anja Reusch
Yonatan Belinkov
RALM
208
0
0
25 Mar 2025
Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models
Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models
Zeping Yu
Yonatan Belinkov
Sophia Ananiadou
LRM
233
12
0
15 Feb 2025
Reversed Attention: On The Gradient Descent Of Attention Layers In GPT
Reversed Attention: On The Gradient Descent Of Attention Layers In GPT
Shahar Katz
Lior Wolf
143
0
0
22 Dec 2024
Revealing the Barriers of Language Agents in Planning
Revealing the Barriers of Language Agents in PlanningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Jian Xie
Kexun Zhang
Jiangjie Chen
Siyu Yuan
Kai Zhang
Yikai Zhang
Lei Li
Yanghua Xiao
LM&RoAIFinLRM
258
17
0
16 Oct 2024
Optimal ablation for interpretability
Optimal ablation for interpretabilityNeural Information Processing Systems (NeurIPS), 2024
Maximilian Li
Lucas Janson
FAtt
343
12
0
16 Sep 2024
Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models
Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2024
Xiyu Liu
Zhengxiao Liu
Naibin Gu
Zheng Lin
Wanli Ma
Ji Xiang
Weiping Wang
KELM
427
3
0
27 Aug 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
358
24
0
27 Jul 2024
Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions
Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions
Sarah Wiegreffe
Oyvind Tafjord
Yonatan Belinkov
Hanna Hajishirzi
Ashish Sabharwal
218
3
0
21 Jul 2024
Confidence Regulation Neurons in Language Models
Confidence Regulation Neurons in Language Models
Alessandro Stolfo
Ben Wu
Wes Gurnee
Yonatan Belinkov
Xingyi Song
Mrinmaya Sachan
Neel Nanda
242
39
0
24 Jun 2024
Finding Transformer Circuits with Edge Pruning
Finding Transformer Circuits with Edge Pruning
Adithya Bhaskar
Alexander Wettig
Dan Friedman
Danqi Chen
468
34
0
24 Jun 2024
Knowledge Circuits in Pretrained Transformers
Knowledge Circuits in Pretrained Transformers
Yunzhi Yao
Ningyu Zhang
Zekun Xi
Meng Wang
Ziwen Xu
Shumin Deng
Huajun Chen
KELM
436
43
0
28 May 2024
InversionView: A General-Purpose Method for Reading Information from
  Neural Activations
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
Madhur Panwar
Navin Goyal
Michael Hahn
351
9
0
27 May 2024
LM Transparency Tool: Interactive Tool for Analyzing Transformer
  Language Models
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models
Igor Tufanov
Karen Hambardzumyan
Javier Ferrando
Elena Voita
KELM
232
14
0
10 Apr 2024
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Diffusion Lens: Interpreting Text Encoders in Text-to-Image PipelinesAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Michael Toker
Hadas Orgad
Mor Ventura
Dana Arad
Yonatan Belinkov
DiffM
284
20
0
09 Mar 2024
Understanding and Patching Compositional Reasoning in LLMs
Understanding and Patching Compositional Reasoning in LLMs
Zhaoyi Li
Gangwei Jiang
Hong Xie
Linqi Song
Defu Lian
Ying Wei
LRM
246
43
0
22 Feb 2024
Backward Lens: Projecting Language Model Gradients into the Vocabulary
  Space
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Shahar Katz
Yonatan Belinkov
Mor Geva
Lior Wolf
267
24
1
20 Feb 2024
A Comprehensive Study of Knowledge Editing for Large Language Models
A Comprehensive Study of Knowledge Editing for Large Language Models
Ningyu Zhang
Yunzhi Yao
Bo Tian
Peng Wang
Shumin Deng
...
Lei Liang
Qing Cui
Xiao-Jun Zhu
Jun Zhou
Huajun Chen
KELM
493
126
0
02 Jan 2024
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro
Sameer Singh
Carlos Guestrin
FAttFaML
2.5K
19,701
0
16 Feb 2016
1