ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00928
  4. Cited By
Quantifying Attention Flow in Transformers

Quantifying Attention Flow in Transformers

2 May 2020
Samira Abnar
Willem H. Zuidema
ArXivPDFHTML

Papers citing "Quantifying Attention Flow in Transformers"

50 / 403 papers shown
Title
Towards Opening the Black Box of Neural Machine Translation: Source and
  Target Interpretations of the Transformer
Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer
Javier Ferrando
Gerard I. Gállego
Belen Alastruey
Carlos Escolano
Marta R. Costa-jussá
30
44
0
23 May 2022
Foundation Posteriors for Approximate Probabilistic Inference
Foundation Posteriors for Approximate Probabilistic Inference
Mike Wu
Noah D. Goodman
UQCV
25
6
0
19 May 2022
A graph-transformer for whole slide image classification
A graph-transformer for whole slide image classification
Yi Zheng
R. Gindra
Emily J. Green
E. Burks
Margrit Betke
J. Beane
V. Kolachalama
MedIm
45
123
0
19 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
59
787
0
12 May 2022
A Song of (Dis)agreement: Evaluating the Evaluation of Explainable
  Artificial Intelligence in Natural Language Processing
A Song of (Dis)agreement: Evaluating the Evaluation of Explainable Artificial Intelligence in Natural Language Processing
Michael Neely
Stefan F. Schouten
Maurits J. R. Bleeker
Ana Lucic
XAI
21
16
0
09 May 2022
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole
  Encoder Layer in Transformers
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
Ali Modarressi
Mohsen Fayyaz
Yadollah Yaghoobzadeh
Mohammad Taher Pilehvar
ViT
19
33
0
06 May 2022
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and
  Object Re-Identification
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification
Haowei Zhu
Wenjing Ke
Dong Li
Ji Liu
Lu Tian
Yi Shan
29
134
0
04 May 2022
BERTops: Studying BERT Representations under a Topological Lens
BERTops: Studying BERT Representations under a Topological Lens
Jatin Chauhan
Manohar Kaul
24
3
0
02 May 2022
Visualizing and Explaining Language Models
Visualizing and Explaining Language Models
Adrian M. P. Braşoveanu
Razvan Andonie
MILM
VLM
29
4
0
30 Apr 2022
StorSeismic: A new paradigm in deep learning for seismic processing
StorSeismic: A new paradigm in deep learning for seismic processing
R. Harsuko
T. Alkhalifah
29
38
0
30 Apr 2022
Do Transformer Models Show Similar Attention Patterns to Task-Specific
  Human Gaze?
Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze?
Stephanie Brandl
Oliver Eberle
Jonas Pilot
Anders Søgaard
72
33
0
25 Apr 2022
Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps
Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps
Oren Barkan
Edan Hauon
Avi Caciularu
Ori Katz
Itzik Malkiel
Omri Armstrong
Noam Koenigstein
34
37
0
23 Apr 2022
Revealing Occlusions with 4D Neural Fields
Revealing Occlusions with 4D Neural Fields
Basile Van Hoorick
Purva Tendulkar
Dídac Surís
Dennis Park
Simon Stent
Carl Vondrick
30
16
0
22 Apr 2022
Diverse Instance Discovery: Vision-Transformer for Instance-Aware
  Multi-Label Image Recognition
Diverse Instance Discovery: Vision-Transformer for Instance-Aware Multi-Label Image Recognition
Yunqing Hu
Xuan Jin
Yin Zhang
Haiwen Hong
Jingfeng Zhang
Feihu Yan
Yuan He
Hui Xue
ViT
11
1
0
22 Apr 2022
An Attention-Based Model for Predicting Contextual Informativeness and
  Curriculum Learning Applications
An Attention-Based Model for Predicting Contextual Informativeness and Curriculum Learning Applications
Sungjin Nam
David Jurgens
Gwen Frishkoff
Kevyn Collins-Thompson
10
0
0
21 Apr 2022
MVSTER: Epipolar Transformer for Efficient Multi-View Stereo
MVSTER: Epipolar Transformer for Efficient Multi-View Stereo
Xiaofen Wang
Zheng Hua Zhu
Fangbo Qin
Yun Ye
Guan Huang
Xu Chi
Yijia He
Xingang Wang
26
79
0
15 Apr 2022
ViTOL: Vision Transformer for Weakly Supervised Object Localization
ViTOL: Vision Transformer for Weakly Supervised Object Localization
Saurav Gupta
Sourav Lakhotia
Abhay Rawat
Rahul Tallamraju
WSOL
32
21
0
14 Apr 2022
How Conservative are Language Models? Adapting to the Introduction of
  Gender-Neutral Pronouns
How Conservative are Language Models? Adapting to the Introduction of Gender-Neutral Pronouns
Stephanie Brandl
Ruixiang Cui
Anders Søgaard
25
20
0
11 Apr 2022
No Token Left Behind: Explainability-Aided Image Classification and
  Generation
No Token Left Behind: Explainability-Aided Image Classification and Generation
Roni Paiss
Hila Chefer
Lior Wolf
VLM
34
29
0
11 Apr 2022
Learning Program Representations for Food Images and Cooking Recipes
Learning Program Representations for Food Images and Cooking Recipes
Dim P. Papadopoulos
Enrique Mora
Nadiia Chepurko
Kuan-Wei Huang
Ferda Ofli
Antonio Torralba
17
31
0
30 Mar 2022
ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator
ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator
Zi-Chao Zhang
Zhen-Duo Chen
Yongxin Wang
Xin Luo
Xin-Shun Xu
ViT
30
6
0
24 Mar 2022
On Robust Prefix-Tuning for Text Classification
On Robust Prefix-Tuning for Text Classification
Zonghan Yang
Yang Liu
VLM
21
20
0
19 Mar 2022
Goal-conditioned dual-action imitation learning for dexterous dual-arm
  robot manipulation
Goal-conditioned dual-action imitation learning for dexterous dual-arm robot manipulation
Heecheol Kim
Y. Ohmura
Y. Kuniyoshi
27
27
0
18 Mar 2022
Are Vision Transformers Robust to Spurious Correlations?
Are Vision Transformers Robust to Spurious Correlations?
Soumya Suvra Ghosal
Yifei Ming
Yixuan Li
ViT
30
28
0
17 Mar 2022
WegFormer: Transformers for Weakly Supervised Semantic Segmentation
WegFormer: Transformers for Weakly Supervised Semantic Segmentation
Chunmeng Liu
Enze Xie
Wenjia Wang
Wenhai Wang
Guangya Li
Ping Luo
ViT
24
6
0
16 Mar 2022
TrueType Transformer: Character and Font Style Recognition in Outline
  Format
TrueType Transformer: Character and Font Style Recognition in Outline Format
Yusuke Nagata
Jinki Otao
Daichi Haraguchi
S. Uchida
6
3
0
10 Mar 2022
Measuring the Mixing of Contextual Information in the Transformer
Measuring the Mixing of Contextual Information in the Transformer
Javier Ferrando
Gerard I. Gállego
Marta R. Costa-jussá
26
49
0
08 Mar 2022
CTformer: Convolution-free Token2Token Dilated Vision Transformer for
  Low-dose CT Denoising
CTformer: Convolution-free Token2Token Dilated Vision Transformer for Low-dose CT Denoising
Dayang Wang
Fenglei Fan
Zhan Wu
R. Liu
Fei Wang
Hengyong Yu
ViT
MedIm
35
122
0
28 Feb 2022
Training Robots without Robots: Deep Imitation Learning for
  Master-to-Robot Policy Transfer
Training Robots without Robots: Deep Imitation Learning for Master-to-Robot Policy Transfer
Heecheol Kim
Y. Ohmura
Akihiko Nagakubo
Y. Kuniyoshi
13
23
0
19 Feb 2022
Modelling the semantics of text in complex document layouts using graph
  transformer networks
Modelling the semantics of text in complex document layouts using graph transformer networks
T. Barillot
Jacob Saks
Polena Lilyanova
Edward Torgas
Yachen Hu
Yuanqing Liu
Varun Balupuri
P. Gaskell
GNN
26
0
0
18 Feb 2022
XAI for Transformers: Better Explanations through Conservative
  Propagation
XAI for Transformers: Better Explanations through Conservative Propagation
Ameen Ali
Thomas Schnake
Oliver Eberle
G. Montavon
Klaus-Robert Muller
Lior Wolf
FAtt
15
89
0
15 Feb 2022
BViT: Broad Attention based Vision Transformer
BViT: Broad Attention based Vision Transformer
Nannan Li
Yaran Chen
Weifan Li
Zixiang Ding
Dong Zhao
ViT
38
23
0
13 Feb 2022
TorchMD-NET: Equivariant Transformers for Neural Network based Molecular
  Potentials
TorchMD-NET: Equivariant Transformers for Neural Network based Molecular Potentials
Philipp Thölke
Gianni De Fabritiis
AI4CE
34
186
0
05 Feb 2022
Rethinking Attention-Model Explainability through Faithfulness Violation
  Test
Rethinking Attention-Model Explainability through Faithfulness Violation Test
Y. Liu
Haoliang Li
Yangyang Guo
Chen Kong
Jing Li
Shiqi Wang
FAtt
121
43
0
28 Jan 2022
LAP: An Attention-Based Module for Concept Based Self-Interpretation and
  Knowledge Injection in Convolutional Neural Networks
LAP: An Attention-Based Module for Concept Based Self-Interpretation and Knowledge Injection in Convolutional Neural Networks
Rassa Ghavami Modegh
Ahmadali Salimi
Alireza Dizaji
Hamid R. Rabiee
FAtt
32
0
0
27 Jan 2022
Diagnosing AI Explanation Methods with Folk Concepts of Behavior
Diagnosing AI Explanation Methods with Folk Concepts of Behavior
Alon Jacovi
Jasmijn Bastings
Sebastian Gehrmann
Yoav Goldberg
Katja Filippova
36
15
0
27 Jan 2022
Video Transformers: A Survey
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
22
103
0
16 Jan 2022
Learning Generalizable Vision-Tactile Robotic Grasping Strategy for
  Deformable Objects via Transformer
Learning Generalizable Vision-Tactile Robotic Grasping Strategy for Deformable Objects via Transformer
Yunhai Han
Kelin Yu
Rahul Batra
Nathan Boyd
Chaitanya Mehta
T. Zhao
Y. She
S. Hutchinson
Ye Zhao
ViT
29
43
0
13 Dec 2021
DBIA: Data-free Backdoor Injection Attack against Transformer Networks
DBIA: Data-free Backdoor Injection Attack against Transformer Networks
Peizhuo Lv
Hualong Ma
Jiachen Zhou
Ruigang Liang
Kai Chen
Shengzhi Zhang
Yunfei Yang
26
15
0
22 Nov 2021
Are Vision Transformers Robust to Patch Perturbations?
Are Vision Transformers Robust to Patch Perturbations?
Jindong Gu
Volker Tresp
Yao Qin
AAML
ViT
38
60
0
20 Nov 2021
Discrete Representations Strengthen Vision Transformer Robustness
Discrete Representations Strengthen Vision Transformer Robustness
Chengzhi Mao
Lu Jiang
Mostafa Dehghani
Carl Vondrick
Rahul Sukthankar
Irfan Essa
ViT
27
44
0
20 Nov 2021
TransMix: Attend to Mix for Vision Transformers
TransMix: Attend to Mix for Vision Transformers
Jieneng Chen
Shuyang Sun
Ju He
Philip Torr
Alan Yuille
S. Bai
ViT
28
103
0
18 Nov 2021
TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Yuefeng Tao
Zhiwei Jia
Runze Ma
Shugong Xu
ViT
19
6
0
16 Nov 2021
Temporal Knowledge Distillation for On-device Audio Classification
Temporal Knowledge Distillation for On-device Audio Classification
Kwanghee Choi
Martin Kersner
Jacob Morton
Buru Chang
21
26
0
27 Oct 2021
Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations
  in Instructional Videos
Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Reuben Tan
Bryan A. Plummer
Kate Saenko
Hailin Jin
Bryan C. Russell
SSL
37
27
0
20 Oct 2021
News-based Business Sentiment and its Properties as an Economic Index
News-based Business Sentiment and its Properties as an Economic Index
Kazuhiro Seki
Y. Ikuta
Yoichi Matsubayashi
16
23
0
20 Oct 2021
Schrödinger's Tree -- On Syntax and Neural Language Models
Schrödinger's Tree -- On Syntax and Neural Language Models
Artur Kulmizev
Joakim Nivre
35
6
0
17 Oct 2021
Evaluating the Faithfulness of Importance Measures in NLP by Recursively
  Masking Allegedly Important Tokens and Retraining
Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining
Andreas Madsen
Nicholas Meade
Vaibhav Adlakha
Siva Reddy
111
35
0
15 Oct 2021
On Neurons Invariant to Sentence Structural Changes in Neural Machine
  Translation
On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation
Gal Patel
Leshem Choshen
Omri Abend
36
2
0
06 Oct 2021
Putting Words in BERT's Mouth: Navigating Contextualized Vector Spaces
  with Pseudowords
Putting Words in BERT's Mouth: Navigating Contextualized Vector Spaces with Pseudowords
Taelin Karidi
Yichu Zhou
Nathan Schneider
Omri Abend
Vivek Srikumar
86
13
0
23 Sep 2021
Previous
123456789
Next