Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.00887
Cited By
Structured Attention Networks
3 February 2017
Yoon Kim
Carl Denton
Luong Hoang
Alexander M. Rush
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Structured Attention Networks"
50 / 68 papers shown
Title
AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Zhao Jin
Dacheng Tao
VGen
99
1
0
16 Dec 2024
Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers
Lorenzo Tiberi
Francesca Mignacco
Kazuki Irie
H. Sompolinsky
44
6
0
24 May 2024
Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization
Chanyeon Kim
Jongwoon Park
Hyun-sool Bae
Woo Chang Kim
44
3
0
03 Apr 2024
Explaining Probabilistic Models with Distributional Values
Luca Franceschi
Michele Donini
Cédric Archambeau
Matthias Seeger
FAtt
37
2
0
15 Feb 2024
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism
Yanxi Chen
Xuchen Pan
Yaliang Li
Bolin Ding
Jingren Zhou
LRM
41
31
0
08 Dec 2023
SynJax: Structured Probability Distributions for JAX
Miloš Stanojević
Laurent Sartran
SyDa
13
4
0
07 Aug 2023
Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts
Haochen Tan
Han Wu
Wei Shao
Xinyun Zhang
Mingjie Zhan
Zhaohui Hou
Ding Liang
Linqi Song
39
0
0
13 May 2023
Self-attention in Vision Transformers Performs Perceptual Grouping, Not Attention
Paria Mehrani
John K. Tsotsos
25
24
0
02 Mar 2023
Recent advances in artificial intelligence for retrosynthesis
Zipeng Zhong
Jie Song
Zunlei Feng
Tiantao Liu
Lingxiang Jia
Shaolun Yao
Tingjun Hou
Mingli Song
29
5
0
14 Jan 2023
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
40
10
0
04 Oct 2022
A Neural Model for Regular Grammar Induction
Peter Belcak
David K. Hofer
Roger Wattenhofer
NAI
24
1
0
23 Sep 2022
UniColor: A Unified Framework for Multi-Modal Colorization with Transformer
Zhitong Huang
Nanxuan Zhao
Jing Liao
ViT
20
16
0
22 Sep 2022
Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
T. Nguyen
Richard G. Baraniuk
Robert M. Kirby
Stanley J. Osher
Bao Wang
23
9
0
01 Aug 2022
Structured Attention Composition for Temporal Action Localization
Le Yang
Junwei Han
Tao Zhao
Nian Liu
Dingwen Zhang
37
17
0
20 May 2022
Twitter-Based Gender Recognition Using Transformers
Z. Nia
A. Ahmadi
Bruce Mellado
Jianhong Wu
J. Orbinski
A. Asgary
J. Kong
ViT
16
5
0
24 Apr 2022
Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks
Lei Cheng
Ruslan Khalitov
Tong Yu
Zhirong Yang
25
32
0
06 Jan 2022
Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence
Wenchi Ma
Tianxiao Zhang
Guanghui Wang
ViT
33
14
0
26 Dec 2021
DRF Codes: Deep SNR-Robust Feedback Codes
Mahdi Boloursaz Mashhadi
Deniz Gunduz
A. Perotti
B. Popović
19
10
0
22 Dec 2021
Understanding Interlocking Dynamics of Cooperative Rationalization
Mo Yu
Yang Zhang
Shiyu Chang
Tommi Jaakkola
20
41
0
26 Oct 2021
Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments
Yu Zhang
Qingrong Xia
Shilin Zhou
Yong-jia Jiang
G. Fu
Min Zhang
43
27
0
13 Oct 2021
A Review of Text Style Transfer using Deep Learning
Martina Toshevska
Sonja Gievska
CLIP
45
43
0
30 Sep 2021
Recommender systems based on graph embedding techniques: A comprehensive review
Yue Deng
42
22
0
20 Sep 2021
Searching for More Efficient Dynamic Programs
Tim Vieira
Ryan Cotterell
Jason Eisner
24
3
0
14 Sep 2021
Excited state, non-adiabatic dynamics of large photoswitchable molecules using a chemically transferable machine learning potential
Simon Axelrod
E. Shakhnovich
Rafael Gómez-Bombarelli
29
49
0
10 Aug 2021
Shellcode_IA32: A Dataset for Automatic Shellcode Generation
Pietro Liguori
Erfan Al-Hossami
Domenico Cotroneo
R. Natella
B. Cukic
Samira Shaikh
34
27
0
27 Apr 2021
OperA: Attention-Regularized Transformers for Surgical Phase Recognition
Tobias Czempiel
Magdalini Paschali
D. Ostler
S. T. Kim
Benjamin Busam
Nassir Navab
MedIm
36
85
0
05 Mar 2021
Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction
Benfeng Xu
Quan Wang
Yajuan Lyu
Yong Zhu
Zhendong Mao
27
166
0
20 Feb 2021
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
64
2,814
0
17 Dec 2020
Molecular machine learning with conformer ensembles
Simon Axelrod
Rafael Gómez-Bombarelli
AI4CE
20
49
0
15 Dec 2020
Vertical-Horizontal Structured Attention for Generating Music with Chords
Yizhou Zhao
Liang Qiu
Wensi Ai
Feng Shi
Song-Chun Zhu
MGen
27
2
0
18 Nov 2020
A Differentiable Relaxation of Graph Segmentation and Alignment for AMR Parsing
Chunchuan Lyu
Shay B. Cohen
Ivan Titov
35
11
0
23 Oct 2020
A Survey of Unsupervised Dependency Parsing
Wenjuan Han
Yong-jia Jiang
Hwee Tou Ng
Kewei Tu
SSL
24
10
0
04 Oct 2020
Looking for change? Roll the Dice and demand Attention
F. Diakogiannis
F. Waldner
P. Caccetta
19
66
0
04 Sep 2020
FTRANS: Energy-Efficient Acceleration of Transformers using FPGA
Bingbing Li
Santosh Pandey
Haowen Fang
Yanjun Lyv
Ji Li
Jieyang Chen
Mimi Xie
Lipeng Wan
Hang Liu
Caiwen Ding
AI4CE
16
168
0
16 Jul 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
41
20
0
25 Jun 2020
Preserving Dynamic Attention for Long-Term Spatial-Temporal Prediction
Haoxing Lin
Rufan Bai
Weijia Jia
Xinyu Yang
Yongjian You
HAI
AI4TS
23
63
0
16 Jun 2020
Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport
Kyle Swanson
L. Yu
Tao Lei
OT
29
37
0
27 May 2020
Reasoning with Latent Structure Refinement for Document-Level Relation Extraction
Guoshun Nan
Zhijiang Guo
Ivan Sekulić
Wei Lu
39
273
0
13 May 2020
Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks
Karan Sikka
Andrew Silberfarb
John Byrnes
Indranil Sur
Edmond Chow
Ajay Divakaran
R. Rohwer
NAI
11
11
0
16 Mar 2020
Biologically-inspired Salience Affected Artificial Neural Network (SANN)
Leendert A. Remmelzwaal
George F. R. Ellis
J. Tapson
Amit K Mishra
13
3
0
09 Aug 2019
Structure-Invariant Testing for Machine Translation
Pinjia He
Clara Meister
Z. Su
24
104
0
19 Jul 2019
Augmenting Neural Networks with First-order Logic
Tao Li
Vivek Srikumar
16
109
0
14 Jun 2019
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
21
110
0
11 Apr 2019
Inferring Which Medical Treatments Work from Reports of Clinical Trials
Eric P. Lehman
Jay DeYoung
Regina Barzilay
Byron C. Wallace
18
114
0
02 Apr 2019
Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions
Omid Rohanian
Shiva Taslimipoor
Samaneh Kouchaki
L. Ha
R. Mitkov
19
26
0
27 Feb 2019
Learning Hierarchical Discourse-level Structure for Fake News Detection
Hamid Karimi
Jiliang Tang
11
127
0
27 Feb 2019
Attention is not Explanation
Sarthak Jain
Byron C. Wallace
FAtt
29
1,298
0
26 Feb 2019
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
Hao Huang
Luowei Zhou
Wei Zhang
Jason J. Corso
Chenliang Xu
18
3
0
13 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
19
362
0
13 Dec 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
19
246
0
12 Nov 2018
1
2
Next