ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.00887
  4. Cited By
Structured Attention Networks

Structured Attention Networks

3 February 2017
Yoon Kim
Carl Denton
Luong Hoang
Alexander M. Rush
ArXivPDFHTML

Papers citing "Structured Attention Networks"

50 / 69 papers shown
Title
AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration
AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Zhao Jin
Dacheng Tao
VGen
99
1
0
16 Dec 2024
Dissecting the Interplay of Attention Paths in a Statistical Mechanics
  Theory of Transformers
Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers
Lorenzo Tiberi
Francesca Mignacco
Kazuki Irie
H. Sompolinsky
44
6
0
24 May 2024
Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization
Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization
Chanyeon Kim
Jongwoon Park
Hyun-sool Bae
Woo Chang Kim
44
3
0
03 Apr 2024
Explaining Probabilistic Models with Distributional Values
Explaining Probabilistic Models with Distributional Values
Luca Franceschi
Michele Donini
Cédric Archambeau
Matthias Seeger
FAtt
37
2
0
15 Feb 2024
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language
  Models with 3D Parallelism
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism
Yanxi Chen
Xuchen Pan
Yaliang Li
Bolin Ding
Jingren Zhou
LRM
41
31
0
08 Dec 2023
SynJax: Structured Probability Distributions for JAX
SynJax: Structured Probability Distributions for JAX
Miloš Stanojević
Laurent Sartran
SyDa
13
4
0
07 Aug 2023
Reconstruct Before Summarize: An Efficient Two-Step Framework for
  Condensing and Summarizing Meeting Transcripts
Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts
Haochen Tan
Han Wu
Wei Shao
Xinyun Zhang
Mingjie Zhan
Zhaohui Hou
Ding Liang
Linqi Song
39
0
0
13 May 2023
Self-attention in Vision Transformers Performs Perceptual Grouping, Not
  Attention
Self-attention in Vision Transformers Performs Perceptual Grouping, Not Attention
Paria Mehrani
John K. Tsotsos
25
24
0
02 Mar 2023
Recent advances in artificial intelligence for retrosynthesis
Recent advances in artificial intelligence for retrosynthesis
Zipeng Zhong
Jie Song
Zunlei Feng
Tiantao Liu
Lingxiang Jia
Shaolun Yao
Tingjun Hou
Mingli Song
29
5
0
14 Jan 2023
Learning to Collocate Visual-Linguistic Neural Modules for Image
  Captioning
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
40
10
0
04 Oct 2022
A Neural Model for Regular Grammar Induction
A Neural Model for Regular Grammar Induction
Peter Belcak
David K. Hofer
Roger Wattenhofer
NAI
27
1
0
23 Sep 2022
UniColor: A Unified Framework for Multi-Modal Colorization with
  Transformer
UniColor: A Unified Framework for Multi-Modal Colorization with Transformer
Zhitong Huang
Nanxuan Zhao
Jing Liao
ViT
20
16
0
22 Sep 2022
Momentum Transformer: Closing the Performance Gap Between Self-attention
  and Its Linearization
Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
T. Nguyen
Richard G. Baraniuk
Robert M. Kirby
Stanley J. Osher
Bao Wang
26
9
0
01 Aug 2022
Structured Attention Composition for Temporal Action Localization
Structured Attention Composition for Temporal Action Localization
Le Yang
Junwei Han
Tao Zhao
Nian Liu
Dingwen Zhang
37
17
0
20 May 2022
Twitter-Based Gender Recognition Using Transformers
Twitter-Based Gender Recognition Using Transformers
Z. Nia
A. Ahmadi
Bruce Mellado
Jianhong Wu
J. Orbinski
A. Asgary
J. Kong
ViT
16
5
0
24 Apr 2022
Classification of Long Sequential Data using Circular Dilated
  Convolutional Neural Networks
Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks
Lei Cheng
Ruslan Khalitov
Tong Yu
Zhirong Yang
25
32
0
06 Jan 2022
Miti-DETR: Object Detection based on Transformers with Mitigatory
  Self-Attention Convergence
Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence
Wenchi Ma
Tianxiao Zhang
Guanghui Wang
ViT
33
14
0
26 Dec 2021
DRF Codes: Deep SNR-Robust Feedback Codes
DRF Codes: Deep SNR-Robust Feedback Codes
Mahdi Boloursaz Mashhadi
Deniz Gunduz
A. Perotti
B. Popović
19
10
0
22 Dec 2021
Understanding Interlocking Dynamics of Cooperative Rationalization
Understanding Interlocking Dynamics of Cooperative Rationalization
Mo Yu
Yang Zhang
Shiyu Chang
Tommi Jaakkola
20
41
0
26 Oct 2021
Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree
  Structures Inside Arguments
Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments
Yu Zhang
Qingrong Xia
Shilin Zhou
Yong-jia Jiang
Guohong Fu
Min Zhang
43
27
0
13 Oct 2021
A Review of Text Style Transfer using Deep Learning
A Review of Text Style Transfer using Deep Learning
Martina Toshevska
Sonja Gievska
CLIP
45
43
0
30 Sep 2021
Recommender systems based on graph embedding techniques: A comprehensive
  review
Recommender systems based on graph embedding techniques: A comprehensive review
Yue Deng
42
22
0
20 Sep 2021
Searching for More Efficient Dynamic Programs
Searching for More Efficient Dynamic Programs
Tim Vieira
Ryan Cotterell
Jason Eisner
26
3
0
14 Sep 2021
Excited state, non-adiabatic dynamics of large photoswitchable molecules
  using a chemically transferable machine learning potential
Excited state, non-adiabatic dynamics of large photoswitchable molecules using a chemically transferable machine learning potential
Simon Axelrod
E. Shakhnovich
Rafael Gómez-Bombarelli
29
49
0
10 Aug 2021
Shellcode_IA32: A Dataset for Automatic Shellcode Generation
Shellcode_IA32: A Dataset for Automatic Shellcode Generation
Pietro Liguori
Erfan Al-Hossami
Domenico Cotroneo
R. Natella
B. Cukic
Samira Shaikh
34
27
0
27 Apr 2021
OperA: Attention-Regularized Transformers for Surgical Phase Recognition
OperA: Attention-Regularized Transformers for Surgical Phase Recognition
Tobias Czempiel
Magdalini Paschali
D. Ostler
S. T. Kim
Benjamin Busam
Nassir Navab
MedIm
39
85
0
05 Mar 2021
Entity Structure Within and Throughout: Modeling Mention Dependencies
  for Document-Level Relation Extraction
Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction
Benfeng Xu
Quan Wang
Yajuan Lyu
Yong Zhu
Zhendong Mao
27
166
0
20 Feb 2021
Taming Transformers for High-Resolution Image Synthesis
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
64
2,819
0
17 Dec 2020
Molecular machine learning with conformer ensembles
Molecular machine learning with conformer ensembles
Simon Axelrod
Rafael Gómez-Bombarelli
AI4CE
20
49
0
15 Dec 2020
Vertical-Horizontal Structured Attention for Generating Music with
  Chords
Vertical-Horizontal Structured Attention for Generating Music with Chords
Yizhou Zhao
Liang Qiu
Wensi Ai
Feng Shi
Song-Chun Zhu
MGen
27
2
0
18 Nov 2020
A Differentiable Relaxation of Graph Segmentation and Alignment for AMR
  Parsing
A Differentiable Relaxation of Graph Segmentation and Alignment for AMR Parsing
Chunchuan Lyu
Shay B. Cohen
Ivan Titov
35
11
0
23 Oct 2020
A Survey of Unsupervised Dependency Parsing
A Survey of Unsupervised Dependency Parsing
Wenjuan Han
Yong-jia Jiang
Hwee Tou Ng
Kewei Tu
SSL
24
10
0
04 Oct 2020
Looking for change? Roll the Dice and demand Attention
Looking for change? Roll the Dice and demand Attention
F. Diakogiannis
F. Waldner
P. Caccetta
19
66
0
04 Sep 2020
FTRANS: Energy-Efficient Acceleration of Transformers using FPGA
FTRANS: Energy-Efficient Acceleration of Transformers using FPGA
Bingbing Li
Santosh Pandey
Haowen Fang
Yanjun Lyv
Ji Li
Jieyang Chen
Mimi Xie
Lipeng Wan
Hang Liu
Caiwen Ding
AI4CE
16
168
0
16 Jul 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
41
20
0
25 Jun 2020
Preserving Dynamic Attention for Long-Term Spatial-Temporal Prediction
Preserving Dynamic Attention for Long-Term Spatial-Temporal Prediction
Haoxing Lin
Rufan Bai
Weijia Jia
Xinyu Yang
Yongjian You
HAI
AI4TS
23
63
0
16 Jun 2020
Rationalizing Text Matching: Learning Sparse Alignments via Optimal
  Transport
Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport
Kyle Swanson
L. Yu
Tao Lei
OT
29
37
0
27 May 2020
Reasoning with Latent Structure Refinement for Document-Level Relation
  Extraction
Reasoning with Latent Structure Refinement for Document-Level Relation Extraction
Guoshun Nan
Zhijiang Guo
Ivan Sekulić
Wei Lu
39
273
0
13 May 2020
Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge
  into Deep Neural Networks
Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks
Karan Sikka
Andrew Silberfarb
John Byrnes
Indranil Sur
Edmond Chow
Ajay Divakaran
R. Rohwer
NAI
11
11
0
16 Mar 2020
Biologically-inspired Salience Affected Artificial Neural Network (SANN)
Biologically-inspired Salience Affected Artificial Neural Network (SANN)
Leendert A. Remmelzwaal
George F. R. Ellis
J. Tapson
Amit K Mishra
15
3
0
09 Aug 2019
Structure-Invariant Testing for Machine Translation
Structure-Invariant Testing for Machine Translation
Pinjia He
Clara Meister
Z. Su
24
104
0
19 Jul 2019
Augmenting Neural Networks with First-order Logic
Augmenting Neural Networks with First-order Logic
Tao Li
Vivek Srikumar
16
109
0
14 Jun 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
21
110
0
11 Apr 2019
Inferring Which Medical Treatments Work from Reports of Clinical Trials
Inferring Which Medical Treatments Work from Reports of Clinical Trials
Eric P. Lehman
Jay DeYoung
Regina Barzilay
Byron C. Wallace
18
114
0
02 Apr 2019
Bridging the Gap: Attending to Discontinuity in Identification of
  Multiword Expressions
Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions
Omid Rohanian
Shiva Taslimipoor
Samaneh Kouchaki
L. Ha
R. Mitkov
21
26
0
27 Feb 2019
Learning Hierarchical Discourse-level Structure for Fake News Detection
Learning Hierarchical Discourse-level Structure for Fake News Detection
Hamid Karimi
Jiliang Tang
13
127
0
27 Feb 2019
Attention is not Explanation
Attention is not Explanation
Sarthak Jain
Byron C. Wallace
FAtt
31
1,298
0
26 Feb 2019
Dynamic Graph Modules for Modeling Object-Object Interactions in
  Activity Recognition
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
Hao Huang
Luowei Zhou
Wei Zhang
Jason J. Corso
Chenliang Xu
18
3
0
13 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual
  Question Answering
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
19
362
0
13 Dec 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
21
246
0
12 Nov 2018
12
Next