Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09925
Cited By
Attention Augmented Convolutional Networks
22 April 2019
Irwan Bello
Barret Zoph
Ashish Vaswani
Jonathon Shlens
Quoc V. Le
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Augmented Convolutional Networks"
50 / 427 papers shown
Title
InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Hanrong Ye
Dan Xu
ViT
25
10
0
08 Jun 2023
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViT
MU
15
28
0
07 Jun 2023
Sparsity and Coefficient Permutation Based Two-Domain AMP for Image Block Compressed Sensing
Junhui Li
Xingsong Hou
Huake Wang
Shuhao Bi
13
0
0
22 May 2023
Towards End-to-End Semi-Supervised Table Detection with Deformable Transformer
Tahira Shehzadi
K. Hashmi
D. Stricker
Marcus Liwicki
Muhammad Zeshan Afzal
ViT
LMTD
34
10
0
04 May 2023
Early Detection of Alzheimer's Disease using Bottleneck Transformers
Arunima Jaiswal
Ananya Sadana
MedIm
18
2
0
01 May 2023
Multi-cropping Contrastive Learning and Domain Consistency for Unsupervised Image-to-Image Translation
Chen Zhao
Weixi Cai
Zheng Yuan
Cheng Hu
23
0
0
24 Apr 2023
Omni Aggregation Networks for Lightweight Image Super-Resolution
Hang Wang
Xuanhong Chen
Bingbing Ni
Yutian Liu
Jinfan Liu
SupR
22
77
0
20 Apr 2023
Deep Unrestricted Document Image Rectification
Hao Feng
Shaokai Liu
Jiajun Deng
Wen-gang Zhou
Houqiang Li
ViT
21
13
0
18 Apr 2023
A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition
Ruchao Fan
Wei Chu
Peng Chang
Abeer Alwan
16
10
0
15 Apr 2023
ASR: Attention-alike Structural Re-parameterization
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Liang Lin
33
6
0
13 Apr 2023
RFAConv: Innovating Spatial Attention and Standard Convolutional Operation
X. Zhang
Chen Liu
Degang Yang
Tingting Song
Yichen Ye
Ke Li
Ying Song
21
110
0
06 Apr 2023
PointCAT: Cross-Attention Transformer for point cloud
Xincheng Yang
Mingze Jin
Weiji He
Qian Chen
3DPC
ViT
19
3
0
06 Apr 2023
Astroformer: More Data Might not be all you need for Classification
Rishit Dagli
28
7
0
03 Apr 2023
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
48
117
0
29 Mar 2023
Exemplar-based Video Colorization with Long-term Spatiotemporal Dependency
Si-Yu Chen
Xueming Li
Xianlin Zhang
Mingdao Wang
Yu Zhang
Jiatong Han
Yue Zhang
13
10
0
27 Mar 2023
Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition
Kai Liu
Hailiang Xiong
Gangqiang Yang
Zhengfeng Du
Yewen Cao
D. Shah
13
0
0
23 Mar 2023
URM4DMU: an user represention model for darknet markets users
Hongmeng Liu
Jiapeng Zhao
Yixuan Huo
Yuyan Wang
Chun Liao
Liyan Shen
Shiyao Cui
Jinqiao Shi
19
3
0
19 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
28
22
0
17 Mar 2023
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
Sara Shoouri
Mingyu Yang
Zichen Fan
Hun-Seok Kim
MoE
26
3
0
16 Mar 2023
SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction
A. A. Nargund
Misha Sra
ViT
34
2
0
11 Mar 2023
Pyramid Pixel Context Adaption Network for Medical Image Classification with Supervised Contrastive Learning
Xiaoqin Zhang
Zunjie Xiao
Xiao Wu
Jiansheng Fang
Junyong Shen
Yan Hu
Jiang-Dong Liu
24
10
0
03 Mar 2023
Hyneter: Hybrid Network Transformer for Object Detection
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
27
3
0
18 Feb 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Ondrej Biza
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gamaleldin F. Elsayed
Aravindh Mahendran
Thomas Kipf
OCL
43
32
0
09 Feb 2023
Semantic Diffusion Network for Semantic Segmentation
Hao Hao Tan
Sitong Wu
Jimin Pi
DiffM
39
33
0
04 Feb 2023
Efficient Scopeformer: Towards Scalable and Rich Feature Extraction for Intracranial Hemorrhage Detection
Yassine Barhoumi
N. Bouaynaya
Ghulam Rasool
MedIm
14
5
0
01 Feb 2023
Flow-guided Semi-supervised Video Object Segmentation
Yushan Zhang
Andreas Robinson
M. Magnusson
M. Felsberg
VOS
12
1
0
25 Jan 2023
ViTs for SITS: Vision Transformers for Satellite Image Time Series
Michail Tarasiou
Erik Chavez
S. Zafeiriou
ViT
6
48
0
12 Jan 2023
AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Xumin Yu
Yongming Rao
Ziyi Wang
Jiwen Lu
Jie Zhou
ViT
30
55
0
11 Jan 2023
DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation
Feilong Tang
Q. Huang
Jinfeng Wang
Xianxu Hou
Jionglong Su
Jingxin Liu
ViT
MedIm
27
49
0
21 Dec 2022
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
Ning Yu
Chia-Chih Chen
Zeyuan Chen
Rui Meng
Ganglu Wu
P. Josel
Juan Carlos Niebles
Caiming Xiong
Ran Xu
ViT
DiffM
24
6
0
19 Dec 2022
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
J. Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
19
6
0
16 Dec 2022
Position Embedding Needs an Independent Layer Normalization
Runyi Yu
Zhennan Wang
Yinhuai Wang
Kehan Li
Yian Zhao
Jian Andrew Zhang
Guoli Song
Jie Chen
26
1
0
10 Dec 2022
Neural Fourier Filter Bank
Zhijie Wu
Yuhe Jin
K. M. Yi
11
27
0
04 Dec 2022
Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations
Tan Yu
Ping Li
ViT
46
5
0
25 Nov 2022
Comparison Study Between Token Classification and Sequence Classification In Text Classification
Amir Jafari
22
5
0
25 Nov 2022
Detecting Anomalies using Generative Adversarial Networks on Images
Rushikesh Zawar
Krupa Bhayani
Neelanjan Bhowmik
K. Tiwari
Dhiraj Sangwan
GAN
23
2
0
24 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
28
129
0
22 Nov 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
18
80
0
18 Nov 2022
Dual Complementary Dynamic Convolution for Image Recognition
Longbin Yan
Yunxiao Qin
Shumin Liu
Jie Chen
10
0
0
11 Nov 2022
Linear Self-Attention Approximation via Trainable Feedforward Kernel
Uladzislau Yorsh
Alexander Kovalenko
27
0
0
08 Nov 2022
Studying inductive biases in image classification task
N. Arizumi
21
1
0
31 Oct 2022
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
23
2
0
28 Oct 2022
VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
Henghui Ding
Chang Liu
Suchen Wang
Xudong Jiang
68
115
0
28 Oct 2022
Towards Improving Workers' Safety and Progress Monitoring of Construction Sites Through Construction Site Understanding
Mahdi Bonyani
Maryam Soleymani
19
0
0
27 Oct 2022
ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
Haoran You
Zhanyi Sun
Huihong Shi
Zhongzhi Yu
Yang Katie Zhao
Yongan Zhang
Chaojian Li
Baopu Li
Yingyan Lin
ViT
17
76
0
18 Oct 2022
SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds
Pei Sun
Mingxing Tan
Weiyue Wang
Chenxi Liu
Fei Xia
Zhaoqi Leng
Drago Anguelov
ViT
21
114
0
13 Oct 2022
Vision Transformers provably learn spatial structure
Samy Jelassi
Michael E. Sander
Yuan-Fang Li
ViT
MLT
32
73
0
13 Oct 2022
Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge
Shuhao Deng
Chengfei Li
Jinfeng Bai
Qingqing Zhang
Weiqiang Zhang
Runyan Yang
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
15
1
0
12 Oct 2022
Understanding Embodied Reference with Touch-Line Transformer
Y. Li
Xiaoxue Chen
Hao Zhao
Jiangtao Gong
Guyue Zhou
Federico Rossano
Yixin Zhu
158
15
0
11 Oct 2022
Compressed Vision for Efficient Video Understanding
Olivia Wiles
João Carreira
Iain Barr
Andrew Zisserman
Mateusz Malinowski
14
7
0
06 Oct 2022
Previous
1
2
3
4
5
6
7
8
9
Next