Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.12292
Cited By
Contextual Transformer Networks for Visual Recognition
26 July 2021
Yehao Li
Ting Yao
Yingwei Pan
Tao Mei
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Contextual Transformer Networks for Visual Recognition"
32 / 32 papers shown
Title
A Decade of You Only Look Once (YOLO) for Object Detection
Leo Thomas Ramos
Angel D. Sappa
66
0
0
24 Apr 2025
Quantum Complex-Valued Self-Attention Model
Fu Chen
Qinglin Zhao
Li Feng
Longfei Tang
Yangbin Lin
Haitao Huang
MQ
56
0
0
24 Mar 2025
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
40
0
0
12 Nov 2024
Brain Tumor Segmentation in MRI Images with 3D U-Net and Contextual Transformer
Thien-Qua T. Nguyen
Hieu-Nghia Nguyen
Thanh-Hieu Bui
Thien B. Nguyen-Tat
V. M. Ngo
ViT
MedIm
25
1
0
11 Jul 2024
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Ting Yao
Yehao Li
Yingwei Pan
Tao Mei
ViT
25
15
0
18 Mar 2024
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers
Haoyang Liu
Aditya Singh
Yijiang Li
Haohan Wang
AAML
ViT
36
1
0
15 Mar 2024
Progressive Feedback-Enhanced Transformer for Image Forgery Localization
Haochen Zhu
Gang Cao
Xianglin Huang
ViT
29
7
0
15 Nov 2023
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Haibo Yang
Yang Chen
Yingwei Pan
Ting Yao
Zhineng Chen
Tao Mei
19
19
0
09 Nov 2023
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
31
38
0
09 Nov 2023
Control3D: Towards Controllable Text-to-3D Generation
Yang Chen
Yingwei Pan
Yehao Li
Ting Yao
Tao Mei
DiffM
25
45
0
09 Nov 2023
Progressive Sub-Graph Clustering Algorithm for Semi-Supervised Domain Adaptation Speaker Verification
Zhuo Li
Jingze Lu
Z. Zhao
Wenchao Wang
Pengyuan Zhang
16
1
0
22 May 2023
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
26
6
0
16 Feb 2023
Rega-Net:Retina Gabor Attention for Deep Convolutional Neural Networks
Chun Bao
Jie Cao
Yaqian Ning
Yang Cheng
Q. Hao
26
1
0
23 Nov 2022
3D Cascade RCNN: High Quality Object Detection in Point Clouds
Qi Cai
Yingwei Pan
Ting Yao
Tao Mei
3DPC
20
21
0
15 Nov 2022
Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images
Yan Zhang
Xiyuan Gao
Qingyan Duan
Jiaxu Leng
Xiao Pu
Xinbo Gao
ViT
16
1
0
28 Oct 2022
EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification
Tien-Phat Nguyen
Trong-Thang Pham
Tri Minh Nguyen
H. Le
Dung Nguyen
Hau Lam
Phong H. Nguyen
Jennifer Fowler
Minh-Triet Tran
Ngan Le
ViT
30
13
0
07 Oct 2022
APAUNet: Axis Projection Attention UNet for Small Target in 3D Medical Segmentation
Yuncheng Jiang
Zixun Zhang
Shixi Qin
Yao Guo
Zhuguo Li
Shuguang Cui
35
12
0
04 Oct 2022
Hierarchical Local-Global Transformer for Temporal Sentence Grounding
Xiang Fang
Daizong Liu
Pan Zhou
Zichuan Xu
Rui Li
12
28
0
31 Aug 2022
V
2
^2
2
L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval
Wenhao Wang
Yifan Sun
Zongxin Yang
Yi Yang
VLM
16
3
0
26 Jul 2022
Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer
Yingyi Chen
Xiaoke Shen
Yahui Liu
Qinghua Tao
Johan A. K. Suykens
AAML
ViT
21
22
0
25 Jul 2022
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Haowen Shi
Simon Reiß
Kunyu Peng
Chaoxiang Ma
Haodong Fu
Philip H. S. Torr
Kaiwei Wang
Rainer Stiefelhagen
ViT
MDE
31
35
0
25 Jul 2022
Dual Vision Transformer
Ting Yao
Yehao Li
Yingwei Pan
Yu Wang
Xiaoping Zhang
Tao Mei
ViT
141
75
0
11 Jul 2022
Comprehending and Ordering Semantics for Image Captioning
Yehao Li
Yingwei Pan
Ting Yao
Tao Mei
17
87
0
14 Jun 2022
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
ViT
24
15
0
13 Jun 2022
Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection
Y. Zhang
Yingwei Pan
Ting Yao
Rui Huang
Tao Mei
C. Chen
ViT
21
68
0
13 Jun 2022
DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
Xianing Chen
Qiong Cao
Yujie Zhong
Jing Zhang
Shenghua Gao
Dacheng Tao
ViT
32
76
0
27 Apr 2022
Transformer in Transformer
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
284
1,524
0
27 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
269
179
0
17 Feb 2021
Bottleneck Transformers for Visual Recognition
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
290
979
0
27 Jan 2021
Bag of Tricks for Image Classification with Convolutional Neural Networks
Tong He
Zhi-Li Zhang
Hang Zhang
Zhongyue Zhang
Junyuan Xie
Mu Li
221
1,399
0
04 Dec 2018
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,216
0
16 Nov 2016
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
253
1,827
0
18 Aug 2016
1