Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.05328
Cited By
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
10 March 2022
Boyu Chen
Peixia Li
Lei Bai
Leixian Qiao
Qiuhong Shen
Bo-wen Li
Weihao Gan
Wei Wu
Wanli Ouyang
ViT
VOT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking"
31 / 81 papers shown
Title
Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation
Jiaming Zhang
Yutao Cui
Gangshan Wu
Limin Wang
VOS
55
10
0
25 Aug 2023
Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking
Teli Ma
Mengmeng Wang
Jimin Xiao
Hui-Ru Wu
Yong-Jin Liu
3DPC
17
12
0
24 Aug 2023
CiteTracker: Correlating Image and Text for Visual Tracking
Xin Li
Yuqing Huang
Zhenyu He
Yaowei Wang
Huchuan Lu
Ming-Hsuan Yang
24
28
0
22 Aug 2023
Scalable Video Object Segmentation with Simplified Framework
Qiangqiang Wu
Tianyu Yang
WU Wei
Antoni B. Chan
VOS
11
20
0
19 Aug 2023
Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization
Yuxuan Tan
Yuanman Li
Li Zeng
J. Ye
W. Wang
Xia Li
22
5
0
18 Aug 2023
Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking
Ben Kang
Xin Chen
D. Wang
Houwen Peng
Huchuan Lu
8
46
0
14 Aug 2023
Robust Object Modeling for Visual Tracking
Y. Cai
Jie Liu
Jie Tang
Gangshan Wu
17
53
0
09 Aug 2023
360VOT: A New Benchmark Dataset for Omnidirectional Visual Object Tracking
Huajian Huang
Yin Xu
Yingshu Chen
Sai-Kit Yeung
8
6
0
27 Jul 2023
Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers
Zhiyu Zhu
Junhui Hou
Dapeng Oliver Wu
ViT
19
21
0
09 Jul 2023
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
Chunhui Zhang
Xin Sun
Li Liu
Yiqian Yang
Qiong Liu
Xiaoping Zhou
Yanfeng Wang
33
15
0
07 Jul 2023
MixFormerV2: Efficient Fully Transformer Tracking
Yutao Cui
Tian-Shu Song
Gangshan Wu
Liming Wang
13
24
0
25 May 2023
Correlation Pyramid Network for 3D Single Object Tracking
Mengmeng Wang
Teli Ma
Xingxing Zuo
Jiajun Lv
Yong-Jin Liu
3DPC
23
10
0
16 May 2023
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
Xin Chen
Houwen Peng
Jiawen Zhu
Dong Wang
Han Hu
Huchuan Lu
61
22
0
27 Apr 2023
RGB-T Tracking Based on Mixed Attention
Yang Luo
Xiqing Guo
Ming Dong
Jin-xia Yu
26
15
0
09 Apr 2023
Generalized Relation Modeling for Transformer Tracking
Shenyuan Gao
Chunluan Zhou
Jun Zhang
ViT
19
50
0
29 Mar 2023
OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Xiyang Dai
Lu Yuan
Yu-Gang Jiang
VOT
12
12
0
21 Mar 2023
PlanarTrack: A Large-scale Challenging Benchmark for Planar Object Tracking
Xinran Liu
Xiaoqiong Liu
Ziruo Yi
Xinyi Zhou
Thanh Le
Libo Zhang
Yanling Huang
Q. Yang
Heng Fan
4
0
0
14 Mar 2023
Universal Instance Perception as Object Discovery and Retrieval
B. Yan
Yi-Xin Jiang
Jiannan Wu
D. Wang
Ping Luo
Zehuan Yuan
Huchuan Lu
VOS
VLM
LRM
19
161
0
12 Mar 2023
Transformers in Single Object Tracking: An Experimental Survey
Janani Kugarajeevan
T. Kokul
A. Ramanan
Subha Fernando
30
35
0
23 Feb 2023
ProContEXT: Exploring Progressive Context Transformer for Tracking
Jinpeng Lan
Zhi-Qi Cheng
Ju He
Chenyang Li
Bin Luo
Xueting Bao
Wangmeng Xiang
Yifeng Geng
Xuansong Xie
33
29
0
27 Oct 2022
High-Performance Transformer Tracking
Xin Chen
B. Yan
Jiawen Zhu
Huchuan Lu
Xiang Ruan
D. Wang
ViT
19
33
0
25 Mar 2022
MixFormer: End-to-End Tracking with Iterative Mixed Attention
Yutao Cui
Jiang Cheng
Limin Wang
Gangshan Wu
VOT
23
437
0
21 Mar 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
178
281
0
06 Nov 2021
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
Yuan Yao
Ao Zhang
Zhengyan Zhang
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
MLLM
VPVLM
VLM
194
218
0
24 Sep 2021
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
233
341
0
22 Sep 2021
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
Boyu Chen
Peixia Li
Baopu Li
Chuming Li
Lei Bai
Chen Lin
Ming-hui Sun
Junjie Yan
Wanli Ouyang
ViT
57
33
0
07 Aug 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
2,875
0
11 Feb 2021
Siamese Box Adaptive Network for Visual Tracking
Zedu Chen
Bineng Zhong
Guorong Li
Shengping Zhang
Rongrong Ji
83
659
0
15 Mar 2020
TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild
Matthias Muller
Adel Bibi
Silvio Giancola
Salman Al-Subaihi
Bernard Ghanem
203
785
0
28 Mar 2018
Previous
1
2