ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.05328
  4. Cited By
Backbone is All Your Need: A Simplified Architecture for Visual Object
  Tracking

Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking

10 March 2022
Boyu Chen
Peixia Li
Lei Bai
Leixian Qiao
Qiuhong Shen
Bo-wen Li
Weihao Gan
Wei Wu
Wanli Ouyang
    ViT
    VOT
ArXivPDFHTML

Papers citing "Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking"

31 / 81 papers shown
Title
Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation
Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation
Jiaming Zhang
Yutao Cui
Gangshan Wu
Limin Wang
VOS
55
10
0
25 Aug 2023
Synchronize Feature Extracting and Matching: A Single Branch Framework
  for 3D Object Tracking
Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking
Teli Ma
Mengmeng Wang
Jimin Xiao
Hui-Ru Wu
Yong-Jin Liu
3DPC
17
12
0
24 Aug 2023
CiteTracker: Correlating Image and Text for Visual Tracking
CiteTracker: Correlating Image and Text for Visual Tracking
Xin Li
Yuqing Huang
Zhenyu He
Yaowei Wang
Huchuan Lu
Ming-Hsuan Yang
24
28
0
22 Aug 2023
Scalable Video Object Segmentation with Simplified Framework
Scalable Video Object Segmentation with Simplified Framework
Qiangqiang Wu
Tianyu Yang
WU Wei
Antoni B. Chan
VOS
11
20
0
19 Aug 2023
Multi-scale Target-Aware Framework for Constrained Image Splicing
  Detection and Localization
Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization
Yuxuan Tan
Yuanman Li
Li Zeng
J. Ye
W. Wang
Xia Li
22
5
0
18 Aug 2023
Exploring Lightweight Hierarchical Vision Transformers for Efficient
  Visual Tracking
Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking
Ben Kang
Xin Chen
D. Wang
Houwen Peng
Huchuan Lu
8
46
0
14 Aug 2023
Robust Object Modeling for Visual Tracking
Robust Object Modeling for Visual Tracking
Y. Cai
Jie Liu
Jie Tang
Gangshan Wu
17
53
0
09 Aug 2023
360VOT: A New Benchmark Dataset for Omnidirectional Visual Object
  Tracking
360VOT: A New Benchmark Dataset for Omnidirectional Visual Object Tracking
Huajian Huang
Yin Xu
Yingshu Chen
Sai-Kit Yeung
8
6
0
27 Jul 2023
Cross-modal Orthogonal High-rank Augmentation for RGB-Event
  Transformer-trackers
Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers
Zhiyu Zhu
Junhui Hou
Dapeng Oliver Wu
ViT
19
21
0
09 Jul 2023
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
Chunhui Zhang
Xin Sun
Li Liu
Yiqian Yang
Qiong Liu
Xiaoping Zhou
Yanfeng Wang
33
15
0
07 Jul 2023
MixFormerV2: Efficient Fully Transformer Tracking
MixFormerV2: Efficient Fully Transformer Tracking
Yutao Cui
Tian-Shu Song
Gangshan Wu
Liming Wang
13
24
0
25 May 2023
Correlation Pyramid Network for 3D Single Object Tracking
Correlation Pyramid Network for 3D Single Object Tracking
Mengmeng Wang
Teli Ma
Xingxing Zuo
Jiajun Lv
Yong-Jin Liu
3DPC
23
10
0
16 May 2023
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual
  Object Tracking
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
Xin Chen
Houwen Peng
Jiawen Zhu
Dong Wang
Han Hu
Huchuan Lu
61
22
0
27 Apr 2023
RGB-T Tracking Based on Mixed Attention
RGB-T Tracking Based on Mixed Attention
Yang Luo
Xiqing Guo
Ming Dong
Jin-xia Yu
26
15
0
09 Apr 2023
Generalized Relation Modeling for Transformer Tracking
Generalized Relation Modeling for Transformer Tracking
Shenyuan Gao
Chunluan Zhou
Jun Zhang
ViT
19
50
0
29 Mar 2023
OmniTracker: Unifying Object Tracking by Tracking-with-Detection
OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Xiyang Dai
Lu Yuan
Yu-Gang Jiang
VOT
12
12
0
21 Mar 2023
PlanarTrack: A Large-scale Challenging Benchmark for Planar Object
  Tracking
PlanarTrack: A Large-scale Challenging Benchmark for Planar Object Tracking
Xinran Liu
Xiaoqiong Liu
Ziruo Yi
Xinyi Zhou
Thanh Le
Libo Zhang
Yanling Huang
Q. Yang
Heng Fan
4
0
0
14 Mar 2023
Universal Instance Perception as Object Discovery and Retrieval
Universal Instance Perception as Object Discovery and Retrieval
B. Yan
Yi-Xin Jiang
Jiannan Wu
D. Wang
Ping Luo
Zehuan Yuan
Huchuan Lu
VOS
VLM
LRM
19
161
0
12 Mar 2023
Transformers in Single Object Tracking: An Experimental Survey
Transformers in Single Object Tracking: An Experimental Survey
Janani Kugarajeevan
T. Kokul
A. Ramanan
Subha Fernando
30
35
0
23 Feb 2023
ProContEXT: Exploring Progressive Context Transformer for Tracking
ProContEXT: Exploring Progressive Context Transformer for Tracking
Jinpeng Lan
Zhi-Qi Cheng
Ju He
Chenyang Li
Bin Luo
Xueting Bao
Wangmeng Xiang
Yifeng Geng
Xuansong Xie
33
29
0
27 Oct 2022
High-Performance Transformer Tracking
High-Performance Transformer Tracking
Xin Chen
B. Yan
Jiawen Zhu
Huchuan Lu
Xiang Ruan
D. Wang
ViT
19
33
0
25 Mar 2022
MixFormer: End-to-End Tracking with Iterative Mixed Attention
MixFormer: End-to-End Tracking with Iterative Mixed Attention
Yutao Cui
Jiang Cheng
Limin Wang
Gangshan Wu
VOT
23
437
0
21 Mar 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language
  Modeling
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
178
281
0
06 Nov 2021
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
Yuan Yao
Ao Zhang
Zhengyan Zhang
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
MLLM
VPVLM
VLM
194
218
0
24 Sep 2021
Pix2seq: A Language Modeling Framework for Object Detection
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
233
341
0
22 Sep 2021
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
Boyu Chen
Peixia Li
Baopu Li
Chuming Li
Lei Bai
Chen Lin
Ming-hui Sun
Junjie Yan
Wanli Ouyang
ViT
57
33
0
07 Aug 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
2,875
0
11 Feb 2021
Siamese Box Adaptive Network for Visual Tracking
Siamese Box Adaptive Network for Visual Tracking
Zedu Chen
Bineng Zhong
Guorong Li
Shengping Zhang
Rongrong Ji
83
659
0
15 Mar 2020
TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in
  the Wild
TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild
Matthias Muller
Adel Bibi
Silvio Giancola
Salman Al-Subaihi
Bernard Ghanem
203
785
0
28 Mar 2018
Previous
12