Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.01076
Cited By
Divert More Attention to Vision-Language Tracking
3 July 2022
Mingzhe Guo
Zhipeng Zhang
Heng Fan
Li Jing
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Divert More Attention to Vision-Language Tracking"
31 / 31 papers shown
Title
COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking
Chunhui Zhang
Li Liu
Jialin Gao
Xin Sun
Hao Wen
Xi Zhou
Shiming Ge
Y. Wang
33
0
0
02 Apr 2025
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
Wenrui Cai
Qingjie Liu
Y. Wang
MoE
55
0
0
24 Mar 2025
Towards General Multimodal Visual Tracking
Andong Lu
Mai Wen
Jinhu Wang
Yuanzhi Guo
Chenglong Li
Jin Tang
Bin Luo
36
0
0
14 Mar 2025
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding
Xin Gu
Yaojie Shen
Chenxi Luo
Tiejian Luo
Yan Huang
Yuewei Lin
Heng Fan
L. Zhang
50
1
0
16 Feb 2025
Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues
X. Feng
D. Zhang
Shuyan Hu
X. Li
M. Wu
Jie Zhang
Xiaojing Chen
K. Huang
38
0
0
27 Dec 2024
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
Chunhui Zhang
Li Liu
Hao-Kai Wen
Xi Zhou
Y. Wang
Mamba
95
2
0
24 Nov 2024
How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Xuchen Li
Shiyu Hu
Xiaokun Feng
Dailing Zhang
Meiqi Wu
Jing Zhang
Kaiqi Huang
56
0
0
23 Nov 2024
DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Xuchen Li
Shiyu Hu
Xiaokun Feng
Dailing Zhang
Meiqi Wu
Jing Zhang
Kaiqi Huang
24
5
0
03 Oct 2024
Improving Visual Object Tracking through Visual Prompting
Shih-Fang Chen
Jun-Cheng Chen
I-Hong Jhuo
Yen-Yu Lin
VLM
23
1
0
27 Sep 2024
Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Xuchen Li
Shiyu Hu
Xiaokun Feng
Dailing Zhang
Meiqi Wu
Jing Zhang
Kaiqi Huang
VLM
MLLM
19
6
0
13 Sep 2024
Autogenic Language Embedding for Coherent Point Tracking
Zikai Song
Ying Tang
Run Luo
Lintao Ma
Junqing Yu
Yi-Ping Phoebe Chen
Wei Yang
39
3
0
30 Jul 2024
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark
Chunhui Zhang
Li Liu
Guanjie Huang
Hao-Kai Wen
Xi Zhou
Yanfeng Wang
38
8
0
30 May 2024
DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM
Xuchen Li
Xiaokun Feng
Shiyu Hu
Meiqi Wu
Dailing Zhang
Jing Zhang
Kaiqi Huang
VLM
28
16
0
20 May 2024
VastTrack: Vast Category Visual Object Tracking
Liang Peng
Junyuan Gao
Xinran Liu
Weihong Li
Shaohua Dong
Zhipeng Zhang
Heng Fan
Libo Zhang
VLM
32
6
0
06 Mar 2024
Unifying Visual and Vision-Language Tracking via Contrastive Learning
Yinchao Ma
Yuyang Tang
Wenfei Yang
Tianzhu Zhang
Jinpeng Zhang
Mengxue Kang
ObjD
8
12
0
20 Jan 2024
Context-Guided Spatio-Temporal Video Grounding
Xin Gu
Hengrui Fan
Yan Huang
Tiejian Luo
Libo Zhang
18
13
0
03 Jan 2024
Tracking with Human-Intent Reasoning
Jiawen Zhu
Zhi-Qi Cheng
Jun-Yan He
Chenyang Li
Bin Luo
Huchuan Lu
Yifeng Geng
Xuansong Xie
LRM
VOS
27
6
0
29 Dec 2023
Beyond Visual Cues: Synchronously Exploring Target-Centric Semantics for Vision-Language Tracking
Jiawei Ge
Xiangmei Chen
Jiuxin Cao
Xueling Zhu
Bo Liu
VLM
27
2
0
28 Nov 2023
Towards Unified Token Learning for Vision-Language Tracking
Yaozong Zheng
Bineng Zhong
Qihua Liang
Guorong Li
R. Ji
Xianxian Li
19
28
0
27 Aug 2023
CiteTracker: Correlating Image and Text for Visual Tracking
Xin Li
Yuqing Huang
Zhenyu He
Yaowei Wang
Huchuan Lu
Ming-Hsuan Yang
22
28
0
22 Aug 2023
Divert More Attention to Vision-Language Object Tracking
Mingzhe Guo
Zhipeng Zhang
Li Jing
Haibin Ling
Heng Fan
VLM
22
3
0
19 Jul 2023
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
Chunhui Zhang
Xin Sun
Li Liu
Yiqian Yang
Qiong Liu
Xiaoping Zhou
Yanfeng Wang
30
15
0
07 Jul 2023
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
Xin Chen
Houwen Peng
Jiawen Zhu
Dong Wang
Han Hu
Huchuan Lu
61
22
0
27 Apr 2023
Joint Visual Grounding and Tracking with Natural Language Specification
Li Zhou
Zikun Zhou
Kaige Mao
Zhenyu He
17
56
0
21 Mar 2023
WebUAV-3M: A Benchmark for Unveiling the Power of Million-Scale Deep UAV Tracking
Chunhui Zhang
Guanjie Huang
Li Liu
Shan Huang
Yinan Yang
Xiang Wan
Shiming Ge
Dacheng Tao
22
22
0
19 Jan 2022
TrackFormer: Multi-Object Tracking with Transformers
Tim Meinhardt
A. Kirillov
Laura Leal-Taixe
Christoph Feichtenhofer
VOT
208
732
0
07 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
241
555
0
31 Dec 2020
Siamese Box Adaptive Network for Visual Tracking
Zedu Chen
Bineng Zhong
Guorong Li
Shengping Zhang
Rongrong Ji
83
580
0
15 Mar 2020
A Survey on Bias and Fairness in Machine Learning
Ninareh Mehrabi
Fred Morstatter
N. Saxena
Kristina Lerman
Aram Galstyan
SyDa
FaML
286
4,143
0
23 Aug 2019
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
264
5,290
0
05 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
141
1,458
0
06 Jun 2016
1