ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.05047
  4. Cited By
TransVOD: End-to-End Video Object Detection with Spatial-Temporal
  Transformers

TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers

13 January 2022
Qianyu Zhou
Xiangtai Li
Lu He
Li Niu
Guangliang Cheng
Yunhai Tong
Lizhuang Ma
Liqing Zhang
    ViT
ArXivPDFHTML

Papers citing "TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers"

17 / 17 papers shown
Title
Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework
Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework
Xinyi Ying
Li Liu
Zaipin Lin
Yangsi Shi
Y. Wang
Ruojing Li
Xu Cao
Boyang Li
Shilin Zhou
Wei An
122
1
0
21 Feb 2025
GloTSFormer: Global Video Text Spotting Transformer
GloTSFormer: Global Video Text Spotting Transformer
Hang Wang
Yanjie Wang
Yang Li
Can Huang
25
0
0
08 Jan 2024
Diverse Target and Contribution Scheduling for Domain Generalization
Diverse Target and Contribution Scheduling for Domain Generalization
Shaocong Long
Qianyu Zhou
Soham Dan
Lizhuang Ma
Yuan Luo
47
8
0
28 Sep 2023
NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation
NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation
Tim Meinhardt
Matt Feiszli
Yuchen Fan
Laura Leal-Taixe
Rakesh Ranjan
ViT
19
5
0
29 Aug 2023
Object Detection Difficulty: Suppressing Over-aggregation for Faster and
  Better Video Object Detection
Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection
Bin Zhang
Sen Wang
Yifan Liu
Brano Kusy
Xue Li
Jiajun Liu
ObjD
25
0
0
22 Aug 2023
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance
  Segmentation
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
27
16
0
03 Jan 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part
  Segmentation
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Xiangtai Li
Shilin Xu
Yibo Yang
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
26
21
0
03 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open
  Vocabulary Instance Segmentation
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
75
31
0
02 Jan 2023
BoxMask: Revisiting Bounding Box Supervision for Video Object Detection
BoxMask: Revisiting Bounding Box Supervision for Video Object Detection
K. Hashmi
A. Pagani
D. Stricker
Muhammad Zeshan Afzal
VOS
25
10
0
12 Oct 2022
Spatio-Temporal Learnable Proposals for End-to-End Video Object
  Detection
Spatio-Temporal Learnable Proposals for End-to-End Video Object Detection
K. Hashmi
D. Stricker
Muhammamd Zeshan Afzal
21
7
0
05 Oct 2022
Fashionformer: A simple, Effective and Unified Baseline for Human
  Fashion Segmentation and Recognition
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition
Shilin Xu
Xiangtai Li
Jingbo Wang
Guangliang Cheng
Yunhai Tong
Dacheng Tao
ViT
19
27
0
10 Apr 2022
TF-Blender: Temporal Feature Blender for Video Object Detection
TF-Blender: Temporal Feature Blender for Video Object Detection
Yiming Cui
Liqi Yan
Zhiwen Cao
Dongfang Liu
ViT
48
97
0
12 Aug 2021
End-to-End Video Object Detection with Spatial-Temporal Transformers
End-to-End Video Object Detection with Spatial-Temporal Transformers
Lu He
Qianyu Zhou
Xiangtai Li
Li Niu
Guangliang Cheng
Xiao Li
Wenxuan Liu
Yu Tong
Lizhuang Ma
Liqing Zhang
ViT
39
94
0
23 May 2021
TrackFormer: Multi-Object Tracking with Transformers
TrackFormer: Multi-Object Tracking with Transformers
Tim Meinhardt
A. Kirillov
Laura Leal-Taixe
Christoph Feichtenhofer
VOT
208
732
0
07 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
241
555
0
31 Dec 2020
Memory Enhanced Global-Local Aggregation for Video Object Detection
Memory Enhanced Global-Local Aggregation for Video Object Detection
Yihong Chen
Yue Cao
Han Hu
Liwei Wang
105
261
0
26 Mar 2020
Relation Distillation Networks for Video Object Detection
Relation Distillation Networks for Video Object Detection
Jiajun Deng
Yingwei Pan
Ting Yao
Wen-gang Zhou
Houqiang Li
Tao Mei
ObjD
92
191
0
26 Aug 2019
1