ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.08833
  4. Cited By
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation

SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation

21 January 2021
Brendan Duke
Abdalla Ahmed
Christian Wolf
P. Aarabi
Graham W. Taylor
    VOS
ArXivPDFHTML

Papers citing "SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation"

32 / 32 papers shown
Title
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection
Q. Yang
Yuan Yao
Miaomiao Cui
Liefeng Bo
VLM
61
0
0
30 Apr 2025
OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models
OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models
Shengkai Chen
Yifang Yin
Jinming Cao
Shili Xiang
Zhenguang Liu
Roger Zimmermann
VOS
VLM
39
0
0
30 Apr 2025
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li
Deshui Miao
Zhenyu He
Y. Wang
Huchuan Lu
Ming Yang
VOS
49
4
0
10 Jul 2024
RMem: Restricted Memory Banks Improve Video Object Segmentation
RMem: Restricted Memory Banks Improve Video Object Segmentation
Junbao Zhou
Ziqi Pang
Yu-xiong Wang
VOS
55
7
0
12 Jun 2024
Self-supervised Video Object Segmentation with Distillation Learning of
  Deformable Attention
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
Quang-Trung Truong
Duc Thanh Nguyen
Binh-Son Hua
Sai-Kit Yeung
VOS
34
1
0
25 Jan 2024
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yiran Zhong
Yuchao Dai
35
34
0
12 Oct 2023
Cross-modal Cognitive Consensus guided Audio-Visual Segmentation
Cross-modal Cognitive Consensus guided Audio-Visual Segmentation
Zhaofeng Shi
Qingbo Wu
Fanman Meng
Linfeng Xu
Hongliang Li
VOS
25
3
0
10 Oct 2023
Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yun-Qiu Lv
Yiran Zhong
Yuchao Dai
DiffM
31
28
0
31 Jul 2023
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Jun-Sang Yoo
H. Lee
Seung‐Won Jung
VOS
26
1
0
17 Jul 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
16
30
0
25 May 2023
Transavs: End-To-End Audio-Visual Segmentation With Transformer
Transavs: End-To-End Audio-Visual Segmentation With Transformer
Yuhang Ling
Yuxi Li
Zhenye Gan
Jiangning Zhang
M. Chi
Yabiao Wang
VOS
ViT
29
1
0
12 May 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
28
11
0
08 Apr 2023
Online Lane Graph Extraction from Onboard Video
Online Lane Graph Extraction from Onboard Video
Y. Can
Alexander Liniger
D. Paudel
Luc Van Gool
19
2
0
03 Apr 2023
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Philip H. S. Torr
S. Bai
VOS
25
132
0
03 Feb 2023
Look Before You Match: Instance Understanding Matters in Video Object
  Segmentation
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
28
39
0
13 Dec 2022
Breaking the "Object" in Video Object Segmentation
Breaking the "Object" in Video Object Segmentation
P. Tokmakov
Jie Li
Adrien Gaidon
VOS
24
39
0
12 Dec 2022
Video Object of Interest Segmentation
Video Object of Interest Segmentation
Siyuan Zhou
Chunru Zhan
Biao Wang
T. Ge
Yuning Jiang
Li Niu
VOS
18
0
0
06 Dec 2022
Grafting Vision Transformers
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
21
2
0
28 Oct 2022
Per-Clip Video Object Segmentation
Per-Clip Video Object Segmentation
Kwanyong Park
Sanghyun Woo
Seoung Wug Oh
In So Kweon
Joon-Young Lee
VLM
VOS
27
50
0
03 Aug 2022
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring
  Space for Video Object Segmentation
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation
Ye Yu
Jialing Yuan
Gaurav Mittal
Fuxin Li
Mei Chen
VOS
45
28
0
01 Aug 2022
Region Aware Video Object Segmentation with Deep Motion Modeling
Region Aware Video Object Segmentation with Deep Motion Modeling
Bo Miao
Bennamoun
Yongsheng Gao
Ajmal Saeed Mian
VOS
12
16
0
21 Jul 2022
Hierarchical Feature Alignment Network for Unsupervised Video Object
  Segmentation
Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation
Gensheng Pei
Fumin Shen
Yazhou Yao
G. Xie
Zhenmin Tang
Jinhui Tang
VOS
23
51
0
18 Jul 2022
Learning Quality-aware Dynamic Memory for Video Object Segmentation
Learning Quality-aware Dynamic Memory for Video Object Segmentation
Yong Liu
R. Yu
Fei Yin
Xinyuan Zhao
Wei-Ye Zhao
Weihao Xia
Yujiu Yang
VOS
19
47
0
16 Jul 2022
Audio-Visual Segmentation
Audio-Visual Segmentation
Jinxing Zhou
Jianyuan Wang
J. Zhang
Weixuan Sun
Jing Zhang
Stan Birchfield
Dan Guo
Lingpeng Kong
Meng Wang
Yiran Zhong
VOS
28
110
0
11 Jul 2022
Recurrent Dynamic Embedding for Video Object Segmentation
Recurrent Dynamic Embedding for Video Object Segmentation
Mingxing Li
Liucheng Hu
Zhiwei Xiong
Bang Zhang
Pan Pan
Dong Liu
VOS
59
61
0
08 May 2022
Temporal Context for Robust Maritime Obstacle Detection
Temporal Context for Robust Maritime Obstacle Detection
Lojze Žust
Matej Kristan
21
14
0
10 Mar 2022
UniFormer: Unifying Convolution and Self-attention for Visual
  Recognition
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li
Yali Wang
Junhao Zhang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
142
361
0
24 Jan 2022
Video Transformers: A Survey
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
20
102
0
16 Jan 2022
Reliable Propagation-Correction Modulation for Video Object Segmentation
Reliable Propagation-Correction Modulation for Video Object Segmentation
Xiaohao Xu
Jinglu Wang
Xiao Li
Yan Lu
VOS
38
61
0
06 Dec 2021
SWAT: Spatial Structure Within and Among Tokens
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
23
6
0
26 Nov 2021
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Yifan Xu
Zhijie Zhang
Mengdan Zhang
Kekai Sheng
Ke Li
Weiming Dong
Liqing Zhang
Changsheng Xu
Xing Sun
ViT
18
201
0
03 Aug 2021
A Survey on Deep Learning Technique for Video Segmentation
A Survey on Deep Learning Technique for Video Segmentation
Tianfei Zhou
Fatih Porikli
David J. Crandall
Luc Van Gool
Wenguan Wang
VOS
20
230
0
02 Jul 2021
1