Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.08833
Cited By
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation
21 January 2021
Brendan Duke
Abdalla Ahmed
Christian Wolf
P. Aarabi
Graham W. Taylor
VOS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation"
32 / 32 papers shown
Title
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection
Q. Yang
Yuan Yao
Miaomiao Cui
Liefeng Bo
VLM
61
0
0
30 Apr 2025
OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models
Shengkai Chen
Yifang Yin
Jinming Cao
Shili Xiang
Zhenguang Liu
Roger Zimmermann
VOS
VLM
39
0
0
30 Apr 2025
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li
Deshui Miao
Zhenyu He
Y. Wang
Huchuan Lu
Ming Yang
VOS
49
4
0
10 Jul 2024
RMem: Restricted Memory Banks Improve Video Object Segmentation
Junbao Zhou
Ziqi Pang
Yu-xiong Wang
VOS
55
7
0
12 Jun 2024
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
Quang-Trung Truong
Duc Thanh Nguyen
Binh-Son Hua
Sai-Kit Yeung
VOS
34
1
0
25 Jan 2024
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yiran Zhong
Yuchao Dai
35
34
0
12 Oct 2023
Cross-modal Cognitive Consensus guided Audio-Visual Segmentation
Zhaofeng Shi
Qingbo Wu
Fanman Meng
Linfeng Xu
Hongliang Li
VOS
25
3
0
10 Oct 2023
Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yun-Qiu Lv
Yiran Zhong
Yuchao Dai
DiffM
31
28
0
31 Jul 2023
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Jun-Sang Yoo
H. Lee
Seung‐Won Jung
VOS
26
1
0
17 Jul 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
16
30
0
25 May 2023
Transavs: End-To-End Audio-Visual Segmentation With Transformer
Yuhang Ling
Yuxi Li
Zhenye Gan
Jiangning Zhang
M. Chi
Yabiao Wang
VOS
ViT
29
1
0
12 May 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
28
11
0
08 Apr 2023
Online Lane Graph Extraction from Onboard Video
Y. Can
Alexander Liniger
D. Paudel
Luc Van Gool
19
2
0
03 Apr 2023
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Philip H. S. Torr
S. Bai
VOS
25
132
0
03 Feb 2023
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
28
39
0
13 Dec 2022
Breaking the "Object" in Video Object Segmentation
P. Tokmakov
Jie Li
Adrien Gaidon
VOS
24
39
0
12 Dec 2022
Video Object of Interest Segmentation
Siyuan Zhou
Chunru Zhan
Biao Wang
T. Ge
Yuning Jiang
Li Niu
VOS
18
0
0
06 Dec 2022
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
21
2
0
28 Oct 2022
Per-Clip Video Object Segmentation
Kwanyong Park
Sanghyun Woo
Seoung Wug Oh
In So Kweon
Joon-Young Lee
VLM
VOS
27
50
0
03 Aug 2022
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation
Ye Yu
Jialing Yuan
Gaurav Mittal
Fuxin Li
Mei Chen
VOS
45
28
0
01 Aug 2022
Region Aware Video Object Segmentation with Deep Motion Modeling
Bo Miao
Bennamoun
Yongsheng Gao
Ajmal Saeed Mian
VOS
12
16
0
21 Jul 2022
Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation
Gensheng Pei
Fumin Shen
Yazhou Yao
G. Xie
Zhenmin Tang
Jinhui Tang
VOS
23
51
0
18 Jul 2022
Learning Quality-aware Dynamic Memory for Video Object Segmentation
Yong Liu
R. Yu
Fei Yin
Xinyuan Zhao
Wei-Ye Zhao
Weihao Xia
Yujiu Yang
VOS
19
47
0
16 Jul 2022
Audio-Visual Segmentation
Jinxing Zhou
Jianyuan Wang
J. Zhang
Weixuan Sun
Jing Zhang
Stan Birchfield
Dan Guo
Lingpeng Kong
Meng Wang
Yiran Zhong
VOS
28
110
0
11 Jul 2022
Recurrent Dynamic Embedding for Video Object Segmentation
Mingxing Li
Liucheng Hu
Zhiwei Xiong
Bang Zhang
Pan Pan
Dong Liu
VOS
59
61
0
08 May 2022
Temporal Context for Robust Maritime Obstacle Detection
Lojze Žust
Matej Kristan
21
14
0
10 Mar 2022
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li
Yali Wang
Junhao Zhang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
142
361
0
24 Jan 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
20
102
0
16 Jan 2022
Reliable Propagation-Correction Modulation for Video Object Segmentation
Xiaohao Xu
Jinglu Wang
Xiao Li
Yan Lu
VOS
38
61
0
06 Dec 2021
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
23
6
0
26 Nov 2021
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Yifan Xu
Zhijie Zhang
Mengdan Zhang
Kekai Sheng
Ke Li
Weiming Dong
Liqing Zhang
Changsheng Xu
Xing Sun
ViT
18
201
0
03 Aug 2021
A Survey on Deep Learning Technique for Video Segmentation
Tianfei Zhou
Fatih Porikli
David J. Crandall
Luc Van Gool
Wenguan Wang
VOS
20
230
0
02 Jul 2021
1