Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.01872
Cited By
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
3 February 2023
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Philip H. S. Torr
S. Bai
VOS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MOSE: A New Dataset for Video Object Segmentation in Complex Scenes"
50 / 108 papers shown
Title
UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model
T. Kaiser
Thomas Norrenbrock
Bodo Rosenhahn
42
0
0
08 May 2025
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
Mengshi Qi
Pengfei Zhu
X. Li
Xiaoyang Bi
Lu Qi
Huadong Ma
Ming Yang
VOS
VLM
42
0
0
16 Apr 2025
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Henghui Ding
Chang Liu
Nikhila Ravi
Shuting He
Y. Wei
...
Haobo Yuan
X. Li
Tao Zhang
Lu Qi
Ming Yang
25
0
0
15 Apr 2025
MASSeg : 2nd Technical Report for 4th PVUW MOSE Track
Xuqiang Cao
Linnan Zhao
Jiaxuan Zhao
Fang Liu
Puhua Chen
Wenping Ma
37
0
0
14 Apr 2025
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
Mengjiao Wang
Junpei Zhang
Xu Liu
Yuting Yang
Mengru Ma
VOS
50
0
0
13 Apr 2025
STSeg-Complex Video Object Segmentation: The 1st Solution for 4th PVUW MOSE Challenge
Kehuan Song
Xinglin Xie
Kexin Zhang
Licheng Jiao
Lingling Li
S. M. I. Simon X. Yang
VOS
45
0
0
11 Apr 2025
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
Thanos Delatolas
Vicky S. Kalogeiton
Dim P. Papadopoulos
DiffM
VOS
43
1
0
07 Apr 2025
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation
Hao Fang
Runmin Cong
Xiankai Lu
Z. Chen
Wei Zhang
29
0
0
07 Apr 2025
REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding
Sakib Reza
Xiyun Song
Heather Yu
Zongfang Lin
Mohsen Moghaddam
Octavia Camps
23
0
0
07 Apr 2025
Exploiting Temporal State Space Sharing for Video Semantic Segmentation
Syed Ariff Syed Hesham
Yun Liu
Guolei Sun
Henghui Ding
Jing Yang
Ender Konukoglu
Xue Geng
Xudong Jiang
49
1
0
26 Mar 2025
TransAnimate: Taming Layer Diffusion to Generate RGBA Video
Xuewei Chen
Zhimin Chen
Yiren Song
VGen
61
0
0
23 Mar 2025
InstructVEdit: A Holistic Approach for Instructional Video Editing
Chi Zhang
C. Feng
Feng Yan
Qiming Zhang
Mingjin Zhang
Yujie Zhong
Jing Zhang
Lin Ma
DiffM
VGen
39
0
0
22 Mar 2025
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
Quanhao Li
Zhen Xing
Rui Wang
Hui Zhang
Qi Dai
Zuxuan Wu
VGen
61
0
0
20 Mar 2025
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Xuan Shen
Weize Ma
Jing Liu
Changdi Yang
Rui Ding
...
Wei Niu
Yanzhi Wang
Pu Zhao
Jun Lin
Jiuxiang Gu
MQ
52
0
0
20 Mar 2025
SAM2 for Image and Video Segmentation: A Comprehensive Survey
Zhang Jiaxing
Tang Hao
VLM
50
0
0
17 Mar 2025
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control
Yuxuan Bian
Zhaoyang Zhang
Xuan Ju
Mingdeng Cao
Liangbin Xie
Ying Shan
Qiang Xu
VGen
DiffM
68
1
0
07 Mar 2025
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Suhwan Cho
Seunghoon Lee
Minhyeok Lee
Jungho Lee
Sangyoun Lee
VOS
77
0
0
05 Mar 2025
Object-Aware Video Matting with Cross-Frame Guidance
H. Zhang
Dongyue Wu
Yuanjie Shao
Nong Sang
Changxin Gao
VOS
69
0
0
03 Mar 2025
SMITE: Segment Me In TimE
Amirhossein Alimohammadi
Sauradip Nag
Saeid Asgari Taghanaki
Andrea Tagliasacchi
Ghassan Hamarneh
Ali Mahdavi-Amiri
VLM
VOS
77
2
0
20 Feb 2025
E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization
T. Pham
Zhang Kang
Ji Woo Hong
Xuran Zheng
Chang D. Yoo
75
0
0
13 Feb 2025
EdgeTAM: On-Device Track Anything Model
Chong Zhou
Chenchen Zhu
Yunyang Xiong
Saksham Suri
Fanyi Xiao
...
Raghuraman Krishnamoorthi
Bo Dai
Chen Change Loy
Vikas Chandra
Bilge Soran
VLM
58
0
0
13 Jan 2025
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control
Yuanpeng Tu
Hao Luo
Xi Chen
S. Ji
Xiang Bai
Hengshuang Zhao
VGen
DiffM
42
3
0
08 Jan 2025
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
Yaxian Wang
Henghui Ding
Shuting He
Xudong Jiang
Bifan Wei
Jun Liu
ObjD
30
1
0
03 Jan 2025
M
3
^3
3
-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Zixuan Chen
Jiaxin Li
Liming Tan
Yejie Guo
Junxuan Liang
Cewu Lu
Y. Li
VOS
63
0
0
18 Dec 2024
A Distractor-Aware Memory for Visual Object Tracking with SAM2
Jovana Videnovic
A. Lukežič
Matej Kristan
VLM
81
1
0
26 Nov 2024
Context-Aware Input Orchestration for Video Inpainting
Hoyoung Kim
Azimbek Khudoyberdiev
Seonghwan Jeong
Jihoon Ryoo
76
0
0
25 Nov 2024
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos
Yunong Liu
Cristobal Eyzaguirre
Manling Li
Shubh Khanna
Juan Carlos Niebles
Vineeth Ravi
Saumitra Mishra
Weiyu Liu
Jiajun Wu
76
0
0
18 Nov 2024
LiVOS: Light Video Object Segmentation with Gated Linear Matching
Qin Liu
Jianfeng Wang
Z. Yang
Linjie Li
Kevin Qinghong Lin
Marc Niethammer
Lijuan Wang
VOS
42
1
0
05 Nov 2024
Addressing Issues with Working Memory in Video Object Segmentation
Clayton Bromley
Alexander Moore
Amar Saini
Douglas Poland
Carmen Carrano
VOS
34
0
0
29 Oct 2024
BIFRÖST: 3D-Aware Image compositing with Language Instructions
Lingxiao Li
Kaixiong Gong
Weihong Li
Xili Dai
Tao Chen
Xiaojun Yuan
Xiangyu Yue
16
2
0
24 Oct 2024
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
Shuangrui Ding
Rui Qian
Xiaoyi Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Yuwei Guo
Dahua Lin
Jiaqi Wang
VLM
VOS
26
8
0
21 Oct 2024
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
Muzhi Zhu
Yang Liu
Zekai Luo
Chenchen Jing
Hao Chen
Guangkai Xu
Xinlong Wang
Chunhua Shen
DiffM
VLM
29
3
0
03 Oct 2024
ChatVTG: Video Temporal Grounding via Chat with Video Dialogue Large Language Models
Mengxue Qu
Xiaodong Chen
Wu Liu
Alicia Li
Yao Zhao
37
13
0
01 Oct 2024
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Zechen Bai
Tong He
Haiyang Mei
Pichao Wang
Ziteng Gao
Joya Chen
Lei Liu
Zheng Zhang
Mike Zheng Shou
VLM
VOS
MLLM
32
17
0
29 Sep 2024
X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Pinxue Guo
Wanyun Li
Hao Huang
Lingyi Hong
Xinyu Zhou
Zhaoyu Chen
Jinglun Li
Kaixun Jiang
Wei Zhang
Wenqiang Zhang
VLM
VOS
26
2
0
28 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
30
0
0
19 Sep 2024
LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Henghui Ding
Lingyi Hong
Chang Liu
Ning Xu
L. Yang
...
Bin Cao
Yisi Zhang
Hanyi Wang
Xingjian He
Jing Liu
VOS
24
2
0
09 Sep 2024
Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Deshui Miao
Yameng Gu
Xin Li
Zhenyu He
Yaowei Wang
Ming-Hsuan Yang
24
0
0
29 Aug 2024
CSS-Segment: 2nd Place Report of LSVOS Challenge VOS Track
Jinming Chai
Qin Ma
Junpei Zhang
Licheng Jiao
Fang Liu
VOS
26
0
0
24 Aug 2024
The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation
Tuyen Tran
29
2
0
22 Aug 2024
LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS
Xinyu Liu
Jing Zhang
Kexin Zhang
Xu Liu
Lingling Li
18
1
0
20 Aug 2024
UNINEXT-Cutie: The 1st Solution for LSVOS Challenge RVOS Track
Hao Fang
Feiyu Pan
Xiankai Lu
Wei Zhang
Runmin Cong
23
3
0
19 Aug 2024
Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track
Feiyu Pan
Hao Fang
Runmin Cong
Wei Zhang
Xiankai Lu
VOS
30
3
0
19 Aug 2024
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
31
676
0
01 Aug 2024
Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object Segmentation
Stéphane Vujasinović
Kamil Dreczkowski
Sebastian Bullinger
Norbert Scherer-Negenborn
Michael Arens
VOS
17
1
0
31 Jul 2024
SegPoint: Segment Any Point Cloud via Large Language Model
Shuting He
Henghui Ding
Xudong Jiang
Bihan Wen
3DV
MLLM
3DPC
35
17
0
18 Jul 2024
VISA: Reasoning Video Object Segmentation via Large Language Models
Cilin Yan
Haochen Wang
Shilin Yan
Xiaolong Jiang
Yao Hu
Guoliang Kang
Weidi Xie
E. Gavves
LRM
VLM
VOS
32
28
0
16 Jul 2024
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
Yaoting Wang
Peiwen Sun
Dongzhan Zhou
Guangyao Li
Honggang Zhang
Di Hu
VOS
32
5
0
15 Jul 2024
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li
Deshui Miao
Zhenyu He
Y. Wang
Huchuan Lu
Ming Yang
VOS
38
4
0
10 Jul 2024
Video Inpainting Localization with Contrastive Learning
Zijie Lou
Gang Cao
Man Lin
28
1
0
25 Jun 2024
1
2
3
Next