Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2204.04656
Cited By
v1
v2 (latest)
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
Computer Vision and Pattern Recognition (CVPR), 2022
10 April 2022
Xiangtai Li
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Guangliang Cheng
Yunhai Tong
Chen Change Loy
VOS
Re-assign community
ArXiv (abs)
PDF
HTML
Github (153★)
Papers citing
"Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation"
50 / 74 papers shown
Click2Graph: Interactive Panoptic Video Scene Graphs from a Single Click
Raphael Ruschel
Hardikkumar Prajapati
Awsafur Rahman
B. S. Manjunath
261
0
0
20 Nov 2025
Explicit Memory through Online 3D Gaussian Splatting Improves Class-Agnostic Video Segmentation
IEEE Robotics and Automation Letters (IEEE RA-L), 2025
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
3DGS
338
0
0
27 Oct 2025
SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding
Zhiliu Yang
Jinyu Dai
Jianyuan Zhang
Zhu Yang
120
0
0
14 Oct 2025
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
Huy Le
Nhat Chung
Tung Kieu
Jingkang Yang
Ngan Le
VOS
OCL
480
1
0
07 Sep 2025
Autoregressive Universal Video Segmentation Model
Miran Heo
Sukjun Hwang
Min-Hung Chen
Y. Wang
Albert Gu
Seon Joo Kim
Ryo Hachiuma
VOS
242
1
0
26 Aug 2025
Local2Global query Alignment for Video Instance Segmentation
Rajat Koner
Zhipeng Wang
Srinivas Parthasarathy
Chinghang Chen
209
0
0
27 Jul 2025
CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting
Haoran Xu
Saining Zhang
Peishuo Li
Baijun Ye
Xiaoxue Chen
...
Hang Zhao
Hao Tang
Hongyang Li
Kaicheng Yu
Hao Zhao
220
2
0
24 Jul 2025
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects
Guohuan Xie
Syed Ariff Syed Hesham
Wenya Guo
Bing Li
Ming-Ming Cheng
Guolei Sun
Yun-Hai Liu
177
1
0
16 Jun 2025
Exploiting Temporal State Space Sharing for Video Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2025
Syed Ariff Syed Hesham
Yun-Hai Liu
Guolei Sun
Henghui Ding
Jing Yang
Ender Konukoglu
Xue Geng
Xudong Jiang
261
6
0
26 Mar 2025
Learning Appearance and Motion Cues for Panoptic Tracking
Juana Valeria Hurtado
Sajad Marvi
Rohit Mohan
Abhinav Valada
320
0
0
12 Mar 2025
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Neural Information Processing Systems (NeurIPS), 2024
Hao Fei
Shengqiong Wu
Hao Zhang
Tat-Seng Chua
Shuicheng Yan
512
81
0
31 Dec 2024
Towards Open-Vocabulary Video Semantic Segmentation
IEEE transactions on multimedia (IEEE TMM), 2024
Xuelong Li
Yun-Hai Liu
Guolei Sun
Min Wu
Le Zhang
Ce Zhu
VLM
VOS
349
3
0
12 Dec 2024
Event-guided Low-light Video Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhen Yao
Mooi Choo Choo Chuah
240
13
0
01 Nov 2024
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation
IEEE Robotics and Automation Letters (RA-L), 2024
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
266
1
0
16 Oct 2024
Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?
Chen Liang
Qiang Guo
Xiaochao Qu
Luoqi Liu
Ting Liu
VOS
178
0
0
20 Aug 2024
Segment Anything for Videos: A Systematic Survey
Chunhui Zhang
Yawen Cui
Weilin Lin
Guanjie Huang
Yan Rong
Li Liu
Shiguang Shan
VLM
255
11
0
31 Jul 2024
PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning
Mu Chen
Zhedong Zheng
Yi Yang
301
1
0
24 Jul 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOS
LRM
519
16
0
18 Jul 2024
General and Task-Oriented Video Segmentation
Mu Chen
Liulei Li
Wenguan Wang
Ruijie Quan
Yi Yang
VOS
395
13
0
09 Jul 2024
CAVIS: Context-Aware Video Instance Segmentation
Seunghun Lee
Jiwan Seo
Kiljoon Han
Minwoo Choi
S. Im
VOS
380
4
0
03 Jul 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
Tao Zhang
Xiangtai Li
Hao Fei
Haobo Yuan
Shengqiong Wu
Shunping Ji
Chen Change Loy
Shuicheng Yan
LRM
MLLM
VLM
338
124
0
27 Jun 2024
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation
Qingfeng Liu
Mostafa El-Khamy
Kee-Bong Song
258
0
0
08 Jun 2024
Semantic Segmentation on VSPW Dataset through Masked Video Consistency
Chen Liang
Qiang Guo
Chongkai Yu
Chengjing Wu
Ting Liu
Luoqi Liu
230
2
0
07 Jun 2024
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Ruipu Wu
Jifei Che
Han Li
Chengjing Wu
Ting Liu
Luoqi Liu
254
0
0
06 Jun 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
304
14
0
30 May 2024
4D Panoptic Scene Graph Generation
Neural Information Processing Systems (NeurIPS), 2024
Jingkang Yang
Jun Cen
Wenxuan Peng
Shuai Liu
Fangzhou Hong
Xiangtai Li
Kaiyang Zhou
Qifeng Chen
Ziwei Liu
191
23
0
16 May 2024
An Integrated Framework for Multi-Granular Explanation of Video Summarization
Frontiers in Signal Processing (FSP), 2024
K. Tsigos
Evlampios Apostolidis
Vasileios Mezaris
196
2
0
16 May 2024
Multi-Space Alignments Towards Universal LiDAR Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
You-Chen Liu
Lingdong Kong
Xiaoyang Wu
Runnan Chen
Xin Li
Liang Pan
Ziwei Liu
Yuexin Ma
3DPC
293
30
0
02 May 2024
Explore In-Context Segmentation via Latent Diffusion Models
AAAI Conference on Artificial Intelligence (AAAI), 2024
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
386
14
0
14 Mar 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
255
31
0
28 Feb 2024
Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes
Computer Vision and Pattern Recognition (CVPR), 2024
Diandian Guo
Deng-Ping Fan
Tongyu Lu
Daniel Gehrig
Luc Van Gool
VOS
316
10
0
27 Jan 2024
OMG-Seg: Is One Model Good Enough For All Segmentation?
Xiangtai Li
Haobo Yuan
Wei Li
Henghui Ding
Size Wu
Wenwei Zhang
Yining Li
Kai Chen
Chen Change Loy
VLM
MLLM
ViT
311
107
0
18 Jan 2024
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
Shilin Xu
Haobo Yuan
Qingyu Shi
Lu Qi
Jingbo Wang
...
Kai Chen
Yunhai Tong
Guohao Li
Xiangtai Li
Ming-Hsuan Yang
VLM
118
8
0
18 Jan 2024
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
European Conference on Computer Vision (ECCV), 2024
Yunzhi Yan
Haotong Lin
Chenxu Zhou
Weijie Wang
Haiyang Sun
Kun Zhan
Xianpeng Lang
Xiaowei Zhou
Sida Peng
3DGS
339
153
0
02 Jan 2024
A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
Ju He
Qihang Yu
Inkyu Shin
XueQing Deng
Yaoyao Liu
Xiaohui Shen
Liang-Chieh Chen
VOS
335
3
0
30 Nov 2023
Panoptic Video Scene Graph Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Jingkang Yang
Wen-Hsiao Peng
Xiangtai Li
Zujin Guo
Liangyu Chen
...
Zheng Ma
Kaiyang Zhou
Wayne Zhang
Chen Change Loy
Ziwei Liu
VOS
324
54
0
28 Nov 2023
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Tianrui Hui
Zihan Ding
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Jiao Dai
Jizhong Han
Si Liu
317
7
0
02 Nov 2023
OV-VG: A Benchmark for Open-Vocabulary Visual Grounding
Chunlei Wang
Wenquan Feng
Xiangtai Li
Guangliang Cheng
Shuchang Lyu
Binghao Liu
Lijiang Chen
Qi Zhao
ObjD
VLM
278
14
0
22 Oct 2023
Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation
British Machine Vision Conference (BMVC), 2023
Zhaochong An
Guolei Sun
Zongwei Wu
Hao Tang
Luc Van Gool
VOS
219
6
0
14 Sep 2023
Tracking Anything with Decoupled Video Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Ho Kei Cheng
Seoung Wug Oh
Brian L. Price
Alexander Schwing
Joon-Young Lee
VOS
VLM
312
209
0
07 Sep 2023
Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Yuanyou Xu
Zongxin Yang
Yi Yang
VOS
315
17
0
25 Aug 2023
Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations
IEEE International Conference on Computer Vision (ICCV), 2023
Mohammadreza Salehi
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
VOS
240
30
0
22 Aug 2023
Prototypical Kernel Learning and Open-set Foreground Perception for Generalized Few-shot Semantic Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Kai Huang
Haiwei Yang
Ye Xi
Yutao Gao
248
15
0
09 Aug 2023
Incorporating Pre-training Data Matters in Unsupervised Domain Adaptation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yinsong Xu
Aidong Men
Yang Liu
Xiahai Zhuang
Qingchao Chen
322
3
0
06 Aug 2023
CTVIS: Consistent Training for Online Video Instance Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Kaining Ying
Qing Zhong
Wei Mao
Zhenhua Wang
Hao Chen
Lin Yuanbo Wu
Yifan Liu
Chengxiang Fan
Yunzhi Zhuge
Chunhua Shen
338
63
0
24 Jul 2023
Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision
Menghao Li
Chunlei Wang
W. Feng
Shuchang Lyu
Guangliang Cheng
Xiangtai Li
Binghao Liu
Qi Zhao
277
7
0
23 Jul 2023
Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jinghao Wang
Zhengyu Wen
Xiangtai Li
Zujin Guo
Jingkang Yang
Ziwei Liu
228
28
0
17 Jul 2023
Dense Video Object Captioning from Disjoint Supervision
International Conference on Learning Representations (ICLR), 2023
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
288
7
0
20 Jun 2023
3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation
Jinming Su
Wangwang Yang
Junfeng Luo
Xiaolin K. Wei
347
1
0
11 Jun 2023
1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation
Zhang Tao
Xingye Tian
Haoran Wei
Yuehua Wu
Shunping Ji
Xuebo Wang
Xin Tao
Yuanhui Zhang
Pengfei Wan
303
7
0
07 Jun 2023
1
2
Next
Page 1 of 2