Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.18115
Cited By
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
28 February 2024
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"UniVS: Unified and Universal Video Segmentation with Prompts as Queries"
17 / 17 papers shown
Title
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
66
0
0
28 Apr 2025
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
Lehan Yang
Jincen Song
Tianlong Wang
Daiqing Qi
Weili Shi
Yuheng Liu
Sheng Li
DiffM
VOS
VGen
69
0
0
11 Mar 2025
SMITE: Segment Me In TimE
Amirhossein Alimohammadi
Sauradip Nag
Saeid Asgari Taghanaki
Andrea Tagliasacchi
Ghassan Hamarneh
Ali Mahdavi-Amiri
VLM
VOS
77
2
0
20 Feb 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Haobo Yuan
X. Li
Tao Zhang
Zilong Huang
Shilin Xu
S. Ji
Yunhai Tong
Lu Qi
Jiashi Feng
Ming Yang
VLM
85
11
0
07 Jan 2025
LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Image
Chetan Madan
Mayuna Gupta
Soumen Basu
Pankaj Gupta
Chetan Arora
78
0
0
30 Nov 2024
Open-Vocabulary Audio-Visual Semantic Segmentation
Zhenghao Zhang
Junchao Liao
Dantong Niu
Yanyu Qi
Menghao Li
Ji Shi
Bowei Xing
Xianghua Ying
VOS
VLM
24
7
0
31 Jul 2024
A Unified Framework for 3D Scene Understanding
Wei Xu
Chunsheng Shi
Sifan Tu
Xin Zhou
Dingkang Liang
Xiang Bai
VOS
21
3
0
03 Jul 2024
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Ruipu Wu
Jifei Che
Han Li
Chengjing Wu
Ting Liu
Luoqi Liu
36
0
0
06 Jun 2024
Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models
Qian Wang
Abdelrahman Eldesokey
Mohit Mendiratta
Fangneng Zhan
Adam Kortylewski
Christian Theobalt
Peter Wonka
DiffM
34
4
0
27 May 2024
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Yabin Zhang
Wen-Qing Zhu
Hui Tang
Zhiyuan Ma
Kaiyang Zhou
Lei Zhang
VLM
29
20
0
26 Mar 2024
R
2
\text{R}^2
R
2
-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Xiang Li
Kai Qiu
Jinglu Wang
Xiaohao Xu
Rita Singh
Kashu Yamazaki
Hao Chen
Xiaonan Huang
Bhiksha Raj
VOS
29
1
0
07 Mar 2024
OpenSD: Unified Open-Vocabulary Segmentation and Detection
Shuai Li
Ming-hui Li
Pengfei Wang
Lei Zhang
ObjD
VLM
24
6
0
10 Dec 2023
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Rongyuan Wu
Tao Yang
Lingchen Sun
Zhengqiang Zhang
Shuai Li
Lei Zhang
DiffM
SupR
19
124
0
27 Nov 2023
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Bin Lin
Yang Ye
Bin Zhu
Jiaxi Cui
Munan Ning
Peng Jin
Li-ming Yuan
VLM
MLLM
185
576
0
16 Nov 2023
TubeFormer-DeepLab: Video Mask Transformer
Dahun Kim
Jun Xie
Huiyu Wang
Siyuan Qiao
Qihang Yu
Hong-Seok Kim
Hartwig Adam
In So Kweon
Liang-Chieh Chen
ViT
MedIm
75
40
0
30 May 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
382
4,010
0
28 Jan 2022
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
229
74,467
0
18 May 2015
1