ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.03903
  4. Cited By
Tracking Anything with Decoupled Video Segmentation

Tracking Anything with Decoupled Video Segmentation

7 September 2023
Ho Kei Cheng
Seoung Wug Oh
Brian L. Price
Alexander Schwing
Joon-Young Lee
    VOS
    VLM
ArXivPDFHTML

Papers citing "Tracking Anything with Decoupled Video Segmentation"

48 / 98 papers shown
Title
Strike the Balance: On-the-Fly Uncertainty based User Interactions for
  Long-Term Video Object Segmentation
Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object Segmentation
Stéphane Vujasinović
Kamil Dreczkowski
Sebastian Bullinger
Norbert Scherer-Negenborn
Michael Arens
VOS
17
1
0
31 Jul 2024
Segment Anything for Videos: A Systematic Survey
Segment Anything for Videos: A Systematic Survey
Chunhui Zhang
Yawen Cui
Weilin Lin
Guanjie Huang
Yan Rong
Li Liu
Shiguang Shan
VLM
39
6
0
31 Jul 2024
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Seokhun Choi
H. Song
Jaechul Kim
Taehyeong Kim
Hoseok Do
3DGS
33
18
0
16 Jul 2024
FoodMem: Near Real-time and Precise Food Video Segmentation
FoodMem: Near Real-time and Precise Food Video Segmentation
Ahmad AlMughrabi
Adrián Galán
Ricardo Marques
P. Radeva
VOS
29
0
0
16 Jul 2024
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
Yaoting Wang
Peiwen Sun
Dongzhan Zhou
Guangyao Li
Honggang Zhang
Di Hu
VOS
35
5
0
15 Jul 2024
StyleSplat: 3D Object Style Transfer with Gaussian Splatting
StyleSplat: 3D Object Style Transfer with Gaussian Splatting
Sahil Jain
Avik Kuthiala
P. Sethi
Prakanshul Saxena
3DGS
22
4
0
12 Jul 2024
ActionVOS: Actions as Prompts for Video Object Segmentation
ActionVOS: Actions as Prompts for Video Object Segmentation
Liangyang Ouyang
Ruicong Liu
Yifei Huang
Ryosuke Furuta
Yoichi Sato
VOS
31
2
0
10 Jul 2024
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li
Deshui Miao
Zhenyu He
Y. Wang
Huchuan Lu
Ming Yang
VOS
41
4
0
10 Jul 2024
General and Task-Oriented Video Segmentation
General and Task-Oriented Video Segmentation
Mu Chen
Liulei Li
Wenguan Wang
Ruijie Quan
Yi Yang
VOS
48
4
0
09 Jul 2024
Addressing single object tracking in satellite imagery through
  prompt-engineered solutions
Addressing single object tracking in satellite imagery through prompt-engineered solutions
Athena Psalta
Vasileios Tsironis
Andreas El Saer
K. Karantzalos
28
0
0
07 Jul 2024
Segment Any 4D Gaussians
Segment Any 4D Gaussians
Shengxiang Ji
Guanjun Wu
Jiemin Fang
Jiazhong Cen
Taoran Yi
Wenyu Liu
Qi Tian
Xinggang Wang
3DGS
28
7
0
05 Jul 2024
EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data
  Efficient Learning
EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning
Jingyun Yang
Zi-ang Cao
Congyue Deng
Rika Antonova
Shuran Song
Jeannette Bohg
DiffM
44
2
0
01 Jul 2024
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for
  Zero-shot Panoptic Reconstruction
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction
Xuan Yu
Yili Liu
Chenrui Han
Sitong Mao
Shunbo Zhou
R. Xiong
Yiyi Liao
Yue Wang
ISeg
36
2
0
01 Jul 2024
Tarsier: Recipes for Training and Evaluating Large Video Description
  Models
Tarsier: Recipes for Training and Evaluating Large Video Description Models
Jiawei Wang
Liping Yuan
Yuchen Zhang
29
52
0
30 Jun 2024
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative
  Object REarrangement
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement
Chengwen Zhang
Yun-Hai Liu
Ruofan Xing
Bingda Tang
Li Yi
33
10
0
27 Jun 2024
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought
Gabriel H. Sarch
Lawrence Jang
Michael J. Tarr
William W. Cohen
Kenneth Marino
Katerina Fragkiadaki
LLMAG
31
0
0
20 Jun 2024
GroPrompt: Efficient Grounded Prompting and Adaptation for Referring
  Video Object Segmentation
GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Ci-Siang Lin
I-Jieh Liu
Min-Hung Chen
Chien-Yi Wang
Sifei Liu
Yu-Chiang Frank Wang
VOS
34
0
0
18 Jun 2024
Zero-Shot Scene Change Detection
Zero-Shot Scene Change Detection
Kyusik Cho
Dong Yeop Kim
Euntai Kim
28
1
0
17 Jun 2024
GPT-4o: Visual perception performance of multimodal large language
  models in piglet activity understanding
GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding
Yiqi Wu
Xiaodan Hu
Ziming Fu
Siling Zhou
Jiangong Li
MLLM
22
9
0
14 Jun 2024
2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex
  Video Object Segmentation
2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
Zhensong Xu
Jiangtao Yao
Chengjing Wu
Ting Liu
Luoqi Liu
18
1
0
12 Jun 2024
InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight
  Information Shaping
InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight Information Shaping
Yunchao Zhang
Guandao Yang
Leonidas J. Guibas
Yanchao Yang
3DGS
31
1
0
09 Jun 2024
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Ruipu Wu
Jifei Che
Han Li
Chengjing Wu
Ting Liu
Luoqi Liu
36
0
0
06 Jun 2024
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Inkyu Shin
Qihang Yu
Xiaohui Shen
In So Kweon
KuK-Jin Yoon
Liang-Chieh Chen
VGen
DiffM
63
1
0
04 Jun 2024
MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion
  Scaffolds
MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds
Jiahui Lei
Yijia Weng
Adam W. Harley
Leonidas J. Guibas
Kostas Daniilidis
3DGS
40
37
0
27 May 2024
LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
Lingyi Hong
Zhongying Liu
Wenchao Chen
Chenzhi Tan
Yuang Feng
...
Jinglun Li
Zhaoyu Chen
Shuyong Gao
Wei Zhang
Wenqiang Zhang
VLM
VOS
29
12
0
30 Apr 2024
OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian
  Segmentation
OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation
Lizhi Wang
Feng Zhou
Jianqin Yin
3DGS
29
0
0
24 Apr 2024
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and
  View-consistent 3D Semantic Understanding
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Understanding
Guibiao Liao
Jiankun Li
Zhenyu Bao
Xiaoqing Ye
Jingdong Wang
Qing Li
Kanglin Liu
3DGS
27
13
0
22 Apr 2024
Moving Object Segmentation: All You Need Is SAM (and Flow)
Moving Object Segmentation: All You Need Is SAM (and Flow)
Junyu Xie
Charig Yang
Weidi Xie
Andrew Zisserman
39
9
0
18 Apr 2024
Empowering Embodied Visual Tracking with Visual Foundation Models and
  Offline RL
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL
Fangwei Zhong
Kui Wu
Hai Ci
Churan Wang
Hao Chen
OffRL
29
2
0
15 Apr 2024
Deep Instruction Tuning for Segment Anything Model
Deep Instruction Tuning for Segment Anything Model
Xiaorui Huang
Gen Luo
Chaoyang Zhu
Bo Tong
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
VLM
36
1
0
31 Mar 2024
Annolid: Annotate, Segment, and Track Anything You Need
Annolid: Annotate, Segment, and Track Anything You Need
Chen Yang
Thomas A. Cleland
VOS
18
2
0
27 Mar 2024
Efficient Video Object Segmentation via Modulated Cross-Attention Memory
Efficient Video Object Segmentation via Modulated Cross-Attention Memory
Abdelrahman M. Shaker
Syed Talal Wasim
Martin Danelljan
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
VOS
18
3
0
26 Mar 2024
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian
  Splatting
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting
Jun Guo
Xiaojian Ma
Yue Fan
Huaping Liu
Qing Li
3DGS
36
26
0
22 Mar 2024
Video Object Segmentation with Dynamic Query Modulation
Video Object Segmentation with Dynamic Query Modulation
Hantao Zhou
Runze Hu
Xiu Li
VOS
28
1
0
18 Mar 2024
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video
  with Point Tracking and Segment Anything
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything
Zijian Wu
Adam Schmidt
Peter Kazanzides
Septimiu E. Salcudean
35
2
0
12 Mar 2024
$\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception
  Models under Perturbations
R2\text{R}^2R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Xiang Li
Kai Qiu
Jinglu Wang
Xiaohao Xu
Rita Singh
Kashu Yamazaki
Hao Chen
Xiaonan Huang
Bhiksha Raj
VOS
32
1
0
07 Mar 2024
BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning
  of SAM
BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM
Li Zhang
Youwei Liang
Ruiyi Zhang
Amirhosein Javadi
Pengtao Xie
VLM
19
8
0
26 Feb 2024
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Jiawei Wang
Yuchen Zhang
Jiaxin Zou
Yan Zeng
Guoqiang Wei
Liping Yuan
Hang Li
DiffM
VGen
19
42
0
02 Feb 2024
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
Xu Hu
Yuxi Wang
Lue Fan
Junsong Fan
Junran Peng
Zhen Lei
Qing Li
Zhaoxiang Zhang
Zhaoxiang Zhang
3DGS
39
7
0
31 Jan 2024
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Tianhe Ren
Shilong Liu
Ailing Zeng
Jing Lin
Kunchang Li
...
Feng Li
Jie-jin Yang
Hongyang Li
Qing Jiang
Lei Zhang
VLM
35
358
0
25 Jan 2024
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
Mingqiao Ye
Martin Danelljan
Fisher Yu
Lei Ke
3DGS
DiffM
14
61
0
01 Dec 2023
Audio-Visual Instance Segmentation
Audio-Visual Instance Segmentation
Ruohao Guo
Yaru Chen
Yanyu Qi
Wenzhen Yue
Dantong Niu
...
Wenzhen Yue
Ji Shi
Qixun Wang
Peiliang Zhang
Buwen Liang
VLM
VOS
26
2
0
28 Oct 2023
EdgeCalib: Multi-Frame Weighted Edge Features for Automatic Targetless
  LiDAR-Camera Calibration
EdgeCalib: Multi-Frame Weighted Edge Features for Automatic Targetless LiDAR-Camera Calibration
Xingchen Li
Yifan Duan
Beibei Wang
Haojie Ren
Guoliang You
Yu Sheng
Jianmin Ji
Yanyong Zhang
14
3
0
25 Oct 2023
CoralVOS: Dataset and Benchmark for Coral Video Segmentation
CoralVOS: Dataset and Benchmark for Coral Video Segmentation
Ziqiang Zheng
Yaofeng Xie
Haixin Liang
Zhibin Yu
Sai-Kit Yeung
VOS
28
7
0
03 Oct 2023
Weak Supervision for Label Efficient Visual Bug Detection
Weak Supervision for Label Efficient Visual Bug Detection
F. Rahman
11
2
0
20 Sep 2023
VLT: Vision-Language Transformer and Query Generation for Referring
  Segmentation
VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
Henghui Ding
Chang Liu
Suchen Wang
Xudong Jiang
63
115
0
28 Oct 2022
Unsupervised Video Object Segmentation via Prototype Memory Network
Unsupervised Video Object Segmentation via Prototype Memory Network
Minhyeok Lee
Suhwan Cho
Seung-Hyun Lee
Chaewon Park
Sangyoun Lee
VOS
31
35
0
08 Sep 2022
UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking
UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking
Jonathon Luiten
Idil Esen Zulfikar
Bastian Leibe
VOS
122
62
0
15 Jan 2020
Previous
12