ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.03903
  4. Cited By
Tracking Anything with Decoupled Video Segmentation

Tracking Anything with Decoupled Video Segmentation

7 September 2023
Ho Kei Cheng
Seoung Wug Oh
Brian L. Price
Alexander Schwing
Joon-Young Lee
    VOS
    VLM
ArXivPDFHTML

Papers citing "Tracking Anything with Decoupled Video Segmentation"

50 / 98 papers shown
Title
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Navin Ranjan
Andreas E. Savakis
MQ
VLM
61
0
0
08 May 2025
Grounding Task Assistance with Multimodal Cues from a Single Demonstration
Grounding Task Assistance with Multimodal Cues from a Single Demonstration
Gabriel Sarch
Balasaravanan Thoravi Kumaravel
Sahithya Ravi
Vibhav Vineet
A. D. Wilson
55
0
0
02 May 2025
CIVIL: Causal and Intuitive Visual Imitation Learning
CIVIL: Causal and Intuitive Visual Imitation Learning
Yinlong Dai
Robert Ramirez Sanchez
Ryan Jeronimus
Shahabedin Sagheb
Cara M. Nunez
Heramb Nemlekar
Dylan P. Losey
58
0
0
24 Apr 2025
Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning
Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning
Mingxuan Cui
Qing Guo
Y. Wang
Hongkai Yu
D. Lin
Q. Zou
Ming-Ming Cheng
X. Li
3DGS
43
0
0
23 Apr 2025
Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting
Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting
Daiwei Zhang
Joaquin Gajardo
Tomislav Medic
Isinsu Katircioglu
Mike Boss
Norbert Kirchgessner
Achim Walter
Lukas Roth
27
0
0
09 Apr 2025
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
Yunlong Tang
Jing Bi
Chao Huang
Susan Liang
Daiki Shimada
...
Jinxi He
Liu He
Zeliang Zhang
Jiebo Luo
Chenliang Xu
34
0
0
07 Apr 2025
Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments
Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments
Chenyu Zhang
Daniil Cherniavskii
Andrii Zadaianchuk
Antonios Tragoudaras
Antonios Vozikis
Thijmen Nijdam
Derck W. E. Prinzhorn
Mark Bodracska
N. Sebe
E. Gavves
EGVM
VGen
46
0
0
03 Apr 2025
EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting
EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting
X. Wang
Shuai Zhang
Baoru Huang
Danail Stoyanov
E. Mazomenos
3DGS
26
1
0
28 Mar 2025
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
David Yifan Yao
Albert Zhai
Shenlong Wang
VGen
46
1
0
27 Mar 2025
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting
Runsong Zhu
Shi Qiu
Zhengzhe Liu
Ka-Hei Hui
Qianyi Wu
Pheng Ann Heng
Chi-Wing Fu
3DGS
3DV
88
1
0
18 Mar 2025
SAM2 for Image and Video Segmentation: A Comprehensive Survey
SAM2 for Image and Video Segmentation: A Comprehensive Survey
Zhang Jiaxing
Tang Hao
VLM
50
0
0
17 Mar 2025
SPOC: Spatially-Progressing Object State Change Segmentation in Video
SPOC: Spatially-Progressing Object State Change Segmentation in Video
Priyanka Mandikal
Tushar Nagarajan
Alex Stoken
Zihui Xue
Kristen Grauman
44
0
0
15 Mar 2025
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
Wanhua Li
Renping Zhou
Jiawei Zhou
Yingwei Song
Johannes Herter
Minghan Qin
Gao Huang
Hanspeter Pfister
3DGS
VLM
66
0
0
13 Mar 2025
GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding
Xihan Wang
Dianyi Yang
Yu Gao
Yufeng Yue
Yi Yang
M. Fu
3DGS
49
0
0
06 Mar 2025
OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding
Dianyi Yang
Yu Gao
Xihan Wang
Yufeng Yue
Yi Yang
M. Fu
3DGS
64
1
0
03 Mar 2025
Gaussian Difference: Find Any Change Instance in 3D Scenes
Gaussian Difference: Find Any Change Instance in 3D Scenes
Binbin Jiang
Rui Huang
Qingyi Zhao
Yuxiang Zhang
38
0
0
24 Feb 2025
Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field
Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field
Wenhao Hu
Wenhao Chai
Shengyu Hao
Xiaotong Cui
Xuexiang Wen
Jenq-Neng Hwang
Gaoang Wang
3DV
53
0
0
22 Feb 2025
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Guiqiu Liao
M. Jogan
Marcel Hussing
Kenta Nakahashi
Kazuhiro Yasufuku
Amin Madani
Eric Eaton
Daniel A. Hashimoto
51
0
0
21 Jan 2025
OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities
OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities
Suyoung Lee
Jaeyoung Chung
Kihoon Kim
Jaeyoo Huh
G. Lee
Minsoo Lee
Kyoung Mu Lee
3DGS
84
0
0
21 Dec 2024
Bootstraping Clustering of Gaussians for View-consistent 3D Scene
  Understanding
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding
W. Zhang
Lu Zhang
Ping Hu
Liqian Ma
Yunzhi Zhuge
Huchuan Lu
3DGS
68
2
0
29 Nov 2024
T-3DGS: Removing Transient Objects for 3D Scene Reconstruction
T-3DGS: Removing Transient Objects for 3D Scene Reconstruction
Vadim Pryadilshchikov
Alexander Markin
Artem Komarichev
Ruslan Rakhimov
Peter Wonka
Evgeny Burnaev
3DGS
74
1
0
29 Nov 2024
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking
  with Motion-Aware Memory
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory
Cheng-Yen Yang
Hsiang-Wei Huang
Wenhao Chai
Zhongyu Jiang
Jenq-Neng Hwang
VLM
85
16
0
18 Nov 2024
DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment
Mangyu Kong
Jaewon Lee
Seongwon Lee
Euntai Kim
3DGS
26
1
0
16 Nov 2024
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Andong Deng
Tongjia Chen
Shoubin Yu
Taojiannan Yang
Lincoln Spencer
Yapeng Tian
Ajmal Saeed Mian
Mohit Bansal
Chen Chen
LRM
54
1
0
15 Nov 2024
LiVOS: Light Video Object Segmentation with Gated Linear Matching
LiVOS: Light Video Object Segmentation with Gated Linear Matching
Qin Liu
Jianfeng Wang
Z. Yang
Linjie Li
Kevin Qinghong Lin
Marc Niethammer
Lijuan Wang
VOS
42
1
0
05 Nov 2024
AutoVFX: Physically Realistic Video Editing from Natural Language
  Instructions
AutoVFX: Physically Realistic Video Editing from Natural Language Instructions
Hao-Yu Hsu
Zhi-Hao Lin
Albert Zhai
Hongchi Xia
Shenlong Wang
VGen
40
9
0
04 Nov 2024
GenXD: Generating Any 3D and 4D Scenes
GenXD: Generating Any 3D and 4D Scenes
Yuyang Zhao
Chung-Ching Lin
Kevin Qinghong Lin
Zhiwen Yan
Linjie Li
Z. Yang
Jianfeng Wang
G. Lee
Lijuan Wang
VGen
43
14
0
04 Nov 2024
Event-guided Low-light Video Semantic Segmentation
Event-guided Low-light Video Semantic Segmentation
Zhen Yao
Mooi Choo Choo Chuah
43
6
0
01 Nov 2024
ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
Anurag Bagchi
Zhipeng Bao
Yu-xiong Wang
P. Tokmakov
Martial Hebert
VOS
25
0
0
30 Oct 2024
Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation
  Models
Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models
Nils Blank
Moritz Reuss
Marcel Rühle
Ömer Erdinç Yagmurlu
Fabian Wenzel
Oier Mees
Rudolf Lioutikov
LM&Ro
OffRL
29
3
0
23 Oct 2024
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a
  Training-Free Memory Tree
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
Shuangrui Ding
Rui Qian
Xiaoyi Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Yuwei Guo
Dahua Lin
Jiaqi Wang
VLM
VOS
29
8
0
21 Oct 2024
BYOCL: Build Your Own Consistent Latent with Hierarchical Representative Latent Clustering
BYOCL: Build Your Own Consistent Latent with Hierarchical Representative Latent Clustering
Jiayue Dai
Yunya Wang
Yihan Fang
Yuetong Chen
Butian Xiong
VLM
24
0
0
19 Oct 2024
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise
  Motion Control
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Yujie Wei
Shiwei Zhang
Hangjie Yuan
Xiang Wang
Haonan Qiu
...
F. Liu
Zhizhong Huang
Jiaxin Ye
Yingya Zhang
Hongming Shan
DiffM
VGen
67
14
0
17 Oct 2024
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video
  Segmentation
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
30
0
0
16 Oct 2024
3D Gaussian Splatting in Robotics: A Survey
3D Gaussian Splatting in Robotics: A Survey
Siting Zhu
Guangming Wang
Dezhi Kong
Hesheng Wang
3DGS
38
6
0
16 Oct 2024
Multiview Scene Graph
Multiview Scene Graph
Juexiao Zhang
Gao Zhu
Sihang Li
Xinhao Liu
Haorui Song
Xinran Tang
Chen Feng
3DV
18
1
0
15 Oct 2024
VideoSAM: Open-World Video Segmentation
VideoSAM: Open-World Video Segmentation
Pinxue Guo
Zixu Zhao
Jianxiong Gao
Chongruo Wu
Tong He
Zheng Zhang
Tianjun Xiao
Wenqiang Zhang
VOS
21
0
0
11 Oct 2024
Learning to Generate Diverse Pedestrian Movements from Web Videos with
  Noisy Labels
Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels
Zhizheng Liu
Joe Lin
Wayne Wu
Bolei Zhou
VGen
55
0
0
10 Oct 2024
On Efficient Variants of Segment Anything Model: A Survey
On Efficient Variants of Segment Anything Model: A Survey
Xiaorui Sun
J. Liu
H. Shen
Xiaofeng Zhu
Ping Hu
VLM
43
4
0
07 Oct 2024
Unpacking Failure Modes of Generative Policies: Runtime Monitoring of
  Consistency and Progress
Unpacking Failure Modes of Generative Policies: Runtime Monitoring of Consistency and Progress
Christopher Agia
Rohan Sinha
Jingyun Yang
Zi-ang Cao
Rika Antonova
Marco Pavone
Jeannette Bohg
26
6
0
06 Oct 2024
EAGLE: Egocentric AGgregated Language-video Engine
EAGLE: Egocentric AGgregated Language-video Engine
Jing Bi
Yunlong Tang
Luchuan Song
A. Vosoughi
Nguyen Nguyen
Chenliang Xu
25
8
0
26 Sep 2024
Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction
  and Rendering in Virtual Reality
Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality
Hannah Schieber
Jacob Young
Tobias Langlotz
Stefanie Zollmann
Daniel Roth
3DGS
23
0
0
24 Sep 2024
CloudTrack: Scalable UAV Tracking with Cloud Semantics
CloudTrack: Scalable UAV Tracking with Cloud Semantics
Yannik Blei
Michael Krawez
Nisarga Nilavadi
Tanja Katharina Kaiser
Wolfram Burgard
39
1
0
24 Sep 2024
Learning Keypoints for Multi-Agent Behavior Analysis using
  Self-Supervision
Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision
Daniel Khalil
Christina Liu
Pietro Perona
Jennifer J. Sun
Markus Marks
14
1
0
14 Sep 2024
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally
Qiuhong Shen
Xingyi Yang
Xinchao Wang
3DGS
29
18
0
12 Sep 2024
GraspSplats: Efficient Manipulation with 3D Feature Splatting
GraspSplats: Efficient Manipulation with 3D Feature Splatting
Mazeyu Ji
Ri-Zhao Qiu
Xueyan Zou
Xiaolong Wang
3DGS
31
18
0
03 Sep 2024
TrackGo: A Flexible and Efficient Method for Controllable Video Generation
TrackGo: A Flexible and Efficient Method for Controllable Video Generation
Haitao Zhou
Chuang Wang
Rui Nie
Jinxiao Lin
Dongdong Yu
Qian Yu
Changhu Wang
VGen
DiffM
46
14
0
21 Aug 2024
3D-Aware Instance Segmentation and Tracking in Egocentric Videos
3D-Aware Instance Segmentation and Tracking in Egocentric Videos
Yash Bhalgat
Vadim Tschernezki
Iro Laina
João F. Henriques
Andrea Vedaldi
Andrew Zisserman
VOS
30
0
0
19 Aug 2024
SpectralGaussians: Semantic, spectral 3D Gaussian splatting for
  multi-spectral scene representation, visualization and analysis
SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis
Saptarshi Neil Sinha
Holger Graf
Michael Weinmann
3DGS
26
1
0
13 Aug 2024
SAM 2: Segment Anything in Images and Videos
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
31
676
0
01 Aug 2024
12
Next