ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.06558
  4. Cited By
Segment and Track Anything

Segment and Track Anything

11 May 2023
Yangming Cheng
Liulei Li
Yuanyou Xu
Xiaodi Li
Zongxin Yang
Wenguan Wang
Yi Yang
    VOS
ArXivPDFHTML

Papers citing "Segment and Track Anything"

37 / 37 papers shown
Title
Segment Any RGB-Thermal Model with Language-aided Distillation
Segment Any RGB-Thermal Model with Language-aided Distillation
Dong Xing
Xianxun Zhu
Wei Zhou
Qika Lin
Hang Yang
Yuqing Wang
VLM
56
0
0
04 May 2025
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
Mengshi Qi
Pengfei Zhu
X. Li
Xiaoyang Bi
Lu Qi
Huadong Ma
Ming Yang
VOS
VLM
42
0
0
16 Apr 2025
L4P: Low-Level 4D Vision Perception Unified
L4P: Low-Level 4D Vision Perception Unified
Abhishek Badki
Hang Su
Bowen Wen
Orazio Gallo
VLM
78
1
0
18 Feb 2025
COMBO-Grasp: Learning Constraint-Based Manipulation for Bimanual Occluded Grasping
COMBO-Grasp: Learning Constraint-Based Manipulation for Bimanual Occluded Grasping
Jun Yamada
Alexander L. Mitchell
Jack Collins
Ingmar Posner
OffRL
81
0
0
17 Feb 2025
Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering
Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering
Yanpeng Zhao
Yiwei Hao
Siyu Gao
Yunbo Wang
Xiaokang Yang
OCL
120
1
0
17 Feb 2025
ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models
ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models
Ying Zhang
Maoliang Yin
Wenfu Bi
Haibao Yan
Shaohan Bian
Cui-Hua Zhang
C. Hua
73
2
0
05 Feb 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Q. Zhang
L. Zhang
VOS
VGen
65
1
0
23 Jan 2025
Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM
Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM
Xin Hu
Janet Wang
Jihun Hamm
R. Yotsu
Zhengming Ding
92
0
0
17 Jan 2025
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Andong Deng
Tongjia Chen
Shoubin Yu
Taojiannan Yang
Lincoln Spencer
Yapeng Tian
Ajmal Saeed Mian
Mohit Bansal
Chen Chen
LRM
54
1
0
15 Nov 2024
BYOCL: Build Your Own Consistent Latent with Hierarchical Representative Latent Clustering
BYOCL: Build Your Own Consistent Latent with Hierarchical Representative Latent Clustering
Jiayue Dai
Yunya Wang
Yihan Fang
Yuetong Chen
Butian Xiong
VLM
29
0
0
19 Oct 2024
VideoSAM: Open-World Video Segmentation
VideoSAM: Open-World Video Segmentation
Pinxue Guo
Zixu Zhao
Jianxiong Gao
Chongruo Wu
Tong He
Zheng Zhang
Tianjun Xiao
Wenqiang Zhang
VOS
26
0
0
11 Oct 2024
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model
Hongliang Zhong
Can Wang
Jingbo Zhang
Jing Liao
3DGS
DiffM
33
2
0
25 Sep 2024
BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
Hanzheng Wang
Wei Li
X. Xia
Qian Du
55
1
0
22 Aug 2024
AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction
  from Sparse Multi-view Videos
AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos
Feichi Lu
Zijian Dong
Jie Song
Otmar Hilliges
3DH
26
0
0
04 Aug 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
58
1
0
23 Jul 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOS
LRM
52
2
0
18 Jul 2024
Zero-Shot Scene Change Detection
Zero-Shot Scene Change Detection
Kyusik Cho
Dong Yeop Kim
Euntai Kim
36
1
0
17 Jun 2024
Matching Anything by Segmenting Anything
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
29
22
0
06 Jun 2024
Innovative Integration of Visual Foundation Model with a Robotic Arm on
  a Mobile Platform
Innovative Integration of Visual Foundation Model with a Robotic Arm on a Mobile Platform
Shimian Zhang
Qiuhong Lu
26
1
0
29 Apr 2024
Empowering Embodied Visual Tracking with Visual Foundation Models and
  Offline RL
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL
Fangwei Zhong
Kui Wu
Hai Ci
Churan Wang
Hao Chen
OffRL
34
2
0
15 Apr 2024
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding
Zhuoyuan Li
Zikun Yuan
Li Li
Dong Liu
Xiaohu Tang
Feng Wu
VOS
29
7
0
18 Mar 2024
TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation
  in VEM images
TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images
Jia Wan
Wanhua Li
Jason Ken Adhinarta
Atmadeep Banerjee
Evelina Sjostedt
Jingpeng Wu
J. Lichtman
Hanspeter Pfister
D. Wei
26
6
0
25 Jan 2024
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
Zongxin Yang
Guikun Chen
Xiaodi Li
Wenguan Wang
Yi Yang
LM&Ro
LLMAG
48
35
0
16 Jan 2024
HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and
  Objects from Video
HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video
Zicong Fan
Maria Parelli
Maria Eleni Kadoglou
Muhammed Kocabas
Xu Chen
Michael J. Black
Otmar Hilliges
3DH
18
28
0
30 Nov 2023
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation
  and Editing
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao
Tianyi Lu
Jiaxi Gu
Xing Zhang
Qingping Zheng
Zuxuan Wu
Hang Xu
Yu-Gang Jiang
VGen
DiffM
27
10
0
29 Nov 2023
Tracking Anything with Decoupled Video Segmentation
Tracking Anything with Decoupled Video Segmentation
Ho Kei Cheng
Seoung Wug Oh
Brian L. Price
Alexander Schwing
Joon-Young Lee
VOS
VLM
30
121
0
07 Sep 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
41
81
0
15 Aug 2023
A One Stop 3D Target Reconstruction and multilevel Segmentation Method
A One Stop 3D Target Reconstruction and multilevel Segmentation Method
J. Xu
Wei-Ye Zhao
Zhiyan Tang
X. Gan
3DV
16
2
0
14 Aug 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
F. Khan
VLM
18
117
0
25 Jul 2023
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single
  Object Tracking
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking
Yuanyou Xu
Jiahao Li
Zongxin Yang
Yi Yang
Yueting Zhuang
14
1
0
05 Jul 2023
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised
  Video Object Segmentation
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation
Jiahao Li
Yuanyou Xu
Zongxin Yang
Yi Yang
Yueting Zhuang
VOS
28
0
0
05 Jul 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring
  Video Object Segmentation
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
Yonglin Li
Jing Zhang
Xiao Teng
Long Lan
VOS
VLM
19
17
0
03 Jul 2023
Matte Anything: Interactive Natural Image Matting with Segment Anything
  Models
Matte Anything: Interactive Natural Image Matting with Segment Anything Models
J. Yao
Xinggang Wang
Lang Ye
Wenyu Liu
13
38
0
07 Jun 2023
Restore Anything Pipeline: Segment Anything Meets Image Restoration
Restore Anything Pipeline: Segment Anything Meets Image Restoration
Jiaxi Jiang
Christian Holz
VLM
27
8
0
22 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
32
89
0
14 May 2023
Treating Motion as Option to Reduce Motion Dependency in Unsupervised
  Video Object Segmentation
Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object Segmentation
Suhwan Cho
Minhyeok Lee
Seung-Hyun Lee
Chaewon Park
Donghyeon Kim
Sangyoun Lee
VOS
60
39
0
04 Sep 2022
Collaborative Video Object Segmentation by Multi-Scale
  Foreground-Background Integration
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration
Zongxin Yang
Yunchao Wei
Yi Yang
VOS
33
163
0
13 Oct 2020
1