SAM 2: Segment Anything in Images and Videos

International Conference on Learning Representations (ICLR), 2024

1 August 2024

Roman Rädle

Kalyan Vasudev Alwala

Nicolas Carion

Chao-Yuan Wu

Ross B. Girshick

Piotr Dollár

Christoph Feichtenhofer

VLM

MLLM

ArXiv (abs)PDF HTML HuggingFace (116 upvotes)

Papers citing "SAM 2: Segment Anything in Images and Videos"

50 / 863 papers shown

Masquerade: Learning from In-the-wild Human Videos using Data-Editing

220

13 Aug 2025

A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation

312

13 Aug 2025

Designing Memory-Augmented AR Agents for Spatiotemporal Reasoning in Personalized Task Assistance

159

12 Aug 2025

HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis

172

12 Aug 2025

Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild

214

11 Aug 2025

ReferSplat: Referring Segmentation in 3D Gaussian Splatting

179

11 Aug 2025

Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing

123

11 Aug 2025

NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction

11 Aug 2025

SAGOnline: Segment Any Gaussians Online

268

11 Aug 2025

OctreeNCA: Single-Pass 184 MP Segmentation on Consumer Hardware

09 Aug 2025

CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing

206

09 Aug 2025

NEP: Autoregressive Image Editing via Next Editing Token Prediction

153

08 Aug 2025

F2PASeg: Feature Fusion for Pituitary Anatomy Segmentation in Endoscopic SurgeryInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

101

07 Aug 2025

Segmenting the Complex and Irregular in Two-Phase Flows: A Real-World Empirical Study with SAM2

Semanur Küçük

Cosimo Della Santina

Angeliki Laskari

07 Aug 2025

MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

352

07 Aug 2025

Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark

198

06 Aug 2025

SAM2-UNeXT: An Improved High-Resolution Baseline for Adapting Foundation Models to Downstream Segmentation Tasks

191

05 Aug 2025

ActionSink: Toward Precise Robot Manipulation with Dynamic Integration of Action Flow

162

05 Aug 2025

Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing

161

05 Aug 2025

H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction

190

05 Aug 2025

Towards Stealthy and Effective Backdoor Attacks on Lane Detection: A Naturalistic Data Poisoning Approach

141

04 Aug 2025

DreamPainter: Image Background Inpainting for E-commerce Scenarios

114

04 Aug 2025

Multimodal Referring Segmentation: A Survey

395

01 Aug 2025

SDMatte: Grafting Diffusion Models for Interactive Matting

251

01 Aug 2025

Video Color Grading via Look-Up Table Generation

123

01 Aug 2025

Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging

182

01 Aug 2025

SAMSA 2.0: Prompting Segment Anything with Spectral Angles for Hyperspectral Interactive Medical Image Segmentation

126

01 Aug 2025

Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution

126

01 Aug 2025

AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware TransformerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

170

01 Aug 2025

Fine-grained Spatiotemporal Grounding on Egocentric Videos

288

01 Aug 2025

Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting

123

01 Aug 2025

Towards Affordable Tumor Segmentation and Visualization for 3D Breast MRI Using SAM2

137

31 Jul 2025

Enhanced Velocity Field Modeling for Gaussian Video Reconstruction

190

31 Jul 2025

RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping

...

121

31 Jul 2025

SAMSA: Segment Anything Model Enhanced with Spectral Angles for Hyperspectral Interactive Medical Image SegmentationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

31 Jul 2025

Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction

145

30 Jul 2025

Beyond Rigid AI: Towards Natural Human-Machine Symbiosis for Interoperative Surgical Assistance

Lalithkumar Seenivasan

117

30 Jul 2025

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

335

30 Jul 2025

Neural Multi-View Self-Calibrated Photometric Stereo without Photometric Stereo Cues

Xu Cao

Takafumi Taketomi

3DV

187

30 Jul 2025

HRVVS: A High-resolution Video Vasculature Segmentation Network via Hierarchical Autoregressive Residual PriorsInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

238

30 Jul 2025

From Waveforms to Pixels: A Survey on Audio-Visual Segmentation

Jia Li

Yapeng Tian

VOS

219

29 Jul 2025

MOVE: Motion-Guided Few-Shot Video Object Segmentation

244

29 Jul 2025

Semantic Segmentation of iPS Cells: Case Study on Model Complexity in Biomedical Imaging

128

29 Jul 2025

SAMITE: Position Prompted SAM2 with Calibrated Memory for Visual Object Tracking

129

29 Jul 2025

RIS-LAD: A Benchmark and Model for Referring Low-Altitude Drone Image Segmentation

197

28 Jul 2025

SAMwave: Wavelet-Driven Feature Enrichment for Effective Adaptation of Segment Anything Model

Saurabh Yadav

Avi Gupta

Koteswar Rao Jerripothula

VLM

168

27 Jul 2025

Latest Object Memory Management for Temporally Consistent Video Instance Segmentation

220

26 Jul 2025

HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly

253

26 Jul 2025

HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback

196

25 Jul 2025

Object-centric Video Question Answering with Visual Grounding and Referring

267

25 Jul 2025