ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.00714
  4. Cited By
SAM 2: Segment Anything in Images and Videos

SAM 2: Segment Anything in Images and Videos

International Conference on Learning Representations (ICLR), 2024
1 August 2024
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
Tengyu Ma
Haitham Khedr
Roman Rädle
Chloe Rolland
Laura Gustafson
Eric Mintun
Junting Pan
Kalyan Vasudev Alwala
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
    VLMMLLM
ArXiv (abs)PDFHTMLHuggingFace (116 upvotes)

Papers citing "SAM 2: Segment Anything in Images and Videos"

9 / 859 papers shown
Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models
Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models
Yutao Ouyang
Jinhan Li
Yunfei Li
Zhongyu Li
Chao Yu
Koushil Sreenath
Yi Wu
393
23
0
08 Apr 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
  Matching
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept MatchingNeural Information Processing Systems (NeurIPS), 2024
Dongzhi Jiang
Guanglu Song
Xiaoshi Wu
Renrui Zhang
Dazhong Shen
Zhuofan Zong
Yu Liu
Jiaming Song
VLM
458
53
0
04 Apr 2024
Crafting Dynamic Virtual Activities with Advanced Multimodal Models
Crafting Dynamic Virtual Activities with Advanced Multimodal ModelsInternational Symposium on Mixed and Augmented Reality (ISMAR), 2024
Changyang Li
Qingan Yan
Minyoung Kim
Z. Li
Yi Tian Xu
Lap-Fai Yu
182
0
0
15 Mar 2024
Renovating Names in Open-Vocabulary Segmentation Benchmarks
Renovating Names in Open-Vocabulary Segmentation BenchmarksNeural Information Processing Systems (NeurIPS), 2024
Haiwen Huang
Songyou Peng
Dan Zhang
Andreas Geiger
VLM
228
5
0
14 Mar 2024
PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models
PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models
Qingdong He
Jinlong Peng
Zhengkai Jiang
Xiaobin Hu
Jiangning Zhang
3DPCVLM
510
9
0
11 Mar 2024
Grounding LLMs For Robot Task Planning Using Closed-loop State Feedback
Grounding LLMs For Robot Task Planning Using Closed-loop State Feedback
V. Bhat
Ali Umut Kaypak
Prashanth Krishnamurthy
Ramesh Karri
Farshad Khorrami
LM&Ro
386
32
0
13 Feb 2024
Promoting Segment Anything Model towards Highly Accurate Dichotomous Image Segmentation
Promoting Segment Anything Model towards Highly Accurate Dichotomous Image Segmentation
Xianjie Liu
Keren Fu
Qijun Zhao
Qijun Zhao
VLM
596
1
0
30 Dec 2023
Audio-Visual Instance Segmentation
Audio-Visual Instance SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Ruohao Guo
Yaru Chen
Yanyu Qi
Wenzhen Yue
Dantong Niu
...
Wenzhen Yue
Ji Shi
Qixun Wang
Peiliang Zhang
Buwen Liang
VLMVOS
358
11
0
28 Oct 2023
Temporal Transductive Inference for Few-Shot Video Object Segmentation
Temporal Transductive Inference for Few-Shot Video Object SegmentationInternational Journal of Computer Vision (IJCV), 2022
Mennatullah Siam
Konstantinos G. Derpanis
Richard P. Wildes
VOS
296
10
0
27 Mar 2022
Previous
123...161718