ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.02643
  4. Cited By
Segment Anything

Segment Anything

5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
    MLLM
    VLM
ArXivPDFHTML

Papers citing "Segment Anything"

50 / 4,188 papers shown
Title
Efficient Adaptation For Remote Sensing Visual Grounding
Efficient Adaptation For Remote Sensing Visual Grounding
Hasan Moughnieh
Mohamad Chalhoub
Hasan Nasrallah
Cristiano Nattero
Paolo Campanella
Giovanni Nico
A. Ghandour
51
0
0
29 Mar 2025
Empowering Large Language Models with 3D Situation Awareness
Empowering Large Language Models with 3D Situation Awareness
Zhihao Yuan
Yibo Peng
Jinke Ren
Yinghong Liao
Yatong Han
Chun-Mei Feng
Hengshuang Zhao
G. Li
Shuguang Cui
Zhen Li
51
0
0
29 Mar 2025
A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery
A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery
Pengyu Chen
Sicheng Wang
Cuizhen Wang
Senrong Wang
Beiao Huang
Lu Huang
Zhe Zang
40
0
0
29 Mar 2025
Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning
Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning
Xinlei Shao
Hongruixuan Chen
Fan Zhao
Kirsty Magson
Jundong Chen
Peiran Li
Jingchao Wang
Jun Sasaki
44
0
0
29 Mar 2025
Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments
Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments
Yifan Xu
V. Kamat
Carol Menassa
51
0
0
29 Mar 2025
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
Ziyue Huang
Hongxi Yan
Qiqi Zhan
Shuai Yang
Mingming Zhang
Chenkai Zhang
Yiming Lei
Zeming Liu
Qingjie Liu
Yansen Wang
49
0
0
28 Mar 2025
Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study
Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study
Soumitri Chattopadhyay
Basar Demir
Marc Niethammer
VLM
69
0
0
28 Mar 2025
FLAM: Foundation Model-Based Body Stabilization for Humanoid Locomotion and Manipulation
FLAM: Foundation Model-Based Body Stabilization for Humanoid Locomotion and Manipulation
Xianqi Zhang
Hongliang Wei
Wenrui Wang
Xingtao Wang
Xiaopeng Fan
Debin Zhao
34
0
0
28 Mar 2025
Segment then Splat: A Unified Approach for 3D Open-Vocabulary Segmentation based on Gaussian Splatting
Segment then Splat: A Unified Approach for 3D Open-Vocabulary Segmentation based on Gaussian Splatting
Yiren Lu
Yunlai Zhou
Yiran Qiao
Chaoda Song
Tuo Liang
Jing Ma
Yu Yin
3DGS
39
0
0
28 Mar 2025
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
J. Huang
Baoxiong Jia
Yansen Wang
Ziyu Zhu
Xiongkun Linghu
Qing Li
Song-Chun Zhu
Siyuan Huang
87
3
0
28 Mar 2025
SCHNet: SAM Marries CLIP for Human Parsing
SCHNet: SAM Marries CLIP for Human Parsing
Kunliang Liu
Jianming Wang
Rize Jin
Wonjun Hwang
Tae-Sun Chung
VLM
68
0
0
28 Mar 2025
TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting
TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting
Boyang
Yanlin Jin
Ashok Veeraraghavan
Akshat Dave
Guha Balakrishnan
3DGS
64
0
0
28 Mar 2025
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
Yunhong Min
Daehyeon Choi
Kyeongmin Yeo
Jihyun Lee
Minhyuk Sung
54
0
0
28 Mar 2025
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Ukcheol Shin
Jinsun Park
3DV
MDE
44
0
0
28 Mar 2025
ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting
ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting
Wei Liu
Ziqiang Liu
Xiaoyan Yang
Man Sha
Yang Li
3DGS
35
0
0
28 Mar 2025
MedCL: Learning Consistent Anatomy Distribution for Scribble-supervised Medical Image Segmentation
MedCL: Learning Consistent Anatomy Distribution for Scribble-supervised Medical Image Segmentation
Ke Zhang
Vishal M. Patel
49
0
0
28 Mar 2025
Synergistic Bleeding Region and Point Detection in Surgical Videos
Synergistic Bleeding Region and Point Detection in Surgical Videos
Jialun Pei
Zhangjun Zhou
Diandian Guo
Zhixi Li
Jing Qin
Bo Du
Pheng-Ann Heng
47
0
0
28 Mar 2025
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset
Ling Feng
Tianyu Xie
Wei Ma
Ruijie Fu
Yuyao Zhang
Jun Li
Bei Zhou
41
0
0
27 Mar 2025
Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence
Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence
Haolin Liu
Xiaohang Zhan
Zizheng Yan
Zhongjin Luo
Yuxin Wen
Xiaoguang Han
63
0
0
27 Mar 2025
A Unified Image-Dense Annotation Generation Model for Underwater Scenes
A Unified Image-Dense Annotation Generation Model for Underwater Scenes
Hongkai Lin
Dingkang Liang
Zhenghao Qi
X. Bai
DiffM
46
0
0
27 Mar 2025
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
72
0
0
27 Mar 2025
Can Video Diffusion Model Reconstruct 4D Geometry?
Can Video Diffusion Model Reconstruct 4D Geometry?
Jinjie Mai
Wenxuan Zhu
Haozhe Liu
Bing Li
Cheng Zheng
Jürgen Schmidhuber
Bernard Ghanem
VGen
MDE
77
0
0
27 Mar 2025
Orange Quality Grading with Deep Learning
Orange Quality Grading with Deep Learning
M. L. Mekhalfi
P. Chippendale
Francisco Fraile
Marcos Rico
42
0
0
27 Mar 2025
HyperFree: A Channel-adaptive and Tuning-free Foundation Model for Hyperspectral Remote Sensing Imagery
HyperFree: A Channel-adaptive and Tuning-free Foundation Model for Hyperspectral Remote Sensing Imagery
Jingtao Li
Y. Liu
Xinyu Wang
Yunning Peng
Chen Sun
...
Tian Ke
Xiao Jiang
Tangwei Lu
Anran Zhao
Yanfei Zhong
VLM
60
0
0
27 Mar 2025
Multimodal Data Integration for Sustainable Indoor Gardening: Tracking Anyplant with Time Series Foundation Model
Multimodal Data Integration for Sustainable Indoor Gardening: Tracking Anyplant with Time Series Foundation Model
Seyed Hamidreza Nabaei
Zeyang Zheng
Dong Chen
Arsalan Heydarian
41
0
0
27 Mar 2025
STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM
STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM
Yanjie Wang
Xu Cao
Weiyun Yi
Zhaoxin Fan
48
0
0
27 Mar 2025
Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying
Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying
Hairong Yin
Huangying Zhan
Yi Tian Xu
Raymond A. Yeh
48
0
0
27 Mar 2025
Foveated Instance Segmentation
Foveated Instance Segmentation
Hongyi Zeng
Wenxuan Liu
Tianhua Xia
Jintai Chen
Ziyun Li
Sai Qian Zhang
ISeg
70
0
0
27 Mar 2025
Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing
Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing
Fan Qi
Yu Duan
Changsheng Xu
DiffM
60
0
0
27 Mar 2025
Online Reasoning Video Segmentation with Just-in-Time Digital Twins
Online Reasoning Video Segmentation with Just-in-Time Digital Twins
Yiqing Shen
Bohan Liu
Chenjia Li
Lalithkumar Seenivasan
Mathias Unberath
VOS
83
2
0
27 Mar 2025
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
David Yifan Yao
Albert Zhai
Shenlong Wang
VGen
57
1
0
27 Mar 2025
Reasoning and Learning a Perceptual Metric for Self-Training of Reflective Objects in Bin-Picking with a Low-cost Camera
Reasoning and Learning a Perceptual Metric for Self-Training of Reflective Objects in Bin-Picking with a Low-cost Camera
Peiyuan Ni
Chee Meng Chew
Marcelo H. Ang Jr.
Gregory S. Chirikjian
66
0
0
26 Mar 2025
Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery
Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery
Mélisande Teng
Arthur Ouaknine
Etienne Laliberté
Yoshua Bengio
David Rolnick
Hugo Larochelle
57
0
0
26 Mar 2025
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
Chen Tang
Xinzhu Ma
Encheng Su
Xiufeng Song
Xiaohong Liu
Wei-Hong Li
Lei Bai
Wanli Ouyang
Xiangyu Yue
3DGS
AI4TS
72
0
0
26 Mar 2025
Operating Room Workflow Analysis via Reasoning Segmentation over Digital Twins
Operating Room Workflow Analysis via Reasoning Segmentation over Digital Twins
Yiqing Shen
Chenjia Li
Bohan Liu
Cheng-Yi Li
Tito Porras
Mathias Unberath
62
2
0
26 Mar 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
67
0
0
26 Mar 2025
Unlocking the Value of Decentralized Data: A Federated Dual Learning Approach for Model Aggregation
Unlocking the Value of Decentralized Data: A Federated Dual Learning Approach for Model Aggregation
Junyi Zhu
Ruicong Yao
Taha Ceritli
Savas Ozkan
Matthew B. Blaschko
Eunchung Noh
Jeongwon Min
Cho Jung Min
Mete Ozay
FedML
103
0
0
26 Mar 2025
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Shijie Zhou
Hui Ren
Yijia Weng
Shuwang Zhang
Zhen Wang
...
Zhiwen Fan
Suya You
Ziyi Wang
Leonidas J. Guibas
A. Kadambi
VGen
3DGS
88
0
0
26 Mar 2025
Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement
Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement
Xinghao Wang
Changtao Miao
Dianmo Sheng
Tao Gong
Qi Chu
82
0
0
26 Mar 2025
EVPGS: Enhanced View Prior Guidance for Splatting-based Extrapolated View Synthesis
EVPGS: Enhanced View Prior Guidance for Splatting-based Extrapolated View Synthesis
Jiajun Li
Feiyu Wang
Xiaochao Qu
Chengjing Wu
Luoqi Liu
Ting Liu
3DGS
34
0
0
26 Mar 2025
Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications
Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications
Ben Rahman
VLM
42
1
0
25 Mar 2025
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text
Weizhi Chen
Jingbo Chen
Yupeng Deng
Jiansheng Chen
Yuman Feng
Zhihao Xi
Diyou Liu
Kai Li
Yu Meng
VLM
51
0
0
25 Mar 2025
RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation
RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation
Nuren Zhaksylyk
Ibrahim Almakky
Jay N. Paranjape
S. Vedula
S. Sikder
Vishal M. Patel
Mohammad Yaqub
39
0
0
25 Mar 2025
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Sangwon Beak
Hyeonwoo Kim
Hanbyul Joo
46
0
0
25 Mar 2025
Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
Jaihoon Kim
Taehoon Yoon
Jisung Hwang
Minhyuk Sung
DiffM
54
1
0
25 Mar 2025
The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs
The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs
Jonathan Sauder
Viktor Domazetoski
G. Banc-Prandi
Gabriela Perna
Anders Meibom
D. Tuia
58
0
0
25 Mar 2025
OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations
OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations
Christina Kassab
Sacha Morin
Martin Buchner
Matías Mattamala
Kumaraditya Gupta
Abhinav Valada
Liam Paull
Maurice F. Fallon
3DV
ELM
46
0
0
25 Mar 2025
Show and Segment: Universal Medical Image Segmentation via In-Context Learning
Show and Segment: Universal Medical Image Segmentation via In-Context Learning
Yunhe Gao
Di Liu
Zhuowei Li
Yongbin Li
Dongdong Chen
Mu Zhou
Dimitris N. Metaxas
VLM
53
0
0
25 Mar 2025
CamSAM2: Segment Anything Accurately in Camouflaged Videos
CamSAM2: Segment Anything Accurately in Camouflaged Videos
Yuli Zhou
Guolei Sun
Yawei Li
Yuqian Fu
Luca Benini
Ender Konukoglu
47
0
0
25 Mar 2025
PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model
PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model
Mingju Gao
Yike Pan
Huan-ang Gao
Zongzheng Zhang
Wenyi Li
Hao Dong
Hao Tang
Li Yi
Hao Zhao
VGen
47
0
0
25 Mar 2025
Previous
123...789...828384
Next