Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.02643
Cited By
Segment Anything
5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Segment Anything"
50 / 4,194 papers shown
Title
LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding
Ang Cao
Sergio Arnaud
Oleksandr Maksymets
Jianing Yang
Ayush Jain
...
Aravind Rajeswaran
Franziska Meier
Justin Johnson
Jeong Joon Park
Alexander Sax
73
0
0
27 Feb 2025
SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model
Xinyu Wang
Feng Liu
Rui Su
Zhilin Wang
Junlin Wu
Wanli Ouyang
VLM
190
0
0
27 Feb 2025
Open-Vocabulary Semantic Part Segmentation of 3D Human
Keito Suzuki
Bang Du
Girish Krishnan
Kunyao Chen
Runfa Li
Truong Thao Nguyen
3DH
VLM
103
0
0
27 Feb 2025
Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Fan Yang
DongSheng Luo
Wenrui Chen
Jiacheng Lin
Junjie Cai
Kailun Yang
Zehan Li
Yaonan Wang
56
0
0
27 Feb 2025
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Yuhao Li
Mirana Claire Angel
Salman Khan
Yu Zhu
Jinqiu Sun
Yanning Zhang
Fahad Shahbaz Khan
VGen
51
0
0
27 Feb 2025
Climate And Resource Awareness is Imperative to Achieving Sustainable AI (and Preventing a Global AI Arms Race)
Pedram Bakhtiarifard
Pınar Tözün
Christian Igel
Raghavendra Selvan
62
0
0
27 Feb 2025
QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock
Timo Kaiser
Sovan Biswas
R. Manuvinakurike
Bodo Rosenhahn
64
0
0
27 Feb 2025
You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving
Guangfeng Jiang
Jun Liu
Yongxuan Lv
Yongpeng Wu
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
3DPC
57
0
0
27 Feb 2025
When does a predictor know its own loss?
Aravind Gollakota
Parikshit Gopalan
Aayush Karan
Charlotte Peale
Udi Wieder
UQCV
FaML
67
0
0
27 Feb 2025
Tell me why: Visual foundation models as self-explainable classifiers
Hugues Turbé
Mina Bjelogrlic
G. Mengaldo
Christian Lovis
69
0
0
26 Feb 2025
ArtGS: Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting
Yu Liu
Baoxiong Jia
Ruijie Lu
Junfeng Ni
Song-Chun Zhu
Siyuan Huang
3DGS
85
8
0
26 Feb 2025
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
Yi Zhao
Aidan Scannell
Wenshuai Zhao
Yuxin Hou
Tianyu Cui
Le Chen
Dieter Büchler
Arno Solin
Juho Kannala
Joni Pajarinen
OffRL
OnRL
96
1
0
26 Feb 2025
A Survey on Foundation-Model-Based Industrial Defect Detection
Tianle Yang
Luyao Chang
Jiadong Yan
Jiyang Li
Zhi Wang
Ke Zhang
AI4CE
94
2
0
26 Feb 2025
QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries
N. H. Chapman
Feras Dayoub
Will N. Browne
Christopher F. Lehnert
VLM
82
0
0
26 Feb 2025
BarkXAI: A Lightweight Post-Hoc Explainable Method for Tree Species Classification with Quantifiable Concepts
Yunmei Huang
Songlin Hou
Zachary Nelson Horve
Songlin Fei
69
0
0
26 Feb 2025
Attention-Guided Integration of CLIP and SAM for Precise Object Masking in Robotic Manipulation
Muhammad A. Muttaqien
Tomohiro Motoda
Ryo Hanai
Domae Yukiyasu
46
0
0
26 Feb 2025
Self-Supervised Data Generation for Precision Agriculture: Blending Simulated Environments with Real Imagery
Leonardo Saraceni
I. M. Motoi
Daniele Nardi
Thomas Alessandro Ciarfuglia
64
1
0
25 Feb 2025
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
Xiangyu Zhao
Shengyuan Ding
Zicheng Zhang
Haian Huang
Maosong Cao
...
Wenhai Wang
Guangtao Zhai
Haodong Duan
Hua Yang
Kai Chen
126
7
0
25 Feb 2025
Bayesian Computation in Deep Learning
Wenlong Chen
Bolian Li
Ruqi Zhang
Yingzhen Li
BDL
75
0
0
25 Feb 2025
VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention
Adnan Iltaf
Rayan Merghani Ahmed
Bin Li
Bin Li
Shoujun Zhou
55
0
0
25 Feb 2025
DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications
Ibrahim Fayad
Max Zimmer
Martin Schwartz
P. Ciais
Fabian Gieseke
Gabriel Belouze
Sarah Brood
A. D. Truchis
Alexandre d’Aspremont
AI4TS
48
0
0
24 Feb 2025
Vision Language Models in Medicine
Beria Chingnabe Kalpelbe
Angel Gabriel Adaambiik
Wei Peng
VLM
LM&MA
91
2
0
24 Feb 2025
UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction
Chenyu Li
Danfeng Hong
Bing Zhang
Yuxuan Li
Gustau Camps-Valls
X. Zhu
J. Chanussot
71
1
0
24 Feb 2025
Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review
Ufaq Khan
Umair Nawaz
A. Qayyum
Shazad Ashraf
Muhammad Bilal
Junaid Qadir
78
0
0
24 Feb 2025
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang
Jing Tan
Mengchen Zhang
Tong Wu
Yong Li
Gordon Wetzstein
Ziwei Liu
Dahua Lin
MDE
VGen
59
6
0
24 Feb 2025
A large-scale multicenter breast cancer DCE-MRI benchmark dataset with expert segmentations
Lidia Garrucho
C. Reidel
Kaisar Kushibar
Smriti Joshi
Richard Osuala
...
M. P. Starmans
Fredrik Strand
Oliver Díaz
Laura Igual
Karim Lekadir
76
5
0
24 Feb 2025
Semantic Neural Radiance Fields for Multi-Date Satellite Data
Valentin Wagner
Sebastian Bullinger
C. Bodensteiner
Michael Arens
36
0
0
24 Feb 2025
Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks
Tianyou Jiang
Mingshun Shao
Tianyi Zhang
Xiaoyu Liu
Qun Yu
65
0
0
24 Feb 2025
Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement
Hogun Kee
Wooseok Oh
Minjae Kang
Hyemin Ahn
Songhwai Oh
62
0
0
24 Feb 2025
Vision-LSTM: xLSTM as Generic Vision Backbone
Benedikt Alkin
M. Beck
Korbinian Poppel
Sepp Hochreiter
Johannes Brandstetter
VLM
69
44
0
24 Feb 2025
Anatomy-Informed Deep Learning and Radiomics for Automated Neurofibroma Segmentation in Whole-Body MRI
Georgii Kolokolnikov
Marie-Lena Schmalhofer
Lennart Well
Said Farschtschi
Victor-Felix Mautner
Inka Ristow
Rene Werner
AI4CE
45
0
0
24 Feb 2025
FlipConcept: Tuning-Free Multi-Concept Personalization for Text-to-Image Generation
Young Beom Woo
Sun Eung Kim
DiffM
50
0
0
24 Feb 2025
ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation
Yuheng Xue
Nenglun Chen
Jun Liu
Wenyun Sun
3DPC
80
7
0
24 Feb 2025
A Closer Look at TabPFN v2: Strength, Limitation, and Extension
Han-Jia Ye
Si-Yang Liu
Wei-Lun Chao
46
4
0
24 Feb 2025
Vision Foundation Models in Medical Image Analysis: Advances and Challenges
Pengchen Liang
Bin Pu
Haishan Huang
Yiwei Li
Haoran Wang
Weibo Ma
Qing Chang
VLM
MedIm
106
0
0
24 Feb 2025
IBURD: Image Blending for Underwater Robotic Detection
Jungseok Hong
Sakshi Singh
Junaed Sattar
62
1
0
24 Feb 2025
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs
Jiarui Zhang
Mahyar Khayatkhoei
P. Chhikara
Filip Ilievski
LRM
43
6
0
24 Feb 2025
SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations
Wen Liu
Pei Yang
Wenhui Hong
Xiaoguang Mei
Jiayi Ma
DiffM
66
0
0
24 Feb 2025
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin
Lukas Miklautz
Sepp Hochreiter
Johannes Brandstetter
VLM
78
8
0
24 Feb 2025
Introducing Visual Perception Token into Multimodal Large Language Model
Runpeng Yu
Xinyin Ma
Xinchao Wang
MLLM
LRM
86
0
0
24 Feb 2025
Gaussian Difference: Find Any Change Instance in 3D Scenes
Binbin Jiang
Rui Huang
Qingyi Zhao
Yuxiang Zhang
46
0
0
24 Feb 2025
Deep learning approaches to surgical video segmentation and object detection: A Scoping Review
Devanish N. Kamtam
Joseph B. Shrager
Satya Deepya Malla
Nicole Lin
Juan J. Cardona
Jake J. Kim
Clarence Hu
47
1
0
23 Feb 2025
OpenVox: Real-time Instance-level Open-vocabulary Probabilistic Voxel Representation
Yinan Deng
Bicheng Yao
Yihang Tang
Yi Yang
Yufeng Yue
43
0
0
23 Feb 2025
Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration
Kim Jun-Seong
GeonU Kim
Kim Yu-Ji
Yu-Chun Wang
Jaesung Choe
Tae-Hyun Oh
3DGS
69
1
0
23 Feb 2025
Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Pei Fu
Tongkun Guan
Zining Wang
Zhentao Guo
Chen Duan
...
Boming Chen
Jiayao Ma
Qianyi Jiang
Kai Zhou
Junfeng Luo
VLM
66
0
0
23 Feb 2025
Audio Visual Segmentation Through Text Embeddings
Kyungbok Lee
You Zhang
Z. Duan
43
0
0
22 Feb 2025
Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field
Wenhao Hu
Wenhao Chai
Shengyu Hao
Xiaotong Cui
Xuexiang Wen
Lei Li
Gaoang Wang
3DV
60
0
0
22 Feb 2025
USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images
Jiamu Wang
Jin Tae Kwak
MedIm
47
1
0
22 Feb 2025
DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation
Luzhou Ge
Xiangyu Zhu
Zhuo Yang
Xuesong Li
3DGS
72
0
0
21 Feb 2025
Structurally Disentangled Feature Fields Distillation for 3D Understanding and Editing
Yoel Levy
David Shavin
Itai Lang
Sagie Benaim
88
0
0
21 Feb 2025
Previous
1
2
3
...
14
15
16
...
82
83
84
Next