Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.20076
Cited By
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
28 June 2024
Yuxuan Zhang
Tianheng Cheng
Lianghui Zhu
Lei Liu
Heng Liu
Longjin Ran
Xiaoxin Chen
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
25 / 25 papers shown
Title
RESAnything: Attribute Prompting for Arbitrary Referring Segmentation
Ruiqi Wang
Hao Zhang
VLM
41
0
0
03 May 2025
Follow Everything: A Leader-Following and Obstacle Avoidance Framework with Goal-Aware Adaptation
Qianyi Zhang
Shijian Ma
Boyi Liu
J. H. Liu
Jianhao Jiao
20
0
0
28 Apr 2025
AffordanceSAM: Segment Anything Once More in Affordance Grounding
D. Jiang
Mengmeng Wang
Teli Ma
H. Li
Y. Liu
Guang Dai
L. Zhang
22
0
0
22 Apr 2025
Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation
SoYoung Park
Hyewon Lee
M. Choi
Seunghoon Han
Jong-Ryul Lee
Sungsu Lim
Tae-Ho Kim
VLM
41
0
0
18 Apr 2025
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model
Kaiyu Li
Zepeng Xin
Li Pang
Chao Pang
Yupeng Deng
Jing Yao
Guisong Xia
Deyu Meng
Zhi Wang
Xiangyong Cao
VLM
LRM
37
0
0
13 Apr 2025
BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts
Suzhe Xu
Jialin Peng
Chengyuan Zhang
VLM
41
0
0
25 Mar 2025
SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors
Yiqing Li
X. Wang
Jiawei Wu
Yikun Ma
Zhi Jin
3DGS
34
0
0
25 Mar 2025
RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation
Chengbo Yuan
Suraj Joshi
Shaoting Zhu
Hang Su
Hang Zhao
Yang Gao
VGen
34
3
0
24 Mar 2025
SAM2 for Image and Video Segmentation: A Comprehensive Survey
Zhang Jiaxing
Tang Hao
VLM
47
0
0
17 Mar 2025
GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding
R. Hu
Lianghui Zhu
Yuxuan Zhang
Tianheng Cheng
Lei Liu
Heng Liu
Longjin Ran
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
ObjD
56
0
0
13 Mar 2025
Customized SAM 2 for Referring Remote Sensing Image Segmentation
Fu Rong
Meng Lan
Q. Zhang
L. Zhang
37
0
0
10 Mar 2025
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Suhwan Cho
Seunghoon Lee
Minhyeok Lee
Jungho Lee
Sangyoun Lee
VOS
72
0
0
05 Mar 2025
ACCORD: Alleviating Concept Coupling through Dependence Regularization for Text-to-Image Diffusion Personalization
Shizhan Liu
Hao Zheng
Hang Yu
Jianguo Li
DiffM
58
0
0
03 Mar 2025
NavG: Risk-Aware Navigation in Crowded Environments Based on Reinforcement Learning with Guidance Points
Qianyi Zhang
Wentao Luo
Boyi Liu
Ziyang Zhang
Yaoyuan Wang
J. H. Liu
26
0
0
03 Mar 2025
Audio Visual Segmentation Through Text Embeddings
Kyungbok Lee
You Zhang
Z. Duan
33
0
0
22 Feb 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Q. Zhang
L. Zhang
VOS
VGen
52
1
0
23 Jan 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Haobo Yuan
X. Li
Tao Zhang
Zilong Huang
Shilin Xu
S. Ji
Yunhai Tong
Lu Qi
Jiashi Feng
Ming Yang
VLM
59
11
0
07 Jan 2025
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
Claudia Cuttano
Gabriele Trivigno
Gabriele Rosi
Carlo Masone
Giuseppe Averta
VOS
79
1
0
26 Nov 2024
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation
Xinyu Xiong
Zihuang Wu
Shuangyi Tan
Wenxue Li
Feilong Tang
Ying Chen
Siying Li
Jie Ma
Guanbin Li
VLM
38
11
0
16 Aug 2024
Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification
Yongcheng Li
Lingcong Cai
Ying Lu
Cheng Lin
Yupeng Zhang
...
Genan Dai
Bowen Zhang
Jingzhou Cao
Xiangzhong Zhang
Xiaomao Fan
25
1
0
14 Aug 2024
Segment Anything in Medical Images and Videos: Benchmark and Deployment
Jun Ma
Sumin Kim
Feifei Li
Mohammed Baharoon
Reza Asakereh
Hongwei Lyu
Bo Wang
VLM
MedIm
29
27
0
06 Aug 2024
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
29
676
0
01 Aug 2024
Universal Segmentation at Arbitrary Granularity with Language Instruction
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLM
VOS
30
5
0
04 Dec 2023
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
S. Song
Bonnie Kruft
Minjia Zhang
Conglong Li
Shiyang Chen
...
Arash Vahdat
Chaowei Xiao
Thomas Gibbs
Anima Anandkumar
R. Stevens
29
4
0
06 Oct 2023
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
Zhao Yang
Jiaqi Wang
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Philip H. S. Torr
115
308
0
04 Dec 2021
1