Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2411.19331
Cited By
v1
v2
v3 (latest)
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
28 November 2024
Luca Barsellotti
Lorenzo Bianchi
Nicola Messina
F. Carrara
Marcella Cornia
Lorenzo Baraldi
Fabrizio Falchi
Rita Cucchiara
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"
9 / 9 papers shown
Title
RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models
Omar Alama
Darshil Jariwala
A. Bhattacharya
Seungchan Kim
Wenshan Wang
Sebastian A. Scherer
VLM
40
0
0
24 Nov 2025
SuperQuadricOcc: Multi-Layer Gaussian Approximation of Superquadrics for Real-Time Self-Supervised Occupancy Estimation
Seamie Hayes
Reenu Mohandas
Tim Brophy
Alexandre Boulch
Ganesh Sistu
Ciarán Eising
188
0
0
21 Nov 2025
FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding
Z. Li
W. Yu
Dilxat Muhtar
X. Zhang
Pengfeng Xiao
Pedram Ghamisi
Xiao Xiang Zhu
CLIP
VLM
144
0
0
18 Nov 2025
One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework
Lorenzo Bianchi
Giacomo Pacini
F. Carrara
Nicola Messina
Giuseppe Amato
Fabrizio Falchi
VLM
98
0
0
03 Oct 2025
EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models
Seamie Hayes
Ganesh Sistu
Ciarán Eising
111
1
0
30 Sep 2025
LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM
Roman Titkov
Egor Zubkov
Dmitry A. Yudin
Jaafar Mahmoud
Malik Mohrat
Gennady Sidorov
3DGS
179
1
0
03 Jun 2025
Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation
Luka Vetoshkin
Dmitry Yudin
98
0
0
03 Jun 2025
Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
Yizhou Huang
Fan Yang
Guoliang Zhu
Gen Li
Hao-miao Shi
Yukun Zuo
Wenrui Chen
Hui Yuan
Kailun Yang
357
0
0
04 Mar 2025
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
Computer Vision and Pattern Recognition (CVPR), 2024
Haoyi Jiang
Liu Liu
Tianheng Cheng
Xinjie Wang
Tianwei Lin
Zhizhong Su
Wen Liu
Xinyu Wang
3DGS
ViT
413
26
0
17 Dec 2024
1