ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.19331
  4. Cited By
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
v1v2v3 (latest)

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

28 November 2024
Luca Barsellotti
Lorenzo Bianchi
Nicola Messina
F. Carrara
Marcella Cornia
Lorenzo Baraldi
Fabrizio Falchi
Rita Cucchiara
    VLM
ArXiv (abs)PDFHTML

Papers citing "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"

9 / 9 papers shown
Title
RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models
RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models
Omar Alama
Darshil Jariwala
A. Bhattacharya
Seungchan Kim
Wenshan Wang
Sebastian A. Scherer
VLM
40
0
0
24 Nov 2025
SuperQuadricOcc: Multi-Layer Gaussian Approximation of Superquadrics for Real-Time Self-Supervised Occupancy Estimation
SuperQuadricOcc: Multi-Layer Gaussian Approximation of Superquadrics for Real-Time Self-Supervised Occupancy Estimation
Seamie Hayes
Reenu Mohandas
Tim Brophy
Alexandre Boulch
Ganesh Sistu
Ciarán Eising
188
0
0
21 Nov 2025
FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding
FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding
Z. Li
W. Yu
Dilxat Muhtar
X. Zhang
Pengfeng Xiao
Pedram Ghamisi
Xiao Xiang Zhu
CLIPVLM
144
0
0
18 Nov 2025
One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework
One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework
Lorenzo Bianchi
Giacomo Pacini
F. Carrara
Nicola Messina
Giuseppe Amato
Fabrizio Falchi
VLM
98
0
0
03 Oct 2025
EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models
EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models
Seamie Hayes
Ganesh Sistu
Ciarán Eising
111
1
0
30 Sep 2025
LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM
LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM
Roman Titkov
Egor Zubkov
Dmitry A. Yudin
Jaafar Mahmoud
Malik Mohrat
Gennady Sidorov
3DGS
179
1
0
03 Jun 2025
Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation
Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation
Luka Vetoshkin
Dmitry Yudin
98
0
0
03 Jun 2025
Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
Yizhou Huang
Fan Yang
Guoliang Zhu
Gen Li
Hao-miao Shi
Yukun Zuo
Wenrui Chen
Hui Yuan
Kailun Yang
357
0
0
04 Mar 2025
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingComputer Vision and Pattern Recognition (CVPR), 2024
Haoyi Jiang
Liu Liu
Tianheng Cheng
Xinjie Wang
Tianwei Lin
Zhizhong Su
Wen Liu
Xinyu Wang
3DGSViT
413
26
0
17 Dec 2024
1