Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2411.19331
Cited By

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

v1v2v3 (latest)

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

28 November 2024

Luca Barsellotti

Lorenzo Bianchi

Marcella Cornia

Lorenzo Baraldi

Fabrizio Falchi

ArXiv (abs)PDF HTML Github

Papers citing "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"

11 / 11 papers shown

One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework

One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework

Lorenzo Bianchi

Fabrizio Falchi

237

1

0

30 Mar 2026

Easy3D-Labels: Supervising Semantic Occupancy Estimation with 3D Pseudo-Labels for Automotive Perception

Easy3D-Labels: Supervising Semantic Occupancy Estimation with 3D Pseudo-Labels for Automotive Perception

Ciaran Eising

322

3

0

27 Mar 2026

ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding

ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding

201

1

0

03 Dec 2025

KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM

KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM

Mikhail Iumanov

Ekaterina Derevyanka

Sergey Kolyubin

170

0

0

01 Dec 2025

RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models

RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models

Darshil Jariwala

A. Bhattacharya

Sebastian A. Scherer

250

2

0

24 Nov 2025

SuperQuadricOcc: Real-Time Self-Supervised Semantic Occupancy Estimation with Superquadric Volume Rendering

SuperQuadricOcc: Real-Time Self-Supervised Semantic Occupancy Estimation with Superquadric Volume Rendering

Alexandre Boulch

Ciaran Eising

392

1

0

21 Nov 2025

FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding

FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding

254

1

0

18 Nov 2025

Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation

Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation

233

0

0

03 Jun 2025

LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM

LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM

Dmitry A. Yudin

Gennady Sidorov

269

2

0

03 Jun 2025

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts

483

0

0

04 Mar 2025

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingComputer Vision and Pattern Recognition (CVPR), 2024

551

44

0

17 Dec 2024

Page 1 of 1