v1v2v3 (latest)

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

Computer Vision and Pattern Recognition (CVPR), 2023

12 December 2023

Shuyang Sun

ArXiv (abs)PDF HTML HuggingFace (19 upvotes)

Papers citing "CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor"

28 / 28 papers shown

Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective

178

20 Nov 2025

NERVE: Neighbourhood & Entropy-guided Random-walk for training free open-Vocabulary sEgmentationLecture Notes in Social Networks (LNSN), 2025

120

11 Nov 2025

Personalizing Retrieval using Joint Embeddings or "the Return of Fluffy"

Bruno Korbar

Andrew Zisserman

106

06 Oct 2025

CoPatch: Zero-Shot Referring Image Segmentation by Leveraging Untapped Spatial Knowledge in CLIP

166

27 Sep 2025

RefAM: Attention Magnets for Zero-Shot Referral Segmentation

Anna Kukleva

Enis Simsar

A. Tonioni

Muhammad Ferjad Naeem

645

26 Sep 2025

DGL-RSIS: Decoupling Global Spatial Context and Local Class Semantics for Training-Free Remote Sensing Image Segmentation

113

30 Aug 2025

Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images

123

25 Aug 2025

Beyond Human-prompting: Adaptive Prompt Tuning with Semantic Alignment for Anomaly Detection

131

22 Aug 2025

Multimodal Referring Segmentation: A Survey

387

01 Aug 2025

Training-Free Class Purification for Open-Vocabulary Semantic Segmentation

162

01 Aug 2025

A Survey on Training-free Open-Vocabulary Semantic Segmentation

224

28 May 2025

Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation

...

606

23 May 2025

RESAnything: Attribute Prompting for Arbitrary Referring Segmentation

Ruiqi Wang

Hao Zhang

VLM

276

03 May 2025

LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image SegmentationPattern Recognition (Pattern Recogn.), 2025

477

20 Apr 2025

SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints

977

19 Mar 2025

Online Language Splatting

344

12 Mar 2025

IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word EmphasisAAAI Conference on Artificial Intelligence (AAAI), 2025

292

02 Mar 2025

RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm

518

18 Feb 2025

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

473

28 Nov 2024

Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2024

776

26 Nov 2024

Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation

637

24 Nov 2024

Gotta Hear Them All: Towards Sound Source Aware Audio Generation

526

23 Nov 2024

ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements

645

18 Nov 2024

CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation

666

15 Nov 2024

Learning Visual Grounding from Generative Vision and Language Model

Shijie Wang

287

18 Jul 2024

A Simple Framework for Open-Vocabulary Zero-Shot Segmentation

434

23 Jun 2024

Annotation Free Semantic Segmentation with Vision Foundation Models

350

14 Mar 2024

A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Chaoyang Zhu

Long Chen

ObjD VLM

511

18 Jul 2023