ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.07661
  4. Cited By
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
v1v2v3 (latest)

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

Computer Vision and Pattern Recognition (CVPR), 2023
12 December 2023
Shuyang Sun
Runjia Li
Juil Sock
Xiuye Gu
Siyang Li
    VLMCLIP
ArXiv (abs)PDFHTMLHuggingFace (19 upvotes)

Papers citing "CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor"

28 / 28 papers shown
Title
Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective
Jiahao Li
Yang Lu
Yachao Zhang
Yong Xie
Fangyong Wang
Yuan Xie
Yanyun Qu
VLM
171
0
0
20 Nov 2025
NERVE: Neighbourhood & Entropy-guided Random-walk for training free open-Vocabulary sEgmentation
NERVE: Neighbourhood & Entropy-guided Random-walk for training free open-Vocabulary sEgmentationLecture Notes in Social Networks (LNSN), 2025
Kunal Mahatha
Jose Dolz
Christian Desrosiers
DiffM
106
0
0
11 Nov 2025
Personalizing Retrieval using Joint Embeddings or "the Return of Fluffy"
Personalizing Retrieval using Joint Embeddings or "the Return of Fluffy"
Bruno Korbar
Andrew Zisserman
102
0
0
06 Oct 2025
CoPatch: Zero-Shot Referring Image Segmentation by Leveraging Untapped Spatial Knowledge in CLIP
CoPatch: Zero-Shot Referring Image Segmentation by Leveraging Untapped Spatial Knowledge in CLIP
Na Min An
Inha Kang
Minhyun Lee
Hyunjung Shim
VLM
149
0
0
27 Sep 2025
RefAM: Attention Magnets for Zero-Shot Referral Segmentation
RefAM: Attention Magnets for Zero-Shot Referral Segmentation
Anna Kukleva
Enis Simsar
A. Tonioni
Muhammad Ferjad Naeem
F. Tombari
J. E. Lenssen
Bernt Schiele
DiffMVLM
618
0
0
26 Sep 2025
DGL-RSIS: Decoupling Global Spatial Context and Local Class Semantics for Training-Free Remote Sensing Image Segmentation
DGL-RSIS: Decoupling Global Spatial Context and Local Class Semantics for Training-Free Remote Sensing Image Segmentation
Boyi Li
Ce Zhang
Richard M. Timmerman
Wenxuan Bao
84
0
0
30 Aug 2025
Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images
Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images
Kaiyu Li
Xiangyong Cao
Ruixun Liu
Shihong Wang
Zixuan Jiang
Zhi Wang
Deyu Meng
109
2
0
25 Aug 2025
Beyond Human-prompting: Adaptive Prompt Tuning with Semantic Alignment for Anomaly Detection
Beyond Human-prompting: Adaptive Prompt Tuning with Semantic Alignment for Anomaly Detection
Pi-Wei Chen
Jerry Chun-Wei Lin
Wei-Han Chen
Jia Ji
Zih-Ching Chen
Feng-Hao Yeh
Chao-Chun Chen
VLM
112
0
0
22 Aug 2025
Multimodal Referring Segmentation: A Survey
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
370
10
0
01 Aug 2025
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Qi Chen
Lingxiao Yang
Yun Chen
Nailong Zhao
Jianhuang Lai
Jie Shao
Xiaohua Xie
VLM
139
1
0
01 Aug 2025
A Survey on Training-free Open-Vocabulary Semantic Segmentation
A Survey on Training-free Open-Vocabulary Semantic Segmentation
Naomi Kombol
Ivan Martinović
Sinisa Segvic
ObjDVLM
218
0
0
28 May 2025
Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation
Zhihua Liu
Amrutha Saseendran
Lei Tong
Xilin He
Fariba Yousefi
...
Dino Oglic
Tom Diethe
Philip Teare
Huiyu Zhou
Chen Jin
VLM
603
3
0
23 May 2025
RESAnything: Attribute Prompting for Arbitrary Referring Segmentation
RESAnything: Attribute Prompting for Arbitrary Referring Segmentation
Ruiqi Wang
Hao Zhang
VLM
265
2
0
03 May 2025
LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation
LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image SegmentationPattern Recognition (Pattern Recogn.), 2025
Jiachen Li
Qing Xie
Xiaohan Yu
Hongyun Wang
Jinyu Xu
Yongjian Liu
ObjD
432
1
0
20 Apr 2025
SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints
SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints
Weiwen Hu
Niccolò Parodi
Marcus Zepp
I. Feldmann
O. Schreer
Peter Eisert
VLM
914
0
0
19 Mar 2025
Online Language Splatting
Online Language Splatting
Saimouli Katragadda
Cho-Ying Wu
Yuliang Guo
Xinyu Huang
Guoquan Huang
Liu Ren
3DGSOffRL
325
2
0
12 Mar 2025
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word EmphasisAAAI Conference on Artificial Intelligence (AAAI), 2025
Yun Wang
Jingchen Ni
Yong-Jin Liu
Chun Yuan
Yansong Tang
275
14
0
02 Mar 2025
RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm
RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm
Tiancheng Gu
Kaicheng Yang
Chaoyi Zhang
Yin Xie
Xiang An
Ziyong Feng
Dongnan Liu
Weidong Cai
Jiankang Deng
CLIPVLM
475
5
0
18 Feb 2025
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
Luca Barsellotti
Lorenzo Bianchi
Nicola Messina
F. Carrara
Marcella Cornia
Lorenzo Baraldi
Fabrizio Falchi
Rita Cucchiara
VLM
409
17
0
28 Nov 2024
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2024
Chanyoung Kim
Dayun Ju
Woojung Han
Ming-Hsuan Yang
Seong Jae Hwang
VLMVOS
699
8
0
26 Nov 2024
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
Sule Bai
Yong-Jin Liu
Yifei Han
Haoji Zhang
Yansong Tang
VLM
600
19
0
24 Nov 2024
Gotta Hear Them All: Towards Sound Source Aware Audio Generation
Gotta Hear Them All: Towards Sound Source Aware Audio Generation
Wei Guo
Heng Wang
Jianbo Ma
Weidong Cai
DiffM
477
6
0
23 Nov 2024
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
M. Arda Aydın
Efe Mert Çırpar
Elvin Abdinli
Gözde B. Ünal
Y. Sahin
VLM
603
3
0
18 Nov 2024
CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
Dengke Zhang
Fagui Liu
Quan Tang
VLM
628
5
0
15 Nov 2024
Learning Visual Grounding from Generative Vision and Language Model
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
257
18
0
18 Jul 2024
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
Thomas Stegmüller
Tim Lebailly
Nikola Dukic
Behzad Bozorgtabar
Tinne Tuytelaars
Jean-Philippe Thiran
VLM
399
3
0
23 Jun 2024
Annotation Free Semantic Segmentation with Vision Foundation Models
Annotation Free Semantic Segmentation with Vision Foundation Models
Soroush Seifi
Daniel Olmeda Reino
Fabien Despinoy
Rahaf Aljundi
VLM
328
2
0
14 Mar 2024
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present,
  and Future
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Chaoyang Zhu
Long Chen
ObjDVLM
503
64
0
18 Jul 2023
1