ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01518
  4. Cited By
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

2 December 2021
Yongming Rao
Wenliang Zhao
Guangyi Chen
Yansong Tang
Zheng Zhu
Guan Huang
Jie Zhou
Jiwen Lu
    VLM
    CLIP
ArXivPDFHTML

Papers citing "DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting"

50 / 400 papers shown
Title
Boosting Single-domain Generalized Object Detection via Vision-Language Knowledge Interaction
Boosting Single-domain Generalized Object Detection via Vision-Language Knowledge Interaction
Xiaoran Xu
Jiangang Yang
Wenyue Chong
Wenhui Shi
S.
Jing Xing
Jian Liu
ObjD
VLM
79
0
0
27 Apr 2025
GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection
GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection
Donghyeong Kim
Chaewon Park
Suhwan Cho
Hyeonjeong Lim
Minseok Kang
Jungho Lee
Sangyoun Lee
VLM
44
0
0
21 Apr 2025
CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey
CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey
Jindong Li
Y. Li
Yali Fu
Jiahong Liu
Yixin Liu
Menglin Yang
Irwin King
VLM
36
0
0
19 Apr 2025
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
Yasser Benigmim
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Raoul de Charette
VLM
32
0
0
14 Apr 2025
econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians
econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians
Can Zhang
G. Lee
3DV
50
0
0
08 Apr 2025
DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration
DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration
Jiamei Xiong
Xuefeng Yan
Yongzhen Wang
Wei Zhao
Xiao-Ping Zhang
Mingqiang Wei
DiffM
29
0
0
07 Apr 2025
UCS: A Universal Model for Curvilinear Structure Segmentation
UCS: A Universal Model for Curvilinear Structure Segmentation
Dianshuo Li
Li Chen
Y. Cao
Kai Zhu
Jun Cheng
33
0
0
05 Apr 2025
Simultaneous Learning of Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Simultaneous Learning of Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Kotaro Ikeda
Masanori Koyama
Jinzhe Zhang
Kohei Hayashi
Kenji Fukumizu
OT
75
0
0
04 Apr 2025
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection
Divya Velayudhan
A. Ahmed
Mohamad Alansari
Neha Gour
Abderaouf Behouch
...
Muzammal Naseer
Juergen Gall
Mohammed Bennamoun
Ernesto Damiani
N. Werghi
42
0
0
03 Apr 2025
Is Temporal Prompting All We Need For Limited Labeled Action Recognition?
Is Temporal Prompting All We Need For Limited Labeled Action Recognition?
Shreyank N. Gowda
Boyan Gao
Xiao Gu
Xiaobo Jin
VLM
32
0
0
02 Apr 2025
Zero-Shot 4D Lidar Panoptic Segmentation
Zero-Shot 4D Lidar Panoptic Segmentation
Yushan Zhang
Aljosa Osep
Laura Leal-Taixé
Tim Meinhardt
3DPC
42
1
0
01 Apr 2025
Classifier-guided CLIP Distillation for Unsupervised Multi-label Classification
Classifier-guided CLIP Distillation for Unsupervised Multi-label Classification
Dongseob Kim
Hyunjung Shim
VLM
44
0
0
21 Mar 2025
Probabilistic Prompt Distribution Learning for Animal Pose Estimation
Probabilistic Prompt Distribution Learning for Animal Pose Estimation
Jiyong Rao
Brian Nlong Zhao
Yu Wang
VLM
VPVLM
64
0
0
20 Mar 2025
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification
Ans Munir
Faisal Z. Qureshi
M. H. Khan
Mohsen Ali
VLM
70
0
0
15 Mar 2025
Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection
Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection
Zhen Qu
Xian Tao
Xinyi Gong
Shichen Qu
Qiyu Chen
Zhengtao Zhang
Xingang Wang
Guiguang Ding
VLM
59
0
0
13 Mar 2025
YOLOE: Real-Time Seeing Anything
Ao Wang
Lihao Liu
Hui Chen
Zijia Lin
J. Han
Guiguang Ding
VLM
ObjD
72
1
0
10 Mar 2025
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization
Mihcael Green
Matan Levy
Issar Tzachor
Dvir Samuel
N. Darshan
Rami Ben-Ari
54
0
0
10 Mar 2025
Is CLIP ideal? No. Can we fix it? Yes!
Raphi Kang
Yue Song
Georgia Gkioxari
Pietro Perona
VLM
53
0
0
10 Mar 2025
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
Huang Huang
Fangchen Liu
Letian Fu
Tingfan Wu
Mustafa Mukadam
Jitendra Malik
Ken Goldberg
Pieter Abbeel
LM&Ro
VLM
74
5
0
05 Mar 2025
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification
Jiangbo Shi
Chen Li
Tieliang Gong
Yefeng Zheng
Huazhu Fu
VLM
60
5
0
12 Feb 2025
Vision-Language Models for Edge Networks: A Comprehensive Survey
Vision-Language Models for Edge Networks: A Comprehensive Survey
Ahmed Sharshar
Latif U. Khan
Waseem Ullah
Mohsen Guizani
VLM
62
3
0
11 Feb 2025
Natural Language Supervision for Low-light Image Enhancement
Natural Language Supervision for Low-light Image Enhancement
Jiahui Tang
Kaihua Zhou
Zhijian Luo
Yueen Hou
41
0
0
11 Jan 2025
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
Jianjie Luo
Jingwen Chen
Yehao Li
Yingwei Pan
Jianlin Feng
Hongyang Chao
Ting Yao
DiffM
VLM
45
0
0
03 Jan 2025
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
S. Park
Subeen Lee
Hyun Seok Seong
Jaejoon Yoo
Jae-Pil Heo
32
1
0
03 Jan 2025
Tuning Vision-Language Models with Candidate Labels by Prompt Alignment
Tuning Vision-Language Models with Candidate Labels by Prompt Alignment
Zhifang Zhang
Yuwei Niu
Xin Liu
Beibei Li
VPVLM
VLM
54
0
0
31 Dec 2024
Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems
Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems
Wen-Dong Jiang
Chih-Yung Chang
Hsiang-Chuan Chang
Ji-Yuan Chen
Diptendu Sinha Roy
31
0
0
31 Dec 2024
Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning
Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning
Zhifang Zhang
Shuo He
Bingquan Shen
Lei Feng
Lei Feng
AAML
38
0
0
29 Dec 2024
Improving Generated and Retrieved Knowledge Combination Through
  Zero-shot Generation
Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation
Xinkai Du
Quanjie Han
Chao Lv
Y. Liu
Yalin Sun
Hao Shu
Hongbo Shan
Maosong Sun
RALM
35
1
0
25 Dec 2024
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot
  Semantic Segmentation
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation
Jiaqi Ma
Guo-Sen Xie
Fang Zhao
Zechao Li
32
0
0
23 Dec 2024
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level
  Vision-Language Alignment
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Cijo Jose
Théo Moutakanni
Dahyun Kang
Federico Baldassarre
Timothée Darcet
...
Maxime Oquab
Oriane Siméoni
Huy V. Vo
Patrick Labatut
Piotr Bojanowski
CLIP
VLM
88
6
0
20 Dec 2024
Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary
  Differential Equations
Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary Differential Equations
Yi Zhang
Chun-Wun Cheng
Junyi He
Zhihai He
Carola-Bibiane Schonlieb
Yuyan Chen
Angelica I Aviles-Rivero
AI4TS
75
0
0
20 Dec 2024
A Decade of Deep Learning: A Survey on The Magnificent Seven
A Decade of Deep Learning: A Survey on The Magnificent Seven
Dilshod Azizov
Muhammad Arslan Manzoor
Velibor Bojkovic
Yingxu Wang
Z. Wang
...
Liang Li
Siwei Liu
Yu Zhong
Wei Liu
Shangsong Liang
OOD
AI4TS
MedIm
116
0
0
13 Dec 2024
DiffCLIP: Few-shot Language-driven Multimodal Classifier
DiffCLIP: Few-shot Language-driven Multimodal Classifier
Jiaqing Zhang
Mingxiang Cao
Xue Yang
Kai Jiang
Yunsong Li
VLM
66
0
0
10 Dec 2024
See What You Seek: Semantic Contextual Integration for Cloth-Changing
  Person Re-Identification
See What You Seek: Semantic Contextual Integration for Cloth-Changing Person Re-Identification
Xiyu Han
X. Zhong
Wenxin Huang
Xuemei Jia
Wenxuan Liu
Xiaohan Yu
Alex Chichung Kot
100
0
0
02 Dec 2024
A Study on Unsupervised Domain Adaptation for Semantic Segmentation in
  the Era of Vision-Language Models
A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models
Manuel Schwonberg
Claus Werner
Hanno Gottschalk
Carsten Meyer
VLM
85
0
0
25 Nov 2024
Style-Pro: Style-Guided Prompt Learning for Generalizable
  Vision-Language Models
Style-Pro: Style-Guided Prompt Learning for Generalizable Vision-Language Models
Niloufar Alipour Talemi
Hossein Kashiani
Fatemeh Afghah
CLIP
VLM
67
0
0
25 Nov 2024
ResCLIP: Residual Attention for Training-free Dense Vision-language
  Inference
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
Yuhang Yang
Jinhong Deng
Wen Li
Lixin Duan
VLM
71
0
0
24 Nov 2024
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
98
2
0
24 Nov 2024
Harnessing Vision Foundation Models for High-Performance, Training-Free
  Open Vocabulary Segmentation
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
Yuheng Shi
Minjing Dong
Chang Xu
VLM
27
1
0
14 Nov 2024
LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided
  Gaze Estimation
LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation
Pengwei Yin
Jingjing Wang
Guanzhong Zeng
Di Xie
Jiang Zhu
26
3
0
13 Nov 2024
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion
  Models
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models
Shuhong Zheng
Zhipeng Bao
Ruoyu Zhao
Martial Hebert
Yu-xiong Wang
DiffM
35
0
0
07 Nov 2024
Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification
Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification
Shengxun Wei
Zan Gao
Yibo Zhao
Weili Guan
Weili Guan
Shengyong Chen
46
1
0
01 Nov 2024
Aggregate-and-Adapt Natural Language Prompts for Downstream
  Generalization of CLIP
Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP
Chen Huang
Skyler Seto
Samira Abnar
David Grangier
Navdeep Jaitly
J. Susskind
VLM
36
0
0
31 Oct 2024
Language-guided Hierarchical Fine-grained Image Forgery Detection and
  Localization
Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization
Xiao Guo
Xiaohong Liu
I. Masi
Xiaoming Liu
90
9
0
31 Oct 2024
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking
Run Luo
Zikai Song
Longze Chen
Yunshui Li
Min Yang
Wei-Guo Yang
33
0
0
30 Oct 2024
An Individual Identity-Driven Framework for Animal Re-Identification
An Individual Identity-Driven Framework for Animal Re-Identification
Yihao Wu
Di Zhao
Jingfeng Zhang
Yun Sing Koh
21
0
0
30 Oct 2024
From Explicit Rules to Implicit Reasoning in an Interpretable Violence
  Monitoring System
From Explicit Rules to Implicit Reasoning in an Interpretable Violence Monitoring System
Wen-Dong Jiang
Chih-Yung Chang
Ssu-Chi Kuai
Diptendu Sinha Roy
29
0
0
29 Oct 2024
Text-Guided Attention is All You Need for Zero-Shot Robustness in
  Vision-Language Models
Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models
Lu Yu
Haiyang Zhang
Changsheng Xu
AAML
VLM
21
3
0
29 Oct 2024
TIPS: Text-Image Pretraining with Spatial awareness
TIPS: Text-Image Pretraining with Spatial awareness
Kevis-Kokitsi Maninis
Kaifeng Chen
Soham Ghosh
Arjun Karpur
Koert Chen
...
Jan Dlabal
Dan Gnanapragasam
Mojtaba Seyedhosseini
Howard Zhou
Andre Araujo
VLM
30
3
0
21 Oct 2024
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
Hao-Tang Tsui
Chien-Yao Wang
H. Liao
ObjD
VLM
41
0
0
20 Oct 2024
12345678
Next