ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01518
  4. Cited By
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

2 December 2021
Yongming Rao
Wenliang Zhao
Guangyi Chen
Yansong Tang
Zheng Zhu
Guan Huang
Jie Zhou
Jiwen Lu
    VLM
    CLIP
ArXivPDFHTML

Papers citing "DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting"

50 / 400 papers shown
Title
Analyzing the Roles of Language and Vision in Learning from Limited Data
Analyzing the Roles of Language and Vision in Learning from Limited Data
Allison Chen
Ilia Sucholutsky
Olga Russakovsky
Thomas L. Griffiths
VLM
21
2
0
15 Feb 2024
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained
  Descriptors
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors
Sheng Jin
Xue-Qiu Jiang
Jiaxing Huang
Lewei Lu
Shijian Lu
VLM
ObjD
21
21
0
07 Feb 2024
Bridging Generative and Discriminative Models for Unified Visual
  Perception with Diffusion Priors
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
Shiyin Dong
Mingrui Zhu
Kun Cheng
Nannan Wang
Xinbo Gao
DiffM
8
3
0
29 Jan 2024
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin
Chien-Yi Wang
Yu-Chiang Frank Wang
Min-Hung Chen
VLM
21
0
0
22 Jan 2024
Concept-Guided Prompt Learning for Generalization in Vision-Language
  Models
Concept-Guided Prompt Learning for Generalization in Vision-Language Models
Yi Zhang
Ce Zhang
Ke Yu
Yushun Tang
Zhihai He
VLM
MLLM
32
20
0
15 Jan 2024
APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning
APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning
Guiming Cao
Kaize Shi
Hong Fu
Huaiwen Zhang
Guandong Xu
VLM
20
1
0
12 Jan 2024
CLIP-Driven Semantic Discovery Network for Visible-Infrared Person
  Re-Identification
CLIP-Driven Semantic Discovery Network for Visible-Infrared Person Re-Identification
Xiaoyan Yu
Neng Dong
Liehuang Zhu
Hao Peng
Dapeng Tao
21
6
0
11 Jan 2024
Revisiting Adversarial Training at Scale
Revisiting Adversarial Training at Scale
Zeyu Wang
Xianhang Li
Hongru Zhu
Cihang Xie
21
15
0
09 Jan 2024
Learning to Prompt Segment Anything Models
Learning to Prompt Segment Anything Models
Jiaxing Huang
Kai Jiang
Jingyi Zhang
Han Qiu
Lewei Lu
Shijian Lu
Eric P. Xing
VLM
LRM
40
7
0
09 Jan 2024
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via
  Text-Only Training
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Longtian Qiu
Shan Ning
Xuming He
VLM
33
3
0
04 Jan 2024
Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without
  Manual Labels
Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels
Rui Huang
Songyou Peng
Ayca Takmaz
Federico Tombari
Marc Pollefeys
Shiji Song
Gao Huang
Francis Engelmann
VLM
13
37
0
28 Dec 2023
Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based
  Person Re-Identification
Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based Person Re-Identification
Yajing Zhai
Yawen Zeng
Zhiyong Huang
Zheng Qin
Xin Jin
Dandan Cao
23
12
0
28 Dec 2023
VLCounter: Text-aware Visual Representation for Zero-Shot Object
  Counting
VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting
Seunggu Kang
WonJun Moon
Euiyeon Kim
Jae-Pil Heo
13
21
0
27 Dec 2023
Token-Level Contrastive Learning with Modality-Aware Prompting for
  Multimodal Intent Recognition
Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
Qianrui Zhou
Hua Xu
Hao Li
Hanlei Zhang
Xiaohan Zhang
Yifan Wang
Kai Gao
28
12
0
22 Dec 2023
Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic
  Segmentation
Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation
Wenhao Xu
Rongtao Xu
Changwei Wang
Shibiao Xu
Li Guo
Man Zhang
Xiaopeng Zhang
VLM
20
10
0
20 Dec 2023
MetaSegNet: Metadata-collaborative Vision-Language Representation
  Learning for Semantic Segmentation of Remote Sensing Images
MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images
Libo Wang
Sijun Dong
Ying Chen
Xiaoliang Meng
Shenghui Fang
Ayman Habib
Songlin Fei
13
0
0
20 Dec 2023
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary
  semantic segmentation
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation
Monika Wysoczañska
Oriane Siméoni
Michael Ramamonjisoa
Andrei Bursuc
Tomasz Trzciñski
Patrick Pérez
VLM
CLIP
24
29
0
19 Dec 2023
Open Vocabulary Semantic Scene Sketch Understanding
Open Vocabulary Semantic Scene Sketch Understanding
Ahmed Bourouis
Judith E. Fan
Yulia Gryaditskaya
VLM
3DV
15
1
0
18 Dec 2023
Understanding the Multi-modal Prompts of the Pre-trained Vision-Language
  Model
Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
Shuailei Ma
Chen-Wei Xie
Ying-yu Wei
Siyang Sun
Jiaqi Fan
Xiaoyi Bao
Yuxin Guo
Yun Zheng
VLM
VPVLM
17
2
0
18 Dec 2023
TF-CLIP: Learning Text-free CLIP for Video-based Person
  Re-Identification
TF-CLIP: Learning Text-free CLIP for Video-based Person Re-Identification
Chenyang Yu
Xuehu Liu
Yingquan Wang
Pingping Zhang
Huchuan Lu
VLM
19
21
0
15 Dec 2023
WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling
  Vision-Language Models Through Open-Vocabulary Knowledge
WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge
Huy Le
Tung Kieu
Anh Nguyen
Ngan Le
VGen
19
1
0
15 Dec 2023
MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning
MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning
Yi Xin
Junlong Du
Qiang Wang
Ke Yan
Shouhong Ding
VLM
32
45
0
14 Dec 2023
CLIP in Medical Imaging: A Comprehensive Survey
CLIP in Medical Imaging: A Comprehensive Survey
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Dinggang Shen
CLIP
MedIm
LM&MA
VLM
21
2
0
12 Dec 2023
Domain Prompt Learning with Quaternion Networks
Domain Prompt Learning with Quaternion Networks
Qinglong Cao
Zhengqin Xu
Yuntian Chen
Chao Ma
Xiaokang Yang
VLM
24
10
0
12 Dec 2023
Compound Text-Guided Prompt Tuning via Image-Adaptive Cues
Compound Text-Guided Prompt Tuning via Image-Adaptive Cues
Hao Tan
Jun Li
Yi Zhou
Jun Wan
Zhen Lei
Xiangyu Zhang
VLM
33
4
0
11 Dec 2023
Learning Hierarchical Prompt with Structured Linguistic Knowledge for
  Vision-Language Models
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models
Yubin Wang
Xinyang Jiang
De Cheng
Dongsheng Li
Cairong Zhao
VLM
33
15
0
11 Dec 2023
OpenSD: Unified Open-Vocabulary Segmentation and Detection
OpenSD: Unified Open-Vocabulary Segmentation and Detection
Shuai Li
Ming-hui Li
Pengfei Wang
Lei Zhang
ObjD
VLM
24
6
0
10 Dec 2023
Foundation Model Assisted Weakly Supervised Semantic Segmentation
Foundation Model Assisted Weakly Supervised Semantic Segmentation
Xiaobo Yang
Xiaojin Gong
VLM
26
22
0
06 Dec 2023
SYNC-CLIP: Synthetic Data Make CLIP Generalize Better in Data-Limited
  Scenarios
SYNC-CLIP: Synthetic Data Make CLIP Generalize Better in Data-Limited Scenarios
Mushui Liu
Weijie He
Ziqian Lu
Yunlong Yu
VLM
22
1
0
06 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language
  Instruction
Universal Segmentation at Arbitrary Granularity with Language Instruction
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLM
VOS
47
15
0
04 Dec 2023
A Simple Recipe for Language-guided Domain Generalized Segmentation
A Simple Recipe for Language-guided Domain Generalized Segmentation
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Patrick Pérez
Raoul de Charette
VLM
16
14
0
29 Nov 2023
Align before Adapt: Leveraging Entity-to-Region Alignments for
  Generalizable Video Action Recognition
Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition
Yifei Chen
Dapeng Chen
Ruijin Liu
Sai Zhou
Wenyuan Xue
Wei Peng
25
6
0
27 Nov 2023
Spatially Covariant Image Registration with Text Prompts
Spatially Covariant Image Registration with Text Prompts
Xiang Chen
Min Liu
Rongguang Wang
Renjiu Hu
Dongdong Liu
Gaolei Li
Hang Zhang
MedIm
35
7
0
27 Nov 2023
BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP
BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP
Jiawang Bai
Kuofeng Gao
Shaobo Min
Shu-Tao Xia
Zhifeng Li
Wei Liu
VLM
21
36
0
26 Nov 2023
Hardware Resilience Properties of Text-Guided Image Classifiers
Hardware Resilience Properties of Text-Guided Image Classifiers
Syed Talal Wasim
Kabila Haile Soboka
Abdulrahman Mahmoud
Salman Khan
David Brooks
Gu-Yeon Wei
VLM
20
1
0
23 Nov 2023
Language-guided Few-shot Semantic Segmentation
Language-guided Few-shot Semantic Segmentation
Jing Wang
Yuang Liu
Qiang-feng Zhou
Fan Wang
VLM
9
3
0
23 Nov 2023
Visual In-Context Prompting
Visual In-Context Prompting
Feng Li
Qing Jiang
Hao Zhang
Tianhe Ren
Shilong Liu
...
Hongyang Li
Chun-yue Li
Jianwei Yang
Lei Zhang
Jianfeng Gao
VLM
LRM
MLLM
27
30
0
22 Nov 2023
Generalized Category Discovery in Semantic Segmentation
Generalized Category Discovery in Semantic Segmentation
Zhengyuan Peng
Qijian Tian
Jianqing Xu
Yizhang Jin
Xuequan Lu
Xin Tan
Yuan Xie
Lizhuang Ma
ISeg
12
2
0
20 Nov 2023
Open-Vocabulary Video Anomaly Detection
Open-Vocabulary Video Anomaly Detection
Peng Wu
Xuerong Zhou
Guansong Pang
Yujia Sun
Jing Liu
Peng Wang
Yanning Zhang
VLM
24
21
0
13 Nov 2023
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities
  for Image Classification
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification
Reza Esfandiarpoor
Stephen H. Bach
VLM
19
13
0
10 Nov 2023
GIPCOL: Graph-Injected Soft Prompting for Compositional Zero-Shot
  Learning
GIPCOL: Graph-Injected Soft Prompting for Compositional Zero-Shot Learning
Guangyue Xu
Joyce Chai
Parisa Kordjamshidi
VLM
14
16
0
09 Nov 2023
Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual
  Downstream Tasks
Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks
Haoyi Duan
Yan Xia
Mingze Zhou
Li Tang
Jieming Zhu
Zhou Zhao
VLM
11
17
0
09 Nov 2023
HIPTrack: Visual Tracking with Historical Prompts
HIPTrack: Visual Tracking with Historical Prompts
Wenrui Cai
Qingjie Liu
Yunhong Wang
VLM
15
30
0
03 Nov 2023
AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly
  Detection
AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection
Qihang Zhou
Guansong Pang
Yu Tian
Shibo He
Jiming Chen
VLM
26
120
0
29 Oct 2023
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation
Yuchen Suo
Linchao Zhu
Yi Yang
21
12
0
27 Oct 2023
What's Left? Concept Grounding with Logic-Enhanced Foundation Models
What's Left? Concept Grounding with Logic-Enhanced Foundation Models
Joy Hsu
Jiayuan Mao
Joshua B. Tenenbaum
Jiajun Wu
VLM
ReLM
LRM
18
21
0
24 Oct 2023
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought
  Language Prompting
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting
Lei Li
9
23
0
24 Oct 2023
Learning with Noisy Labels Using Collaborative Sample Selection and
  Contrastive Semi-Supervised Learning
Learning with Noisy Labels Using Collaborative Sample Selection and Contrastive Semi-Supervised Learning
Qing Miao
Xiaohe Wu
Chao Xu
Yanli Ji
Wangmeng Zuo
Yiwen Guo
Zhaopeng Meng
NoLa
24
2
0
24 Oct 2023
Videoprompter: an ensemble of foundational models for zero-shot video
  understanding
Videoprompter: an ensemble of foundational models for zero-shot video understanding
Adeel Yousaf
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Mubarak Shah
VLM
22
2
0
23 Oct 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided
  Image Editing
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
Yue Jiang
Yingya Zhang
Jing Dong
17
2
0
12 Oct 2023
Previous
12345678
Next