Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01518
Cited By
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
2 December 2021
Yongming Rao
Wenliang Zhao
Guangyi Chen
Yansong Tang
Zheng Zhu
Guan Huang
Jie Zhou
Jiwen Lu
VLM
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting"
50 / 400 papers shown
Title
Analyzing the Roles of Language and Vision in Learning from Limited Data
Allison Chen
Ilia Sucholutsky
Olga Russakovsky
Thomas L. Griffiths
VLM
21
2
0
15 Feb 2024
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors
Sheng Jin
Xue-Qiu Jiang
Jiaxing Huang
Lewei Lu
Shijian Lu
VLM
ObjD
21
21
0
07 Feb 2024
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
Shiyin Dong
Mingrui Zhu
Kun Cheng
Nannan Wang
Xinbo Gao
DiffM
8
3
0
29 Jan 2024
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin
Chien-Yi Wang
Yu-Chiang Frank Wang
Min-Hung Chen
VLM
21
0
0
22 Jan 2024
Concept-Guided Prompt Learning for Generalization in Vision-Language Models
Yi Zhang
Ce Zhang
Ke Yu
Yushun Tang
Zhihai He
VLM
MLLM
32
20
0
15 Jan 2024
APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning
Guiming Cao
Kaize Shi
Hong Fu
Huaiwen Zhang
Guandong Xu
VLM
20
1
0
12 Jan 2024
CLIP-Driven Semantic Discovery Network for Visible-Infrared Person Re-Identification
Xiaoyan Yu
Neng Dong
Liehuang Zhu
Hao Peng
Dapeng Tao
21
6
0
11 Jan 2024
Revisiting Adversarial Training at Scale
Zeyu Wang
Xianhang Li
Hongru Zhu
Cihang Xie
21
15
0
09 Jan 2024
Learning to Prompt Segment Anything Models
Jiaxing Huang
Kai Jiang
Jingyi Zhang
Han Qiu
Lewei Lu
Shijian Lu
Eric P. Xing
VLM
LRM
40
7
0
09 Jan 2024
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Longtian Qiu
Shan Ning
Xuming He
VLM
33
3
0
04 Jan 2024
Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels
Rui Huang
Songyou Peng
Ayca Takmaz
Federico Tombari
Marc Pollefeys
Shiji Song
Gao Huang
Francis Engelmann
VLM
13
37
0
28 Dec 2023
Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based Person Re-Identification
Yajing Zhai
Yawen Zeng
Zhiyong Huang
Zheng Qin
Xin Jin
Dandan Cao
23
12
0
28 Dec 2023
VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting
Seunggu Kang
WonJun Moon
Euiyeon Kim
Jae-Pil Heo
13
21
0
27 Dec 2023
Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
Qianrui Zhou
Hua Xu
Hao Li
Hanlei Zhang
Xiaohan Zhang
Yifan Wang
Kai Gao
28
12
0
22 Dec 2023
Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation
Wenhao Xu
Rongtao Xu
Changwei Wang
Shibiao Xu
Li Guo
Man Zhang
Xiaopeng Zhang
VLM
20
10
0
20 Dec 2023
MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images
Libo Wang
Sijun Dong
Ying Chen
Xiaoliang Meng
Shenghui Fang
Ayman Habib
Songlin Fei
13
0
0
20 Dec 2023
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation
Monika Wysoczañska
Oriane Siméoni
Michael Ramamonjisoa
Andrei Bursuc
Tomasz Trzciñski
Patrick Pérez
VLM
CLIP
24
29
0
19 Dec 2023
Open Vocabulary Semantic Scene Sketch Understanding
Ahmed Bourouis
Judith E. Fan
Yulia Gryaditskaya
VLM
3DV
15
1
0
18 Dec 2023
Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
Shuailei Ma
Chen-Wei Xie
Ying-yu Wei
Siyang Sun
Jiaqi Fan
Xiaoyi Bao
Yuxin Guo
Yun Zheng
VLM
VPVLM
17
2
0
18 Dec 2023
TF-CLIP: Learning Text-free CLIP for Video-based Person Re-Identification
Chenyang Yu
Xuehu Liu
Yingquan Wang
Pingping Zhang
Huchuan Lu
VLM
19
21
0
15 Dec 2023
WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge
Huy Le
Tung Kieu
Anh Nguyen
Ngan Le
VGen
19
1
0
15 Dec 2023
MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning
Yi Xin
Junlong Du
Qiang Wang
Ke Yan
Shouhong Ding
VLM
32
45
0
14 Dec 2023
CLIP in Medical Imaging: A Comprehensive Survey
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Dinggang Shen
CLIP
MedIm
LM&MA
VLM
21
2
0
12 Dec 2023
Domain Prompt Learning with Quaternion Networks
Qinglong Cao
Zhengqin Xu
Yuntian Chen
Chao Ma
Xiaokang Yang
VLM
24
10
0
12 Dec 2023
Compound Text-Guided Prompt Tuning via Image-Adaptive Cues
Hao Tan
Jun Li
Yi Zhou
Jun Wan
Zhen Lei
Xiangyu Zhang
VLM
33
4
0
11 Dec 2023
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models
Yubin Wang
Xinyang Jiang
De Cheng
Dongsheng Li
Cairong Zhao
VLM
33
15
0
11 Dec 2023
OpenSD: Unified Open-Vocabulary Segmentation and Detection
Shuai Li
Ming-hui Li
Pengfei Wang
Lei Zhang
ObjD
VLM
24
6
0
10 Dec 2023
Foundation Model Assisted Weakly Supervised Semantic Segmentation
Xiaobo Yang
Xiaojin Gong
VLM
26
22
0
06 Dec 2023
SYNC-CLIP: Synthetic Data Make CLIP Generalize Better in Data-Limited Scenarios
Mushui Liu
Weijie He
Ziqian Lu
Yunlong Yu
VLM
22
1
0
06 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language Instruction
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLM
VOS
47
15
0
04 Dec 2023
A Simple Recipe for Language-guided Domain Generalized Segmentation
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Patrick Pérez
Raoul de Charette
VLM
16
14
0
29 Nov 2023
Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition
Yifei Chen
Dapeng Chen
Ruijin Liu
Sai Zhou
Wenyuan Xue
Wei Peng
25
6
0
27 Nov 2023
Spatially Covariant Image Registration with Text Prompts
Xiang Chen
Min Liu
Rongguang Wang
Renjiu Hu
Dongdong Liu
Gaolei Li
Hang Zhang
MedIm
35
7
0
27 Nov 2023
BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP
Jiawang Bai
Kuofeng Gao
Shaobo Min
Shu-Tao Xia
Zhifeng Li
Wei Liu
VLM
21
36
0
26 Nov 2023
Hardware Resilience Properties of Text-Guided Image Classifiers
Syed Talal Wasim
Kabila Haile Soboka
Abdulrahman Mahmoud
Salman Khan
David Brooks
Gu-Yeon Wei
VLM
20
1
0
23 Nov 2023
Language-guided Few-shot Semantic Segmentation
Jing Wang
Yuang Liu
Qiang-feng Zhou
Fan Wang
VLM
9
3
0
23 Nov 2023
Visual In-Context Prompting
Feng Li
Qing Jiang
Hao Zhang
Tianhe Ren
Shilong Liu
...
Hongyang Li
Chun-yue Li
Jianwei Yang
Lei Zhang
Jianfeng Gao
VLM
LRM
MLLM
27
30
0
22 Nov 2023
Generalized Category Discovery in Semantic Segmentation
Zhengyuan Peng
Qijian Tian
Jianqing Xu
Yizhang Jin
Xuequan Lu
Xin Tan
Yuan Xie
Lizhuang Ma
ISeg
12
2
0
20 Nov 2023
Open-Vocabulary Video Anomaly Detection
Peng Wu
Xuerong Zhou
Guansong Pang
Yujia Sun
Jing Liu
Peng Wang
Yanning Zhang
VLM
24
21
0
13 Nov 2023
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification
Reza Esfandiarpoor
Stephen H. Bach
VLM
19
13
0
10 Nov 2023
GIPCOL: Graph-Injected Soft Prompting for Compositional Zero-Shot Learning
Guangyue Xu
Joyce Chai
Parisa Kordjamshidi
VLM
14
16
0
09 Nov 2023
Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks
Haoyi Duan
Yan Xia
Mingze Zhou
Li Tang
Jieming Zhu
Zhou Zhao
VLM
11
17
0
09 Nov 2023
HIPTrack: Visual Tracking with Historical Prompts
Wenrui Cai
Qingjie Liu
Yunhong Wang
VLM
15
30
0
03 Nov 2023
AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection
Qihang Zhou
Guansong Pang
Yu Tian
Shibo He
Jiming Chen
VLM
26
120
0
29 Oct 2023
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation
Yuchen Suo
Linchao Zhu
Yi Yang
21
12
0
27 Oct 2023
What's Left? Concept Grounding with Logic-Enhanced Foundation Models
Joy Hsu
Jiayuan Mao
Joshua B. Tenenbaum
Jiajun Wu
VLM
ReLM
LRM
18
21
0
24 Oct 2023
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting
Lei Li
9
23
0
24 Oct 2023
Learning with Noisy Labels Using Collaborative Sample Selection and Contrastive Semi-Supervised Learning
Qing Miao
Xiaohe Wu
Chao Xu
Yanli Ji
Wangmeng Zuo
Yiwen Guo
Zhaopeng Meng
NoLa
24
2
0
24 Oct 2023
Videoprompter: an ensemble of foundational models for zero-shot video understanding
Adeel Yousaf
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Mubarak Shah
VLM
22
2
0
23 Oct 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
Yue Jiang
Yingya Zhang
Jing Dong
17
2
0
12 Oct 2023
Previous
1
2
3
4
5
6
7
8
Next