ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01518
  4. Cited By
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

2 December 2021
Yongming Rao
Wenliang Zhao
Guangyi Chen
Yansong Tang
Zheng Zhu
Guan Huang
Jie Zhou
Jiwen Lu
    VLM
    CLIP
ArXivPDFHTML

Papers citing "DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting"

50 / 400 papers shown
Title
Vision Language Modeling of Content, Distortion and Appearance for Image
  Quality Assessment
Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment
Fei Zhou
Zhicong Huang
Tianhao Gu
Guoping Qiu
CoGe
VLM
51
1
0
14 Jun 2024
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
Xiangheng Shan
Dongyue Wu
Guilin Zhu
Yuanjie Shao
Nong Sang
Changxin Gao
VLM
29
15
0
14 Jun 2024
Industrial Language-Image Dataset (ILID): Adapting Vision Foundation
  Models for Industrial Settings
Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings
Keno Moenck
Duc Trung Thieu
Julian Koch
Thorsten Schuppstuhl
VLM
27
0
0
14 Jun 2024
ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery
ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
33
2
0
12 Jun 2024
AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video
  Grounding
AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Xing Zhang
Jiaxi Gu
Haoyu Zhao
Shicong Wang
Hang Xu
Renjing Pei
Songcen Xu
Zuxuan Wu
Yu-Gang Jiang
33
0
0
11 Jun 2024
A Recover-then-Discriminate Framework for Robust Anomaly Detection
A Recover-then-Discriminate Framework for Robust Anomaly Detection
Peng-Fei Xing
Dong Zhang
Jinhui Tang
Zechao li
29
1
0
07 Jun 2024
M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via
  Multimodal Denoising
M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising
Chengjie Wang
Haokun Zhu
Jinlong Peng
Yue Wang
Ran Yi
Yunsheng Wu
Lizhuang Ma
J. J. Zhang
57
4
0
04 Jun 2024
ProGEO: Generating Prompts through Image-Text Contrastive Learning for
  Visual Geo-localization
ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization
Chen Mao
Jingqi Hu
24
4
0
04 Jun 2024
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view
  Understanding
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Thanh-Dat Truong
Utsav Prabhu
Dongyi Wang
Bhiksha Raj
Susan Gauch
J. Subbiah
Khoa Luu
46
2
0
03 Jun 2024
Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise
  Pseudo Labeling
Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling
Jinxing Zhou
Dan Guo
Yiran Zhong
Meng Wang
VLM
53
18
0
03 Jun 2024
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for
  Zero-Shot Semantic Segmentation
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Yunheng Li
Zhongyu Li
Quansheng Zeng
Qibin Hou
Ming-Ming Cheng
VLM
30
8
0
02 Jun 2024
Universal and Extensible Language-Vision Models for Organ Segmentation
  and Tumor Detection from Abdominal Computed Tomography
Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography
Jie Liu
Yixiao Zhang
Kang Wang
Mehmet Can Yavuz
Xiaoxi Chen
...
Haoliang Li
Yang Yang
Alan L. Yuille
Yucheng Tang
Zongwei Zhou
MedIm
VLM
40
16
0
28 May 2024
Promoting AI Equity in Science: Generalized Domain Prompt Learning for
  Accessible VLM Research
Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research
Qinglong Cao
Yuntian Chen
Lu Lu
Hao Sun
Zhenzhong Zeng
Xiaokang Yang
Dong-juan Zhang
VLM
16
1
0
14 May 2024
Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?
Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?
Hari Chandana Kuchibhotla
Sai Srinivas Kancheti
Abbavaram Gowtham Reddy
Vineeth N. Balasubramanian
VLM
29
0
0
13 May 2024
CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual
  Question Answering
CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering
Yuanyuan Jiang
Jianqin Yin
38
1
0
13 May 2024
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
Lingdong Kong
You-Chen Liu
Lai Xing Ng
Benoit R. Cottereau
Wei Tsang Ooi
VLM
29
12
0
08 May 2024
G-Refine: A General Quality Refiner for Text-to-Image Generation
G-Refine: A General Quality Refiner for Text-to-Image Generation
Chunyi Li
Haoning Wu
Hongkun Hao
Zicheng Zhang
Tengchaun Kou
Chaofeng Chen
Lei Bai
Xiaohong Liu
Weisi Lin
Guangtao Zhai
25
4
0
29 Apr 2024
What does CLIP know about peeling a banana?
What does CLIP know about peeling a banana?
Claudia Cuttano
Gabriele Rosi
Gabriele Trivigno
Giuseppe Averta
21
2
0
18 Apr 2024
Progressive Multi-modal Conditional Prompt Tuning
Progressive Multi-modal Conditional Prompt Tuning
Xiaoyu Qiu
Hao Feng
Yuechen Wang
Wen-gang Zhou
Houqiang Li
VLM
29
1
0
18 Apr 2024
Single-temporal Supervised Remote Change Detection for Domain
  Generalization
Single-temporal Supervised Remote Change Detection for Domain Generalization
Qiangang Du
Jinlong Peng
Xu Chen
Qingdong He
Liren He
Qiang Nie
Wenbing Zhu
Mingmin Chi
Yabiao Wang
Chengjie Wang
30
1
0
17 Apr 2024
Unifying Global and Local Scene Entities Modelling for Precise Action
  Spotting
Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Kim Hoang Tran
Phuc Vuong Do
Ngoc Quoc Ly
Ngan Le
29
4
0
15 Apr 2024
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually
  Expanding Large Vocabularies
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Zhongrui Gui
Shuyang Sun
Runjia Li
Jianhao Yuan
Zhaochong An
Karsten Roth
Ameya Prabhu
Philip H. S. Torr
VLM
CLL
24
6
0
15 Apr 2024
Implicit and Explicit Language Guidance for Diffusion-based Visual
  Perception
Implicit and Explicit Language Guidance for Diffusion-based Visual Perception
Hefeng Wang
Jiale Cao
Jin Xie
Aiping Yang
Yanwei Pang
VLM
DiffM
32
2
0
11 Apr 2024
Retrieval-Augmented Open-Vocabulary Object Detection
Retrieval-Augmented Open-Vocabulary Object Detection
Jooyeon Kim
Eulrang Cho
Sehyung Kim
Hyunwoo J. Kim
VLM
ObjD
30
8
0
08 Apr 2024
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Ji-Jia Wu
Andy Chia-Hao Chang
Chieh-Yu Chuang
Chun-Pei Chen
Yu-Lun Liu
Min-Hung Chen
Hou-Ning Hu
Yung-Yu Chuang
Yen-Yu Lin
VLM
33
9
0
05 Apr 2024
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Elham Amin Mansour
Ozan Unal
Suman Saha
Benjamin Bejar
Luc Van Gool
34
1
0
04 Apr 2024
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features
  and Rendered Novel Views
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
Francis Engelmann
Fabian Manhardt
Michael Niemeyer
Keisuke Tateno
Marc Pollefeys
Federico Tombari
VLM
65
32
1
04 Apr 2024
WorDepth: Variational Language Prior for Monocular Depth Estimation
WorDepth: Variational Language Prior for Monocular Depth Estimation
Ziyao Zeng
Daniel Wang
Fengyu Yang
Hyoungseob Park
Yangchao Wu
Stefano Soatto
Byung-Woo Hong
Dong Lao
Alex Wong
MDE
38
26
0
04 Apr 2024
ASAP: Interpretable Analysis and Summarization of AI-generated Image
  Patterns at Scale
ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale
Jinbin Huang
C. L. P. Chen
Aditi Mishra
Bum Chul Kwon
Zhicheng Liu
Chris Bryan
42
4
0
03 Apr 2024
Segment Any 3D Object with Language
Segment Any 3D Object with Language
Seungjun Lee
Yuyang Zhao
Gim Hee Lee
31
1
0
02 Apr 2024
Training-Free Semantic Segmentation via LLM-Supervision
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
35
2
0
31 Mar 2024
Weak Distribution Detectors Lead to Stronger Generalizability of
  Vision-Language Prompt Tuning
Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning
Kun Ding
Haojian Zhang
Qiang Yu
Ying Wang
Shiming Xiang
Chunhong Pan
OODD
VLM
16
4
0
31 Mar 2024
Transfer CLIP for Generalizable Image Denoising
Transfer CLIP for Generalizable Image Denoising
Junting Cheng
Dong Liang
Shan Tan
VLM
25
12
0
22 Mar 2024
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic
  Segmentation
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation
Kwanyoung Kim
Y. Oh
Jong Chul Ye
VLM
41
7
0
21 Mar 2024
Better Call SAL: Towards Learning to Segment Anything in Lidar
Better Call SAL: Towards Learning to Segment Anything in Lidar
Aljovsa Ovsep
Tim Meinhardt
Francesco Ferroni
Neehar Peri
Deva Ramanan
Laura Leal-Taixé
VLM
25
15
0
19 Mar 2024
Adapting Visual-Language Models for Generalizable Anomaly Detection in
  Medical Images
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
Chaoqin Huang
Aofan Jiang
Jinghao Feng
Ya-Qin Zhang
Xinchao Wang
Yanfeng Wang
MedIm
28
25
0
19 Mar 2024
Compositional Kronecker Context Optimization for Vision-Language Models
Compositional Kronecker Context Optimization for Vision-Language Models
Kun Ding
Xiaohui Li
Qiang Yu
Ying Wang
Haojian Zhang
Shiming Xiang
VLM
26
0
0
18 Mar 2024
Generative Region-Language Pretraining for Open-Ended Object Detection
Generative Region-Language Pretraining for Open-Ended Object Detection
Chuang Lin
Yi-Xin Jiang
Lizhen Qu
Zehuan Yuan
Jianfei Cai
ObjD
VLM
40
13
0
15 Mar 2024
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Brandon McKinzie
Zhe Gan
J. Fauconnier
Sam Dodge
Bowen Zhang
...
Zirui Wang
Ruoming Pang
Peter Grasch
Alexander Toshev
Yinfei Yang
MLLM
27
185
0
14 Mar 2024
Anomaly Detection by Adapting a pre-trained Vision Language Model
Anomaly Detection by Adapting a pre-trained Vision Language Model
Yuxuan Cai
Xinwei He
Dingkang Liang
Ao Tong
Xiang Bai
VLM
22
1
0
14 Mar 2024
Annotation Free Semantic Segmentation with Vision Foundation Models
Annotation Free Semantic Segmentation with Vision Foundation Models
Soroush Seifi
Daniel Olmeda Reino
Fabien Despinoy
Rahaf Aljundi
VLM
24
1
0
14 Mar 2024
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Zicheng Zhang
Tong Zhang
Yi Zhu
Jian-zhuo Liu
Xiaodan Liang
QiXiang Ye
Wei Ke
VLM
44
2
0
13 Mar 2024
Open-World Semantic Segmentation Including Class Similarity
Open-World Semantic Segmentation Including Class Similarity
Matteo Sodano
Federico Magistri
Lucas Nunes
Jens Behley
C. Stachniss
VLM
34
8
0
12 Mar 2024
Split to Merge: Unifying Separated Modalities for Unsupervised Domain
  Adaptation
Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation
Xinyao Li
Yuke Li
Zhekai Du
Fengling Li
Ke Lu
Jingjing Li
VLM
39
4
0
11 Mar 2024
CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
Pengwei Yin
Guanzhong Zeng
Jingjing Wang
Di Xie
VLM
18
11
0
08 Mar 2024
Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Zhekai Du
Xinyao Li
Fengling Li
Ke Lu
Lei Zhu
Jingjing Li
38
15
0
05 Mar 2024
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
Zheng Li
Xiang Li
Xinyi Fu
Xing Zhang
Weiqiang Wang
Shuo Chen
Jian Yang
VLM
27
34
0
05 Mar 2024
Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation
Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation
Maksim Kuprashevich
Grigorii Alekseenko
Irina Tolstykh
ELM
48
4
0
04 Mar 2024
Unveiling Typographic Deceptions: Insights of the Typographic
  Vulnerability in Large Vision-Language Model
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
Hao-Ran Cheng
Erjia Xiao
Jindong Gu
Le Yang
Jinhao Duan
Jize Zhang
Jiahang Cao
Kaidi Xu
Renjing Xu
29
6
0
29 Feb 2024
FineDiffusion: Scaling up Diffusion Models for Fine-grained Image
  Generation with 10,000 Classes
FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes
Ziying Pan
Kun Wang
Gang Li
Feihong He
Yongxuan Lai
22
0
0
28 Feb 2024
Previous
12345678
Next