v1v2 (latest)

PromptDet: Towards Open-vocabulary Detection using Uncurated Images

European Conference on Computer Vision (ECCV), 2022

30 March 2022

Yujie Zhong

Papers citing "PromptDet: Towards Open-vocabulary Detection using Uncurated Images"

50 / 115 papers shown

State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection

Jiaying Zhou

Qingchao Chen

161

22 Nov 2025

TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models

504

20 Nov 2025

NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation

536

24 Oct 2025

On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration

543

20 Oct 2025

Cluster-Aware Prompt Ensemble Learning for Few-Shot Vision-Language Model AdaptationPattern Recognition (Pattern Recogn.), 2025

236

10 Oct 2025

Cross-View Open-Vocabulary Object Detection in Aerial Imagery

276

04 Oct 2025

Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation

175

01 Oct 2025

Constrained Prompt Enhancement for Improving Zero-Shot Generalization of Vision-Language Models

221

24 Aug 2025

Towards Open World Detection: A Survey

Andrei-Stefan Bulzan

Cosmin Cernazanu-Glavan

ObjD VLM

265

22 Aug 2025

AME: Aligned Manifold Entropy for Robust Vision-Language Distillation

Guiming Cao

Yuming Ou

AAML VLM

226

12 Aug 2025

Prompt-Guided Relational Reasoning for Social Behavior Understanding with Vision Foundation Models

Thinesh Thiyakesan Ponbagavathi

Chengzheng Yang

Alina Roitberg

VLM

246

11 Aug 2025

ODOV: Towards Open-Domain Open-Vocabulary Object Detection

265

02 Aug 2025

Advancing Visual Large Language Model for Multi-granular Versatile Perception

345

22 Jul 2025

Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline

258

12 Jul 2025

Open World Object Detection: A Survey

480

01 Jul 2025

EarthGPT-X: A Spatial MLLM for Multi-level Multi-Source Remote Sensing Imagery Understanding with Visual PromptingIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025

493

17 Apr 2025

Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation

...

392

13 Apr 2025

GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection

470

26 Mar 2025

Squeeze Out Tokens from Sample for Finer-Grained Data Governance

...

331

18 Mar 2025

Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object DetectionInternational Conference on Learning Representations (ICLR), 2025

1.2K

14 Mar 2025

A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection

375

13 Mar 2025

YOLO-UniOW: Efficient Universal Open-World Object Detection

372

31 Dec 2024

Style-Pro: Style-Guided Prompt Learning for Generalizable Vision-Language ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

Niloufar Alipour Talemi

Hossein Kashiani

Fatemeh Afghah

CLIP VLM

382

25 Nov 2024

Active Prompt Learning with Vision-Language Model Priors

234

23 Nov 2024

Efficient Transfer Learning for Video-language Foundation ModelsComputer Vision and Pattern Recognition (CVPR), 2024

456

18 Nov 2024

Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation

310

04 Nov 2024

OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object TrackingNeural Information Processing Systems (NeurIPS), 2024

Haiji Liang

Ruize Han

VLM

413

23 Oct 2024

SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary DetectionACM Multimedia (MM), 2024

285

08 Oct 2024

Revisiting Prompt Pretraining of Vision-Language Models

Zhaowei Chen

Xiang Li

400

10 Sep 2024

Exploring Conditional Multi-Modal Prompts for Zero-shot HOI DetectionEuropean Conference on Computer Vision (ECCV), 2024

392

05 Aug 2024

A Simple Background Augmentation Method for Object Detection with Diffusion ModelEuropean Conference on Computer Vision (ECCV), 2024

368

01 Aug 2024

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

286

31 Jul 2024

EarthMarker: Visual Prompt Learning for Region-level and Point-level Remote Sensing Imagery Comprehension

Wei Zhang

Miaoxin Cai

Tong Zhang

Jun Li

Zhuang Yin

Xuerui Mao

457

18 Jul 2024

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation

Pengfei Wang

Yuxi Wang

Shuai Li

Zhaoxiang Zhang

Zhen Lei

Lei Zhang

271

18 Jul 2024

CerberusDet: Unified Multi-Task Object Detection

311

17 Jul 2024

LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

Penghui Du

Yu Wang

Yifan Sun

Luting Wang

Errui Ding

Jingdong Wang

392

16 Jul 2024

Quantized Prompt for Efficient Generalization of Vision-Language Models

Hui Chen

340

15 Jul 2024

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

282

12 Jul 2024

Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization

377

11 Jul 2024

Rethinking Image-to-Video Adaptation: An Object-centric Perspective

Rui Qian

Shuangrui Ding

Dahua Lin

OCL

289

09 Jul 2024

AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

Limin Wang

381

05 Jul 2024

OVMR: Open-Vocabulary Recognition with Multi-Modal ReferencesComputer Vision and Pattern Recognition (CVPR), 2024

Qi Tian

457

07 Jun 2024

Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection

498

02 Jun 2024

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

441

01 Jun 2024

RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection

Hao Chen

260

30 May 2024

Open-Vocabulary SAM3D: Understand Any 3D Scene

Hanchen Tai

Qingdong He

Jiangning Zhang

Yijie Qian

Ying Tai

Xiaobin Hu

Yabiao Wang

Yong Liu

VLM

324

24 May 2024

Open-Vocabulary Spatio-Temporal Action Detection

Tao Wu

Gangshan Wu

280

17 May 2024

SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024

315

16 May 2024

Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?

Hari Chandana Kuchibhotla

Sai Srinivas Kancheti

Abbavaram Gowtham Reddy

Vineeth N. Balasubramanian

VLM

388

13 May 2024

Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation

Yanhao Zheng

Kai Liu

ObjD

244

12 Apr 2024