v1v2 (latest)

Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

Neural Information Processing Systems (NeurIPS), 2022

14 September 2022

ArXiv (abs)PDF HTML Github (121★)

Papers citing "Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models"

25 / 25 papers shown

How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions

290

15 Feb 2025

Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation

S. Ly

Hien Nguyen

330

28 Nov 2024

Attention Prompting on Image for Large Vision-Language ModelsEuropean Conference on Computer Vision (ECCV), 2024

Runpeng Yu

Weihao Yu

Xinchao Wang

VLM

396

25 Sep 2024

Fairness and Bias Mitigation in Computer Vision: A Survey

Ruozhen He

Vicente Ordonez

351

05 Aug 2024

Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models

Saman Motamed

Wouter Van Gansbeke

Luc Van Gool

VGen DiffM

296

08 Apr 2024

NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning

231

02 Mar 2024

LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding

Yuxuan Wang

Yueqian Wang

Pengfei Wu

Jianxin Liang

Dongyan Zhao

Zilong Zheng

VLM

268

25 Feb 2024

Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models

C. Wu

Fernando de la Torre

DiffM

224

21 Feb 2024

A Note on Bias to Complete

Jia Xu

Mona Diab

280

18 Feb 2024

Text-to-Image Cross-Modal Generation: A Systematic Review

Maciej Żelaszczyk

Jacek Mańdziuk

320

21 Jan 2024

Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World ModelsNeural Information Processing Systems (NeurIPS), 2023

256

15 Nov 2023

Finetuning Text-to-Image Diffusion Models for FairnessInternational Conference on Learning Representations (ICLR), 2023

254

11 Nov 2023

TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene UnderstandingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

216

06 Nov 2023

Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM Micrographs

Newgin Sam Ebin Sam Dhas

Rajan Gyawali

Ashwin Dhakal

Jianlin Cheng

Dong Xu

222

04 Nov 2023

Customize StyleGAN with One Hand Sketch

Shaocong Zhang

VLM

280

29 Oct 2023

Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups

354

15 Sep 2023

ITI-GEN: Inclusive Text-to-Image GenerationIEEE International Conference on Computer Vision (ICCV), 2023

251

11 Sep 2023

Inspecting the Geographical Representativeness of Images from Text-to-Image ModelsIEEE International Conference on Computer Vision (ICCV), 2023

310

18 May 2023

Visual TuningACM Computing Surveys (ACM Comput. Surv.), 2023

...

438

10 May 2023

TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional GenerationInternational Conference on Machine Learning (ICML), 2023

Jimmy Ba

292

26 Apr 2023

PATMAT: Person Aware Tuning of Mask-Aware Transformer for Face InpaintingIEEE International Conference on Computer Vision (ICCV), 2023

279

12 Apr 2023

Controllable Text Generation via Probability Density Estimation in the Latent SpaceAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

228

16 Dec 2022

Taming Normalizing FlowsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

364

29 Nov 2022

Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance

Chen Henry Wu

Fernando de la Torre

DiffM

376

11 Oct 2022

A Distributional Lens for Multi-Aspect Controllable Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

344

06 Oct 2022