ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.06970
  4. Cited By
Generative Visual Prompt: Unifying Distributional Control of Pre-Trained
  Generative Models
v1v2 (latest)

Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

Neural Information Processing Systems (NeurIPS), 2022
14 September 2022
Chen Henry Wu
Saman Motamed
Shaunak Srivastava
Fernando de la Torre
    VLMDiffM
ArXiv (abs)PDFHTMLGithub (121★)

Papers citing "Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models"

25 / 25 papers shown
How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions
How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions
Na Min An
Eunki Kim
Wan Ju Kang
Sangryul Kim
Hyunjung Shim
Hyunjung Shim
290
2
0
15 Feb 2025
Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through
  Frequency-Based Adaptation
Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation
S. Ly
Hien Nguyen
330
4
0
28 Nov 2024
Attention Prompting on Image for Large Vision-Language Models
Attention Prompting on Image for Large Vision-Language ModelsEuropean Conference on Computer Vision (ECCV), 2024
Runpeng Yu
Weihao Yu
Xinchao Wang
VLM
396
28
0
25 Sep 2024
Fairness and Bias Mitigation in Computer Vision: A Survey
Fairness and Bias Mitigation in Computer Vision: A Survey
Sepehr Dehdashtian
Ruozhen He
Yi Li
Guha Balakrishnan
Nuno Vasconcelos
Vicente Ordonez
Vishnu Boddeti
351
13
0
05 Aug 2024
Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot
  Editing of Text-to-Video Diffusion Models
Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models
Saman Motamed
Wouter Van Gansbeke
Luc Van Gool
VGenDiffM
296
2
0
08 Apr 2024
NeRF-VPT: Learning Novel View Representations with Neural Radiance
  Fields via View Prompt Tuning
NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning
Linsheng Chen
Guangrun Wang
Liuchun Yuan
Keze Wang
Ken Deng
Juil Sock
231
3
0
02 Mar 2024
LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form
  Video-Text Understanding
LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding
Yuxuan Wang
Yueqian Wang
Pengfei Wu
Jianxin Liang
Dongyan Zhao
Zilong Zheng
VLM
268
3
0
25 Feb 2024
Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion
  Models
Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models
C. Wu
Fernando de la Torre
DiffM
224
3
0
21 Feb 2024
A Note on Bias to Complete
A Note on Bias to Complete
Jia Xu
Mona Diab
280
2
0
18 Feb 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
320
6
0
21 Jan 2024
Imagine the Unseen World: A Benchmark for Systematic Generalization in
  Visual World Models
Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World ModelsNeural Information Processing Systems (NeurIPS), 2023
Yeongbin Kim
Gautam Singh
Junyeong Park
Çağlar Gülçehre
Sungjin Ahn
OCLVLM
256
7
0
15 Nov 2023
Finetuning Text-to-Image Diffusion Models for Fairness
Finetuning Text-to-Image Diffusion Models for FairnessInternational Conference on Learning Representations (ICLR), 2023
Xudong Shen
Chao Du
Tianyu Pang
Min Lin
Yongkang Wong
Mohan S. Kankanhalli
254
85
0
11 Nov 2023
TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic
  Scene Understanding
TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene UnderstandingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shuo Wang
Jing Li
Zibo Zhao
Dongze Lian
Binbin Huang
Xiaomei Wang
Zhengxin Li
Shenghua Gao
216
13
0
06 Nov 2023
Adapting Segment Anything Model (SAM) through Prompt-based Learning for
  Enhanced Protein Identification in Cryo-EM Micrographs
Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM Micrographs
Fei He
Zhiyuan Yang
Mingyue Gao
Biplab Poudel
Newgin Sam Ebin Sam Dhas
Rajan Gyawali
Ashwin Dhakal
Jianlin Cheng
Dong Xu
222
6
0
04 Nov 2023
Customize StyleGAN with One Hand Sketch
Customize StyleGAN with One Hand Sketch
Shaocong Zhang
VLM
280
0
0
29 Oct 2023
Toward responsible face datasets: modeling the distribution of a
  disentangled latent space for sampling face images from demographic groups
Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups
Parsa Rahimi
Christophe Ecabert
S´ebastien Marcel
CVBM
354
7
0
15 Sep 2023
ITI-GEN: Inclusive Text-to-Image Generation
ITI-GEN: Inclusive Text-to-Image GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Cheng Zhang
Xuanbai Chen
Siqi Chai
Chen Henry Wu
Dmitry Lagun
Thabo Beeler
Fernando de la Torre
VLM
251
79
0
11 Sep 2023
Inspecting the Geographical Representativeness of Images from
  Text-to-Image Models
Inspecting the Geographical Representativeness of Images from Text-to-Image ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Aparna Basu
R. Venkatesh Babu
Danish Pruthi
DiffM
310
48
0
18 May 2023
Visual Tuning
Visual TuningACM Computing Surveys (ACM Comput. Surv.), 2023
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
438
60
0
10 May 2023
TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional
  Generation
TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional GenerationInternational Conference on Machine Learning (ICML), 2023
Zhaoyan Liu
Noël Vouitsis
S. Gorti
Jimmy Ba
Gabriel Loaiza-Ganem
ViT
292
2
0
26 Apr 2023
PATMAT: Person Aware Tuning of Mask-Aware Transformer for Face
  Inpainting
PATMAT: Person Aware Tuning of Mask-Aware Transformer for Face InpaintingIEEE International Conference on Computer Vision (ICCV), 2023
Saman Motamed
Jianjin Xu
Chenhuan Wu
Fernando de la Torre
DiffM
279
4
0
12 Apr 2023
Controllable Text Generation via Probability Density Estimation in the
  Latent Space
Controllable Text Generation via Probability Density Estimation in the Latent SpaceAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yuxuan Gu
Xiaocheng Feng
Sicheng Ma
Lingyuan Zhang
Heng Gong
Weihong Zhong
Bing Qin
228
28
0
16 Dec 2022
Taming Normalizing Flows
Taming Normalizing FlowsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Shimon Malnick
S. Avidan
Ohad Fried
TPMDiffM
364
1
0
29 Nov 2022
Unifying Diffusion Models' Latent Space, with Applications to
  CycleDiffusion and Guidance
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu
Fernando de la Torre
DiffM
376
79
0
11 Oct 2022
A Distributional Lens for Multi-Aspect Controllable Text Generation
A Distributional Lens for Multi-Aspect Controllable Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yuxuan Gu
Xiaocheng Feng
Sicheng Ma
Lingyuan Zhang
Heng Gong
Bing Qin
344
46
0
06 Oct 2022
1
Page 1 of 1