Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2408.11706
Cited By
v1
v2 (latest)
FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting
21 August 2024
Liyao Jiang
Negar Hassanpour
Mohammad Salameh
Mohan Sai Singamsetti
Fengyu Sun
Wei Lu
Di Niu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (7 upvotes)
Papers citing
"FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting"
34 / 34 papers shown
Title
SPF-Portrait: Towards Pure Text-to-Portrait Customization with Semantic Pollution-Free Fine-Tuning
Xiaole Xian
Zhichao Liao
Qingyu Li
Wenyu Qin
Pengfei Wan
Weicheng Xie
Long Zeng
Linlin Shen
Pingfa Feng
DiffM
397
0
0
01 Apr 2025
Improving Text-to-Image Consistency via Automatic Prompt Optimization
Oscar Manas
Pietro Astolfi
Melissa Hall
Candace Ross
Jack Urbanek
Adina Williams
Aishwarya Agrawal
Adriana Romero Soriano
M. Drozdzal
236
60
0
26 Mar 2024
Divide & Bind Your Attention for Improved Generative Semantic Nursing
British Machine Vision Conference (BMVC), 2023
Yumeng Li
Margret Keuper
Dan Zhang
Anna Khoreva
DiffM
253
74
0
20 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
International Conference on Learning Representations (ICLR), 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
759
3,637
0
04 Jul 2023
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Xiaoshi Wu
Yiming Hao
Keqiang Sun
Yixiong Chen
Feng Zhu
Rui Zhao
Jiaming Song
236
536
0
15 Jun 2023
Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Neural Information Processing Systems (NeurIPS), 2023
Royi Rassin
Eran Hirsch
Daniel Glickman
Shauli Ravfogel
Yoav Goldberg
Gal Chechik
DiffM
466
146
0
15 Jun 2023
Transferring Visual Attributes from Natural Language to Verified Image Generation
Rodrigo Valerio
João Bordalo
Michal Yarom
Yonattan Bitton
Idan Szpektor
João Magalhães
153
5
0
24 May 2023
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
IEEE International Conference on Computer Vision (ICCV), 2023
Yushi Hu
Benlin Liu
Jungo Kasai
Yizhong Wang
Mari Ostendorf
Ranjay Krishna
Noah A. Smith
EGVM
240
331
0
21 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
243
580
0
09 Mar 2023
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
ACM Transactions on Graphics (TOG), 2023
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
491
650
0
31 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
International Conference on Machine Learning (ICML), 2023
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
407
684
0
02 Jan 2023
Optimizing Prompts for Text-to-Image Generation
Neural Information Processing Systems (NeurIPS), 2022
Y. Hao
Zewen Chi
Li Dong
Furu Wei
267
215
0
19 Dec 2022
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
International Conference on Learning Representations (ICLR), 2022
Weixi Feng
Xuehai He
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Xinze Wang
William Yang Wang
CoGe
505
377
0
09 Dec 2022
Investigating Prompt Engineering in Diffusion Models
Sam Witteveen
Martin Andrews
123
78
0
21 Nov 2022
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Zijie J. Wang
Evan Montoya
David Munechika
Haoyang Yang
Benjamin Hoover
Duen Horng Chau
429
380
0
26 Oct 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
International Conference on Learning Representations (ICLR), 2022
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
667
2,267
0
02 Aug 2022
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
422
5,152
0
26 Jul 2022
Exploring CLIP for Assessing the Look and Feel of Images
AAAI Conference on Artificial Intelligence (AAAI), 2022
Jianyi Wang
Kelvin C. K. Chan
Chen Change Loy
VLM
352
905
0
25 Jul 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Neural Information Processing Systems (NeurIPS), 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
1.1K
7,348
0
23 May 2022
A very preliminary analysis of DALL-E 2
G. Marcus
E. Davis
S. Aaronson
228
159
0
25 Apr 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
963
8,148
0
13 Apr 2022
No Token Left Behind: Explainability-Aided Image Classification and Generation
European Conference on Computer Vision (ECCV), 2022
Roni Paiss
Hila Chefer
Lior Wolf
VLM
186
34
0
11 Apr 2022
Pseudo Numerical Methods for Diffusion Models on Manifolds
International Conference on Learning Representations (ICLR), 2022
Luping Liu
Yi Ren
Zhijie Lin
Zhou Zhao
DiffM
441
781
0
20 Feb 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
International Conference on Machine Learning (ICML), 2022
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
1.2K
5,585
0
28 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
1.3K
20,430
0
20 Dec 2021
Design Guidelines for Prompt Engineering Text-to-Image Generative Models
Vivian Liu
Lydia B. Chilton
238
604
0
14 Sep 2021
Diffusion Models Beat GANs on Image Synthesis
Neural Information Processing Systems (NeurIPS), 2021
Prafulla Dhariwal
Alex Nichol
1.4K
10,048
0
11 May 2021
Learning Transferable Visual Models From Natural Language Supervision
International Conference on Machine Learning (ICML), 2021
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
2.0K
39,913
0
26 Feb 2021
Denoising Diffusion Implicit Models
International Conference on Learning Representations (ICLR), 2020
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
1.3K
9,913
0
06 Oct 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
4.2K
24,966
0
19 Jun 2020
Decision-Making with Auto-Encoding Variational Bayes
Neural Information Processing Systems (NeurIPS), 2020
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
1.4K
19,430
0
17 Feb 2020
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
2.9K
87,959
0
18 May 2015
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDa
DiffM
1.3K
8,669
0
12 Mar 2015
Microsoft COCO: Common Objects in Context
European Conference on Computer Vision (ECCV), 2014
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
11.6K
48,835
0
01 May 2014
1