Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.02793
Cited By
Generating Images from Captions with Attention
9 November 2015
Elman Mansimov
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generating Images from Captions with Attention"
50 / 243 papers shown
Title
JourneyDB: A Benchmark for Generative Image Understanding
Keqiang Sun
Junting Pan
Yuying Ge
Hao Li
Haodong Duan
...
Yi Wang
Jifeng Dai
Yu Qiao
Limin Wang
Hongsheng Li
31
101
0
03 Jul 2023
AI could create a perfect storm of climate misinformation
V. Galaz
Hannah Metzler
Stefan Daume
A. Olsson
B. Lindström
A. Marklund
21
5
0
22 Jun 2023
The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models
Roy Voetman
Maya Aghaei
K. Dijkstra
DiffM
19
11
0
16 Jun 2023
GBSD: Generative Bokeh with Stage Diffusion
Jieren Deng
Xiaoxia Zhou
Hao Tian
Zhihong Pan
Derek Aguiar
DiffM
19
1
0
14 Jun 2023
Generative Watermarking Against Unauthorized Subject-Driven Image Synthesis
Y. Ma
Zhengyu Zhao
Xinlei He
Zheng Li
Michael Backes
Yang Zhang
AAML
WIGM
9
21
0
13 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models
Michael Stephen Saxon
William Yang Wang
EGVM
24
8
0
02 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
37
20
0
01 Jun 2023
Cones 2: Customizable Image Synthesis with Multiple Subjects
Zhiheng Liu
Yifei Zhang
Yujun Shen
Kecheng Zheng
Kai Zhu
Ruili Feng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
50
80
0
30 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLM
DiffM
30
165
0
25 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
MLLM
21
50
0
24 May 2023
If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
Shyamgopal Karthik
Karsten Roth
Massimiliano Mancini
Zeynep Akata
24
20
0
22 May 2023
SneakyPrompt: Jailbreaking Text-to-image Generative Models
Yuchen Yang
Bo Hui
Haolin Yuan
Neil Gong
Yinzhi Cao
EGVM
31
74
0
20 May 2023
Inspecting the Geographical Representativeness of Images from Text-to-Image Models
Aparna Basu
R. Venkatesh Babu
Danish Pruthi
DiffM
17
39
0
18 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Junchen Zhu
Jianlong Fu
Jiaying Liu
DiffM
VGen
40
114
0
18 May 2023
Image Captioners Sometimes Tell More Than Images They See
Honori Udo
Takafumi Koshinaka
VLM
17
3
0
04 May 2023
Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement
N. Gkanatsios
Ayush Jain
Zhou Xian
Yunchu Zhang
C. Atkeson
Katerina Fragkiadaki
LM&Ro
98
31
0
27 Apr 2023
Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models
Stephen Brade
Bryan Wang
Maurício Sousa
Sageev Oore
Tovi Grossman
MLLM
DiffM
16
82
0
18 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
77
79
0
13 Apr 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Mohit Bansal
EGVM
11
5
0
13 Apr 2023
One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era
Chaoning Zhang
Chenshuang Zhang
Chenghao Li
Yu Qiao
Sheng Zheng
...
Sung-Ho Bae
Lik-Hang Lee
Pan Hui
In So Kweon
Choong Seon Hong
LM&MA
AI4MH
LRM
ELM
31
130
0
04 Apr 2023
DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance
Longwen Zhang
Qiwei Qiu
Hongyang Lin
Qixuan Zhang
Cheng Shi
Wei Yang
Ye Shi
Sibei Yang
Lan Xu
Jingyi Yu
3DH
25
74
0
01 Apr 2023
Indonesian Text-to-Image Synthesis with Sentence-BERT and FastGAN
Made Raharja Surya Mahadi
N. P. Utama
30
2
0
25 Mar 2023
Human Preference Score: Better Aligning Text-to-Image Models with Human Preference
Xiaoshi Wu
Keqiang Sun
Feng Zhu
Rui Zhao
Hongsheng Li
6
131
0
25 Mar 2023
Ablating Concepts in Text-to-Image Diffusion Models
Nupur Kumari
Bin Zhang
Sheng-Yu Wang
Eli Shechtman
Richard Y. Zhang
Jun-Yan Zhu
VLM
21
182
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
27
541
0
23 Mar 2023
A Word is Worth a Thousand Pictures: Prompts as AI Design Material
Chinmay Kulkarni
Stefania Druga
Minsuk Chang
Alexander J. Fiannaca
Carrie J. Cai
Michael Terry
3DV
15
30
0
22 Mar 2023
The Prompt Artists
Minsuk Chang
Stefania Druga
Alexander J. Fiannaca
P. Vergani
Chinmay Kulkarni
Carrie J. Cai
Michael Terry
21
59
0
22 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
75
159
0
21 Mar 2023
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
47
264
0
14 Mar 2023
Generation-Guided Multi-Level Unified Network for Video Grounding
Xingyi Cheng
Xiangyu Wu
Dong Shen
Hezheng Lin
Fan Yang
14
0
0
14 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
35
439
0
09 Mar 2023
A Prompt Log Analysis of Text-to-Image Generation Systems
Yutong Xie
Zhaoying Pan
Jing Ma
Jie Luo
Qiaozhu Mei
DiffM
117
40
0
08 Mar 2023
TextIR: A Simple Framework for Text-based Editable Image Restoration
Yun-Hao Bai
Cairong Wang
Shuzhao Xie
Chao Dong
Chun Yuan
Zhi Wang
DiffM
24
15
0
28 Feb 2023
"An Adapt-or-Die Type of Situation": Perception, Adoption, and Use of Text-To-Image-Generation AI by Game Industry Professionals
Veera Vimpari
Annakaisa Kultima
Perttu Hämäläinen
Christian Guckelsberger
DiffM
13
45
0
24 Feb 2023
Region-Aware Diffusion for Zero-shot Text-driven Image Editing
Nisha Huang
Fan Tang
Weiming Dong
Tong-Yee Lee
Changsheng Xu
DiffM
32
23
0
23 Feb 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
35
40
0
14 Feb 2023
Glaze: Protecting Artists from Style Mimicry by Text-to-Image Models
Shawn Shan
Jenna Cryan
Emily Wenger
Haitao Zheng
Rana Hanocka
Ben Y. Zhao
WIGM
12
176
0
08 Feb 2023
Multi-Lingual DALL-E Storytime
Noga Mudrik
Adam S. Charles
9
0
0
22 Dec 2022
Text-Guided Mask-free Local Image Retouching
Zerun Liu
Fan Zhang
Jingxuan He
Jin Wang
Zhangye Wang
Lechao Cheng
DiffM
22
5
0
15 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
29
822
0
08 Dec 2022
Testing Human Ability To Detect Deepfake Images of Human Faces
Sergi D. Bray
Shane D. Johnson
Bennett Kleinberg
CVBM
17
44
0
07 Dec 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
43
3,247
0
16 Oct 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
25
3
0
22 Sep 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
18
90
0
29 Aug 2022
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks
Qingrong Cheng
Keyu Wen
X. Gu
VLM
EGVM
24
16
0
20 Aug 2022
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
50
373
0
06 Jun 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
55
5,768
0
23 May 2022
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
Fangzhou Hong
Mingyuan Zhang
Liang Pan
Zhongang Cai
Lei Yang
Ziwei Liu
CLIP
94
78
0
17 May 2022
Synthetic Data -- what, why and how?
James Jordon
Lukasz Szpruch
F. Houssiau
M. Bottarelli
Giovanni Cherubin
Carsten Maple
Samuel N. Cohen
Adrian Weller
32
109
0
06 May 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
36
149
0
27 Apr 2022
Previous
1
2
3
4
5
Next