Imagen 3
Imagen-Team-Google
:
Kelvin Chan
Yichang Chen
Christos Kaplanis
Soňa Mokrá
Rui Qian
Ali Razavi
Srivatsan Srinivasan
Su Wang
Hao Xiong
Keyang Xu
Frankie Garcia
Yena Han
Jamie Hayes
Ed Hirst
Xuhui Jia
Christos Kaplanis
Yukun Ma
Tom Murray
Michela Paganini
Tom Le Paine
Ali Razavi
Kaushik Shivakumar
Shuai Tang
Qifei Wang
Yuxiao Wang
Han Zhang
Jiageng Zhang
Shengqi Zhu
Zhenkai Zhu
Anca Dragan
Yeqing Li
Amar Subramanya
Main:28 Pages
20 Figures
Bibliography:3 Pages
5 Tables
Appendix:4 Pages
Abstract
We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models.
View on arXivComments on this paper
