Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2206.10789
Cited By
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
22 June 2022
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
Zirui Wang
Vijay Vasudevan
Alexander Ku
Yinfei Yang
Burcu Karagol Ayan
Ben Hutchinson
Wei Han
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"Scaling Autoregressive Models for Content-Rich Text-to-Image Generation"
50 / 1,010 papers shown
AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence
CAAI International Conference on Artificial Intelligence (ICCAI), 2023
Jiarui Wang
Huiyu Duan
Jing Liu
S. Chen
Xiongkuo Min
Guangtao Zhai
EGVM
337
96
0
01 Jul 2023
DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data
International Journal of Computer Vision (IJCV), 2023
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
DiffM
469
19
0
25 Jun 2023
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
International Conference on Learning Representations (ICLR), 2023
Rishabh Agarwal
Nino Vieillard
Yongchao Zhou
Piotr Stańczyk
Sabela Ramos
Matthieu Geist
Olivier Bachem
316
183
0
23 Jun 2023
AudioPaLM: A Large Language Model That Can Speak and Listen
Paul Kishan Rubenstein
Chulayuth Asawaroengchai
D. Nguyen
Ankur Bapna
Zalan Borsos
...
Neil Zeghidour
Yu Zhang
Zhishuai Zhang
Lukás Zilka
Christian Frank
LM&MA
AuLLM
VLM
257
396
0
22 Jun 2023
Align, Adapt and Inject: Sound-guided Unified Image Generation
Yue Yang
Kaipeng Zhang
Yuying Ge
Wenqi Shao
Zeyue Xue
Yu Qiao
Ping Luo
DiffM
309
8
0
20 Jun 2023
MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yaqi Zhang
Di Huang
B. Liu
Weizhen He
Yan Lu
Lu Chen
Mengwei He
Qi Chu
Nenghai Yu
Wanli Ouyang
355
149
0
19 Jun 2023
UniG3D: A Unified 3D Object Generation Dataset
Qinghong Sun
Yangguang Li
Zexia Liu
Xiaoshui Huang
Fenggang Liu
Xihui Liu
Wanli Ouyang
Jing Shao
208
6
0
19 Jun 2023
DreamHuman: Animatable 3D Avatars from Text
Neural Information Processing Systems (NeurIPS), 2023
Nikos Kolotouros
Thiemo Alldieck
Andrei Zanfir
Eduard Gabriel Bazavan
Mihai Fieraru
C. Sminchisescu
240
117
0
15 Jun 2023
Training Multimedia Event Extraction With Generated Images and Captions
ACM Multimedia (ACM MM), 2023
Zilin Du
Yunxin Li
Xu Guo
Yidan Sun
Boyang Albert Li
DiffM
275
15
0
15 Jun 2023
Toward Grounded Commonsense Reasoning
IEEE International Conference on Robotics and Automation (ICRA), 2023
Minae Kwon
Hengyuan Hu
Vivek Myers
Siddharth Karamcheti
Anca Dragan
Dorsa Sadigh
LM&Ro
ReLM
LRM
272
17
0
14 Jun 2023
GBSD: Generative Bokeh with Stage Diffusion
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jieren Deng
Xiaoxia Zhou
Hao Tian
Zhihong Pan
Derek Aguiar
DiffM
253
1
0
14 Jun 2023
Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations
ACM Conference on Recommender Systems (RecSys), 2023
Anima Singh
Trung Vu
Nikhil Mehta
Raghunandan H. Keshavan
M. Sathiamoorthy
...
Lukasz Heldt
Li Wei
Devansh Tandon
Ed H. Chi
Xinyang Yi
245
56
0
13 Jun 2023
Dynamically Masked Discriminator for Generative Adversarial Networks
Wentian Zhang
Haozhe Liu
Bing Li
Jinheng Xie
Yawen Huang
Yuexiang Li
Yefeng Zheng
Guohao Li
TTA
322
2
0
13 Jun 2023
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model
Xinyu Zhang
Jiaxian Guo
Paul D. Yoo
Yutaka Matsuo
Yusuke Iwasawa
DiffM
246
26
0
13 Jun 2023
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Neural Information Processing Systems (NeurIPS), 2023
Zeju Qiu
Wei-yu Liu
Haiwen Feng
Yuxuan Xue
Yao Feng
Zhen Liu
Dan Zhang
Adrian Weller
Bernhard Schölkopf
DiffM
388
217
0
12 Jun 2023
Fill-Up: Balancing Long-Tailed Data with Generative Models
Joonghyuk Shin
Minguk Kang
Jaesik Park
301
41
0
12 Jun 2023
Face0: Instantaneously Conditioning a Text-to-Image Model on a Face
ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2023
Dani Valevski
Danny Lumen
Yossi Matias
Yaniv Leviathan
DiffM
VLM
181
97
0
11 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN
Neural Information Processing Systems (NeurIPS), 2023
Rithesh Kumar
Prem Seetharaman
Alejandro Luebs
I. Kumar
Kundan Kumar
294
561
0
11 Jun 2023
Image Vectorization: a Review
Journal of Mathematical Sciences (J. Math. Sci.), 2023
Maria Dziuba
Ivan Jarsky
Valeria Efimova
Andrey Filchenkov
3DV
DiffM
174
16
0
10 Jun 2023
Grounded Text-to-Image Synthesis with Attention Refocusing
Computer Vision and Pattern Recognition (CVPR), 2023
Quynh Phung
Songwei Ge
Jia-Bin Huang
DiffM
390
157
0
08 Jun 2023
Improving Tuning-Free Real Image Editing with Proximal Guidance
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Ligong Han
Song Wen
Qi Chen
Zhixing Zhang
Kunpeng Song
...
Qilong Zhangli
Jindong Jiang
Zhaoyang Xia
Akash Srivastava
Dimitris N. Metaxas
DiffM
330
83
0
08 Jun 2023
AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment
Chunyi Li
Zicheng Zhang
Haoning Wu
Wei Sun
Xiongkuo Min
Xiaohong Liu
Guangtao Zhai
Weisi Lin
EGVM
258
193
0
07 Jun 2023
A survey of Generative AI Applications
Journal of Computer Science (JCS), 2023
Roberto Gozalo-Brizuela
Eduardo C. Garrido-Merchán
3DV
MedIm
382
135
0
05 Jun 2023
Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Yiji Cheng
Fei Yin
Xiaoke Huang
Xintong Yu
Jiaxiang Liu
Shi Feng
Yujiu Yang
Yansong Tang
DiffM
158
5
0
03 Jun 2023
Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang
Yilun Du
Bo Dai
Dale Schuurmans
J. Tenenbaum
Pieter Abbeel
VGen
DiffM
269
31
0
02 Jun 2023
KL-Divergence Guided Temperature Sampling
Chung-Ching Chang
David Reitter
Renat Aksitov
Yun-hsuan Sung
HILM
192
10
0
02 Jun 2023
Insights into Closed-form IPM-GAN Discriminator Guidance for Diffusion Modeling
Aadithya Srikanth
Siddarth Asokan
Nishanth Shetty
C. Seelamantula
308
0
0
02 Jun 2023
Diffusion Self-Guidance for Controllable Image Generation
Neural Information Processing Systems (NeurIPS), 2023
Dave Epstein
Allan Jabri
Ben Poole
Alexei A. Efros
Aleksander Holynski
379
345
0
01 Jun 2023
StyleDrop: Text-to-Image Generation in Any Style
Kihyuk Sohn
Nataniel Ruiz
Kimin Lee
Daniel Castro Chin
Irina Blok
...
Yuanzhen Li
Yuan Hao
Irfan Essa
Michael Rubinstein
Dilip Krishnan
242
205
0
01 Jun 2023
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
Neural Information Processing Systems (NeurIPS), 2023
Yonglong Tian
Lijie Fan
Phillip Isola
Huiwen Chang
Dilip Krishnan
VLM
DiffM
436
205
0
01 Jun 2023
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Neural Information Processing Systems (NeurIPS), 2023
Yanyu Li
Huan Wang
Qing Jin
Ju Hu
Pavlo Chemerys
Yun Fu
Yanzhi Wang
Sergey Tulyakov
Jian Ren
VLM
343
234
0
01 Jun 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
220
17
0
01 Jun 2023
The Hidden Language of Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Hila Chefer
Oran Lang
Mor Geva
Volodymyr Polosukhin
Assaf Shocher
Michal Irani
Inbar Mosseri
Lior Wolf
DiffM
345
33
0
01 Jun 2023
T2IAT: Measuring Valence and Stereotypical Biases in Text-to-Image Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Jialu Wang
Xinyue Liu
Zonglin Di
Yongxu Liu
Xin Eric Wang
202
47
0
01 Jun 2023
Learning Disentangled Prompts for Compositional Image Synthesis
Kihyuk Sohn
Albert Eaton Shaw
Yuan Hao
Han Zhang
Luisa F. Polanía
Huiwen Chang
Lu Jiang
Irfan Essa
VLM
198
8
0
01 Jun 2023
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models
Pablo Pernias
Dominic Rampas
Mats L. Richter
Christopher Pal
Marc Aubreville
DiffM
VLM
232
49
0
01 Jun 2023
Cones 2: Customizable Image Synthesis with Multiple Subjects
Neural Information Processing Systems (NeurIPS), 2023
Zhiheng Liu
Yifei Zhang
Yujun Shen
Kecheng Zheng
Kai Zhu
Ruili Feng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
245
109
0
30 May 2023
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing
Nazmul Karim
Umar Khalid
M. Joneidi
Chen Chen
Nazanin Rahnavard
DiffM
VGen
154
5
0
30 May 2023
Controllable Text-to-Image Generation with GPT-4
Tianjun Zhang
Yi Zhang
Vibhav Vineet
Neel Joshi
Xin Eric Wang
DiffM
313
61
0
29 May 2023
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Yuchao Gu
Xintao Wang
Jay Zhangjie Wu
Yujun Shi
Yunpeng Chen
...
Shuning Chang
Wei Wu
Yixiao Ge
Ying Shan
Mike Zheng Shou
DiffM
359
253
0
29 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Jiaming Song
VGen
DiffM
252
124
0
29 May 2023
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
International Conference on Machine Learning (ICML), 2023
Jinqi Xiao
Miao Yin
Yu Gong
Xiao Zang
Jian Ren
Bo Yuan
VLM
ViT
328
16
0
26 May 2023
Generating Images with Multimodal Language Models
Neural Information Processing Systems (NeurIPS), 2023
Jing Yu Koh
Daniel Fried
Ruslan Salakhutdinov
MLLM
360
328
0
26 May 2023
High-Fidelity Image Compression with Score-based Generative Models
Emiel Hoogeboom
E. Agustsson
Fabian Mentzer
Luca Versari
G. Toderici
Lucas Theis
DiffM
375
56
0
26 May 2023
Improved Visual Story Generation with Adaptive Context Modeling
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhangyin Feng
Yuchen Ren
Xinmiao Yu
Xiaocheng Feng
Duyu Tang
Shuming Shi
Bing Qin
DiffM
234
23
0
26 May 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee K. Wong
424
389
0
25 May 2023
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Xiyang Dai
Lu Yuan
Zicheng Liu
Youzuo Lin
231
2
0
25 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2023
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLM
DiffM
253
241
0
25 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2023
Xingqian Xu
Jiayi Guo
Zinan Lin
Gao Huang
Irfan Essa
Humphrey Shi
VLM
DiffM
284
80
0
25 May 2023
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
European Conference on Computer Vision (ECCV), 2023
Ibrahim Ethem Hamamci
Sezgin Er
Anjany Sekuboyina
Enis Simsar
A. Tezcan
...
Hadrien Reynaud
Sarthak Pati
Christian Bluethgen
M. K. Özdemir
Bjoern Menze
DiffM
MedIm
384
52
0
25 May 2023
Previous
1
2
3
...
14
15
16
...
19
20
21
Next