Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

22 June 2022

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "Scaling Autoregressive Models for Content-Rich Text-to-Image Generation"

50 / 1,010 papers shown

A Neural Space-Time Representation for Text-to-Image PersonalizationACM Transactions on Graphics (TOG), 2023

Daniel Cohen-Or

330

128

24 May 2023

Visual Programming for Text-to-Image Generation and Evaluation

388

24 May 2023

I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual MetaphorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Marianna Apidianaki

Smaranda Muresan

DiffM

204

24 May 2023

Vision + Language Applications: A Survey

Yutong Zhou

N. Shimada

VLM

277

24 May 2023

Diffusion Hyperfeatures: Searching Through Time and Space for Semantic CorrespondenceNeural Information Processing Systems (NeurIPS), 2023

Aleksander Holynski

421

197

23 May 2023

Training Transitive and Commutative Multimodal Transformers with LoReTTaNeural Information Processing Systems (NeurIPS), 2023

321

23 May 2023

Training Priors Predict Text-To-Image Model Performance

Charles Lovering

Ellie Pavlick

CoGe

214

23 May 2023

Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

259

23 May 2023

If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection

233

22 May 2023

ControlVideo: Training-free Controllable Text-to-Video GenerationInternational Conference on Learning Representations (ICLR), 2023

284

330

22 May 2023

Textually Pretrained Speech Language ModelsNeural Information Processing Systems (NeurIPS), 2023

...

Yossi Adi

408

22 May 2023

The Waymo Open Sim Agents ChallengeNeural Information Processing Systems (NeurIPS), 2023

...

453

19 May 2023

AI's Regimes of Representation: A Community-centered Study of Text-to-Image Models in South AsiaConference on Fairness, Accountability and Transparency (FAccT), 2023

266

19 May 2023

Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector QuantizationComputer Vision and Pattern Recognition (CVPR), 2023

Mengqi Huang

Zhendong Mao

Zhuowei Chen

Yongdong Zhang

272

19 May 2023

Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots

Jinyi Hu

Xu Han

Xiaoyuan Yi

Yutong Chen

Wenhao Li

Zhiyuan Liu

Maosong Sun

DiffM

19 May 2023

A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and ValidationArtificial Intelligence Review (AIR), 2023

...

352

146

19 May 2023

Inspecting the Geographical Representativeness of Images from Text-to-Image ModelsIEEE International Conference on Computer Vision (ICCV), 2023

308

18 May 2023

X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models

Yixiong Chen

Li Liu

C. Ding

174

18 May 2023

What You See is What You Read? Improving Text-Image Alignment EvaluationNeural Information Processing Systems (NeurIPS), 2023

568

116

17 May 2023

Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding

117

16 May 2023

DATED: Guidelines for Creating Synthetic Datasets for Engineering Design ApplicationsDesign Automation Conference (DAC), 2023

Cyril Picard

Jürg Schiffmann

Faez Ahmed

167

15 May 2023

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

Lanqing Hong

196

15 May 2023

MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image SynthesisInternational Journal of Computer Vision (IJCV), 2023

Zuopeng Yang

149

10 May 2023

Recommender Systems with Generative RetrievalNeural Information Processing Systems (NeurIPS), 2023

Shashank Rajput

Nikhil Mehta

Anima Singh

Raghunandan H. Keshavan

...

358

180

08 May 2023

ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation

138

08 May 2023

Towards Prompt-robust Face Privacy Protection via Adversarial Decoupling Augmentation Framework

183

06 May 2023

Controllable Visual-Tactile SynthesisIEEE International Conference on Computer Vision (ICCV), 2023

Ruihan Gao

Wenzhen Yuan

Jun-Yan Zhu

DiffM

204

04 May 2023

Shap-E: Generating Conditional 3D Implicit Functions

Heewoo Jun

Alex Nichol

DiffM

625

414

03 May 2023

Nonparametric Generative Modeling with Conditional Sliced-Wasserstein FlowsInternational Conference on Machine Learning (ICML), 2023

323

03 May 2023

DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling

183

02 May 2023

Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative ModelIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023

346

28 Apr 2023

IconShop: Text-Guided Vector Icon Synthesis with Autoregressive TransformersACM Transactions on Graphics (TOG), 2023

490

27 Apr 2023

Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement

421

27 Apr 2023

TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional GenerationInternational Conference on Machine Learning (ICML), 2023

Jimmy Ba

292

26 Apr 2023

Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated ImagesNeural Information Processing Systems (NeurIPS), 2023

Zeyu Lu

Di Huang

Wanli Ouyang

259

25 Apr 2023

TextMesh: Generation of Realistic 3D Meshes From Text PromptsInternational Conference on 3D Vision (3DV), 2023

Christina Tsalicoglou

192

162

24 Apr 2023

A Cookbook of Self-Supervised Learning

...

Pierre Fernandez

439

362

24 Apr 2023

Evolving Three Dimension (3D) Abstract Art: Fitting Concepts by Language

Yingtao Tian

134

24 Apr 2023

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023

Sanja Fidler

610

1,435

18 Apr 2023

Visual Instruction TuningNeural Information Processing Systems (NeurIPS), 2023

1.1K

7,496

17 Apr 2023

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation

290

135

17 Apr 2023

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and EditingIEEE International Conference on Computer Vision (ICCV), 2023

Ying Shan

232

673

17 Apr 2023

AutoSplice: A Text-prompt Manipulated Image Dataset for Media Forensics

264

14 Apr 2023

Expressive Text-to-Image Generation with Rich TextIEEE International Conference on Computer Vision (ICCV), 2023

Jun-Yan Zhu

482

13 Apr 2023

Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation

196

13 Apr 2023

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image GenerationNeural Information Processing Systems (NeurIPS), 2023

Xiao Liu

Yuxiao Dong

559

736

12 Apr 2023

Gradient-Free Textual InversionACM Multimedia (ACM MM), 2023

Zhengcong Fei

Mingyuan Fan

Junshi Huang

DiffM

260

12 Apr 2023

Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond

Mohammadreza Armandpour

A. Sadeghian

Huangjie Zheng

Amir Sadeghian

Mingyuan Zhou

DiffM

405

148

11 Apr 2023

InstantBooth: Personalized Text-to-Image Generation without Test-Time FinetuningComputer Vision and Pattern Recognition (CVPR), 2023

Jing Shi

Wei Xiong

Zhe Lin

H. J. Jung

DiffM

367

366

06 Apr 2023

Training-Free Layout Control with Cross-Attention GuidanceIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Minghao Chen

Iro Laina

Andrea Vedaldi

DiffM

440

313

06 Apr 2023