v1v2v3 (latest)

Taming Transformers for High-Resolution Image Synthesis

Computer Vision and Pattern Recognition (CVPR), 2020

17 December 2020

ArXiv (abs)PDF HTML Github (6185★)

Papers citing "Taming Transformers for High-Resolution Image Synthesis"

50 / 2,404 papers shown

StyleSwin: Transformer-based GAN for High-resolution Image GenerationComputer Vision and Pattern Recognition (CVPR), 2021

Bo Zhang

Shuyang Gu

Bo Zhang

Jianmin Bao

459

293

20 Dec 2021

High-Resolution Image Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2021

3.1K

21,434

20 Dec 2021

Solving Inverse Problems with NerfGANs

201

16 Dec 2021

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs

435

678

15 Dec 2021

MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning

259

110

09 Dec 2021

Multimodal Conditional Image Synthesis with Product-of-Experts GANs

269

102

09 Dec 2021

Text2Mesh: Text-Driven Neural Stylization for MeshesComputer Vision and Pattern Recognition (CVPR), 2021

Sagie Benaim

1.3K

423

06 Dec 2021

Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation

174

03 Dec 2021

Zero-Shot Text-Guided Object Generation with Dream Fields

Pieter Abbeel

402

636

02 Dec 2021

Exploration into Translation-Equivariant Image Quantization

Joonseok Lee

216

01 Dec 2021

CLIPstyler: Image Style Transfer with a Single Text Condition

Gihyun Kwon

Jong Chul Ye

VLM CLIP

529

317

01 Dec 2021

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

Konpat Preechakul

Nattanat Chatthee

Suttisak Wizadwongsa

Supasorn Suwajanakorn

SyDa DiffM

427

538

30 Nov 2021

EdiBERT, a generative model for image editing

342

30 Nov 2021

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Jianmin Bao

Lu Yuan

579

952

29 Nov 2021

Blended Diffusion for Text-driven Editing of Natural Images

521

1,146

29 Nov 2021

SWAT: Spatial Structure Within and Among TokensInternational Joint Conference on Artificial Intelligence (IJCAI), 2021

Kumara Kahatapitiya

Michael S. Ryoo

273

26 Nov 2021

Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene RepresentationsComputer Vision and Pattern Recognition (CVPR), 2021

...

Alexey Dosovitskiy

429

230

25 Nov 2021

Layered Controllable Video Generation

405

24 Nov 2021

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

Jianmin Bao

Lu Yuan

371

272

24 Nov 2021

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes

298

24 Nov 2021

Octree Transformer: Autoregressive 3D Shape Generation on Hierarchically Structured Sequences

Moritz Ibing

Gregor Kobsik

Leif Kobbelt

217

24 Nov 2021

NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion

Fan Yang

314

344

24 Nov 2021

One to Transfer All: A Universal Transfer Framework for Vision Foundation Model with Few Data

174

24 Nov 2021

L-Verse: Bidirectional Generation Between Image and TextComputer Vision and Pattern Recognition (CVPR), 2021

1.0K

22 Nov 2021

Discrete Representations Strengthen Vision Transformer RobustnessInternational Conference on Learning Representations (ICLR), 2021

Carl Vondrick

303

20 Nov 2021

Compositional Transformers for Scene Generation

Drew A. Hudson

C. L. Zitnick

ViT

263

17 Nov 2021

INTERN: A New Learning Paradigm Towards General Vision

Siyu Chen

...

Yu Qiao

237

16 Nov 2021

Losses, Dissonances, and Distortions

Pablo Samuel Castro

08 Nov 2021

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers

320

05 Nov 2021

LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs

865

1,722

03 Nov 2021

PatchGame: Learning to Signal Mid-level Patches in Referential GamesNeural Information Processing Systems (NeurIPS), 2021

Anubhav Gupta

188

02 Nov 2021

Projected GANs Converge FasterNeural Information Processing Systems (NeurIPS), 2021

299

284

01 Nov 2021

Blending Anti-Aliasing into Vision TransformerNeural Information Processing Systems (NeurIPS), 2021

213

28 Oct 2021

Telling Creative Stories Using Generative Visual Aids

Safinah Ali

Devi Parikh

27 Oct 2021

Towards artificial general intelligence via a multimodal foundation model

...

Xin Gao

233

290

27 Oct 2021

The Nuts and Bolts of Adopting Transformer in GANs

343

25 Oct 2021

Unsupervised Source Separation By Steering Pretrained Music Models

183

25 Oct 2021

Wav2CLIP: Learning Robust Audio Representations From CLIPIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

346

331

21 Oct 2021

3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with Transformers

Roger Wattenhofer

211

17 Oct 2021

Taming Visually Guided Sound Generation

Vladimir E. Iashin

Esa Rahtu

VLM

320

175

17 Oct 2021

AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

162

17 Oct 2021

Multimodal Dialogue Response Generation

Yujing Wang

260

16 Oct 2021

Vector-quantized Image Modeling with Improved VQGANInternational Conference on Learning Representations (ICLR), 2021

498

688

09 Oct 2021

ATISS: Autoregressive Transformers for Indoor Scene Synthesis

Sanja Fidler

374

210

07 Oct 2021

DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation

986

793

06 Oct 2021

CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation

Joseph G. Lambourne

Kamal Rahimi Malekshan

CLIP

370

344

06 Oct 2021

Transformer Assisted Convolutional Network for Cell Instance Segmentation

Deepanshu Pandey

Pradyumna Gupta

Sumit K. Bhattacharya

Aman Sinha

Rohit Agarwal

ViT MedIm

176

05 Oct 2021

AffectGAN: Affect-Based Generative Art Driven by Semantics

Theodoros Galanos

Antonios Liapis

Georgios N. Yannakakis

GAN

188

30 Sep 2021

UFO-ViT: High Performance Linear Vision Transformer without Softmax

Jeonggeun Song

ViT

325

29 Sep 2021

Resolution-robust Large Mask Inpainting with Fourier Convolutions

343

1,170

15 Sep 2021