Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.09841
Cited By
Taming Transformers for High-Resolution Image Synthesis
17 December 2020
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Taming Transformers for High-Resolution Image Synthesis"
50 / 476 papers shown
Title
FontTransformer: Few-shot High-resolution Chinese Glyph Image Synthesis via Stacked Transformers
Yitian Liu
Z. Lian
31
13
0
12 Oct 2022
ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors
Chenjie Cao
Qiaole Dong
Yanwei Fu
28
30
0
12 Oct 2022
Dynamic Latent Separation for Deep Learning
Yi-Lin Tuan
Zih-Yun Chiu
William Yang Wang
14
0
0
07 Oct 2022
Efficient Diffusion Models for Vision: A Survey
Anwaar Ulhaq
Naveed Akhtar
MedIm
32
59
0
07 Oct 2022
Content-Based Search for Deep Generative Models
Daohan Lu
Sheng-Yu Wang
Nupur Kumari
Rohan Agarwal
Mia Tang
David Bau
Jun-Yan Zhu
DiffM
SyDa
30
5
0
06 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
22
370
0
05 Oct 2022
Implicit Warping for Animation with Image Sets
Arun Mallya
Ting-Chun Wang
Ming-Yu Liu
VGen
114
41
0
04 Oct 2022
Visual Prompt Tuning for Generative Transfer Learning
Kihyuk Sohn
Yuan Hao
José Lezama
Luisa F. Polanía
Huiwen Chang
Han Zhang
Irfan Essa
Lu Jiang
VPVLM
VLM
51
81
0
03 Oct 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
73
64
0
26 Sep 2022
Environmental and Social Sustainability of Creative-Ai
A. Holzapfel
Petra Jääskeläinen
Anna-Kaisa Kaila
4
5
0
26 Sep 2022
UniColor: A Unified Framework for Multi-Modal Colorization with Transformer
Zhitong Huang
Nanxuan Zhao
Jing Liao
ViT
13
16
0
22 Sep 2022
MIDMs: Matching Interleaved Diffusion Models for Exemplar-based Image Translation
Junyoung Seo
Gyuseong Lee
Seokju Cho
Jiyoung Lee
Seung Wook Kim
DiffM
21
27
0
22 Sep 2022
Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
Zhaoxi Chen
Guangcong Wang
Ziwei Liu
88
30
0
20 Sep 2022
Can segmentation models be trained with fully synthetically generated data?
Virginia Fernandez
W. H. Pinaya
Pedro Borges
Petru-Daniel Tudosiu
M. Graham
Tom Kamiel Magda Vercauteren
M. Jorge Cardoso
DiffM
MedIm
43
44
0
17 Sep 2022
Semantic Image Synthesis with Semantically Coupled VQ-Model
Stephan Alaniz
Thomas Hummel
Zeynep Akata
21
6
0
06 Sep 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
15
90
0
29 Aug 2022
Synthetic Data in Human Analysis: A Survey
Indu Joshi
Marcel Grimmer
Christian Rathgeb
Christoph Busch
F. Brémond
A. Dantcheva
20
46
0
19 Aug 2022
G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model
Pan Xie
Qipeng Zhang
Zexian Li
Hao Tang
Yao Du
Xiaohui Hu
DiffM
36
12
0
19 Aug 2022
Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance
Bahjat Kawar
Roy Ganz
Michael Elad
DiffM
19
38
0
18 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
38
1,689
0
02 Aug 2022
Eco2AI: carbon emissions tracking of machine learning models as the first step towards sustainable AI
S. Budennyy
V. Lazarev
N. Zakharenko
A. Korovin
Olga Plosskaya
...
Ivan V. Oseledets
I. Barsola
Ilya M. Egorov
A. Kosterina
L. Zhukov
24
89
0
31 Jul 2022
Depth Field Networks for Generalizable Multi-view Scene Representation
Vitor Campagnolo Guizilini
Igor Vasiljevic
Jiading Fang
Rares Ambrus
G. Shakhnarovich
Matthew R. Walter
Adrien Gaidon
3DV
MDE
22
15
0
28 Jul 2022
Improved Super Resolution of MR Images Using CNNs and Vision Transformers
Dwarikanath Mahapatra
SupR
ViT
MedIm
16
5
0
24 Jul 2022
InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images
Zhengqi Li
Qianqian Wang
Noah Snavely
Angjoo Kanazawa
VGen
22
59
0
22 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
10
72
0
20 Jul 2022
Global-Local Stepwise Generative Network for Ultra High-Resolution Image Restoration
Xin Feng
Haobo Ji
Wenjie Pei
Fanglin Chen
Guangming Lu
19
4
0
16 Jul 2022
Towards Highly Expressive Machine Learning Models of Non-Melanoma Skin Cancer
S. Thomas
J. Lefevre
Glenn W. Baxter
N. Hamilton
MedIm
8
2
0
09 Jul 2022
Vector Quantisation for Robust Segmentation
Ainkaran Santhirasekaram
Avinash Kori
Mathias Winkler
A. Rockall
Ben Glocker
OOD
14
9
0
05 Jul 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
100
109
0
23 Jun 2022
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Shangchen Zhou
Kelvin C. K. Chan
Chongyi Li
Chen Change Loy
CVBM
13
220
0
22 Jun 2022
A Fast Text-Driven Approach for Generating Artistic Content
M. Lupascu
Ryan Murdock
Ionut Mironica
Yijun Li
16
1
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
85
1,061
0
22 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
45
391
0
17 Jun 2022
Disentangling visual and written concepts in CLIP
Joanna Materzyñska
Antonio Torralba
David Bau
CoGe
12
46
0
15 Jun 2022
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
41
522
0
13 Jun 2022
PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework
Ning Kang
Shanzhao Qiu
Shifeng Zhang
Zhenguo Li
Shutao Xia
18
16
0
10 Jun 2022
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
50
374
0
06 Jun 2022
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Jie Shi
Chenfei Wu
Jian Liang
Xiang Liu
Nan Duan
DiffM
4
25
0
01 Jun 2022
VALHALLA: Visual Hallucination for Machine Translation
Yi Li
Rameswar Panda
Yoon Kim
Chun-Fu Chen
Rogerio Feris
David D. Cox
Nuno Vasconcelos
MLLM
36
38
0
31 May 2022
Text2Human: Text-Driven Controllable Human Image Generation
Yuming Jiang
Shuai Yang
Haonan Qiu
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
107
45
0
31 May 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
243
564
0
29 May 2022
COFS: Controllable Furniture layout Synthesis
W. Para
Paul Guerrero
Niloy Mitra
Peter Wonka
3DV
34
16
0
29 May 2022
Video2StyleGAN: Disentangling Local and Global Variations in a Video
Rameen Abdal
Peihao Zhu
Niloy J. Mitra
Peter Wonka
VGen
24
7
0
27 May 2022
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Siyuan Li
Di Wu
Fang Wu
Lei Shang
Stan.Z.Li
25
47
0
27 May 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
55
5,762
0
23 May 2022
StyLandGAN: A StyleGAN based Landscape Image Synthesis using Depth-map
Gun-Hee Lee
Jonghwa Yim
Chanran Kim
Min-Jung Kim
GAN
MDE
31
1
0
13 May 2022
Deep Learning and Synthetic Media
Raphaël Millière
13
18
0
11 May 2022
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
Xu Tan
Jiawei Chen
Haohe Liu
Jian Cong
Chen Zhang
...
Lei He
Frank Soong
Tao Qin
Sheng Zhao
Tie-Yan Liu
14
210
0
09 May 2022
Seeding Diversity into AI Art
Marvin Zammit
Antonios Liapis
Georgios N. Yannakakis
22
4
0
02 May 2022
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Ming Ding
Wendi Zheng
Wenyi Hong
Jie Tang
VLM
18
321
0
28 Apr 2022
Previous
1
2
3
...
10
7
8
9
Next