ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.09841
  4. Cited By
Taming Transformers for High-Resolution Image Synthesis
v1v2v3 (latest)

Taming Transformers for High-Resolution Image Synthesis

Computer Vision and Pattern Recognition (CVPR), 2020
17 December 2020
Patrick Esser
Robin Rombach
Bjorn Ommer
    ViT
ArXiv (abs)PDFHTMLGithub (6185★)

Papers citing "Taming Transformers for High-Resolution Image Synthesis"

50 / 2,402 papers shown
What Users Want? WARHOL: A Generative Model for Recommendation
Jules Samaran
Ugo Tanielian
Romain Beaumont
Flavian Vasile
HAI
109
0
0
02 Sep 2021
Controlled GAN-Based Creature Synthesis via a Challenging Game Art
  Dataset -- Addressing the Noise-Latent Trade-Off
Controlled GAN-Based Creature Synthesis via a Challenging Game Art Dataset -- Addressing the Noise-Latent Trade-Off
Vaibhav Vavilala
David A. Forsyth
229
3
0
19 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for
  Autoregressive Image Synthesis
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
231
177
0
19 Aug 2021
PixelSynth: Generating a 3D-Consistent Experience from a Single Image
PixelSynth: Generating a 3D-Consistent Experience from a Single ImageIEEE International Conference on Computer Vision (ICCV), 2021
C. Rockwell
David Fouhey
Justin Johnson
VGen
251
93
0
12 Aug 2021
Transformer-based deep imitation learning for dual-arm robot manipulation
Transformer-based deep imitation learning for dual-arm robot manipulation
Heecheol Kim
Yoshiyuki Ohmura
Yasuo Kuniyoshi
465
67
0
01 Aug 2021
Data synthesis and adversarial networks: A review and meta-analysis in
  cancer imaging
Data synthesis and adversarial networks: A review and meta-analysis in cancer imaging
Richard Osuala
Kaisar Kushibar
Lidia Garrucho
Akis Linardos
Zuzanna Szafranowska
Stefan Klein
Ben Glocker
Oliver Díaz
Karim Lekadir
MedIm
347
59
0
20 Jul 2021
GenRadar: Self-supervised Probabilistic Camera Synthesis based on Radar
  Frequencies
GenRadar: Self-supervised Probabilistic Camera Synthesis based on Radar FrequenciesIEEE Access (IEEE Access), 2021
Carsten Ditzel
Klaus C. J. Dietmayer
157
3
0
19 Jul 2021
CCVS: Context-aware Controllable Video Synthesis
CCVS: Context-aware Controllable Video SynthesisNeural Information Processing Systems (NeurIPS), 2021
G. L. Moing
Jean Ponce
Cordelia Schmid
258
90
0
16 Jul 2021
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
Sheng-Chun Kao
Suvinay Subramanian
Gaurav Agrawal
Amir Yazdanbakhsh
T. Krishna
416
88
0
13 Jul 2021
ViTGAN: Training GANs with Vision Transformers
ViTGAN: Training GANs with Vision TransformersInternational Conference on Learning Representations (ICLR), 2021
Kwonjoon Lee
Huiwen Chang
Lu Jiang
Han Zhang
Zhuowen Tu
Ce Liu
ViT
342
220
0
09 Jul 2021
CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image
  Encoders
CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image EncodersNeural Information Processing Systems (NeurIPS), 2021
Kevin Frans
Lisa Soros
Olaf Witkowski
CLIP
224
265
0
28 Jun 2021
Alias-Free Generative Adversarial Networks
Alias-Free Generative Adversarial Networks
Tero Karras
M. Aittala
S. Laine
Erik Härkönen
Janne Hellsten
J. Lehtinen
Timo Aila
GAN
940
1,867
0
23 Jun 2021
Improved Transformer for High-Resolution GANs
Improved Transformer for High-Resolution GANsNeural Information Processing Systems (NeurIPS), 2021
Long Zhao
Zizhao Zhang
Ting Chen
Dimitris N. Metaxas
Han Zhang
ViT
350
109
0
14 Jun 2021
Styleformer: Transformer based Generative Adversarial Networks with
  Style Vector
Styleformer: Transformer based Generative Adversarial Networks with Style VectorComputer Vision and Pattern Recognition (CVPR), 2021
Jeeseung Park
Younggeun Kim
ViT
301
59
0
13 Jun 2021
Inverting Adversarially Robust Networks for Image Synthesis
Inverting Adversarially Robust Networks for Image SynthesisAsian Conference on Computer Vision (ACCV), 2021
Renan A. Rojas-Gomez
Raymond A. Yeh
Minh Do
A. Nguyen
205
6
0
13 Jun 2021
PriorGrad: Improving Conditional Denoising Diffusion Models with
  Data-Dependent Adaptive Prior
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive PriorInternational Conference on Learning Representations (ICLR), 2021
Sang-gil Lee
Heeseung Kim
Chaehun Shin
Xu Tan
Yu Xie
Qi Meng
Tao Qin
Wei Chen
Sung-Hoon Yoon
Tie-Yan Liu
DiffM
275
108
0
11 Jun 2021
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural
  Language Generation
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language GenerationFindings (Findings), 2021
Wanrong Zhu
Xinze Wang
An Yan
Miguel P. Eckstein
Wenjie Wang
147
7
0
10 Jun 2021
Score-based Generative Modeling in Latent Space
Score-based Generative Modeling in Latent SpaceNeural Information Processing Systems (NeurIPS), 2021
Arash Vahdat
Karsten Kreis
Jan Kautz
DiffM
433
803
0
10 Jun 2021
The Image Local Autoregressive Transformer
The Image Local Autoregressive TransformerNeural Information Processing Systems (NeurIPS), 2021
Chenjie Cao
Yue Hong
Xiang Li
Chengrong Wang
C. Xu
Xiangyang Xue
Yanwei Fu
169
15
0
04 Jun 2021
X-volution: On the unification of convolution and self-attention
X-volution: On the unification of convolution and self-attention
Xuanhong Chen
Hang Wang
Bingbing Ni
ViT
142
27
0
04 Jun 2021
Barbershop: GAN-based Image Compositing using Segmentation Masks
Barbershop: GAN-based Image Compositing using Segmentation MasksACM Transactions on Graphics (TOG), 2021
Peihao Zhu
Rameen Abdal
John C. Femiani
Peter Wonka
169
32
0
02 Jun 2021
Container: Context Aggregation Network
Container: Context Aggregation NetworkNeural Information Processing Systems (NeurIPS), 2021
Peng Gao
Jiasen Lu
Jiaming Song
Roozbeh Mottaghi
Aniruddha Kembhavi
ViT
271
81
0
02 Jun 2021
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis
  via Non-Autoregressive Generative Transformers
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers
Zhu Zhang
Jianxin Ma
Chang Zhou
Rui Men
Zhikang Li
Ming Ding
Jie Tang
Jingren Zhou
Hongxia Yang
351
47
0
29 May 2021
CogView: Mastering Text-to-Image Generation via Transformers
CogView: Mastering Text-to-Image Generation via TransformersNeural Information Processing Systems (NeurIPS), 2021
Ming Ding
Zhuoyi Yang
Wenyi Hong
Wendi Zheng
Chang Zhou
...
Junyang Lin
Xu Zou
Zhou Shao
Hongxia Yang
Jie Tang
ViTVLM
415
919
0
26 May 2021
Combining Transformer Generators with Convolutional Discriminators
Combining Transformer Generators with Convolutional DiscriminatorsDeutsche Jahrestagung für Künstliche Intelligenz (KI), 2021
Ricard Durall
Stanislav Frolov
Jörn Hees
Federico Raue
Franz-Josef Pfreundt
Andreas Dengel
J. Keuper
ViT
296
18
0
21 May 2021
Medical Image Segmentation Using Squeeze-and-Expansion Transformers
Medical Image Segmentation Using Squeeze-and-Expansion TransformersInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Shaohua Li
Xiuchao Sui
Xiangde Luo
Xinxing Xu
Yong Liu
Rick Siow Mong Goh
ViTMedIm
163
188
0
20 May 2021
Do We Really Need to Learn Representations from In-domain Data for
  Outlier Detection?
Do We Really Need to Learn Representations from In-domain Data for Outlier Detection?
Zhisheng Xiao
Qing Yan
Y. Amit
OODUQCV
205
19
0
19 May 2021
High-Resolution Complex Scene Synthesis with Transformers
High-Resolution Complex Scene Synthesis with Transformers
Manuel Jahn
Robin Rombach
Bjorn Ommer
ViT
164
40
0
13 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image SynthesisNeural Information Processing Systems (NeurIPS), 2021
Prafulla Dhariwal
Alex Nichol
3.0K
10,306
0
11 May 2021
MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with
  One Transformer VAE
MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAEIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Shih-Lun Wu
Yi-Hsuan Yang
ViT
375
73
0
10 May 2021
Synthetic Data for Model Selection
Synthetic Data for Model SelectionInternational Conference on Machine Learning (ICML), 2021
Alon Shoshan
Nadav Bhonker
Igor Kviatkovsky
Matan Fintz
Gérard Medioni
152
7
0
03 May 2021
Diverse Image Inpainting with Bidirectional and Autoregressive
  Transformers
Diverse Image Inpainting with Bidirectional and Autoregressive TransformersACM Multimedia (ACM MM), 2021
Yingchen Yu
Fangneng Zhan
Rongliang Wu
Jianxiong Pan
Kaiwen Cui
Shijian Lu
Feiying Ma
Xuansong Xie
Chunyan Miao
ViT
498
168
0
26 Apr 2021
Geometry-Free View Synthesis: Transformers and no 3D Priors
Geometry-Free View Synthesis: Transformers and no 3D PriorsIEEE International Conference on Computer Vision (ICCV), 2021
Robin Rombach
Patrick Esser
Bjorn Ommer
ViT
410
109
0
15 Apr 2021
Aligning Latent and Image Spaces to Connect the Unconnectable
Aligning Latent and Image Spaces to Connect the UnconnectableIEEE International Conference on Computer Vision (ICCV), 2021
Ivan Skorokhodov
Grigorii Sotnikov
Mohamed Elhoseiny
DiffM
164
96
0
14 Apr 2021
InfinityGAN: Towards Infinite-Pixel Image Synthesis
InfinityGAN: Towards Infinite-Pixel Image SynthesisInternational Conference on Learning Representations (ICLR), 2021
C. Lin
Hsin-Ying Lee
Yen-Chi Cheng
Sergey Tulyakov
Ming-Hsuan Yang
243
80
0
08 Apr 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A SurveyACM Computing Surveys (CSUR), 2021
Giorgio Franceschelli
Mirco Musolesi
VLMAI4CE
539
55
0
06 Apr 2021
Bridging Global Context Interactions for High-Fidelity Image Completion
Bridging Global Context Interactions for High-Fidelity Image CompletionComputer Vision and Pattern Recognition (CVPR), 2021
Chuanxia Zheng
Tat-Jen Cham
Jianfei Cai
Dinh Q. Phung
ViT
165
101
0
02 Apr 2021
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis
Putting NeRF on a Diet: Semantically Consistent Few-Shot View SynthesisIEEE International Conference on Computer Vision (ICCV), 2021
Ajay Jain
Matthew Tancik
Pieter Abbeel
310
580
0
01 Apr 2021
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
StyleCLIP: Text-Driven Manipulation of StyleGAN ImageryIEEE International Conference on Computer Vision (ICCV), 2021
Or Patashnik
Zongze Wu
Eli Shechtman
Daniel Cohen-Or
Dani Lischinski
CLIPVLM
390
1,369
0
31 Mar 2021
Facial Expression Recognition with Visual Transformers and Attentional
  Selective Fusion
Facial Expression Recognition with Visual Transformers and Attentional Selective FusionIEEE Transactions on Affective Computing (TAC), 2021
Fuyan Ma
Bin Sun
Shutao Li
ViT
348
266
0
31 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs,
  Normalizing Flows, Energy-Based and Autoregressive Models
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLMTPM
721
627
0
08 Mar 2021
Generative Adversarial Transformers
Generative Adversarial TransformersInternational Conference on Machine Learning (ICML), 2021
Drew A. Hudson
C. L. Zitnick
ViT
413
203
0
01 Mar 2021
M6: A Chinese Multimodal Pretrainer
M6: A Chinese Multimodal Pretrainer
Junyang Lin
Rui Men
An Yang
Chan Zhou
Ming Ding
...
Yong Li
Jialin Li
Jingren Zhou
J. Tang
Hongxia Yang
VLMMoE
345
147
0
01 Mar 2021
Countering Malicious DeepFakes: Survey, Battleground, and Horizon
Countering Malicious DeepFakes: Survey, Battleground, and HorizonInternational Journal of Computer Vision (IJCV), 2021
Felix Juefei Xu
Run Wang
Yihao Huang
Qing Guo
Lei Ma
Yang Liu
AAML
502
165
0
27 Feb 2021
Remote Sensing Image Change Detection with Transformers
Remote Sensing Image Change Detection with TransformersIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2021
Hao Chen
Zipeng Qi
Zhenwei Shi
ViT
320
1,330
0
27 Feb 2021
Unsupervised Brain Anomaly Detection and Segmentation with Transformers
Unsupervised Brain Anomaly Detection and Segmentation with TransformersInternational Conference on Medical Imaging with Deep Learning (MIDL), 2021
W. H. Pinaya
Petru-Daniel Tudosiu
Robert J. Gray
G. Rees
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
ViTMedIm
149
68
0
23 Feb 2021
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can
  Scale Up
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale UpNeural Information Processing Systems (NeurIPS), 2021
Lezhi Li
Shiyu Chang
Zinan Lin
ViT
600
461
0
14 Feb 2021
Adversarial Text-to-Image Synthesis: A Review
Adversarial Text-to-Image Synthesis: A ReviewNeural Networks (NN), 2021
Stanislav Frolov
Tobias Hinz
Federico Raue
Jörn Hees
Andreas Dengel
EGVM
321
201
0
25 Jan 2021
Transformers in Vision: A Survey
Transformers in Vision: A SurveyACM Computing Surveys (CSUR), 2021
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
Fahad Shahbaz Khan
M. Shah
ViT
924
3,176
0
04 Jan 2021
A Survey on Visual Transformer
A Survey on Visual TransformerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
1.0K
3,095
0
23 Dec 2020
Previous
123...474849
Next