ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.09841
  4. Cited By
Taming Transformers for High-Resolution Image Synthesis

Taming Transformers for High-Resolution Image Synthesis

17 December 2020
Patrick Esser
Robin Rombach
Bjorn Ommer
    ViT
ArXivPDFHTML

Papers citing "Taming Transformers for High-Resolution Image Synthesis"

50 / 492 papers shown
Title
Causal Unsupervised Semantic Segmentation
Causal Unsupervised Semantic Segmentation
Junho Kim
Byung-Kwan Lee
Yonghyun Ro
33
18
0
11 Oct 2023
Improving Compositional Text-to-image Generation with Large
  Vision-Language Models
Improving Compositional Text-to-image Generation with Large Vision-Language Models
Song Wen
Guian Fang
Renrui Zhang
Peng Gao
Hao Dong
Dimitris N. Metaxas
21
17
0
10 Oct 2023
Memory-Consistent Neural Networks for Imitation Learning
Memory-Consistent Neural Networks for Imitation Learning
Kaustubh Sridhar
Souradeep Dutta
Dinesh Jayaraman
James Weimer
Insup Lee
36
8
0
09 Oct 2023
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video
  editing
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong
Mengmeng Xu
Christian Simon
Shoufa Chen
Jiawei Ren
Yanping Xie
Juan-Manuel Perez-Rua
Bodo Rosenhahn
Tao Xiang
Sen He
DiffM
VGen
22
74
0
09 Oct 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image
  Action Editing
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
16
24
0
28 Sep 2023
Jointly Training Large Autoregressive Multimodal Models
Jointly Training Large Autoregressive Multimodal Models
Emanuele Aiello
L. Yu
Yixin Nie
Armen Aghajanyan
Barlas Oğuz
11
29
0
27 Sep 2023
Diffusion-based Holistic Texture Rectification and Synthesis
Diffusion-based Holistic Texture Rectification and Synthesis
Guoqing Hao
S. Iizuka
Kensho Hara
E. Simo-Serra
Hirokatsu Kataoka
Kazuhiro Fukui
DiffM
10
5
0
26 Sep 2023
Neural Image Compression Using Masked Sparse Visual Representation
Neural Image Compression Using Masked Sparse Visual Representation
Wei Jiang
Wei Wang
Yuewei Chen
13
7
0
20 Sep 2023
CoNeS: Conditional neural fields with shift modulation for
  multi-sequence MRI translation
CoNeS: Conditional neural fields with shift modulation for multi-sequence MRI translation
Yunjie Chen
Marius Staring
O. M. Neve
Stephan R. Romeijn
Erik F. Hensen
Berit M. Verbist
J. Wolterink
Qian Tao
DiffM
MedIm
16
3
0
06 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High
  Definition Text-to-Video Generation
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Qi Zhang
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
45
51
0
01 Sep 2023
Coarse-to-Fine Amodal Segmentation with Shape Prior
Coarse-to-Fine Amodal Segmentation with Shape Prior
Jianxiong Gao
Xuelin Qian
Yikai Wang
Tianjun Xiao
Tong He
Zheng-Wei Zhang
Yanwei Fu
29
19
0
31 Aug 2023
Terrain Diffusion Network: Climatic-Aware Terrain Generation with
  Geological Sketch Guidance
Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance
Zexin Hu
Kun Hu
Clinton Mo
Lei Pan
Zhiyong Wang
DiffM
21
2
0
31 Aug 2023
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and
  Personalized Stylization
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization
Tao Yang
Rongyuan Wu
Peiran Ren
Xuansong Xie
Lei Zhang
DiffM
34
136
0
28 Aug 2023
StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion
  Models
StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models
Zhizhong Wang
Lei Zhao
Wei Xing
DiffM
27
119
0
15 Aug 2023
MarkovGen: Structured Prediction for Efficient Text-to-Image Generation
MarkovGen: Structured Prediction for Efficient Text-to-Image Generation
Sadeep Jayasumana
Daniel Glasner
Srikumar Ramalingam
Andreas Veit
Ayan Chakrabarti
Surinder Kumar
DiffM
19
0
0
14 Aug 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang
Yinzheng Luo
Ziliang Chen
Guangrun Wang
Xiaodan Liang
Liang Lin
DiffM
19
12
0
13 Aug 2023
Controlling Character Motions without Observable Driving Source
Controlling Character Motions without Observable Driving Source
Weiyuan Li
Bin Dai
Ziyi Zhou
Qi Yao
Baoyuan Wang
VGen
6
1
0
11 Aug 2023
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth
  Estimation
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation
Lingdong Kong
Yaru Niu
Shaoyuan Xie
Hanjiang Hu
Lai Xing Ng
...
Zhenyu Li
Runze Chen
Haiyong Luo
Fang Zhao
Jing Yu
26
13
0
27 Jul 2023
PreDiff: Precipitation Nowcasting with Latent Diffusion Models
PreDiff: Precipitation Nowcasting with Latent Diffusion Models
Zhihan Gao
Xingjian Shi
Boran Han
Hongya Wang
Xiaoyong Jin
Danielle C. Maddix
Yi Zhu
Mu Li
Bernie Wang
BDL
DiffM
23
54
0
19 Jul 2023
Towards Authentic Face Restoration with Iterative Diffusion Models and
  Beyond
Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond
Yang Zhao
Tingbo Hou
Yu-Chuan Su
Xuhui Jia. Yandong Li
Matthias Grundmann
DiffM
30
16
0
18 Jul 2023
Flow Matching in Latent Space
Flow Matching in Latent Space
Quan Dao
Hao Phung
Binh Duc Nguyen
Anh Tran
33
59
0
17 Jul 2023
JourneyDB: A Benchmark for Generative Image Understanding
JourneyDB: A Benchmark for Generative Image Understanding
Keqiang Sun
Junting Pan
Yuying Ge
Hao Li
Haodong Duan
...
Yi Wang
Jifeng Dai
Yu Qiao
Limin Wang
Hongsheng Li
31
101
0
03 Jul 2023
Masked Diffusion Models Are Fast Distribution Learners
Masked Diffusion Models Are Fast Distribution Learners
Jiachen Lei
Qinglong Wang
Pengyu Cheng
Zhongjie Ba
Zhan Qin
Zhibo Wang
Zhenguang Liu
Kui Ren
DiffM
17
2
0
20 Jun 2023
Human Preference Score v2: A Solid Benchmark for Evaluating Human
  Preferences of Text-to-Image Synthesis
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Xiaoshi Wu
Yiming Hao
Keqiang Sun
Yixiong Chen
Feng Zhu
Rui Zhao
Hongsheng Li
41
251
0
15 Jun 2023
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Zhe Ye
Ziyue Jiang
Yi Ren
Jinglin Liu
Chen Zhang
Xiang Yin
Zejun Ma
Zhou Zhao
40
4
0
06 Jun 2023
Text-to-image Editing by Image Information Removal
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
16
12
0
27 May 2023
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical
  Invariance
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Youzuo Lin
24
2
0
25 May 2023
Generative Modeling through the Semi-dual Formulation of Unbalanced
  Optimal Transport
Generative Modeling through the Semi-dual Formulation of Unbalanced Optimal Transport
Jaemoo Choi
Jaewoong Choi
Myung-joo Kang
OT
15
19
0
24 May 2023
Generalizable Synthetic Image Detection via Language-guided Contrastive Learning
Generalizable Synthetic Image Detection via Language-guided Contrastive Learning
Haiwei Wu
Jiantao Zhou
Shile Zhang
110
27
0
23 May 2023
Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D
  Diffusion Probabilistic Models
Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models
Byungjun Kim
Patrick Kwon
K. Lee
Myunggi Lee
Sookwan Han
Daesik Kim
Hanbyul Joo
DiffM
34
20
0
19 May 2023
Learning Global-aware Kernel for Image Harmonization
Learning Global-aware Kernel for Image Harmonization
Xintian Shen
Jiangning Zhang
Jun Chen
Shipeng Bai
Yue Han
Yabiao Wang
Chengjie Wang
Yong-Jin Liu
19
7
0
19 May 2023
Incomplete Multi-view Clustering via Diffusion Completion
Incomplete Multi-view Clustering via Diffusion Completion
Sifan Fang
DiffM
16
4
0
19 May 2023
A Survey of Safety and Trustworthiness of Large Language Models through
  the Lens of Verification and Validation
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
27
81
0
19 May 2023
Computing high-dimensional optimal transport by flow neural networks
Computing high-dimensional optimal transport by flow neural networks
Chen Xu
Xiuyuan Cheng
Yao Xie
OT
35
4
0
19 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffM
OCL
28
44
0
18 May 2023
Ray-Patch: An Efficient Querying for Light Field Transformers
Ray-Patch: An Efficient Querying for Light Field Transformers
T. B. Martins
Javier Civera
ViT
29
0
0
16 May 2023
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang
Zongsheng Yue
Shangchen Zhou
Kelvin C. K. Chan
Chen Change Loy
32
279
0
11 May 2023
A vector quantized masked autoencoder for audiovisual speech emotion recognition
A vector quantized masked autoencoder for audiovisual speech emotion recognition
Samir Sadok
Simon Leglaive
Renaud Séguier
SSL
79
6
0
05 May 2023
StyleGenes: Discrete and Efficient Latent Distributions for GANs
StyleGenes: Discrete and Efficient Latent Distributions for GANs
Evangelos Ntavelis
Mohamad Shahbazi
I. Kastanis
Radu Timofte
Martin Danelljan
Luc Van Gool
30
1
0
30 Apr 2023
Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Zeyu Lu
Chengyue Wu
Xinyuan Chen
Yaohui Wang
Lei Bai
Yu Qiao
Xihui Liu
DiffM
24
15
0
24 Apr 2023
Align your Latents: High-Resolution Video Synthesis with Latent
  Diffusion Models
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
60
1,010
0
18 Apr 2023
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient
  Text-to-Video Generation
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation
Jie An
Songyang Zhang
Harry Yang
Sonal Gupta
Jia-Bin Huang
Jiebo Luo
Xiaoyue Yin
DiffM
VGen
27
106
0
17 Apr 2023
Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET
  Anomaly Detection
Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET Anomaly Detection
Ashay Patel
Petru-Daniel Tudosiu
W. H. Pinaya
G. Cook
Vicky Goh
Sebastien Ourselin
M. Jorge Cardoso
OOD
ViT
MedIm
20
11
0
14 Apr 2023
Intriguing properties of synthetic images: from generative adversarial
  networks to diffusion models
Intriguing properties of synthetic images: from generative adversarial networks to diffusion models
Riccardo Corvi
D. Cozzolino
Giovanni Poggi
Koki Nagano
L. Verdoliva
DiffM
27
88
0
13 Apr 2023
ENTL: Embodied Navigation Trajectory Learner
ENTL: Embodied Navigation Trajectory Learner
Klemen Kotar
Aaron Walsman
Roozbeh Mottaghi
8
6
0
05 Apr 2023
DiffCollage: Parallel Generation of Large Content with Diffusion Models
DiffCollage: Parallel Generation of Large Content with Diffusion Models
Qinsheng Zhang
Jiaming Song
Xun Huang
Yongxin Chen
Ming-Yu Liu
DiffM
27
82
0
30 Mar 2023
Implicit Diffusion Models for Continuous Super-Resolution
Implicit Diffusion Models for Continuous Super-Resolution
Sicheng Gao
Xuhui Liu
Bo-Wen Zeng
Sheng Xu
Yanjing Li
Xiaonan Luo
Jianzhuang Liu
Xiantong Zhen
Baochang Zhang
DiffM
43
213
0
29 Mar 2023
Variational Distribution Learning for Unsupervised Text-to-Image
  Generation
Variational Distribution Learning for Unsupervised Text-to-Image Generation
Minsoo Kang
Doyup Lee
Jiseob Kim
Saehoon Kim
Bohyung Han
DRL
OOD
14
3
0
28 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
15
174
0
27 Mar 2023
Towards Accurate Post-Training Quantization for Vision Transformer
Towards Accurate Post-Training Quantization for Vision Transformer
Yifu Ding
Haotong Qin
Qing-Yu Yan
Z. Chai
Junjie Liu
Xiaolin K. Wei
Xianglong Liu
MQ
54
66
0
25 Mar 2023
Previous
123...1056789
Next