ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.10752
  4. Cited By
High-Resolution Image Synthesis with Latent Diffusion Models

High-Resolution Image Synthesis with Latent Diffusion Models

20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
    3DV
ArXivPDFHTML

Papers citing "High-Resolution Image Synthesis with Latent Diffusion Models"

50 / 7,956 papers shown
Title
MagicEdit: High-Fidelity and Temporally Coherent Video Editing
MagicEdit: High-Fidelity and Temporally Coherent Video Editing
Jun Hao Liew
Hanshu Yan
Jianfeng Zhang
Zhongcong Xu
Jiashi Feng
VGen
DiffM
17
52
0
28 Aug 2023
Total Selfie: Generating Full-Body Selfies
Total Selfie: Generating Full-Body Selfies
B. Chen
Brian L. Curless
Ira Kemelmacher-Shlizerman
S. M. Seitz
DiffM
34
4
0
28 Aug 2023
LatentDR: Improving Model Generalization Through Sample-Aware Latent
  Degradation and Restoration
LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration
Ran Liu
Sahil Khose
Jingyun Xiao
Lakshmi Sathidevi
Keerthan Ramnath
Z. Kira
Eva L. Dyer
19
3
0
28 Aug 2023
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and
  Personalized Stylization
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization
Tao Yang
Rongyuan Wu
Peiran Ren
Xuansong Xie
Lei Zhang
DiffM
34
136
0
28 Aug 2023
ExpCLIP: Bridging Text and Facial Expressions via Semantic Alignment
ExpCLIP: Bridging Text and Facial Expressions via Semantic Alignment
Yicheng Zhong
Huawei Wei
Pei-Yin Yang
Zhisheng Wang
CLIP
19
6
0
28 Aug 2023
Mobile Foundation Model as Firmware
Mobile Foundation Model as Firmware
Jinliang Yuan
Chenchen Yang
Dongqi Cai
Shihe Wang
Xin Yuan
...
Di Zhang
Hanzi Mei
Xianqing Jia
Shangguang Wang
Mengwei Xu
32
19
0
28 Aug 2023
HoloFusion: Towards Photo-realistic 3D Generative Modeling
HoloFusion: Towards Photo-realistic 3D Generative Modeling
Animesh Karnewar
Niloy J. Mitra
Andrea Vedaldi
David Novotny
DiffM
26
31
0
28 Aug 2023
Spoken Language Intelligence of Large Language Models for Language Learning
Spoken Language Intelligence of Large Language Models for Language Learning
Linkai Peng
Baorian Nuchged
Yingming Gao
ELM
57
4
0
28 Aug 2023
SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation
SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation
Zhiyu Qu
Tao Xiang
Yi-Zhe Song
DiffM
34
11
0
27 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
J. Liu
62
31
0
27 Aug 2023
Sparse3D: Distilling Multiview-Consistent Diffusion for Object
  Reconstruction from Sparse Views
Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views
Zi-Xin Zou
Weihao Cheng
Yan-Pei Cao
Shi-Sheng Huang
Ying Shan
Songiie Zhang
DiffM
19
23
0
27 Aug 2023
CHORUS: Learning Canonicalized 3D Human-Object Spatial Relations from
  Unbounded Synthesized Images
CHORUS: Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images
Sookwan Han
Hanbyul Joo
14
14
0
23 Aug 2023
Manipulating Embeddings of Stable Diffusion Prompts
Manipulating Embeddings of Stable Diffusion Prompts
Niklas Deckers
Julia Peters
Martin Potthast
DiffM
32
9
0
23 Aug 2023
Boosting Diffusion Models with an Adaptive Momentum Sampler
Boosting Diffusion Models with an Adaptive Momentum Sampler
Xiyu Wang
Anh-Dung Dinh
Daochang Liu
Chang Xu
13
4
0
23 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
52
14
0
23 Aug 2023
Hey That's Mine Imperceptible Watermarks are Preserved in Diffusion
  Generated Outputs
Hey That's Mine Imperceptible Watermarks are Preserved in Diffusion Generated Outputs
Luke Ditria
Tom Drummond
WIGM
19
2
0
22 Aug 2023
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD
  2023
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023
Zexin Cai
Weiqing Wang
Yikang Wang
Ming Li
17
6
0
20 Aug 2023
Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from
  a Single Image
Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image
Liao Shen
Xingyi Li
Huiqiang Sun
Juewen Peng
Ke Xian
Zhiguo Cao
Guo-Shing Lin
DiffM
27
14
0
20 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation
  with Temporal Correspondence Guidance
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
19
13
0
19 Aug 2023
AltDiffusion: A Multilingual Text-to-Image Diffusion Model
AltDiffusion: A Multilingual Text-to-Image Diffusion Model
Fulong Ye
Guangyi Liu
Xinya Wu
Ledell Yu Wu
VLM
27
25
0
19 Aug 2023
Learning Representations on Logs for AIOps
Learning Representations on Logs for AIOps
Pranjal Gupta
Harshit Kumar
Debanjana Kar
Karan Bhukar
Pooja Aggarwal
P. Mohapatra
20
11
0
18 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
41
81
0
15 Aug 2023
Link-Context Learning for Multimodal LLMs
Link-Context Learning for Multimodal LLMs
Yan Tai
Weichen Fan
Zhao Zhang
Feng Zhu
Rui Zhao
Ziwei Liu
ReLM
LRM
21
17
0
15 Aug 2023
StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion
  Models
StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models
Zhizhong Wang
Lei Zhao
Wei Xing
DiffM
27
119
0
15 Aug 2023
Steering Language Generation: Harnessing Contrastive Expert Guidance and
  Negative Prompting for Coherent and Diverse Synthetic Data Generation
Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation
Charles OÑeill
Y. Ting 丁
I. Ciucă
Jack Miller
Thang Bui
SyDa
29
1
0
15 Aug 2023
MarkovGen: Structured Prediction for Efficient Text-to-Image Generation
MarkovGen: Structured Prediction for Efficient Text-to-Image Generation
Sadeep Jayasumana
Daniel Glasner
Srikumar Ramalingam
Andreas Veit
Ayan Chakrabarti
Surinder Kumar
DiffM
19
0
0
14 Aug 2023
FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous
  Driving
FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving
Zhonghua Yi
Haowen Shi
Kailun Yang
Qi Jiang
Yaozu Ye
Ze Wang
Huajian Ni
Kaiwei Wang
3DPC
15
9
0
14 Aug 2023
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Xiaofei Wang
Manthan Thakker
Zhuo Chen
Naoyuki Kanda
Sefik Emre Eskimez
Sanyuan Chen
M. Tang
Shujie Liu
Jinyu Li
Takuya Yoshioka
18
79
0
14 Aug 2023
Precipitation nowcasting with generative diffusion models
Precipitation nowcasting with generative diffusion models
Andrea Asperti
Fabio Merizzi
Alberto Paparella
G. Pedrazzi
M. Angelinelli
Stefano Colamonaco
DiffM
25
18
0
13 Aug 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang
Yinzheng Luo
Ziliang Chen
Guangrun Wang
Xiaodan Liang
Liang Lin
DiffM
19
12
0
13 Aug 2023
Large Language Models and Foundation Models in Smart Agriculture:
  Basics, Opportunities, and Challenges
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
25
3
0
13 Aug 2023
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
Baolin Liu
Zongyuan Yang
Pengfei Wang
Junjie Zhou
Ziqi Liu
Ziyi Song
Yan Liu
Yongping Xiong
26
7
0
13 Aug 2023
VisIT-Bench: A Benchmark for Vision-Language Instruction Following
  Inspired by Real-World Use
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use
Yonatan Bitton
Hritik Bansal
Jack Hessel
Rulin Shao
Wanrong Zhu
Anas Awadalla
Josh Gardner
Rohan Taori
L. Schimdt
VLM
29
77
0
12 Aug 2023
White-box Membership Inference Attacks against Diffusion Models
White-box Membership Inference Attacks against Diffusion Models
Yan Pang
Tianhao Wang
Xu Kang
Mengdi Huai
Yang Zhang
AAML
DiffM
31
22
0
11 Aug 2023
Masked-Attention Diffusion Guidance for Spatially Controlling
  Text-to-Image Generation
Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Yuki Endo
25
8
0
11 Aug 2023
Illumination and Shadows in Head Rotation: experiments with Denoising Diffusion Models
Illumination and Shadows in Head Rotation: experiments with Denoising Diffusion Models
Andrea Asperti
Gabriele Colasuonno
António Guerra
DiffM
60
1
0
11 Aug 2023
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised
  Pretraining
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Haohe Liu
Yiitan Yuan
Xubo Liu
Xinhao Mei
Qiuqiang Kong
Qiao Tian
Yuping Wang
Wenwu Wang
Yuxuan Wang
Mark D. Plumbley
DiffM
17
220
0
10 Aug 2023
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Peike Li
Bo-Yu Chen
Yao Yao
Yikai Wang
Allen Wang
Alex Jinpeng Wang
MGen
VLM
DiffM
57
37
0
09 Aug 2023
Generative Approach for Probabilistic Human Mesh Recovery using
  Diffusion Models
Generative Approach for Probabilistic Human Mesh Recovery using Diffusion Models
Hanbyel Cho
Junmo Kim
3DH
14
7
0
05 Aug 2023
A Review of Change of Variable Formulas for Generative Modeling
A Review of Change of Variable Formulas for Generative Modeling
Ullrich Kothe
11
6
0
04 Aug 2023
VisAlign: Dataset for Measuring the Degree of Alignment between AI and
  Humans in Visual Perception
VisAlign: Dataset for Measuring the Degree of Alignment between AI and Humans in Visual Perception
Jiyoung Lee
Seung Wook Kim
Seunghyun Won
Joonseok Lee
Marzyeh Ghassemi
James Thorne
Jaeseok Choi
O.-Kil Kwon
E. Choi
18
1
0
03 Aug 2023
Reverse Stable Diffusion: What prompt was used to generate this image?
Reverse Stable Diffusion: What prompt was used to generate this image?
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
M. Shah
VLM
DiffM
23
5
0
02 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based
  Image Manipulation
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
27
33
0
02 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language
  Models
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
29
41
0
01 Aug 2023
Detecting Cloud Presence in Satellite Images Using the RGB-based CLIP
  Vision-Language Model
Detecting Cloud Presence in Satellite Images Using the RGB-based CLIP Vision-Language Model
Mikolaj Czerkawski
Robert C. Atkinson
Christos Tachtatzis
VLM
10
2
0
01 Aug 2023
MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
Baoquan Zhang
Chuyao Luo
Demin Yu
Huiwei Lin
Xutao Li
Yunming Ye
Bowen Zhang
DiffM
25
42
0
31 Jul 2023
MobileVidFactory: Automatic Diffusion-Based Social Media Video
  Generation for Mobile Devices from Text
MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
Junchen Zhu
Huan Yang
Wenjing Wang
Huiguo He
Zixi Tuo
...
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
Jiebo Luo
DiffM
23
6
0
31 Jul 2023
UniBriVL: Robust Universal Representation and Generation of Audio Driven
  Diffusion Models
UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Sen Fang
Bowen Gao
Yangjian Wu
T. Teoh
DiffM
18
1
0
29 Jul 2023
CLIP Brings Better Features to Visual Aesthetics Learners
CLIP Brings Better Features to Visual Aesthetics Learners
Liwu Xu
Jinjin Xu
Yuzhe Yang
Yi-Jie Huang
Yanchun Xie
Yaqian Li
VLM
20
3
0
28 Jul 2023
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model
  and Language Model: A Comparative Study of Semantic Coding
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Chunyu Qiang
Hao Li
Hao Ni
He Qu
Ruibo Fu
Tao Wang
Longbiao Wang
J. Dang
DiffM
27
8
0
28 Jul 2023
Previous
123...149150151...158159160
Next