ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,309 papers shown
Title
Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
Ketan Suhaas Saichandran
Xavier Thomas
Prakhar Kaushik
Deepti Ghadiyaram
DiffM
73
0
0
22 Mar 2025
TDRI: Two-Phase Dialogue Refinement and Co-Adaptation for Interactive Image Generation
TDRI: Two-Phase Dialogue Refinement and Co-Adaptation for Interactive Image Generation
Yuheng Feng
Jianhui Wang
Kun Li
Sida Li
Tianyu Shi
Haoyue Han
Miao Zhang
Xueqian Wang
DiffM
67
0
0
22 Mar 2025
ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation
ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation
Oucheng Huang
Yuhang Ma
Zeng Zhao
Mingrui Wu
Jiayi Ji
Rongsheng Zhang
Z. Hu
Xiaoshuai Sun
Rongrong Ji
41
0
0
22 Mar 2025
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
Zhiqiang Yuan
Ting Zhang
Ying Deng
Jiapei Zhang
Yeshuang Zhu
Zexi Jia
Jie Zhou
Jinchao Zhang
VGen
39
0
0
22 Mar 2025
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance
Shulei Wang
Wang Lin
Hai Huang
Hanting Wang
Sihang Cai
...
Tao Jin
Jingyuan Chen
Jiacheng Sun
Jieming Zhu
Zhou Zhao
DiffM
55
2
0
22 Mar 2025
AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process
AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process
J. Hu
Shuyong Gao
Qianyu Guo
Yan Wang
Qishan Wang
Yuang Feng
Wenqiang Zhang
DiffM
VGen
44
0
0
21 Mar 2025
R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model
R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model
Boyuan Zheng
Shouyi Lu
Renbo Huang
Minqing Huang
Fan Lu
Wei Tian
G. Zhuo
Lu Xiong
54
0
0
21 Mar 2025
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
Kwan Yun
Chaelin Kim
Hangyeul Shin
Junyong Noh
CVBM
43
0
0
21 Mar 2025
ARFlow: Human Action-Reaction Flow Matching with Physical Guidance
ARFlow: Human Action-Reaction Flow Matching with Physical Guidance
Wentao Jiang
Jingya Wang
Haotao Lu
Kaiyang Ji
Baoxiong Jia
Siyuan Huang
Ye-ling Shi
39
0
0
21 Mar 2025
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
Keyon Vafa
Sarah Bentley
Jon M. Kleinberg
S. Mullainathan
38
0
0
21 Mar 2025
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
Fanghua Yu
Jinjin Gu
Jinfan Hu
Zheyuan Li
Chao Dong
DiffM
50
0
0
21 Mar 2025
Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting
Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting
Simona Kocour
Assia Benbihi
Aikaterini Adam
Torsten Sattler
3DPC
41
0
0
21 Mar 2025
Scale-wise Distillation of Diffusion Models
Scale-wise Distillation of Diffusion Models
Nikita Starodubcev
Denis Kuznedelev
Artem Babenko
Dmitry Baranchuk
DiffM
48
0
0
20 Mar 2025
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
Quanhao Li
Zhen Xing
Rui Wang
Hui Zhang
Qi Dai
Zuxuan Wu
VGen
61
0
0
20 Mar 2025
Controllable Segmentation-Based Text-Guided Style Editing
Controllable Segmentation-Based Text-Guided Style Editing
Jingwen Li
Aravind Chandrasekar
Mariana Rocha
Chao Li
Yuqing Chen
48
0
0
20 Mar 2025
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Yingmao Miao
Zhanpeng Huang
Rui Han
Zibin Wang
Chenhao Lin
Chao Shen
DiffM
47
0
0
20 Mar 2025
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Tianyi Wei
Yifan Zhou
Dongdong Chen
Xingang Pan
72
0
0
20 Mar 2025
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
Yu Cao
Zengqun Zhao
Ioannis Patras
Shaogang Gong
DiffM
48
0
0
20 Mar 2025
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Y. Wang
Zhijie Lin
Yao Teng
Yuanzhi Zhu
Shuhuai Ren
Jiashi Feng
Xihui Liu
46
0
0
20 Mar 2025
PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
Longbin Ji
Lei Zhong
Pengfei Wei
Changjian Li
DiffM
VGen
39
0
0
20 Mar 2025
World Knowledge from AI Image Generation for Robot Control
World Knowledge from AI Image Generation for Robot Control
Jonas Krumme
C. Zetzsche
LM&Ro
50
0
0
20 Mar 2025
Bezier Distillation
Bezier Distillation
Ling Feng
SK Yang
39
0
0
20 Mar 2025
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Jiaqi Liu
Jichao Zahng
Paolo Rota
N. Sebe
DiffM
45
0
0
19 Mar 2025
Visual Persona: Foundation Model for Full-Body Human Customization
Visual Persona: Foundation Model for Full-Body Human Customization
Jisu Nam
Soowon Son
Zhan Xu
Jing Shi
Difan Liu
Feng Liu
Aashish Misraa
Seungryong Kim
Yang Zhou
DiffM
37
0
0
19 Mar 2025
How to Train Your Dragon: Automatic Diffusion-Based Rigging for Characters with Diverse Topologies
How to Train Your Dragon: Automatic Diffusion-Based Rigging for Characters with Diverse Topologies
Zeqi Gu
Difan Liu
Timothy Langlois
Matthew Fisher
Abe Davis
DiffM
3DH
60
0
0
19 Mar 2025
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
Junfeng Ni
Yu Liu
Ruijie Lu
Zirui Zhou
Song-Chun Zhu
Yixin Chen
Siyuan Huang
DiffM
59
4
0
19 Mar 2025
Advances in 4D Generation: A Survey
Advances in 4D Generation: A Survey
Qiaowei Miao
Kehan Li
Jinsheng Quan
Zhiyuan Min
Shaojie Ma
Yichao Xu
Yi Yang
Yawei Luo
51
0
0
18 Mar 2025
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
Yuyang Xue
Edward Moroshko
Feng Chen
Steven G. McDonagh
Sotirios A. Tsaftaris
56
1
0
18 Mar 2025
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Haoyu Guo
He Zhu
Sida Peng
Haotong Lin
Yunzhi Yan
Tao Xie
Wenguan Wang
Xiaowei Zhou
Hujun Bao
3DV
MDE
78
1
0
18 Mar 2025
Diffusion-based G-buffer generation and rendering
Diffusion-based G-buffer generation and rendering
Bowen Xue
G. C. Guarnera
Shuang Zhao
Zahra Montazeri
DiffM
48
0
0
18 Mar 2025
The Power of Context: How Multimodality Improves Image Super-Resolution
The Power of Context: How Multimodality Improves Image Super-Resolution
Kangfu Mei
Hossein Talebi
Mojtaba Ardakani
Vishal M. Patel
P. Milanfar
M. Delbracio
DiffM
77
1
0
18 Mar 2025
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
Yulin Pan
Xiangteng He
Chaojie Mao
Zhen Han
Zeyinzi Jiang
J. Zhang
Yu Liu
EGVM
VLM
71
1
0
18 Mar 2025
MusicInfuser: Making Video Diffusion Listen and Dance
MusicInfuser: Making Video Diffusion Listen and Dance
Susung Hong
Ira Kemelmacher-Shlizerman
Brian L. Curless
Steven M. Seitz
VGen
45
0
0
18 Mar 2025
A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models
A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models
Ziqiang Li
Jun Li
Lizhi Xiong
Zhangjie Fu
Zechao Li
VLM
54
0
0
17 Mar 2025
TextInVision: Text and Prompt Complexity Driven Visual Text Generation Benchmark
TextInVision: Text and Prompt Complexity Driven Visual Text Generation Benchmark
Forouzan Fallah
Maitreya Patel
Agneet Chatterjee
Vlad I. Morariu
Chitta Baral
Yezhou Yang
CoGe
59
0
0
17 Mar 2025
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Lan Chen
Qi Mao
Yuchao Gu
Mike Zheng Shou
51
1
0
17 Mar 2025
DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode
DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode
Junjia Huang
Pengxiang Yan
Jinhang Cai
Jiyang Liu
Zhao Wang
Yitong Wang
Xinglong Wu
Guanbin Li
DiffM
70
0
0
17 Mar 2025
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Dewei Zhou
Mingwei Li
Zongxin Yang
Yi Yang
87
0
0
17 Mar 2025
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation
Daniil Selikhanovych
David Li
Aleksei Leonov
Nikita Gushchin
Sergei Kushneriuk
Alexander N. Filippov
E. Burnaev
Iaroslav Koshelev
Alexander Korotin
DiffM
58
0
0
17 Mar 2025
Diffusion on Graph: Augmentation of Graph Structure for Node Classification
Diffusion on Graph: Augmentation of Graph Structure for Node Classification
Yancheng Wang
Changyu Liu
Yingzhen Yang
DiffM
GNN
68
0
0
16 Mar 2025
Personalize Anything for Free with Diffusion Transformer
Personalize Anything for Free with Diffusion Transformer
Haoran Feng
Zehuan Huang
Lin Li
Hairong Lv
Lu Sheng
DiffM
72
1
0
16 Mar 2025
BalancedDPO: Adaptive Multi-Metric Alignment
BalancedDPO: Adaptive Multi-Metric Alignment
Dipesh Tamboli
Souradip Chakraborty
Aditya Malusare
B. Banerjee
Amrit Singh Bedi
Vaneet Aggarwal
EGVM
65
0
0
16 Mar 2025
FedGAI: Federated Style Learning with Cloud-Edge Collaboration for Generative AI in Fashion Design
FedGAI: Federated Style Learning with Cloud-Edge Collaboration for Generative AI in Fashion Design
Mingzhu Wu
Jianan Jiang
Xinglin Li
Hanhui Deng
Di Wu
FedML
47
0
0
16 Mar 2025
ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis
ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis
Yu Fang
Yue Yang
Xinghao Zhu
Kaiyuan Zheng
Gedas Bertasius
D. Szafir
Mingyu Ding
47
2
0
15 Mar 2025
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
Ruijie Lu
Yixin Chen
Yu Liu
Jiaxiang Tang
Junfeng Ni
Diwen Wan
Gang Zeng
Siyuan Huang
DiffM
VGen
44
3
0
15 Mar 2025
DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving
DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving
Tao Wang
Cong Zhang
Xingguang Qu
Kun Li
W. Liu
C. Huang
56
0
0
15 Mar 2025
DecompDreamer: Advancing Structured 3D Asset Generation with Multi-Object Decomposition and Gaussian Splatting
DecompDreamer: Advancing Structured 3D Asset Generation with Multi-Object Decomposition and Gaussian Splatting
Utkarsh Nath
Rajeev Goel
Rahul Khurana
Kyle Min
Mark Ollila
P. Turaga
Varun Jampani
Tejaswi Gowda
3DGS
42
0
0
15 Mar 2025
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Wonwoong Cho
Yan-Ying Chen
M. Klenk
David I. Inouye
Yanxia Zhang
DiffM
91
0
0
15 Mar 2025
DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
Shentong Mo
Zehua Chen
Fan Bao
Jun-Jie Zhu
DiffM
50
0
0
15 Mar 2025
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
Kyle Sargent
Kyle Hsu
Justin Johnson
L. Fei-Fei
Jiajun Wu
DiffM
MU
53
2
0
14 Mar 2025
Previous
123456...858687
Next