ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,056 papers shown
Tailored Emotional LLM-Supporter: Enhancing Cultural Sensitivity
Tailored Emotional LLM-Supporter: Enhancing Cultural Sensitivity
Chen Cecilia Liu
Hiba Arnaout
Nils Kovačić
Dana Atzil-Slonim
Iryna Gurevych
184
0
0
11 Aug 2025
Learning User Preferences for Image Generation Model
Learning User Preferences for Image Generation Model
Wenyi Mo
Ying Ba
Tianyu Zhang
Yalong Bai
Biye Li
DiffM
108
2
0
11 Aug 2025
Undress to Redress: A Training-Free Framework for Virtual Try-On
Undress to Redress: A Training-Free Framework for Virtual Try-On
Ruoyao Xiao
Junhao Wu
Yeying Jin
Daiheng Gao
Yun Ji
...
Hao Xu
Kai Chen
Bruce Gu
Nana Wang
Zhaoxin Fan
DiffM
153
0
0
11 Aug 2025
Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion
Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion
Minseo Kim
Minchan Kwon
Dongyeun Lee
Yunho Jeon
Junmo Kim
DiffM
111
0
0
11 Aug 2025
S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix
S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix
Peng Dai
Feitong Tan
Qiangeng Xu
Yihua Huang
David Futschik
Ruofei Du
S. Fanello
Yinda Zhang
Xiaojuan Qi
VGen
173
0
0
11 Aug 2025
Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo
Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo
Advait Parulekar
Litu Rout
Karthikeyan Shanmugam
Sanjay Shakkottai
223
2
0
11 Aug 2025
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Joonghyuk Shin
Alchan Hwang
Yujin Kim
Daneul Kim
Jaesik Park
DiffM
209
7
0
11 Aug 2025
Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing
Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing
Weitao Wang
Haoran Xu
Jun Meng
Haoqian Wang
DiffM
113
0
0
11 Aug 2025
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Xin Ma
Yaohui Wang
Genyun Jia
Xinyuan Chen
Tien-Tsin Wong
C. L. P. Chen
VGen
283
0
0
10 Aug 2025
AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning
AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning
Shihao Yuan
Yahui Liu
Yang Yue
Jingyuan Zhang
Wangmeng Zuo
Qi Wang
Fuzheng Zhang
Guorui Zhou
EGVMVLM
187
15
0
09 Aug 2025
Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing
Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing
Shichao Ma
Yunhe Guo
Jiahao Su
Qihe Huang
Zhengyang Zhou
Yang Wang
DiffM
163
4
0
09 Aug 2025
Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models
Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models
Shiqian Zhao
Chong Wang
Yiming Li
Yihao Huang
Wenjie Qu
Siew Kei Lam
Yi Xie
Kangjie Chen
Jie Zhang
Tianwei Zhang
317
3
0
09 Aug 2025
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
Wonjun Kang
Byeongkeun Ahn
Minjae Lee
Kevin Galim
Seunghyuk Oh
Hyung Il Koo
N. Cho
DiffM
215
0
0
07 Aug 2025
FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
Jian Zhu
Shanyuan Liu
Liuzhuozheng Li
Yue Gong
He Wang
...
Liebucha Wu
Xiaoyu Wu
Dawei Leng
Yuhui Yin
Yang Xu
DiffM
127
0
0
07 Aug 2025
LayerT2V: A Unified Multi-Layer Video Generation Framework
LayerT2V: A Unified Multi-Layer Video Generation Framework
Kangrui Cen
Baixuan Zhao
Yi Xin
Siqi Luo
Guoquan Zheng
Xiaohong Liu
Lei Zhang
Xiaohong Liu
DiffMVGen
188
0
0
06 Aug 2025
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
Seungyong Lee
Jeong-gi Kwak
DiffM
338
7
0
06 Aug 2025
Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model
Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model
Hongxu Chen
Zhen Wang
Taoran Mei
Lin Li
Bowei Zhu
Runshi Li
L. Chen
DiffM
194
0
0
06 Aug 2025
Slice or the Whole Pie? Utility Control for AI Models
Slice or the Whole Pie? Utility Control for AI Models
Ye Tao
AAML
131
0
0
06 Aug 2025
HPSv3: Towards Wide-Spectrum Human Preference Score
HPSv3: Towards Wide-Spectrum Human Preference Score
Yuhang Ma
Xiaoshi Wu
Keqiang Sun
K. Sun
Jiaming Song
188
87
0
05 Aug 2025
Diffusion Models with Adaptive Negative Sampling Without External Resources
Diffusion Models with Adaptive Negative Sampling Without External Resources
Alakh Desai
Nuno Vasconcelos
DiffM
217
0
0
05 Aug 2025
Veila: Panoramic LiDAR Generation from a Monocular RGB Image
Veila: Panoramic LiDAR Generation from a Monocular RGB Image
Youquan Liu
Lingdong Kong
Weidong Yang
Ao Liang
Jianxiong Gao
...
Xiang Xu
Xin Li
Linfeng Li
Runnan Chen
Ben Fei
DiffM
140
2
0
05 Aug 2025
VideoGuard: Protecting Video Content from Unauthorized Editing
VideoGuard: Protecting Video Content from Unauthorized Editing
Junjie Cao
KaiZhou Li
Xinchun Yu
Hongxiang Li
Xiaoping Zhang
DiffMVGen
155
0
0
05 Aug 2025
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Hyungjin Kim
Seokho Ahn
Young-Duk Seo
DiffM
216
3
0
05 Aug 2025
When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models
When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models
Dasol Choi Jihwan Lee
Jihwan Lee
Minjae Lee
Minsuk Kahng
EGVM
309
0
0
05 Aug 2025
CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation
CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation
Kaishen Yuan
Yuting Zhang
Shang Gao
Yijie Zhu
Wenshuo Chen
Yutao Yue
DiffM
209
2
0
05 Aug 2025
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Philipp Wulff
Felix Wimbauer
Dominik Muhle
Daniel Cremers
MDE
188
0
0
04 Aug 2025
Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor
Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated TaylorKnowledge-Based Systems (KBS), 2025
Xiaoliu Guan
Lielin Jiang
Hanqi Chen
X. Zhang
Jiaxing Yan
Guanzhong Wang
Yi-Hsueh Liu
Zetao Zhang
Yu Wu
405
5
0
04 Aug 2025
Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models
Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models
Haoran Dai
Jiawen Wang
Ruo Yang
Manali Sharma
Zhonghao Liao
Yuan Hong
Binghui Wang
AAML
144
2
0
03 Aug 2025
DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding
DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding
Hanqing Wang
Zhenhao Zhang
Kaiyang Ji
Mingyu Liu
Wenti Yin
Yuchao Chen
Zhirui Liu
Xiangyu Zeng
Tianxiang Gui
Hangxing Zhang
220
5
0
03 Aug 2025
Versatile Transition Generation with Image-to-Video Diffusion
Versatile Transition Generation with Image-to-Video Diffusion
Zuhao Yang
Jiahui Zhang
Yingchen Yu
Shijian Lu
Song Bai
DiffMVGen
328
5
0
03 Aug 2025
Personalized Safety Alignment for Text-to-Image Diffusion Models
Personalized Safety Alignment for Text-to-Image Diffusion Models
Yu Lei
Jinbin Bai
Qingyu Shi
Aosong Feng
Kaidong Yu
Xiao Zhang
Rex Ying
EGVM
277
0
0
02 Aug 2025
Dataset Condensation with Color Compensation
Dataset Condensation with Color Compensation
Huyu Wu
Duo Su
Junjie Hou
Guang Li
DD
508
2
0
02 Aug 2025
ReCoSeg++:Extended Residual-Guided Cross-Modal Diffusion for Brain Tumor Segmentation
ReCoSeg++:Extended Residual-Guided Cross-Modal Diffusion for Brain Tumor Segmentation
Sara Yavari
Rahul Nitin Pandya
Jacob Furst
MedIm
281
3
0
01 Aug 2025
Steering Guidance for Personalized Text-to-Image Diffusion Models
Steering Guidance for Personalized Text-to-Image Diffusion Models
S. Park
Seokeon Choi
Hyoungwoo Park
Sungrack Yun
324
4
0
01 Aug 2025
Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting
Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting
Yuekun Dai
Haitian Li
Shangchen Zhou
Chen Change Loy
221
2
0
01 Aug 2025
Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence
Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence
Danzhen Fu
Jiagao Hu
Daiguo Zhou
Fei Wang
Zepeng Wang
Wenhua Liao
VGen
239
0
0
01 Aug 2025
Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution
Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution
Yiwen Wang
Xinning Chai
Yuhong Zhang
Zhengxue Cheng
Jun Zhao
Rong Xie
Li Song
DiffMVGen
185
1
0
01 Aug 2025
UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries
UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries
Yijie Zhu
Lingsen Zhang
Zitong Yu
Rui Shao
Tao Tan
Liqiang Nie
253
5
0
31 Jul 2025
Training-free Geometric Image Editing on Diffusion Models
Training-free Geometric Image Editing on Diffusion Models
Hanshen Zhu
Zhen Zhu
Kaile Zhang
Yiming Gong
Yuliang Liu
Xiang Bai
DiffM
314
1
0
31 Jul 2025
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
Federico Girella
Davide Talon
Ziyue Liu
Zanxi Ruan
Yiming Wang
Marco Cristani
DiffM
281
0
0
30 Jul 2025
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent AttentionInternational Conference on Learning Representations (ICLR), 2025
Xiaochen Zhao
Hongyi Xu
Guoxian Song
You Xie
Chenxu Zhang
Xiu Li
Linjie Luo
J. Suo
Yebin Liu
VGen
244
26
0
30 Jul 2025
On the Reliability of Vision-Language Models Under Adversarial Frequency-Domain Perturbations
On the Reliability of Vision-Language Models Under Adversarial Frequency-Domain Perturbations
Jordan Vice
Naveed Akhtar
Yansong Gao
Richard Hartley
Ajmal Mian
AAML
262
2
0
30 Jul 2025
DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion
DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion
Qingcheng Zhao
Xiang Zhang
Haiyang Xu
Z. Chen
Jianwen Xie
Yuan Gao
Zhuowen Tu
DiffMMDE
201
9
0
30 Jul 2025
Trade-offs in Image Generation: How Do Different Dimensions Interact?
Trade-offs in Image Generation: How Do Different Dimensions Interact?
Sicheng Zhang
Binzhu Xie
Zhonghao Yan
Yuli Zhang
Donghao Zhou
Xiaofei Chen
Shi Qiu
Jiaqi Liu
Guoyang Xie
Zhichao Lu
247
3
0
29 Jul 2025
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
Zigang Geng
Y. Wang
Yeyao Ma
Chen Li
Yongming Rao
...
Han Hu
Xiaosong Zhang
Linus
Di Wang
Jie Jiang
218
53
0
29 Jul 2025
GuidPaint: Class-Guided Image Inpainting with Diffusion Models
GuidPaint: Class-Guided Image Inpainting with Diffusion Models
Qimin Wang
Xinda Liu
Guohua Geng
DiffM
304
3
0
29 Jul 2025
Compositional Video Synthesis by Temporal Object-Centric Learning
Compositional Video Synthesis by Temporal Object-Centric Learning
Adil Kaan Akan
Yucel Yemez
DiffMOCL
285
0
0
28 Jul 2025
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
Shijie Zhou
Ruiyi Zhang
Huaisheng Zhu
Branislav Kveton
Jiuxiang Gu
J. Gu
Jian Chen
Changyou Chen
MLLMVLMLRM
507
7
0
28 Jul 2025
AIComposer: Any Style and Content Image Composition via Feature Integration
AIComposer: Any Style and Content Image Composition via Feature Integration
Haowen Li
Zhenfeng Fan
Zhang Wen
Zhengzhou Zhu
Yunjin Li
DiffM
244
1
0
28 Jul 2025
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
Meishan Zhang
Xin Zhang
X. Zhao
Shouzheng Huang
Baotian Hu
Min Zhang
344
4
0
28 Jul 2025
Previous
123...8910...100101102
Next
Page 9 of 102
Pageof 102