ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,046 papers shown
Undress to Redress: A Training-Free Framework for Virtual Try-On
Undress to Redress: A Training-Free Framework for Virtual Try-On
Ruoyao Xiao
Junhao Wu
Yeying Jin
Daiheng Gao
Yun Ji
...
Hao Xu
Kai Chen
Bruce Gu
Nana Wang
Zhaoxin Fan
DiffM
137
0
0
11 Aug 2025
Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion
Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion
Minseo Kim
Minchan Kwon
Dongyeun Lee
Yunho Jeon
Junmo Kim
DiffM
92
0
0
11 Aug 2025
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Xin Ma
Yaohui Wang
Genyun Jia
Xinyuan Chen
Tien-Tsin Wong
C. L. P. Chen
VGen
163
0
0
10 Aug 2025
Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models
Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models
Shiqian Zhao
Chong Wang
Yiming Li
Yihao Huang
Wenjie Qu
Siew Kei Lam
Yi Xie
Kangjie Chen
Jie Zhang
Tianwei Zhang
205
3
0
09 Aug 2025
AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning
AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning
Shihao Yuan
Yahui Liu
Yang Yue
Jingyuan Zhang
Wangmeng Zuo
Qi Wang
Fuzheng Zhang
Guorui Zhou
EGVMVLM
148
11
0
09 Aug 2025
Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing
Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing
Shichao Ma
Yunhe Guo
Jiahao Su
Qihe Huang
Zhengyang Zhou
Yang Wang
DiffM
104
2
0
09 Aug 2025
FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
Jian Zhu
Shanyuan Liu
Liuzhuozheng Li
Yue Gong
He Wang
...
Liebucha Wu
Xiaoyu Wu
Dawei Leng
Yuhui Yin
Yang Xu
DiffM
87
0
0
07 Aug 2025
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
Wonjun Kang
Byeongkeun Ahn
Minjae Lee
Kevin Galim
Seunghyuk Oh
Hyung Il Koo
N. Cho
DiffM
180
0
0
07 Aug 2025
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
Seungyong Lee
Jeong-gi Kwak
DiffM
239
2
0
06 Aug 2025
Slice or the Whole Pie? Utility Control for AI Models
Slice or the Whole Pie? Utility Control for AI Models
Ye Tao
AAML
111
0
0
06 Aug 2025
Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model
Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model
Hongxu Chen
Zhen Wang
Taoran Mei
Lin Li
Bowei Zhu
Runshi Li
L. Chen
DiffM
171
0
0
06 Aug 2025
LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation
LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation
Kangrui Cen
Baixuan Zhao
Yi Xin
Siqi Luo
Guoquan Zheng
Xiaohong Liu
DiffMVGen
150
0
0
06 Aug 2025
VideoGuard: Protecting Video Content from Unauthorized Editing
VideoGuard: Protecting Video Content from Unauthorized Editing
Junjie Cao
KaiZhou Li
Xinchun Yu
Hongxiang Li
Xiaoping Zhang
DiffMVGen
127
0
0
05 Aug 2025
Diffusion Models with Adaptive Negative Sampling Without External Resources
Diffusion Models with Adaptive Negative Sampling Without External Resources
Alakh Desai
Nuno Vasconcelos
DiffM
170
0
0
05 Aug 2025
CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation
CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation
Kaishen Yuan
Yuting Zhang
Shang Gao
Yijie Zhu
Wenshuo Chen
Yutao Yue
DiffM
138
1
0
05 Aug 2025
HPSv3: Towards Wide-Spectrum Human Preference Score
HPSv3: Towards Wide-Spectrum Human Preference Score
Yuhang Ma
Xiaoshi Wu
Keqiang Sun
K. Sun
Jiaming Song
158
55
0
05 Aug 2025
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Hyungjin Kim
Seokho Ahn
Young-Duk Seo
DiffM
133
1
0
05 Aug 2025
When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models
When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models
Dasol Choi Jihwan Lee
Jihwan Lee
Minjae Lee
Minsuk Kahng
EGVM
206
0
0
05 Aug 2025
Veila: Panoramic LiDAR Generation from a Monocular RGB Image
Veila: Panoramic LiDAR Generation from a Monocular RGB Image
Youquan Liu
Lingdong Kong
Weidong Yang
Ao Liang
Jianxiong Gao
...
Xiang Xu
Xin Li
Linfeng Li
Runnan Chen
Ben Fei
DiffM
110
2
0
05 Aug 2025
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Philipp Wulff
Felix Wimbauer
Dominik Muhle
Daniel Cremers
MDE
156
0
0
04 Aug 2025
Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor
Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated TaylorKnowledge-Based Systems (KBS), 2025
Xiaoliu Guan
Lielin Jiang
Hanqi Chen
X. Zhang
Jiaxing Yan
Guanzhong Wang
Yi-Hsueh Liu
Zetao Zhang
Yu Wu
305
2
0
04 Aug 2025
DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding
DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding
Hanqing Wang
Zhenhao Zhang
Kaiyang Ji
Mingyu Liu
Wenti Yin
Yuchao Chen
Zhirui Liu
Xiangyu Zeng
Tianxiang Gui
Hangxing Zhang
162
3
0
03 Aug 2025
Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models
Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models
Haoran Dai
Jiawen Wang
Ruo Yang
Manali Sharma
Zhonghao Liao
Yuan Hong
Binghui Wang
AAML
105
1
0
03 Aug 2025
Versatile Transition Generation with Image-to-Video Diffusion
Versatile Transition Generation with Image-to-Video Diffusion
Zuhao Yang
Jiahui Zhang
Yingchen Yu
Shijian Lu
Song Bai
DiffMVGen
241
4
0
03 Aug 2025
Dataset Condensation with Color Compensation
Dataset Condensation with Color Compensation
Huyu Wu
Duo Su
Junjie Hou
Guang Li
DD
418
1
0
02 Aug 2025
Personalized Safety Alignment for Text-to-Image Diffusion Models
Personalized Safety Alignment for Text-to-Image Diffusion Models
Yu Lei
Jinbin Bai
Qingyu Shi
Aosong Feng
Kaidong Yu
EGVM
207
0
0
02 Aug 2025
ReCoSeg++:Extended Residual-Guided Cross-Modal Diffusion for Brain Tumor Segmentation
ReCoSeg++:Extended Residual-Guided Cross-Modal Diffusion for Brain Tumor Segmentation
Sara Yavari
Rahul Nitin Pandya
Jacob Furst
MedIm
150
3
0
01 Aug 2025
Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence
Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence
Danzhen Fu
Jiagao Hu
Daiguo Zhou
Fei Wang
Zepeng Wang
Wenhua Liao
VGen
208
0
0
01 Aug 2025
Steering Guidance for Personalized Text-to-Image Diffusion Models
Steering Guidance for Personalized Text-to-Image Diffusion Models
S. Park
Seokeon Choi
Hyoungwoo Park
Sungrack Yun
200
1
0
01 Aug 2025
Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution
Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution
Yiwen Wang
Xinning Chai
Yuhong Zhang
Zhengxue Cheng
Jun Zhao
Rong Xie
Li Song
DiffMVGen
126
0
0
01 Aug 2025
Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting
Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting
Yuekun Dai
Haitian Li
Shangchen Zhou
Chen Change Loy
173
1
0
01 Aug 2025
Training-free Geometric Image Editing on Diffusion Models
Training-free Geometric Image Editing on Diffusion Models
Hanshen Zhu
Zhen Zhu
Kaile Zhang
Yiming Gong
Yuliang Liu
Xiang Bai
DiffM
257
0
0
31 Jul 2025
UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries
UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries
Yijie Zhu
Lingsen Zhang
Zitong Yu
Rui Shao
Tao Tan
Liqiang Nie
210
3
0
31 Jul 2025
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
Federico Girella
Davide Talon
Ziyue Liu
Zanxi Ruan
Yiming Wang
Marco Cristani
DiffM
180
0
0
30 Jul 2025
DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion
DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion
Qingcheng Zhao
Xiang Zhang
Haiyang Xu
Z. Chen
Jianwen Xie
Yuan Gao
Zhuowen Tu
DiffMMDE
169
8
0
30 Jul 2025
On the Reliability of Vision-Language Models Under Adversarial Frequency-Domain Perturbations
On the Reliability of Vision-Language Models Under Adversarial Frequency-Domain Perturbations
Jordan Vice
Naveed Akhtar
Yansong Gao
Richard Hartley
Ajmal Mian
AAML
214
2
0
30 Jul 2025
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent AttentionInternational Conference on Learning Representations (ICLR), 2025
Xiaochen Zhao
Hongyi Xu
Guoxian Song
You Xie
Chenxu Zhang
Xiu Li
Linjie Luo
J. Suo
Yebin Liu
VGen
204
17
0
30 Jul 2025
Trade-offs in Image Generation: How Do Different Dimensions Interact?
Trade-offs in Image Generation: How Do Different Dimensions Interact?
Sicheng Zhang
Binzhu Xie
Zhonghao Yan
Yuli Zhang
Donghao Zhou
Xiaofei Chen
Shi Qiu
Jiaqi Liu
Guoyang Xie
Zhichao Lu
164
2
0
29 Jul 2025
GuidPaint: Class-Guided Image Inpainting with Diffusion Models
GuidPaint: Class-Guided Image Inpainting with Diffusion Models
Qimin Wang
Xinda Liu
Guohua Geng
DiffM
236
1
0
29 Jul 2025
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
Zigang Geng
Y. Wang
Yeyao Ma
Chen Li
Yongming Rao
...
Han Hu
Xiaosong Zhang
Linus
Di Wang
Jie Jiang
180
32
0
29 Jul 2025
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
Meishan Zhang
Xin Zhang
X. Zhao
Shouzheng Huang
Baotian Hu
Min Zhang
269
3
0
28 Jul 2025
Flow Matching Policy Gradients
Flow Matching Policy Gradients
David McAllister
Songwei Ge
Brent Yi
Chung Min Kim
Ethan Weber
Hongsuk Choi
Haiwen Feng
Angjoo Kanazawa
278
22
0
28 Jul 2025
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
Shijie Zhou
Ruiyi Zhang
Huaisheng Zhu
Branislav Kveton
Jiuxiang Gu
J. Gu
Jian Chen
Changyou Chen
MLLMVLMLRM
391
6
0
28 Jul 2025
AIComposer: Any Style and Content Image Composition via Feature Integration
AIComposer: Any Style and Content Image Composition via Feature Integration
Haowen Li
Zhenfeng Fan
Zhang Wen
Zhengzhou Zhu
Yunjin Li
DiffM
173
1
0
28 Jul 2025
Compositional Video Synthesis by Temporal Object-Centric Learning
Compositional Video Synthesis by Temporal Object-Centric Learning
Adil Kaan Akan
Yucel Yemez
DiffMOCL
246
0
0
28 Jul 2025
Sem-DPO: Mitigating Semantic Inconsistency in Preference Optimization for Prompt Engineering
Sem-DPO: Mitigating Semantic Inconsistency in Preference Optimization for Prompt Engineering
Anas Mohamed
A. Khan
Xinran Wang
Ahmad Faraz Khan
Shuwen Ge
Saman Bahzad Khan
Ayaan Ahmad
Ali Anwar
221
0
0
27 Jul 2025
Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training
Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training
Qiaosi Yi
Shuai Li
Rongyuan Wu
Lingchen Sun
Y. Wu
Lei Zhang
SupR
294
7
0
27 Jul 2025
SCALAR: Scale-wise Controllable Visual Autoregressive Learning
SCALAR: Scale-wise Controllable Visual Autoregressive Learning
Ryan Xu
Dongyang Jin
Y. Bai
Rui Lan
Xu Duan
Lei Sun
Xiangxiang Chu
305
8
0
26 Jul 2025
A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction
A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction
Xiaohua Feng
Jiaming Zhang
Fengyuan Yu
C. Wang
Li Zhang
Kaixiang Li
Yuyuan Li
Chaochao Chen
Jianwei Yin
MU
274
2
0
26 Jul 2025
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025
J. Park
Kumju Jo
Sungyong Baik
DiffM
224
0
0
26 Jul 2025
Previous
123...8910...99100101
Next
Page 9 of 101
Pageof 101