ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,039 papers shown
NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes
NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes
Shiyi Zhang
Dong Liang
Yihang Zhou
161
0
0
02 Oct 2025
Leveraging Prior Knowledge of Diffusion Model for Person Search
Leveraging Prior Knowledge of Diffusion Model for Person Search
Giyeol Kim
Sooyoung Yang
Jihyong Oh
Myungjoo Kang
Chanho Eom
DiffM
104
0
0
02 Oct 2025
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
Zihan Zhou
Shilin Lu
Shuli Leng
Shaocong Zhang
Zhuming Lian
Xinlei Yu
A. Kong
DiffM
296
7
0
02 Oct 2025
PEO: Training-Free Aesthetic Quality Enhancement in Pre-Trained Text-to-Image Diffusion Models with Prompt Embedding Optimization
PEO: Training-Free Aesthetic Quality Enhancement in Pre-Trained Text-to-Image Diffusion Models with Prompt Embedding Optimization
Hovhannes Margaryan
Bo Wan
Tinne Tuytelaars
280
0
0
02 Oct 2025
Towards Better Optimization For Listwise Preference in Diffusion Models
Towards Better Optimization For Listwise Preference in Diffusion Models
Jiamu Bai
Xin Yu
Meilong Xu
Weitao Lu
Xin Pan
Kiwan Maeng
Daniel Kifer
Jian Wang
Yu Wang
EGVM
338
1
0
02 Oct 2025
Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability
Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability
Shojiro Yamabe
Jun Sakuma
AAML
124
0
0
01 Oct 2025
Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling
Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling
Huangjie Zheng
Shansan Gong
Ruixiang Zhang
Tianrong Chen
Jiatao Gu
Mingyuan Zhou
Navdeep Jaitly
Y. Zhang
DiffM
279
6
0
01 Oct 2025
Syntax-Guided Diffusion Language Models with User-Integrated Personalization
Syntax-Guided Diffusion Language Models with User-Integrated Personalization
Ruqian Zhang
Yijiao Zhang
Juan Shen
Zhongyi Zhu
Annie Qu
DiffM
128
0
0
01 Oct 2025
ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning
ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning
Yuxiang Guo
Jiang Liu
Ze Wang
Hao Chen
Ximeng Sun
Yang Zhao
Jialian Wu
Xiaodong Yu
Zicheng Liu
Emad Barsoum
LM&MA
138
0
0
01 Oct 2025
Learn to Guide Your Diffusion Model
Learn to Guide Your Diffusion Model
Alexandre Galashov
Ashwini Pokle
Arnaud Doucet
Arthur Gretton
Mauricio Delbracio
Valentin De Bortoli
DiffM
438
0
0
01 Oct 2025
Erased, But Not Forgotten: Erased Rectified Flow Transformers Still Remain Unsafe Under Concept Attack
Erased, But Not Forgotten: Erased Rectified Flow Transformers Still Remain Unsafe Under Concept Attack
Nanxiang Jiang
Zhaoxin Fan
Enhan Kang
Daiheng Gao
Yun Zhou
Yanxia Chang
Zheng Zhu
Yeying Jin
Wenjun Wu
AAML
184
0
0
01 Oct 2025
JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation
JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation
Siheng Wan
Zhengtao Yao
Zhengdao Li
Junhao Dong
Yanshu Li
...
Haoyan Xu
Yijiang Li
Zhikang Dong
Huacan Wang
Jifeng Shen
DiffM
96
0
0
01 Oct 2025
MetaLogic: Robustness Evaluation of Text-to-Image Models via Logically Equivalent Prompts
MetaLogic: Robustness Evaluation of Text-to-Image Models via Logically Equivalent Prompts
Yifan Shen
Yangyang Shu
Hye-young Paik
Yulei Sui
DiffMEGVM
197
1
0
01 Oct 2025
Secure and Robust Watermarking for AI-generated Images: A Comprehensive Survey
Secure and Robust Watermarking for AI-generated Images: A Comprehensive Survey
Jie Cao
Qi Li
Z. Zhang
Jianbing Ni
184
0
0
30 Sep 2025
EVODiff: Entropy-aware Variance Optimized Diffusion Inference
EVODiff: Entropy-aware Variance Optimized Diffusion Inference
Shigui Li
Wei Chen
Delu Zeng
DiffM
154
2
0
30 Sep 2025
EchoGen: Generating Visual Echoes in Any Scene via Feed-Forward Subject-Driven Auto-Regressive Model
EchoGen: Generating Visual Echoes in Any Scene via Feed-Forward Subject-Driven Auto-Regressive Model
Ruixiao Dong
Z. Wang
Keli Liu
Li Li
Ying Chen
Kai Li
Daowen Li
Houqiang Li
DiffMVGen
142
0
0
30 Sep 2025
Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation
Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation
Mingyu Kang
Yong Suk Choi
DiffM
223
0
0
30 Sep 2025
VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing
VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing
Abdelilah Aitrouga
Youssef Hmamouche
Amal El Fallah Seghrouchni
VGen
214
0
0
30 Sep 2025
Stitch: Training-Free Position Control in Multimodal Diffusion Transformers
Stitch: Training-Free Position Control in Multimodal Diffusion Transformers
Jessica Bader
Mateusz Pach
Maria A. Bravo
Serge Belongie
Zeynep Akata
151
1
0
30 Sep 2025
GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts
GaussEdit: Adaptive 3D Scene Editing with Text and Image PromptsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025
Zhenyu Shu
Junlong Yu
Kai Chao
Shiqing Xin
Ligang Liu
3DGS
193
3
0
30 Sep 2025
CO3: Contrasting Concepts Compose Better
CO3: Contrasting Concepts Compose Better
Debottam Dutta
Jianchong Chen
Rajalaxmi Rajagopalan
Yu-Lin Wei
Romit Roy Choudhury
DiffM
126
0
0
30 Sep 2025
Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models
Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models
Shuchen Xue
Chongjian Ge
Shilong Zhang
Yichen Li
Zhi-Ming Ma
139
2
0
29 Sep 2025
U-DiT Policy: U-shaped Diffusion Transformers for Robotic Manipulation
U-DiT Policy: U-shaped Diffusion Transformers for Robotic Manipulation
Linzhi Wu
Aoran Mei
Xiyue Wang
Guo-Niu Zhu
Zhongxue Gan
76
0
0
29 Sep 2025
When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis
When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis
Xiang Li
Zebang Shen
Ya-Ping Hsieh
Niao He
DiffM
1.4K
0
0
29 Sep 2025
DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space
DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space
Wenkun He
Yuchao Gu
Junyu Chen
Dongyun Zou
Yujun Lin
...
Jincheng Yu
Junsong Chen
Enze Xie
Song Han
Han Cai
244
2
0
29 Sep 2025
Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
Jiaqi Tan
F. Li
Yang Liu
DiffM
109
0
0
29 Sep 2025
DiffPCN: Latent Diffusion Model Based on Multi-view Depth Images for Point Cloud Completion
DiffPCN: Latent Diffusion Model Based on Multi-view Depth Images for Point Cloud Completion
Z. Li
Hongyu Yan
Shijie Li
Kunming Luo
Li Lu
Xulei Yang
Weisi Lin
DiffM
113
0
0
28 Sep 2025
HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation
HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation
Cong Chen
Ziyuan Huang
Cheng Zou
Huanyi Zheng
Kaixiang Ji
Jiajia Liu
Jingdong Chen
Hao Chen
Chunhua Shen
150
3
0
28 Sep 2025
Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
Longtao Jiang
Mingfei Han
Lei Chen
Yongqiang Yu
Feng Zhao
Feng Zhao
Xiaojun Chang
Zhihui Li
DiffM
116
0
0
28 Sep 2025
DiffInk: Glyph- and Style-Aware Latent Diffusion Transformer for Text to Online Handwriting Generation
DiffInk: Glyph- and Style-Aware Latent Diffusion Transformer for Text to Online Handwriting Generation
Wei Pan
Huiguo He
Hiuyi Cheng
Yilin Shi
Lianwen Jin
DiffM
122
0
0
28 Sep 2025
Diff-3DCap: Shape Captioning with Diffusion Models
Diff-3DCap: Shape Captioning with Diffusion ModelsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025
Zhenyu Shu
Jiawei Wen
Shiyang Li
Shiqing Xin
Ligang Liu
DiffM
123
0
0
28 Sep 2025
Griffin: Generative Reference and Layout Guided Image Composition
Griffin: Generative Reference and Layout Guided Image Composition
Aryan Mikaeili
Amirhossein Alimohammadi
Negar Hassanpour
Ali Mahdavi-Amiri
Andrea Tagliasacchi
DiffM
79
0
0
28 Sep 2025
No Concept Left Behind: Test-Time Optimization for Compositional Text-to-Image Generation
No Concept Left Behind: Test-Time Optimization for Compositional Text-to-Image Generation
Mohammad Hossein Sameti
Amir M. Mansourian
Arash Marioriyad
Soheil Fadaee Oshyani
M. Rohban
M. Baghshah
96
0
0
27 Sep 2025
Group Critical-token Policy Optimization for Autoregressive Image Generation
Group Critical-token Policy Optimization for Autoregressive Image Generation
Guohui Zhang
Hu Yu
Xiaoxiao Ma
Jinghao Zhang
Yaning Pan
Mingde Yao
Jie Xiao
Linjiang Huang
Feng Zhao
147
2
0
26 Sep 2025
Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation
Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation
Abdelrahman Eldesokey
Aleksandar Cvejic
Bernard Ghanem
Peter Wonka
120
0
0
26 Sep 2025
UISim: An Interactive Image-Based UI Simulator for Dynamic Mobile Environments
UISim: An Interactive Image-Based UI Simulator for Dynamic Mobile Environments
Jiannan Xiang
Yun Zhu
Lei Shu
Maria Wang
Lijun Yu
Gabriel Barcik
James Lyon
Srinivas Sunkara
Jindong Chen
96
0
0
26 Sep 2025
Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
Luc Boudier
Loris Manganelli
Eleftherios Tsonis
Nicolas Dufour
Vicky Kalogeiton
DiffMVLM
107
1
0
26 Sep 2025
HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models
HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models
Seyedmorteza Sadat
Farnood Salehi
Romann M. Weber
DiffM
160
0
0
26 Sep 2025
SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet
SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet
Woosung Joung
Daewon Chae
Jinkyu Kim
DiffM
90
0
0
26 Sep 2025
High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling
High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling
Chao Huang
Susan Liang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
144
0
0
26 Sep 2025
LABELING COPILOT: A Deep Research Agent for Automated Data Curation in Computer Vision
LABELING COPILOT: A Deep Research Agent for Automated Data Curation in Computer Vision
Debargha Ganguly
Sumit Kumar
Ishwar B Balappanawar
Weicong Chen
Shashank Kambhatla
Srinivasan Iyengar
Shivkumar Kalyanaraman
Ponnurangam Kumaraguru
Vipin Chaudhary
VLM
183
0
0
26 Sep 2025
FailureAtlas:Mapping the Failure Landscape of T2I Models via Active Exploration
FailureAtlas:Mapping the Failure Landscape of T2I Models via Active Exploration
Muxi Chen
Zhaohua Zhang
Chenchen Zhao
Mingyang Chen
Wenyu Jiang
...
Jianhuan Zhuo
Yu Tang
Qiuyong Xiao
Jihong Zhang
Qiang Xu
100
1
0
26 Sep 2025
A Unified Framework for Diffusion Model Unlearning with f-Divergence
A Unified Framework for Diffusion Model Unlearning with f-Divergence
Nicola Novello
Federico Fontana
Luigi Cinque
Deniz Gunduz
Andrea M. Tonello
226
0
0
25 Sep 2025
A Single Neuron Works: Precise Concept Erasure in Text-to-Image Diffusion Models
A Single Neuron Works: Precise Concept Erasure in Text-to-Image Diffusion Models
Qinqin He
Jiaqi Weng
Jialing Tao
Hui Xue
96
1
0
25 Sep 2025
Prompt-aware classifier free guidance for diffusion models
Prompt-aware classifier free guidance for diffusion models
Xuanhao Zhang
Chang Li
DiffMVLM
173
0
0
25 Sep 2025
Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detection
Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detection
Yu Guo
Shengfeng He
Yuxu Lu
Haonan An
Yihang Tao
Huilin Zhu
Jingxian Liu
Yuguang Fang
246
1
0
25 Sep 2025
MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation
MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation
Guojun Lei
Chi-Yin Wang
Yikai Wang
Hong Li
Ying Song
W. Xu
DiffMVGen
103
0
0
25 Sep 2025
CusEnhancer: A Zero-Shot Scene and Controllability Enhancement Method for Photo Customization via ResInversion
CusEnhancer: A Zero-Shot Scene and Controllability Enhancement Method for Photo Customization via ResInversion
Maoye Ren
Praneetha Vaddamanu
Jianjin Xu
Fernando De la Torre Frade
DiffM
92
0
0
25 Sep 2025
FreeInsert: Personalized Object Insertion with Geometric and Style Control
FreeInsert: Personalized Object Insertion with Geometric and Style Control
Yuhong Zhang
Han Wang
Yiwen Wang
Rong Xie
Li Song
DiffM
105
1
0
25 Sep 2025
SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models
SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models
Arani Roy
Shristi Das Biswas
Kaushik Roy
136
0
0
25 Sep 2025
Previous
123456...99100101
Next