ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.01062
  4. Cited By
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
v1v2v3v4 (latest)

Layout-Agnostic Scene Text Image Synthesis with Diffusion Models

3 June 2024
Qilong Zhangli
Jindong Jiang
Di Liu
Licheng Yu
Xiaoliang Dai
Ankit Ramchandani
Guan Pang
Dimitris N. Metaxas
Praveen Krishnan
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Layout-Agnostic Scene Text Image Synthesis with Diffusion Models"

11 / 11 papers shown
Large Sign Language Models: Toward 3D American Sign Language Translation
Large Sign Language Models: Toward 3D American Sign Language Translation
S. Zhang
Xiaoxiao He
Di Liu
Zhaoyang Xia
Mingyu Zhao
Chaowei Tan
Vivian Li
Bo Liu
Dimitris N. Metaxas
Mubbasir Kapadia
SLR
306
1
0
11 Nov 2025
MILD: Multi-Layer Diffusion Strategy for Complex and Precise Multi-IP Aware Human Erasing
MILD: Multi-Layer Diffusion Strategy for Complex and Precise Multi-IP Aware Human Erasing
Jinghan Yu
Junhao Xiao
Zhiyuan Ma
Yue Ma
Kaiqi Liu
Yuhan Wang
Daizong Liu
Xianghao Meng
Jianjun Li
DiffM
193
0
0
05 Aug 2025
HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models
HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models
Shuhan Zhuang
Mengqi Huang
Fengyi Fu
Nan Chen
Bohan Lei
Zhendong Mao
DiffM
192
0
0
10 May 2025
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
Nikai Du
Zhennan Chen
Zheyu Chen
Shan Gao
Xi Chen
Zhengkai Jiang
Jian Yang
Ying Tai
DiffM
649
16
0
30 Mar 2025
Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
Eric M. Chen
Di Liu
Sizhuo Ma
Michael Vasilkovsky
Bing Zhou
...
Wei Wang
Jiahao Luo
Dimitris N. Metaxas
Vincent Sitzmann
Jian Wang
3DGS
472
1
0
15 Mar 2025
LUCAS: Layered Universal Codec Avatars
LUCAS: Layered Universal Codec AvatarsComputer Vision and Pattern Recognition (CVPR), 2025
Di Liu
Teng Deng
Giljoo Nam
Yu Rong
Stanislav Pidhorskyi
Junxuan Li
Jason M. Saragih
Dimitris N. Metaxas
Chen Cao
3DGS
396
5
0
27 Feb 2025
Improved Training Technique for Latent Consistency Models
Improved Training Technique for Latent Consistency ModelsInternational Conference on Learning Representations (ICLR), 2025
Quan Dao
Khanh Doan
Di Liu
Trung Le
Dimitris N. Metaxas
466
10
0
03 Feb 2025
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Type-R: Automatically Retouching Typos for Text-to-Image GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Wataru Shimoda
Naoto Inoue
Daichi Haraguchi
Hayato Mitani
S. Uchida
Kota Yamaguchi
DiffM
566
1
0
27 Nov 2024
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Qilong Zhangli
Di Liu
Abhishek Aich
Dimitris Metaxas
S. Schulter
206
1
0
15 Sep 2024
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
Jian Ma
Yonglin Deng
Chen Chen
H. Lu
Zhenyu Yang
Zhenyu Yang
VLMDiffM
598
22
0
02 Jul 2024
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
SINE: SINgle Image Editing with Text-to-Image Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Zhixing Zhang
Ligong Han
Arna Ghosh
Dimitris N. Metaxas
Jian Ren
DiffM
452
180
0
08 Dec 2022
1