ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.18159
  4. Cited By
Type-R: Automatically Retouching Typos for Text-to-Image Generation
v1v2 (latest)

Type-R: Automatically Retouching Typos for Text-to-Image Generation

Computer Vision and Pattern Recognition (CVPR), 2024
27 November 2024
Wataru Shimoda
Naoto Inoue
Daichi Haraguchi
Hayato Mitani
S. Uchida
Kota Yamaguchi
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Type-R: Automatically Retouching Typos for Text-to-Image Generation"

46 / 46 papers shown
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlNeural Information Processing Systems (NeurIPS), 2024
Weichao Zeng
Yan Shu
Zhenhang Li
Dongbao Yang
Can Ma
DiffM
258
25
0
14 Oct 2024
TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control
TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control
Zhenyu Yan
Jiangming Wang
Aoqiang Wang
Wenxiang Shang
Ran Lin
Zhao Zhang
DiffM
284
2
0
13 Oct 2024
A Reflection on the Impact of Misspecifying Unidentifiable Causal
  Inference Models in Surrogate Endpoint Evaluation
A Reflection on the Impact of Misspecifying Unidentifiable Causal Inference Models in Surrogate Endpoint Evaluation
Gokce Deliorman
Florian Stijven
Wim Van der Elst
Maria del Carmen Pardo
Ariel Alonso
CML
238
5
0
06 Oct 2024
JoyType: A Robust Design for Multilingual Visual Text Creation
JoyType: A Robust Design for Multilingual Visual Text Creation
Chao Li
Chen Jiang
Xiaolong Liu
Jun Zhao
Guoxin Wang
DiffM
363
7
0
26 Sep 2024
Harmonizing Visual Text Comprehension and Generation
Harmonizing Visual Text Comprehension and Generation
Zhen Zhao
Jingqun Tang
Binghong Wu
Chunhui Lin
Shubo Wei
Hao Liu
Xin Tan
Zhizhong Zhang
Can Huang
Yuan Xie
VLM
328
41
0
23 Jul 2024
Visual Text Generation in the Wild
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
247
14
0
19 Jul 2024
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual
  Visual Text Rendering
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering
Zeyu Liu
Weicong Liang
Yiming Zhao
Bohan Chen
Lin Liang
Lijuan Wang
Ji Li
Yuhui Yuan
248
36
0
14 Jun 2024
OpenCOLE: Towards Reproducible Automatic Graphic Design Generation
OpenCOLE: Towards Reproducible Automatic Graphic Design Generation
Naoto Inoue
Kento Masui
Wataru Shimoda
Kota Yamaguchi
250
24
0
12 Jun 2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli
Jindong Jiang
Di Liu
Licheng Yu
Xiaoliang Dai
Ankit Ramchandani
Guan Pang
Dimitris N. Metaxas
Praveen Krishnan
DiffM
476
17
0
03 Jun 2024
Refining Text-to-Image Generation: Towards Accurate Training-Free
  Glyph-Enhanced Image Generation
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Sanyam Lakhanpal
Shivang Chopra
Vinija Jain
Vasu Sharma
Man Luo
171
16
0
25 Mar 2024
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text RenderingEuropean Conference on Computer Vision (ECCV), 2024
Zeyu Liu
Weicong Liang
Zhanhao Liang
Chong Luo
Ji Li
Gao Huang
Yuhui Yuan
DiffM
390
53
0
14 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
2.6K
2,881
0
05 Mar 2024
Typographic Text Generation with Off-the-Shelf Diffusion Model
Typographic Text Generation with Off-the-Shelf Diffusion Model
KhayTze Peong
Seiichi Uchida
Daichi Haraguchi
DiffM
303
7
0
22 Feb 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text
  Segmentation
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
240
32
0
31 Jan 2024
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
  Generating with Multimodal LLMs
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMsInternational Conference on Machine Learning (ICML), 2024
Ling Yang
Zhaochen Yu
Chenlin Meng
Minkai Xu
Stefano Ermon
Tengjiao Wang
CoGeDiffM
520
194
0
22 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
440
12
0
29 Dec 2023
Parrot Captions Teach CLIP to Spot Text
Parrot Captions Teach CLIP to Spot Text
Yiqi Lin
Conghui He
Alex Jinpeng Wang
Sijin Yu
Weijia Li
Mike Zheng Shou
324
14
0
21 Dec 2023
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Lingjun Zhang
Xinyuan Chen
Yaohui Wang
Yue Lu
Yu Qiao
DiffM
249
50
0
19 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in
  Arbitrary Images via Character-aware Diffusion Models
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
261
48
0
08 Dec 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text
  Rendering
TextDiffuser-2: Unleashing the Power of Language Models for Text RenderingEuropean Conference on Computer Vision (ECCV), 2023
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
286
104
0
28 Nov 2023
Self-correcting LLM-controlled Diffusion Models
Self-correcting LLM-controlled Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Tsung-Han Wu
Long Lian
Joseph E. Gonzalez
Boyi Li
Trevor Darrell
319
99
0
27 Nov 2023
LayoutPrompter: Awaken the Design Ability of Large Language Models
LayoutPrompter: Awaken the Design Ability of Large Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Jiawei Lin
Jiaqi Guo
Shizhao Sun
Z. Yang
Jian-Guang Lou
Dongmei Zhang
VLM
292
47
0
11 Nov 2023
AnyText: Multilingual Visual Text Generation And Editing
AnyText: Multilingual Visual Text Generation And EditingInternational Conference on Learning Representations (ICLR), 2023
Yuxiang Tuo
Wangmeng Xiang
Jun-Yan He
Yifeng Geng
Xuansong Xie
DiffM
656
122
0
06 Nov 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MHALM
8.8K
15,551
0
18 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis
SDXL: Improving Latent Diffusion Models for High-Resolution Image SynthesisInternational Conference on Learning Representations (ICLR), 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
1.7K
3,891
0
04 Jul 2023
GlyphControl: Glyph Conditional Control for Visual Text Generation
GlyphControl: Glyph Conditional Control for Visual Text GenerationNeural Information Processing Systems (NeurIPS), 2023
Yukang Yang
Dongnan Gui
Yuhui Yuan
Weicong Liang
Haisong Ding
Hang-Rui Hu
Kai Chen
DiffM
269
118
0
29 May 2023
LayoutGPT: Compositional Visual Planning and Generation with Large
  Language Models
LayoutGPT: Compositional Visual Planning and Generation with Large Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Weixi Feng
Wanrong Zhu
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Xinze Wang
William Yang Wang
MLLM
512
298
0
24 May 2023
TextDiffuser: Diffusion Models as Text Painters
TextDiffuser: Diffusion Models as Text PaintersNeural Information Processing Systems (NeurIPS), 2023
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
590
184
0
18 May 2023
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Jiabao Ji
Guanhua Zhang
Zhaowen Wang
Bairu Hou
Zhifei Zhang
Brian L. Price
Shiyu Chang
DiffM
220
46
0
12 Apr 2023
GlyphDraw: Seamlessly Rendering Text with Intricate Spatial Structures
  in Text-to-Image Generation
GlyphDraw: Seamlessly Rendering Text with Intricate Spatial Structures in Text-to-Image Generation
Jiancang Ma
Mingjun Zhao
Chen Chen
Ruichen Wang
Di Niu
H. Lu
Xiaodong Lin
DiffM
346
24
0
31 Mar 2023
A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT
A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT
Jules White
Quchen Fu
Sam Hays
Michael Sandborn
Carlos Olea
Henry Gilbert
Ashraf Elnashar
Jesse Spencer-Smith
Douglas C. Schmidt
LLMAG
403
1,554
0
21 Feb 2023
Character-Aware Models Improve Visual Text Rendering
Character-Aware Models Improve Visual Text RenderingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Rosanne Liu
Daniel H Garrette
Chitwan Saharia
William Chan
Adam Roberts
Sharan Narang
Irina Blok
R. Mical
Mohammad Norouzi
Noah Constant
VLM
271
87
0
20 Dec 2022
Exploring Stroke-Level Modifications for Scene Text Editing
Exploring Stroke-Level Modifications for Scene Text EditingAAAI Conference on Artificial Intelligence (AAAI), 2022
Yadong Qu
Qingfeng Tan
Hongtao Xie
Jianjun Xu
Yuxin Wang
Yongdong Zhang
DiffM
196
45
0
05 Dec 2022
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text
  Spotting
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text SpottingComputer Vision and Pattern Recognition (CVPR), 2022
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
385
103
0
19 Nov 2022
The Surprisingly Straightforward Scene Text Removal Method With Gated
  Attention and Region of Interest Generation: A Comprehensive Prominent Model
  Analysis
The Surprisingly Straightforward Scene Text Removal Method With Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model AnalysisEuropean Conference on Computer Vision (ECCV), 2022
Hyeonsu Lee
Chankyu Choi
249
18
0
14 Oct 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language UnderstandingNeural Information Processing Systems (NeurIPS), 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
1.2K
7,613
0
23 May 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
DiffM
3.1K
21,434
0
20 Dec 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
542
515
0
21 Sep 2021
Resolution-robust Large Mask Inpainting with Fourier Convolutions
Resolution-robust Large Mask Inpainting with Fourier Convolutions
Roman Suvorov
Elizaveta Logacheva
Anton Mashikhin
Anastasia Remizova
Arsenii Ashukha
Aleksei Silvestrov
Naejin Kong
Harshith Goka
Kiwoong Park
Victor Lempitsky
348
1,170
0
15 Sep 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language SupervisionInternational Conference on Machine Learning (ICML), 2021
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
2.0K
42,087
0
26 Feb 2021
Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text
  Spotting
Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text SpottingEuropean Conference on Computer Vision (ECCV), 2020
Minghui Liao
Guan Pang
Jing Huang
Tal Hassner
X. Bai
271
203
0
18 Jul 2020
SwapText: Image Based Texts Transfer in Scenes
SwapText: Image Based Texts Transfer in ScenesComputer Vision and Pattern Recognition (CVPR), 2020
Qiangpeng Yang
Hongsheng Jin
Yanjie Liang
Jialin Li
DiffM
255
80
0
18 Mar 2020
Editing Text in the Wild
Editing Text in the WildACM Multimedia (ACM MM), 2019
Liang Wu
Chengquan Zhang
Jiaming Liu
Junyu Han
Jingtuo Liu
Errui Ding
X. Bai
359
152
0
08 Aug 2019
Character Region Awareness for Text Detection
Character Region Awareness for Text Detection
Youngmin Baek
Bado Lee
Dongyoon Han
Sangdoo Yun
Hwalsuk Lee
226
917
0
03 Apr 2019
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and
  Model Analysis
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis
Jeonghun Baek
Geewook Kim
Junyeop Lee
Sungrae Park
Dongyoon Han
Sangdoo Yun
Seong Joon Oh
Hwalsuk Lee
910
534
0
03 Apr 2019
STEFANN: Scene Text Editor using Font Adaptive Neural Network
STEFANN: Scene Text Editor using Font Adaptive Neural NetworkComputer Vision and Pattern Recognition (CVPR), 2019
Prasun Roy
Saumik Bhattacharya
Subhankar Ghosh
Umapada Pal
DiffM
353
69
0
04 Mar 2019
1
Page 1 of 1