ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.03054
  4. Cited By
AnyText: Multilingual Visual Text Generation And Editing

AnyText: Multilingual Visual Text Generation And Editing

6 November 2023
Yuxiang Tuo
Wangmeng Xiang
Jun-Yan He
Yifeng Geng
Xuansong Xie
    DiffM
ArXivPDFHTML

Papers citing "AnyText: Multilingual Visual Text Generation And Editing"

14 / 64 papers shown
Title
ARTIST: Improving the Generation of Text-rich Images by Disentanglement
ARTIST: Improving the Generation of Text-rich Images by Disentanglement
Jianyi Zhang
Yufan Zhou
Jiuxiang Gu
Curtis Wigington
Tong Yu
Yiran Chen
Tong Sun
Ruiyi Zhang
75
0
0
17 Jun 2024
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual
  Visual Text Rendering
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering
Zeyu Liu
Weicong Liang
Yiming Zhao
Bohan Chen
Lin Liang
Lijuan Wang
Ji Li
Yuhui Yuan
38
14
0
14 Jun 2024
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Olivia Wiles
Chuhan Zhang
Isabela Albuquerque
Ivana Kajić
Su Wang
...
Jordi Pont-Tuset
Aida Nematzadeh
Anant Nawalgaria
Jordi Pont-Tuset
Aida Nematzadeh
EGVM
127
14
0
25 Apr 2024
LTOS: Layout-controllable Text-Object Synthesis via Adaptive
  Cross-attention Fusions
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Xiaoran Zhao
Tianhao Wu
Yu Lai
Zhiliang Tian
Zhen Huang
Yahui Liu
Zejiang He
Dongsheng Li
DiffM
36
1
0
21 Apr 2024
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
Tianyi Liang
Jiangqi Liu
Sicheng Song
Shiqi Jiang
Yifei Huang
Changbo Wang
Chenhui Li
42
0
0
18 Apr 2024
PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware
  Layout Generation
PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation
Jaejung Seol
Seojun Kim
Jaejun Yoo
3DV
VLM
34
7
0
01 Apr 2024
Refining Text-to-Image Generation: Towards Accurate Training-Free
  Glyph-Enhanced Image Generation
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Sanyam Lakhanpal
Shivang Chopra
Vinija Jain
Aman Chadha
Man Luo
32
9
0
25 Mar 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with
  Auto-Generated Data
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Jialu Li
Jaemin Cho
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
MoMe
DiffM
39
8
0
11 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
72
35
0
07 Mar 2024
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text
  Detection and Spotting
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Chen Duan
Pei Fu
Shan Guo
Qianyi Jiang
Xiaoming Wei
VLM
46
5
0
01 Mar 2024
InstantBooth: Personalized Text-to-Image Generation without Test-Time
  Finetuning
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe-nan Lin
H. J. Jung
DiffM
124
279
0
06 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
270
4,229
0
30 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
197
519
0
02 Jan 2023
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,777
0
24 Feb 2021
Previous
12