Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.15799
Cited By
StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
29 March 2022
Zhiheng Li
Martin Renqiang Min
K. Li
Chenliang Xu
EGVM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis"
11 / 11 papers shown
Title
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
48
1
0
02 Mar 2025
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
Ali Naseh
Katherine Thai
Mohit Iyyer
Amir Houmansadr
28
5
0
21 Apr 2024
Effective pruning of web-scale datasets based on complexity of concept clusters
Amro Abbas
E. Rusak
Kushal Tirumala
Wieland Brendel
Kamalika Chaudhuri
Ari S. Morcos
VLM
CLIP
21
22
0
09 Jan 2024
SAIR: Learning Semantic-aware Implicit Representation
Canyu Zhang
Xiaoguang Li
Qing-Wu Guo
Song Wang
23
3
0
13 Oct 2023
Are Diffusion Models Vision-And-Language Reasoners?
Benno Krojer
Elinor Poole-Dayan
Vikram S. Voleti
Christopher Pal
Siva Reddy
29
12
0
25 May 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
17
62
0
04 Apr 2023
Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Haomiao Ni
Changhao Shi
Kaican Li
Sharon X. Huang
Martin Renqiang Min
VGen
DiffM
16
161
0
24 Mar 2023
Recurrent Transformer Variational Autoencoders for Multi-Action Motion Synthesis
Rania Briq
Chuhang Zou
L. Pishchulin
Christopher Broaddus
Juergen Gall
14
1
0
14 Jun 2022
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
262
10,320
0
12 Dec 2018
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
160
841
0
17 May 2016
1