Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.19085
Cited By
Patch-enhanced Mask Encoder Prompt Image Generation
29 May 2024
Shusong Xu
Peiye Liu
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Patch-enhanced Mask Encoder Prompt Image Generation"
3 / 3 papers shown
Title
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
Naoto Inoue
Kotaro Kikuchi
E. Simo-Serra
Mayu Otani
Kota Yamaguchi
DiffM
52
101
0
14 Mar 2023
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Kenton Lee
Mandar Joshi
Iulia Turc
Hexiang Hu
Fangyu Liu
Julian Martin Eisenschlos
Urvashi Khandelwal
Peter Shaw
Ming-Wei Chang
Kristina Toutanova
CLIP
VLM
158
262
0
07 Oct 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,110
0
28 Jan 2022
1