Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.01808
Cited By
aMUSEd: An Open MUSE Reproduction
3 January 2024
Suraj Patil
William Berman
Robin Rombach
Patrick von Platen
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"aMUSEd: An Open MUSE Reproduction"
18 / 18 papers shown
Title
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft
Junliang Guo
Yang Ye
Tianyu He
Haoyu Wu
Yushu Jiang
Tim Pearce
Jiang Bian
VGen
SyDa
49
2
0
11 Apr 2025
Halton Scheduler For Masked Generative Image Transformer
Victor Besnier
Mickael Chen
David Hurych
Eduardo Valle
Matthieu Cord
49
1
0
21 Mar 2025
Fast Autoregressive Video Generation with Diagonal Decoding
Yang Ye
Junliang Guo
Haoyu Wu
Tianyu He
Tim Pearce
Tabish Rashid
Katja Hofmann
Jiang Bian
DiffM
VGen
73
1
0
18 Mar 2025
Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?
Yuechen Xie
Jie Song
Huiqiong Wang
Mingli Song
50
0
0
12 Mar 2025
DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization
Geonhui Jang
Jin-Hwa Kim
Yong-Hyun Park
Junho Kim
Gayoung Lee
Yonghyun Jeong
DiffM
77
0
0
12 Dec 2024
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing
Jinbin Bai
Wei Chow
L. Yang
Xiangtai Li
Juncheng Billy Li
H. Zhang
Shuicheng Yan
101
3
0
05 Dec 2024
Multi-Style Facial Sketch Synthesis through Masked Generative Modeling
Bowen Sun
Guo Lu
Shibao Zheng
CVBM
30
0
0
22 Aug 2024
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
Lorenzo Baraldi
Federico Cocchi
Marcella Cornia
Lorenzo Baraldi
Alessandro Nicolosi
Rita Cucchiara
28
7
0
29 Jul 2024
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
34
3
0
10 Jul 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Jianye Hao
Mingsheng Long
VGen
37
23
0
24 May 2024
Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks
João Bordalo
Vasco Ramos
Rodrigo Valerio
Diogo Glória-Silva
Yonatan Bitton
Michal Yarom
Idan Szpektor
João Magalhães
31
7
0
16 May 2024
Controllable Image Generation With Composed Parallel Token Prediction
Jamie Stirling
Noura Al-Moubayed
33
0
0
10 May 2024
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
S. A. Baumann
Felix Krause
Michael Neumayr
Nick Stracke
Vincent Tao Hu
Bjorn Ommer
Björn Ommer
DiffM
LM&Ro
68
11
0
25 Mar 2024
World Model on Million-Length Video And Language With Blockwise RingAttention
Hao Liu
Wilson Yan
Matei A. Zaharia
Pieter Abbeel
VGen
29
59
0
13 Feb 2024
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
Xiaofeng Wang
Zheng Zhu
Guan Huang
Boyuan Wang
Xinze Chen
Jiwen Lu
VGen
32
32
0
18 Jan 2024
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Simian Luo
Yiqin Tan
Suraj Patil
Daniel Gu
Patrick von Platen
Apolinário Passos
Longbo Huang
Jian Li
Hang Zhao
MoMe
108
144
0
09 Nov 2023
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,774
0
24 Feb 2021
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
237
75,770
0
18 May 2015
1