ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.15111
  4. Cited By
Matryoshka Diffusion Models

Matryoshka Diffusion Models

23 October 2023
Jiatao Gu
Shuangfei Zhai
Yizhen Zhang
Joshua M. Susskind
Navdeep Jaitly
    DiffM
ArXivPDFHTML

Papers citing "Matryoshka Diffusion Models"

41 / 41 papers shown
Title
PixelFlow: Pixel-Space Generative Models with Flow
PixelFlow: Pixel-Space Generative Models with Flow
Shoufa Chen
Chongjian Ge
Shilong Zhang
Peize Sun
Ping Luo
VLM
DRL
35
0
0
10 Apr 2025
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Jinho Jeong
Sangmin Han
Jinwoo Kim
Seon Joo Kim
34
0
0
24 Mar 2025
Scale-wise Distillation of Diffusion Models
Scale-wise Distillation of Diffusion Models
Nikita Starodubcev
Denis Kuznedelev
Artem Babenko
Dmitry Baranchuk
DiffM
48
0
0
20 Mar 2025
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
Kyle Sargent
Kyle Hsu
Justin Johnson
L. Fei-Fei
Jiajun Wu
DiffM
MU
53
3
0
14 Mar 2025
AugGen: Synthetic Augmentation Can Improve Discriminative Models
Parsa Rahimi
Damien Teney
S´ebastien Marcel
64
0
0
14 Mar 2025
NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers
Yuhang Ma
Bo Cheng
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Liebucha Wu
Dawei Leng
Yuhui Yin
55
0
0
12 Mar 2025
Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation
Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation
Tiansheng Wen
Yifei Wang
Zequn Zeng
Zhong Peng
Yudi Su
Xinyang Liu
Bo Chen
Hongwei Liu
Stefanie Jegelka
Chenyu You
CLL
66
2
0
03 Mar 2025
Personalized and Sequential Text-to-Image Generation
Personalized and Sequential Text-to-Image Generation
Ofir Nabati
Guy Tennenholtz
ChihWei Hsu
Moonkyung Ryu
Deepak Ramachandran
Yinlam Chow
Xiang Li
Craig Boutilier
MLLM
70
0
0
10 Dec 2024
Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Yuyang Wang
Anurag Ranjan
J. Susskind
Miguel Angel Bautista
3DPC
68
0
0
05 Dec 2024
Steering Rectified Flow Models in the Vector Field for Controlled Image
  Generation
Steering Rectified Flow Models in the Vector Field for Controlled Image Generation
Maitreya Patel
Song Wen
Dimitris N. Metaxas
Yezhou Yang
DiffM
109
3
0
27 Nov 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient
  Recognition and Generation
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Anil Kag
Huseyin Coskun
Jierun Chen
Junli Cao
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
Jian Ren
46
3
0
07 Nov 2024
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel
  Governance Mechanisms
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms
Jordan Meyer
Nick Padgett
Cullen Miller
Laura Exline
29
4
0
30 Oct 2024
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion
Emiel Hoogeboom
Thomas Mensink
Jonathan Heek
Kay Lamerigts
Ruiqi Gao
Tim Salimans
81
6
0
25 Oct 2024
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Jiatao Gu
Yuyang Wang
Yizhe Zhang
Qihang Zhang
Dinghuai Zhang
Navdeep Jaitly
Josh Susskind
Shuangfei Zhai
DiffM
31
12
0
10 Oct 2024
MatMamba: A Matryoshka State Space Model
MatMamba: A Matryoshka State Space Model
Abhinav Shukla
Sai H. Vemprala
Aditya Kusupati
Ashish Kapoor
Mamba
28
0
0
09 Oct 2024
Sparse Repellency for Shielded Generation in Text-to-image Diffusion
  Models
Sparse Repellency for Shielded Generation in Text-to-image Diffusion Models
Michael Kirchhof
James Thornton
Pierre Ablin
Louis Béthune
Eugène Ndiaye
Marco Cuturi
36
2
0
08 Oct 2024
$\infty$-Brush: Controllable Large Image Synthesis with Diffusion Models
  in Infinite Dimensions
∞\infty∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
Minh-Quan Le
Alexandros Graikos
Srikar Yellapragada
Rajarsi R. Gupta
Joel H. Saltz
Dimitris Samaras
27
9
0
20 Jul 2024
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Bowen Zhang
Yiji Cheng
Chunyu Wang
Ting Zhang
Jiaolong Yang
Yansong Tang
Feng Zhao
Dong Chen
Baining Guo
DiffM
35
18
0
09 Jul 2024
Alleviating Distortion in Image Generation via Multi-Resolution
  Diffusion Models
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Qihao Liu
Zhanpeng Zeng
Ju He
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
46
18
0
13 Jun 2024
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Ivan Skorokhodov
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
VGen
40
10
0
12 Jun 2024
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with
  Foundation Models
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models
Athanasios Tragakis
Marco Aversa
Chaitanya Kaul
Roderick Murray-Smith
Daniele Faccio
49
2
0
11 Jun 2024
Learning Image Priors through Patch-based Diffusion Models for Solving
  Inverse Problems
Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems
Jason Hu
Bowen Song
Xiaojian Xu
Liyue Shen
Jeffrey A. Fessler
MedIm
DiffM
36
8
0
04 Jun 2024
Kaleido Diffusion: Improving Conditional Diffusion Models with
  Autoregressive Latent Modeling
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling
Jiatao Gu
Ying Shen
Shuangfei Zhai
Yizhe Zhang
Navdeep Jaitly
J. Susskind
42
10
0
31 May 2024
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
C. N. Vasconcelos
Abdullah Rashwan Austin Waters
Trevor Walker
Keyang Xu
Jimmy Yan
...
Wenlei Zhou
Kevin Swersky
David J. Fleet
Jason Baldridge
Oliver Wang
39
3
0
27 May 2024
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with
  Fine-Grained Chinese Understanding
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Zhimin Li
Jianwei Zhang
Qin Lin
Jiangfeng Xiong
Yanxin Long
...
Wei Liu
Dingyong Wang
Yong Yang
Jie Jiang
Qinglin Lu
ViT
46
91
0
14 May 2024
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise
  Optimization
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Xiefan Guo
Jinlin Liu
Miaomiao Cui
Jiankai Li
Hongyu Yang
Di Huang
23
25
0
06 Apr 2024
Upsample Guidance: Scale Up Diffusion Models without Training
Upsample Guidance: Scale Up Diffusion Models without Training
Juno Hwang
Yong-Hyun Park
Junghyo Jo
27
12
0
02 Apr 2024
DistriFusion: Distributed Parallel Inference for High-Resolution
  Diffusion Models
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Muyang Li
Tianle Cai
Jiaxin Cao
Qinsheng Zhang
Han Cai
Junjie Bai
Yangqing Jia
Ming-Yu Liu
Kai Li
Song Han
DiffM
29
41
0
29 Feb 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
  Synthesis
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
38
56
0
22 Feb 2024
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image
  Diffusion Models
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Xinchen Zhang
Ling Yang
Yaqi Cai
Zhaochen Yu
Kai-Ni Wang
...
Ye Tian
Minkai Xu
Yong Tang
Yujiu Yang
Bin Cui
DiffM
27
5
0
20 Feb 2024
Make a Cheap Scaling: A Self-Cascade Diffusion Model for
  Higher-Resolution Adaptation
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Lanqing Guo
Yin-Yin He
Haoxin Chen
Menghan Xia
Xiaodong Cun
...
Yong Zhang
Xintao Wang
Qifeng Chen
Ying Shan
Bihan Wen
27
23
0
16 Feb 2024
Lumiere: A Space-Time Diffusion Model for Video Generation
Lumiere: A Space-Time Diffusion Model for Video Generation
Omer Bar-Tal
Hila Chefer
Omer Tov
Charles Herrmann
Roni Paiss
...
T. Michaeli
Oliver Wang
Deqing Sun
Tali Dekel
Inbar Mosseri
VGen
104
215
0
23 Jan 2024
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed
  Diffusion Models
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM
3DGS
32
112
0
21 Dec 2023
RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion
  Models
RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion Models
Yue Jiang
Yueming Lyu
Tianxiang Ma
Bo Peng
Jing Dong
43
3
0
08 Dec 2023
Exploiting Diffusion Prior for Generalizable Dense Prediction
Exploiting Diffusion Prior for Generalizable Dense Prediction
Hsin-Ying Lee
Hung-Yu Tseng
Hsin-Ying Lee
Ming-Hsuan Yang
DiffM
MDE
30
18
0
30 Nov 2023
Retargeting Visual Data with Deformation Fields
Retargeting Visual Data with Deformation Fields
Tim Elsner
Julia Berger
Tong Wu
Victor Czech
Lin Gao
Leif Kobbelt
24
2
0
22 Nov 2023
Synthetically Enhanced: Unveiling Synthetic Data's Potential in Medical
  Imaging Research
Synthetically Enhanced: Unveiling Synthetic Data's Potential in Medical Imaging Research
Bardia Khosravi
Frank Li
Theo Dapamede
Pouria Rouzrokh
Cooper Gamble
...
C. Wyles
Andrew B. Sellergren
S. Purkayastha
Bradley J. Erickson
J. Gichoya
MedIm
27
17
0
15 Nov 2023
f-DM: A Multi-stage Diffusion Model via Progressive Signal
  Transformation
f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Miguel Angel Bautista
J. Susskind
DiffM
39
26
0
10 Oct 2022
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize
  Long-Tail Visual Concepts
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
273
1,081
0
17 Feb 2021
Pixel Recurrent Neural Networks
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
225
2,543
0
25 Jan 2016
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
232
75,445
0
18 May 2015
1