ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.05477
  4. Cited By
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and
  Consistency-Enhanced MAE

Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE

10 August 2024
Yiying Yang
Fukun Yin
Jiayuan Fan
Xin Chen
Wanzhang Li
Gang Yu
    VGen
ArXivPDFHTML

Papers citing "Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE"

11 / 11 papers shown
Title
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
150
985
0
25 Nov 2023
Set-the-Scene: Global-Local Training for Generating Controllable NeRF
  Scenes
Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes
Dana Cohen-Bar
Elad Richardson
G. Metzer
Raja Giryes
Daniel Cohen-Or
63
52
0
23 Mar 2023
VideoFusion: Decomposed Diffusion Models for High-Quality Video
  Generation
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation
Zhengxiong Luo
Dayou Chen
Yingya Zhang
Yan Huang
Liangsheng Wang
Yujun Shen
Deli Zhao
Jinren Zhou
Tien-Ping Tan
DiffM
VGen
132
215
0
15 Mar 2023
RealFusion: 360° Reconstruction of Any Object from a Single Image
RealFusion: 360° Reconstruction of Any Object from a Single Image
Luke Melas-Kyriazi
Christian Rupprecht
Iro Laina
Andrea Vedaldi
85
288
0
21 Feb 2023
NeRFPlayer: A Streamable Dynamic Scene Representation with Decomposed
  Neural Radiance Fields
NeRFPlayer: A Streamable Dynamic Scene Representation with Decomposed Neural Radiance Fields
Liangchen Song
Anpei Chen
Zhong Li
Z. Chen
Lele Chen
Junsong Yuan
Yi Tian Xu
Andreas Geiger
125
87
0
28 Oct 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
382
4,010
0
28 Jan 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
PixelSynth: Generating a 3D-Consistent Experience from a Single Image
PixelSynth: Generating a 3D-Consistent Experience from a Single Image
C. Rockwell
David Fouhey
Justin Johnson
VGen
50
83
0
12 Aug 2021
Learning a Probabilistic Latent Space of Object Shapes via 3D
  Generative-Adversarial Modeling
Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling
Jiajun Wu
Chengkai Zhang
Tianfan Xue
Bill Freeman
J. Tenenbaum
GAN
161
1,926
0
24 Oct 2016
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
229
74,467
0
18 May 2015
ImageNet Large Scale Visual Recognition Challenge
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
279
39,083
0
01 Sep 2014
1