ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.04468
60
17

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

9 January 2024
Weimin Wang
Jiawei Liu
Zhijie Lin
Jiangqiao Yan
Shuo Chen
Chetwin Low
Tuyen Hoang
Jie Wu
Jun Hao Liew
Hanshu Yan
Daquan Zhou
Jiashi Feng
    VGen
    DiffM
ArXivPDFHTML
Abstract

The growing demand for high-fidelity video generation from textual descriptions has catalyzed significant research in this field. In this work, we introduce MagicVideo-V2 that integrates the text-to-image model, video motion generator, reference image embedding module and frame interpolation module into an end-to-end video generation pipeline. Benefiting from these architecture designs, MagicVideo-V2 can generate an aesthetically pleasing, high-resolution video with remarkable fidelity and smoothness. It demonstrates superior performance over leading Text-to-Video systems such as Runway, Pika 1.0, Morph, Moon Valley and Stable Video Diffusion model via user evaluation at large scale.

View on arXiv
Comments on this paper