Papers
arxiv:2401.04468

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

Published on Jan 9
· Submitted by akhaliq on Jan 10
#1 Paper of the day
Authors:
,
,
,
,

Abstract

The growing demand for high-fidelity video generation from textual descriptions has catalyzed significant research in this field. In this work, we introduce MagicVideo-V2 that integrates the text-to-image model, video motion generator, reference image embedding module and frame interpolation module into an end-to-end video generation pipeline. Benefiting from these architecture designs, MagicVideo-V2 can generate an aesthetically pleasing, high-resolution video with remarkable fidelity and smoothness. It demonstrates superior performance over leading Text-to-Video systems such as Runway, Pika 1.0, Morph, Moon Valley and Stable Video Diffusion model via user evaluation at large scale.

Community

This comment has been hidden
This comment has been hidden
This comment has been hidden

一个女孩骑着老虎奔驰在峡谷之中

This comment has been hidden

Unveiling MagicVideo-V2: Stunning High-Aesthetic Video Generation from Text Descriptions

Links 🔗:

👉 Subscribe: https://www.youtube.com/@Arxflix
👉 Twitter: https://x.com/arxflix
👉 LMNT (Partner): https://lmnt.com/

By Arxflix
9t4iCUHx_400x400-1.jpg

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2401.04468 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2401.04468 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2401.04468 in a Space README.md to link it from this page.

Collections including this paper 18