🎞 Movie Gen: A Cast of Media Foundation Models
Meta AI researchers have introduced Movie Gen, a suite of foundation models capable of generating high-quality video and audio. Movie Gen models can synthesize videos based on text prompts, personalize videos using a user’s image, precisely edit videos with text instructions, and generate synchronized audio for videos. The research paper details the models' architecture, training procedures, and evaluation results, demonstrating their superior performance compared to existing methods in each of these areas. The paper also explores various aspects of model scaling, including parallelism techniques and data curation strategies used to train these large-scale media generation models.
📎
Link to paper🌐
Read their blog