Thursday, 13 March 2025
26.5 C
Singapore
29.1 C
Thailand
20.7 C
Indonesia
27 C
Philippines

New AI model developed for high-resolution video generation

A Chinese research team has developed an open-source AI model, Pyramid Flow, for cost-effective, high-resolution video generation at 768p.

Researchers from Peking University, Kuaishou Technology, and Beijing University of Posts and Telecommunications have made significant progress in AI video generation. Their new AI model, Pyramid Flow, promises to revolutionise the way high-resolution virtual videos are created.

Unlike many proprietary models that require expensive resources and are often difficult to access, the team behind Pyramid Flow has chosen to make their model open-source. This move allows developers and users worldwide to access the technology freely, allowing a broader audience to experiment with and use it for various purposes.

Pyramid Flowโ€™s cost-effective approach to high-resolution video generation

Pyramid Flow takes an innovative approach by generating videos through low-resolution stages before reaching the final high-resolution output. This multi-stage process helps to significantly reduce the computing power needed to run the model, making it more affordable and practical for users. The team claims that Pyramid Flow can produce a five-second video clip at 384p resolution in just 56 seconds, demonstrating the efficiency of their model.

One of Pyramid Flow’s most notable advantages is its ability to create high-quality, detailed imagery. The model has been shown to generate lifelike visuals, including complex scenes like underwater explosions that produce bubbles and splashing water. This level of realism is an exciting breakthrough for the AI video generation community, especially given its low cost.

Open-source availability and potential concerns

Along with the model, the team has made the source code available under the MIT License. This means that anyone can download, modify, and use the software for personal and commercial purposes without worrying about licensing fees or restrictions. The team has also provided several sample videos showcasing the impressive output quality of the model.

Additionally, the research team has made the datasets used to train Pyramid Flow available to the public. These datasets consist of approximately 10 million short videos, allowing other developers to build upon and improve the model in the future.

However, using open-source datasets in AI video generation has raised some concerns. Critics argue that such practices could infringe on the intellectual property rights of copyright holders. While the team behind Pyramid Flow has yet to address these concerns directly, they have suggested that their model could be a valuable tool for fine-tuning open-source material. This would help reduce reliance on third-party sources, alleviating some copyright concerns.

Pyramid Flow represents a significant leap forward in AI video generation technology. It offers both high-quality output and an open-source approach that could open up new possibilities for developers and creators. The cost-effective nature of the model and the free access to the underlying code and datasets could reshape the way AI-generated videos are used across industries, making high-resolution video creation more accessible than ever.

Hot this week

Tammy Nam takes the helm as CEO of AI-driven ad startup Creatopy

Tammy Nam joins AI-powered ad startup Creatopy as CEO, bringing experience from PicsArt and Viki. The company reports a 400% revenue growth.

Microsoft intensifies AI race to rival OpenAI

Microsoft is increasing its AI efforts, developing its models and testing alternatives to OpenAI technology for products like Copilot.

Musk may still have a chance to stop OpenAIโ€™s profit-driven shift

A U.S. judge denied Muskโ€™s injunction against OpenAIโ€™s profit shift but raised concerns, offering hope to those challenging the AI giantโ€™s plans.

Armis acquires OTORIO to enhance on-premises security and strengthen cyber physical systems protection

Armis acquires OTORIO to expand its on-premises cybersecurity solutions, strengthening OT, ICS, and CPS protection for critical industries.

Trump vows to classify violence against Tesla as domestic terrorism

Trump vows to classify attacks on Tesla dealerships as domestic terrorism, sparking debate over protests, government cuts, and Muskโ€™s influence.

Lego unveils 1,972-piece Mario Kart set with posable arms and head

Lego unveils a 1,972-piece Mario Kart set featuring a posable Mario figure and display stand, which will be available on May 15 for US$249.90.

Trump vows to classify violence against Tesla as domestic terrorism

Trump vows to classify attacks on Tesla dealerships as domestic terrorism, sparking debate over protests, government cuts, and Muskโ€™s influence.

Meta tests in-house AI chip to reduce reliance on Nvidia

Meta is testing an in-house AI chip for training models to cut costs and reduce reliance on Nvidia. The chip is currently in a trial phase.

Pure Storage launches high-performance AI and HPC data storage platform

Pure Storage unveils FlashBlade//EXA, a high-performance AI and HPC storage platform designed to improve scalability and metadata processing efficiency.

Related Articles