Sunday, 20 April 2025
26.2 C
Singapore
29.5 C
Thailand
20.1 C
Indonesia
29 C
Philippines

New AI model developed for high-resolution video generation

A Chinese research team has developed an open-source AI model, Pyramid Flow, for cost-effective, high-resolution video generation at 768p.

Researchers from Peking University, Kuaishou Technology, and Beijing University of Posts and Telecommunications have made significant progress in AI video generation. Their new AI model, Pyramid Flow, promises to revolutionise the way high-resolution virtual videos are created.

Unlike many proprietary models that require expensive resources and are often difficult to access, the team behind Pyramid Flow has chosen to make their model open-source. This move allows developers and users worldwide to access the technology freely, allowing a broader audience to experiment with and use it for various purposes.

Pyramid Flow’s cost-effective approach to high-resolution video generation

Pyramid Flow takes an innovative approach by generating videos through low-resolution stages before reaching the final high-resolution output. This multi-stage process helps to significantly reduce the computing power needed to run the model, making it more affordable and practical for users. The team claims that Pyramid Flow can produce a five-second video clip at 384p resolution in just 56 seconds, demonstrating the efficiency of their model.

One of Pyramid Flow’s most notable advantages is its ability to create high-quality, detailed imagery. The model has been shown to generate lifelike visuals, including complex scenes like underwater explosions that produce bubbles and splashing water. This level of realism is an exciting breakthrough for the AI video generation community, especially given its low cost.

Open-source availability and potential concerns

Along with the model, the team has made the source code available under the MIT License. This means that anyone can download, modify, and use the software for personal and commercial purposes without worrying about licensing fees or restrictions. The team has also provided several sample videos showcasing the impressive output quality of the model.

Additionally, the research team has made the datasets used to train Pyramid Flow available to the public. These datasets consist of approximately 10 million short videos, allowing other developers to build upon and improve the model in the future.

However, using open-source datasets in AI video generation has raised some concerns. Critics argue that such practices could infringe on the intellectual property rights of copyright holders. While the team behind Pyramid Flow has yet to address these concerns directly, they have suggested that their model could be a valuable tool for fine-tuning open-source material. This would help reduce reliance on third-party sources, alleviating some copyright concerns.

Pyramid Flow represents a significant leap forward in AI video generation technology. It offers both high-quality output and an open-source approach that could open up new possibilities for developers and creators. The cost-effective nature of the model and the free access to the underlying code and datasets could reshape the way AI-generated videos are used across industries, making high-resolution video creation more accessible than ever.

Hot this week

YouTube launches free AI tool to help you create background music

YouTube introduces a free AI tool in the Creator Music section that lets you create copyright-free background music using simple prompts.

AI is reshaping tech infrastructure as Seagate urges balance between cost and carbon

Seagate’s new global report urges data centre operators to balance sustainability with cost as AI-driven data demands surge.

Intel’s new CEO reshapes leadership, promotes AI chief and plans closer work with engineers

Intel CEO Lip-Bu Tan is reshaping leadership, promoting a new AI chief, and aiming for a leaner, more engineering-driven company.

Illumio launches AI-powered cloud detection and response tool

Illumio debuts AI-driven cloud security tool to detect, visualise, and contain threats in real time across hybrid and multi-cloud systems.

OpenAI’s latest reasoning AI models are more prone to making mistakes

OpenAI’s new o3 and o4-mini AI models perform better in some areas but hallucinate more often than their predecessors, raising concerns.

AMD’s RX 9070 GRE leak could bring welcome news for gamers

Leaked AMD’s RX 9070 GRE specs suggest a strong mid-range GPU with 12GB memory and fast clocks, perfect for modern gamers.

Intel’s new CEO reshapes leadership, promotes AI chief and plans closer work with engineers

Intel CEO Lip-Bu Tan is reshaping leadership, promoting a new AI chief, and aiming for a leaner, more engineering-driven company.

Apple’s iPhone sales drop in China amid growing trade tensions

Apple’s iPhone sales in China fell 9% as local brands grew, and trade tensions created more uncertainty for the smartphone market.

ASUS and Hatsune Miku team up for colourful new gaming gear

ASUS and Hatsune Miku join forces to launch a vibrant limited-edition gaming gear set, arriving in Singapore this June.

Related Articles

Popular Categories