Thursday, 19 December 2024
25.7 C
Singapore

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

YouTube partners with CAA to help creators combat AI copies of their likeness

YouTube collaborates with CAA to develop tools that help creators and celebrities track and remove AI-generated copies of their likenesses.

Intel’s next CEO faces big decision over potential company split

Intel’s interim CEOs highlight tough challenges as the company’s next leader decides to split manufacturing and product divisions.

VisionOS 2.2 introduces Ultrawide Mac Virtual Display for Vision Pro

VisionOS 2.2 brings Ultrawide Mac Virtual Display to Vision Pro, offering incredible multitasking with 32:9 and 21:9 screen options.

Elon Musk and SpaceX face federal scrutiny over foreign meetings

Elon Musk and SpaceX face federal and international scrutiny over undisclosed meetings with foreign leaders and potential security risks.

Microsoft ends Skype credits and phone numbers in favour of subscriptions

Microsoft is discontinuing Skype Credits and Numbers and urging users to adopt subscriptions as it shifts focus from pay-as-you-go features.

Salesforce: How ASEAN businesses will lead the AI-driven future in 2025

Salesforce shares its 2025 predictions for ASEAN, highlighting AI-driven innovations like autonomous agents, robotics, and specialised models reshaping business.

Salesforce announces major hiring spree to boost AI sales

Salesforce plans to hire 2,000 sales reps to meet AI demand, marking growth despite recent layoffs, as it focuses on expanding its AI offerings.

Why human skills remain essential in software development’s AI era

Developers’ critical thinking and creativity remain essential as AI tools like GenAI assist in coding. Learn why human skills still matter in the AI era.

NVIDIA’s new compact generative AI supercomputer is its most affordable yet

NVIDIA unveils its Jetson Orin Nano Super Developer Kit, a compact AI supercomputer with enhanced performance and an affordable US$249 price tag.

Related Articles

Popular Categories