Microsoft AI creates lifelike videos of talking and singing from photos.

Technology

Microsoft AI creates lifelike videos of talking and singing from photos.

Jack K.

April 20, 2024

Microsoft AI creates lifelike videos of talking and singing from photos.

Microsoft Research Asia has unveiled a groundbreaking AI tool called VASA-1 that can create a lifelike talking face in real-time using a still image of a person and an existing audio file. This tool has the capability to generate facial expressions, head movements, and appropriate lip movements to match speech or songs. While the results may look convincing at first glance, there are concerns about the potential misuse of this technology to create deepfake videos of real people.

Despite the risks associated with misuse, the researchers behind VASA-1 see a range of potential benefits for their technology. They believe it can enhance educational equity, improve accessibility for individuals with communication challenges, and provide companionship and therapeutic support for those in need. The tool was trained on the VoxCeleb2 Dataset, which includes over 1 million utterances from 6,112 celebrities, allowing it to work not only on real faces but also on artistic photos like the Mona Lisa.

While the technology behind VASA-1 may raise ethical concerns, the researchers are hopeful that it can be used for positive applications. By providing access to AI characters who can communicate on behalf of individuals with communication challenges and offering companionship to those in need, VASA-1 has the potential to make a meaningful impact in various sectors. As the researchers continue to refine their technology, it will be crucial to ensure that safeguards are in place to prevent misuse and protect individuals’ privacy and security.

Source link

LEAVE A REPLY Cancel reply