Google researchers find novel way of turning a single photo of a human into AI-generated video good enough to make you think ‘this might go badly’-

Google researchers have found a way to create video versions of humans generated from just a single still image. This enables it to do things like, generate a video of someone speaking from input text, or changing a person’s mouth movements to match an audio track in a different language to the one originally spoken. It also feels like a slippery slope into identity theft and misinformation, but what’s AI if not with a hint of frightening consequences.

The tech itself is rather interesting: it’s called Vlogger by the Google researchers that published the paper. In it the authors (Enric Corona et al) offer up various examples of how the AI takes a single input image of a human—in this case, I believe mostly AI-generated humans—and with an audio file produces both facial and bodily movements for them to match.

That’s just one of a few potential use cases for the tech. Another is editing video, specifically a video subject’s facial expressions. In an example, the researchers show various versions of the same clip: one has a presenter speaking to camera, another with the presenter’s mouth closed in an eerie fashion, another with their eyes closed. My favourite is the video of the presenter with their eyes artificially held open by the AI, unblinking. Huge serial killer vibes. Thanks, AI.

The most useful feature in my opinion is the ability to swap an audio track for a video with a dubbed foreign language version and have the AI lip-sync the person’s facial movements to the audio track.

It works through the use of two stages: “1) a stochastic human-to-3d-motion diffusion model, and 2) a novel diffusion based architecture that augments text-to-image models with both temporal and spatial controls. This approach enables the generation of high quality videos of variable length, that are easily controllable through high-level representations of human faces and bodies,” the GitHub page says.

Admittedly the tech isn’t perfect. In the examples given the mouth movements have certain qualities common across AI-generated video content. It’s also pretty creepy at times, as noted by users responding to a thread about the technology by EyeingAI on X. But Vlogger doesn’t need to fool everyone, or even fool anyone at all, to have some use. Similarly, if it were a more perfect technology, it’d be even more worrying to think about how this technology could be used to create deep fakes, spread misinformation, or steal identities. We’ll get there one day, and I for one hope we have some handle on how to deal with this stuff a bit more by then. 

Related Posts

Digimon's Monster Hunter Virtual Pets Let You Raise Your Own Rathalos

In honor of its 20th anniversary, Monster Hunter is crossing over with Digimon for one of the coolest pieces of official Monster Hunter merch we’ve seen yet:…

Star Ocean- The Second Story R Discounted To Best Price Yet At Amazon

RPG fans who haven’t picked up Star Ocean: The Second Story R are in luck. The beautiful reimagining of the classic 1998 role-playing game is on sale…

A Key Deadpool 3 Scene Was Based On Star Wars Return Of The Jedi

Shawn Levy, the director of Marvel’s upcoming Deadpool 3, revealed that a pivotal scene in the upcoming superhero film was based on Star Wars: Episode VI, Return…

Creed 4 Reportedly Happening With Michael B. Jordan Returning To Direct

Adonis Creed will apparently go for another round on the silver screen. A Creed III producer has stated that a new installment in the Rocky spin-off series…

Fanatical's 2024 Bundlefestive Caps Off With A Bundle Deal Featuring Popular PC Games

Fanatical’s Bundlefestive event caps off today with one final new bundle, the Prestige Collection Winter Special 2024 edition, which lets you build a collection of some of…

Fallout 4 Tesla Cannon And Best Of Three Quest Guide

One of the new weapons that arrived with the next-generation update in Fallout 4 is the Tesla Cannon, a powerful Fusion Cell gun that shocks any enemy…