Google’s new Omni AI tool will let you video clone yourself – I’m intrigued (and concerned)


Google I/O ZDNET

Rene Ramos/ZDNET

Follow ZDNET: Add us as a preferred source on Google.


ZDNET’s key takeaways

  • Google Omni aims to do for video what Nano Banana did for images.
  • Creators can build videos from text, images, audio, or video.
  • AI avatars could help creators, but raise trust concerns.

Today, Google announced a new AI video capability that will either help creatives produce higher-quality videos more easily, or vastly increase the amount of AI slop on YouTube. I’m betting it’ll be a mix of both.

Google announced Gemini Omni, a tool that raises the ability to create video via AI to an entirely new level. The company compared this announcement to the level of AI image generation improvement that came about when it released Nano Banana.

Also: Google I/O 2026 live: Latest updates

Nano Banana raised the bar considerably on what was possible with image generation. Omni purports to do the same with video. Omni will be rolling out starting today, but I didn’t have a chance to play with it prior to the announcement.

Google described Omni as “where Gemini’s ability to reason meets the ability to create.” Interestingly, according to the company, “With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini’s real-world knowledge.”

Although Omni is “starting with video,” Google said the new model can “create anything from any input,” so presumably we’ll see other media types generated by the tool within due time.

Also: 6 Android Auto apps I wish I found sooner, because they make every drive easier

Omni will also be available in model tiers, starting now with Gemini Omni Flash. The capability is coming to the Gemini app, Google Flow, and YouTube Shorts. It’s not clear whether the web version of Gemini will support Omni, or whether you’ll need to use the Flow interface via your browser.

There are some standout features that make this a very interesting offering.

Clone yourself

I honestly can’t decide if this is going to be a standout feature, a very big concern for privacy, or an untethered slop generator. The company said you can create videos “with your own voice by using Avatars, which create a digital version of yourself so you can generate videos that look and sound like you.”

Also: I used Nano Banana 2 to make perfect sketchnotes: 5 lessons learned

As a regular producer of YouTube videos for my channel, I’m intrigued. There have been times when I wanted to put out a video, but was having a bad hair day, a bad voice day, or a bad attitude day, and I just didn’t want that to come across in video.

Could I just feed a script into my digital twin avatar and have RoboDave do the talking? Would my audience notice? Would they care? Would they hate it? Would I? Clearly that’s an area worthy of experimentation, but it’s probably not something I’ll use often.

I do my YouTube channel, in part, to keep my speaking and presentation chops up. Foisting that work on a digital avatar might reduce my workload, but it would also reduce my training and practice.

Google is very careful to say that it’s incorporating its SynthID digital fingerprinting technology in these videos, so they can be verified as having been produced with Omni. Google also said, “Beyond the avatar feature, in terms of editing videos to change audio and speech, we are still working to test this and better understand how we can bring this capability to users responsibly.”

Physics model

Some of you may remember the early days of video games, when characters behaved more like ragdolls than objects in the physical world. As games got better, they began to incorporate physics models, so if something got shot, knocked back, or dropped, it did so in a matter consistent with the physics of the object.

Omni now incorporates physics into the videos it creates. Google said it has “an improved intuitive understanding of forces like gravity, kinetic energy, and fluid dynamics.” It also uses Gemini’s knowledge to “connect language, imagery, and meaning in ways that go far beyond pattern matching.”

Also: OpenAI’s new image watermarks make it easier to spot AI fakes – here’s how

The company said Omni can build detailed videos from short prompts and can generate videos for things like explainers that break down fairly complex ideas. I don’t doubt this. The analysis capabilities of NotebookLM’s audio overview and video overview to be able to create explainers are astonishing. If some of that technology found its way into Omni, things could get interesting quickly.

I actually fed marketing documents and spec sheets into NotebookLM and it produced explainer videos for various features of my security product that were better than anything I could have done by hand, especially in the time it took. The visuals at the time weren’t great, but having complex features explained in a clean video in under 30 minutes was a force-multiplier for my product release schedule.

Input variety

One of Nano Banana’s early standout features was its ability to recontextualize an image. For example, I had it take a picture of me walking in a park and change it so I was wearing something close to an admiral’s uniform on the bridge of an aircraft carrier. While it didn’t get the uniform fruit salad and brass quite right, it did manage to accurately reproduce my body and face.

Also: I turned casual selfies into professional headshots with Gemini

Omni proposes to take that to video, turning image, text, video, or audio into a “cohesive output.” Right now, the only audio it will accept is voice recordings, but the company said it’ll “roll out other types of audio inputs soon.”

The company also said you can create scenes, match styles, describe what you want in natural language, and get character consistency throughout the video.

Conversational editing

One aspect of producing videos I do not enjoy is the editing process. It’s often enormously tedious. But, with Omni, “Gemini Omni gives you an easier way to edit video – with natural language. Every instruction builds on the last. Your characters stay consistent, the physics hold up and the scene remembers what came before.”

Google also said you can change elements in the video. I can see a huge benefit if it’s possible to import a video and have the editor remove obstructions or change objects and backgrounds. It’s not clear how long a clip can be, or exactly how much editing you can do with Omni on a given plan, but those possibilities are exciting.

Also: Are Sora 2 and other AI video tools risky to use? Here’s what a legal scholar says

Two other transformations the company said the new Omni can do are:

  • Change specific things, or change everything. Your video becomes the starting point for something you never could have filmed yourself.
  • Take a video you shot and just ask Omni to change what’s happening. Edit the action, add in new characters or objects, or transform a moment into something unexpected.

Additionally, Google hasn’t yet specified video format or resolution. Will this be a professional tool that can handle 16:9 videos in 4K or 8K resolution, or is it meant to be a tool for the YouTube Shorts generation?

When OpenAI introduced Sora, it was a novelty. While users abused it (we gave Sam Altman blue hair and made him sing ZDNET’s praise), it never managed to be a tool that helped a professional’s workflow.

While producing AI avatar clones and replacing objects might be fun, I’m hoping this capability is extended so that it’s usable either inside Final Cut, Premiere Pro, and DaVinci Resolve, or at least integrated enough that those tools can use edits created by Omni.

It’s possible. Omni’s features will be rolling out to enterprise customers and developers via a Google API.

Also: OpenAI’s new image watermarks make it easier to spot AI fakes – here’s how

I’m also curious if Omni will embed the little diamond watermark in the corner of its videos, like it does with Nano Banana’s generated images. While it’s nice to know a clip was generated by AI, such watermarking gets in the way of using the AI as a professional tool.

Will we see licensing tiers where the watermark can be removed? Or will we see third-party tools crop up that remove the watermark, whether Google wants you to or not? Time will tell.

Would you use Google Omni to create a digital avatar of yourself for videos you didn’t want to record in person? Let us know in the comments below.


You can follow my day-to-day project updates on social media. Be sure to subscribe to my weekly update newsletter, and follow me on Twitter/X at @DavidGewirtz, on Facebook at Facebook.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, on Bluesky at @DavidGewirtz.com, and on YouTube at YouTube.com/DavidGewirtzTV.





Source link

Leave a Reply

Subscribe to Our Newsletter

Get our latest articles delivered straight to your inbox. No spam, we promise.

Recent Reviews


If Game Two of their first-round playoff series with the Denver Nuggets saved the 2025-26 season for the Minnesota Timberwolves, Game Three showed why it should be saved. 

The Timberwolves were a different beast while decisively thumping the Nuggets, 113-96 Thursday night at Target Center, in a game that wasn’t nearly that close. These Wolves were the mythical creature we’d heard about in preseason lore, purposefully locked and loaded to be both marauding and staunch. They owned both ends of the court, gleefully transferring back and forth from irresistible force to immovable object. 

A quartet of Timberwolves deserve special mention, but it begins with Jaden McDaniels. After his team had toppled Denver to even the series at a game apiece Monday night, McDaniels used the sizable chip on his shoulder to etch some graffiti into the public discourse, casually castigating the most prominent Nuggets players by name as “bad defenders” in a matter-of-fact manner that had the media compelling him to confirm what he had just said. 

Trash talk is fleetingly fungible in the jaundiced social environment of 2026, functioning more like coupons than currency in that it needs to be rapidly leveraged before its expiration date. The common perception naturally was that McDaniels was calling out the Nuggets. But in a more subtle, profound way, he was also putting his teammates on notice. 

All season long the Timberwolves have procrastinated on their full potential, frequently demonstrating that their preseason talk about maturity and commitment was cheap. By contrast, those words uttered by McDaniels were expensive. He had just picked a fight with the opponent, leaving open the question of how many of his teammates would join him in the fray. 

That he would lead the charge was established early, after the Timberwolves’ top two scorers, Anthony Edwards and Julius Randle, had each missed a pair of open looks against Denver’s bad defenders in the game’s first 90 seconds.  

With the game still scoreless, the NBA’s best pick-and-roll combo, Nikola Jokic and Jamal Murray, were clustered around the foul line with Minnesota’s best defenders, McDaniels and Rudy Gobert. As they jammed up Jokic, McDaniels picked the ball loose and started sprint-dribbling the other way. To no one’s surprise, Donte “Ragu” DiVincenzo was also on his horse in transition, receiving a pass from McDaniels and then lobbing it back for a Jaden slam against a hapless Murray and Murray’s late-arriving teammate, Cam Johnson, who committed the foul that allowed McDaniels to finish with the “and-1” free throw. 

On the Timberwolves next offensive possession, McDaniels muscled his way to two offensive rebounds, feeding Ragu off the first one for a missed three-pointer, which he corralled for the second one and executed the putback in traffic. It was McDaniels 5, Nuggets 0, setting the tone for a game in which not only did the Wolves never trail, but never let the lead go under double digits after McDaniels made a consecutive pair of driving layups eight minutes into the game. 

“Spectacular. I thought his activity offensively in the first quarter was outstanding,” said Wolves coach Chris Finch after the game. “He was inspirational.” 

Among the most inspired were McDaniels fellow wing players, Ragu and Ayo Dosunmu. Ragu is exactly the kind of player who will have your back in a squabble, and his galvanized performance seemed borne of satisfaction that someone else had clarified the mission. As usual, the Timberwolves were at their best with him on the court: +20 in the 32:54 he played, -3 in the 15:06 he sat. 

“He makes so many hustle plays, momentum plays, different styles of plays.” Finch raved. “He’ll make a shot, get a transition bucket, he’ll rebound, get a steal, blow something up. So many different plays. He’s just a basketball player.”

Related: How the Timberwolves sparked a season-saving Game 2 comeback over the Nuggets in Denver

Then there was Ayo, whose fearless, blazing, bee-lines for the bucket were quicksilver kryptonite for a Nuggets defense that is neither swift nor rugged. “I’ve been waiting for him to wake up a little bit in this series,” Finch accurately observed. “The downhill mindset that he played with all season for us was back.”

Back with the sort of multipurpose propulsion that leaves witnesses with giddy whiplash. Ayo led the team with 25 points and 9 assists in 32 minutes of time-lapse hoops, the lone blemish being three clanks from long range. Why chuck treys when you can so easily undress players in the paint? Ayo was 10-for-12 on two-pointers and none of those dozen shots came from anywhere but beneath the rim. Five of his nine dimes likewise yielded layups or dunks, which means he personally accounted for 30 of the 68 points in the paint by the Timberwolves on Thursday, doubling up the Nuggets’ 34.

Which brings us to the non-wing in Game 3’s ring of honor, Rudy Gobert. For the third straight game, Gobert blunted the supposed advantage Denver had with the magical playmaker Nikola Jokic at the controls. Suffice to say that in the last five quarters, Jokic has shot 8-for-33 from the floor. If that continues, the Nuggets are toast in this series. 

When I asked Finch after the game if the herculean job Gobert was doing on Jokic made planning his defense simpler and better thus far, he replied, “Rudy is making all of us look good right now with his defense.” 

Amen.

If there is an asterisk on this game, it would be the absence of Denver’s brutishly versatile power forward Aaron Gordon. Nuggets coach David Adelman should be given a lot of credit for his honesty and transparency in dealing with the media during his first full season at the helm, but it came back to bite him and his team during the pregame presser, when he was clearly rattled and dejected by the sudden unavailability of Gordon, whose playing status went to “probable” to “out” in a period of a few hours due to a chronic calf strain. 

Gordon is far and away his team’s best defender, making the timing of his injury especially troublesome in the wake of McDaniels laying down his marker. Rattled is a good way to describe the entire team’s performance in the first quarter, an emotional wounding that needs to heal as fast as Gordon’s body if the Nuggets are going to be competitive in a series that had dramatically been flipped on its head over the past three days. 

That the Timberwolves played with such dominance despite mediocre outings from Ant and Randle would be a good thing for both of those current cornerstones to keep in mind. Ant was beset by foul trouble and Randle had a solid second quarter, but it stood out that neither player fully embraced what so often works on offense when the Wolves are at their best: Push the pace, move the ball, move without the ball, and make quick decisions. Ant and Randle can still be first among equals and blend into that catechism if they stay attuned to the possibilities of a greater good, one that all of sudden doesn’t have to end with them being postseason fodder for the Spurs or the Thunder. 

Not when you’ve got three wings at a collective peak, with a chaser of Rudy semi-clowning the Joker. 



Source link