Google’s Most-Hated Announcement Ever

Linus Tech Tips| 00:18:00|May 20, 2026

Chapters11

The speaker reflects on Google IO’s viewership and engagement, highlighting impressive AI-driven demos (like Gemini and Omni) while questioning whether the excitement masks existential dread. The segment previews Google's strategy of integrating AI across Search, Android, YouTube, and more with personalized experiences.

Linus Tech Tips breaks down Google IO’s bold AI-push, mixed with creeping dystopia and a lot of wow-factor demos.

Summary

Linus from Linus Tech Tips digs into Google IO’s keynote, highlighting how Google’s Gemini-powered future aims to blend search, Maps, YouTube, and assistants into a seamless AI-first ecosystem. He notes demos like AI-assisted search that creates bespoke animations, an intelligent shopping cart that tracks deals across the web, and Omni, Google’s multimodal world model that handles text, images, and video. Spark, Gemini 3.5, and anti-gravity 2.0 were presented as core accelerants, with Doom being run on stage and then fixed by AI. Linus emphasizes the ambitious vision where AI knows your preferences, personalizes every touchpoint, and even can draft apps or browse the web on your behalf. He also critiques potential conflicts of interest in shopping recommendations and questions the trustworthiness of AI-generated results. The talk on synth ID and C2PA brings a glimmer of verification to AI-generated content, while mixed reality glasses and TPU upgrades show Google’s hardware ambitions are intent on scale. By the end, Linus warns that while the tech looks impressive, the real test will be how verifiable and user-friendly these tools remain as they roll out this year and beyond.

Key Takeaways

Gemini-powered search aims for conversational, context-aware results rather than isolated queries, shifting how users interact with information.
Omni enables multimodal inputs (text, images, video) to produce diverse outputs, improving tasks like video editing and 3D scene generation in a single workflow.
Google’s assistant/agent vision—Gemini Spark—integrates across Google apps to automate complex tasks like consolidating a party guest list from multiple emails.
The intelligent shopping cart and proactive deal-finding demonstrate a frictionless shopping future where AI can autonomously optimize purchases.
Ask Maps and Ask YouTube bring natural language interaction to navigation and video discovery, prioritizing context over algorithms.
Vibe coding demos show potential but raise questions about QA, verification, and long-term maintainability of AI-generated software.
Synth ID and C2PA work to increase content authenticity verification in a world full of AI-generated media.

Who Is This For?

Essential viewing for anyone curious about where Google’s AI ambitions are taking consumer tech, especially if you use Google Search, Maps, YouTube, or Android daily. Linus explains what’s coming, what’s believable, and what to watch for in the months ahead.

Notable Quotes

"In just one day, this year's Google IO keynote has accumulated more views than any other previous Google IO keynote ever."

—Opening observation on audience interest and hype versus dread.

"Omni, Google's new super multimodal world model that can ingest basically any input, text, pictures, videos, whatever, and then turn it into basically any output you want according to your prompt."

—Describes the breadth and versatility of Omni.

"Agentic AI represents a major shift from reactive prompt-driven chat bots to proactive systems that possess agency..."

—Definition of the AI agent concept introduced by Gemini.

"I mean, why search when you can just talk to your AI? Why download an app when you can just have your AI create and deploy one for you?"

—Capture of the consumer-facing promise and its implications for daily tasks.

"Synth ID and C2PA... hopefully things will become a little more verifiable."

—Verifying AI-generated content to combat misinformation.

Questions This Video Answers

How will Gemini Spark integrate across Google apps and what tasks can it automate today?
What is Omni and how does it handle multimodal inputs for video and image generation?
Can Google’s AI shopping cart actually replace manual shopping or does it raise privacy concerns?
What are synth ID and C2PA, and how do they improve authenticity of AI-generated media?
What is Vibe coding, and should developers trust AI to write and QA production code?

Google IO GeminiGemini SparkOmniAsk MapsAsk YouTubeAI agent Vibe codingSynth IDC2PA verification','TPU 8/8i','Pathways','JAXs','War Thunder

Full Transcript

In just one day, this year's Google IO keynote has accumulated more views than any other previous Google IO keynote ever. But question is, were millions of us tuning in out of excitement, or was it out of existential dread? See, only 2% of the people who watched smashed that like button. And to put that number in context, this is one of the most hated videos that we've ever uploaded to our channel. It still managed over 27 times the positive engagement. However, none of that means that the tech that was on display wasn't objectively very impressive. Google showed off everything from AI assisted search that can create bespoke animations to help explain complex subjects to a new intelligent shopping cart that can monitor for product availability and deals across the known universe and even go as far as to just buy things for you within the parameters you set. I think the one that blew my mind the most though was when they used anti-gravity 2.0 know backed by their new Gemini 3.5 flash model to break down and tackle the incredibly complex task of creating an operating system that they then used to run Doom live on stage. Oh, and then there's Omni, which is Google's new super multimodal world model that can ingest basically any input, text, pictures, videos, whatever, and then turn it into basically any output you want according to your prompt. Google's vision for the future is that at every touch point, search, Android, even YouTube, where AI search is rolling out right now, they're going to combine their AI's general knowledge and context with all the data they've collected about you personally and your life and then turn pretty much everything they serve you into a personalized experience. I mean, why search when you can just talk to your AI? Why download an app when you can just have your AI create and deploy one for you? Why Segue when the AI can tell you about our sponsor, a man obsessed with computers, yet overlooking the most comprehensive vehicle combat game that can be played with them. Over 2500 tanks, planes, helicopters, ships, and 95 million others playing on consoles and mobile, too. The question that remains is if he or you will join them for free in War Thunder. At its core, Google's business is still built on search. So, why don't we start there? Google says that their new Gemini infused paradigm for search is meant to be less like individual queries and more like a conversation. So instead of searching troubleshooting graphics drivers and then reading through an article about how to do said thing, you might just say, "My GPU doesn't work. Can you help me figure out why?" And then a back and forth ensues involving text instructions, YouTube video links, or maybe even custom visualizations. Not going to lie, that actually sounds pretty cool, at least on the surface. And according to Google, it's contextual awareness of your preferences are only going to grow over time. But like, does this kind of just feel like the AI version of hitting the I'm feeling lucky button where instead of looking through all of the various sources of information to find credible, authoritative voices, we're just kind of trusting whatever the AI throws at us. Google isn't even hiding all of the shopping partnerships they have that create in my mind a pretty big conflict of interest. I mean, can I really trust that the activities that my Gemini powered search recommends for this weekend aren't sponsored? And on the subject of where I'm going, where is Maps going to send me? Because of course, this feature is coming to Maps as well. I think one of the creepiest moments in the entire presentation was when Google CEO Sundar Pachai showed what was apparently a real prompt that somebody put into Google Maps asking, "My kid just fell into the duck pond and the wedding starts in 30 minutes. Where can I walk and buy her a new dress?" Again, objectively impressive technology. But does Sundar just have a folder on his Chromebook with funniest prompts from real users that he can just browse in plain text? Well, if he does, guess what? you can contribute to it today because Ask Maps with more natural language interaction is starting to roll out now. And the same goes for Ask YouTube, which I got to say honestly looks pretty cool. Instead of relying on the almighty algorithm to surface what you might want to watch, Google is overhauling search finally to surface contextaware videos that will focus on exactly what you want to do when you want to do it. The example they gave was teaching a kid to ride a bike with an emphasis on the exact stage that they were at in the training to try to surface the most relevant videos. I mean, I don't know. I managed to teach my kids to ride a bike without AI, but if we're being real, we've probably all seen some parents who could probably be replaced by AI. An AI that's going to be more powerful than ever if Gemini Spark has anything to say about it. Spark was a major undercurrent of the whole keynote with integration across multiple Google products and maybe finally the promise of actually becoming the assistant or agent that Google has been trying to build for years. But before I can talk about what Spark does, let's do a quick primer on what is an AI agent or agentic AI. Now I could have written this but it seemed on theme to just ask Gemini. So let's see how close it got. Agentic AI represents a major shift from reactive prompt-driven chat bots to proactive systems that possess agency, the ability to independently reason, create multi-step plans, and execute complex digital workflows to achieve highlevel goals. Rather than just generating text, these autonomous systems continuously perceive their environment, break massive objectives down into smaller subtasks, interact with the external world using APIs and software tools, and dynamically adjust their strategy when they encounter errors. Whether operating as single agents or collaborating in specialized multi- aent teams, Agentic AI shifts the human role from tedious execution to highlevel supervision, transforming industries by independently handling intricate pipelines like software engineering, customer service resolution, and business operations. Thanks, Gemini. Very cool. I see you've learned to glaze yourself as hard as you glaze the user. That that whole last sentence was probably unnecessary when I asked for a simple definition, but hey, always be closing, right? Also, hey, since we're chatting, I noticed you've gotten a facelift. You're right, Lionus. It's called neural expressive. Sorry, I'm going to stop doing that now. It sure is called neural expressive. And actually, it looks pretty fancy with some new animations and typography, which has already rolled out. But the facelift is not the impressive part. The impressive part is that Gemini Spark is integrated into like everything Google. And if the demos are anything to go by, this is by far the most polished and useful Agentic AI that we've seen that can not only browse the web and look stuff up for you, but check your calendar and consolidate information from your emails and check your photos and your documents and bring all of that together to tackle complex and yet everyday tasks. I think the most useful demo that they probably did for my daily life was having Spark go through all of the separate email chains for a block party and then consolidate the guest list, RSVPs, and even what dish everyone is bringing into a Google sheet. Like, what a timesaver. To be clear, all of this can be done today with tools like OpenClaw. But Google's innovation is to make it easy and to integrate it into the tools that you were already using. That is something that is going to be hard for the other big players to compete with, especially when you consider Google's partnership with Apple. Now, let's talk about the big updates to video and image generation, courtesy of Gemini Omni, not to be confused with Omni Man. Omni boasts better than ever multimodal capabilities and an awareness of the world that should result in improvements to everything from spatial consistency to physics simulations and even gravity. And once again, the demos were powerful. See, this is where I would normally use superlatives like cool and amazing. But by this point in the presentation, the pit in my stomach had gone from solid pit to like sharp pit. Anyway, they showed that just ploning any media into Omni and then asking it to do stuff would result in that stuff done like here. Okay, look at this. This is a single shot. Okay, Omni, give me 12 angles. Also, while you're at it, make it at night. Like, what? Then there's this one. Here's a riff. Can you make it sloppier? Okay, that one wasn't that impressive, but I think the coolest part was how well it handled complex multi-part instructions. So, you could just say, "Here's a video, and I want the background to look kind of like this image, and also, can you clean up the audio?" The hope here is that this means no more writing 10 prompts and then like mashing clips together, allowing you to do everything in one place. and maybe even all from one initial prompt and then with some refinement prompts. Pretty neat. Moving on to Docs Live. We've got another tool that's both useful looking and depressing. Instead of opening up a Google Doc and adding some notes, you could just tell Gemini to summarize your thoughts in one place and then add whatever you want while you're at it. Here, let's try it. Hey Gemini, can you plan all of LT's 2026 content for me? Please include a table of our best performing videos so far this year. And why not add a few presentation slides telling me that I'm the best. Could you do that, pretty, please? Now, I'm joking right now, of course, but in the future, I I paused the video so that I could actually read and look at some of the quality of these outputs that they were doing live on stage. And if they were honest about that, I guarantee you that as we move forward, many of your favorite creators will be doing just that. I mean, it's amazing how much AI they're building into YouTube. This wasn't part of IO today, but YouTube's relatively new catch me up on this video tool spits out shockingly relevant suggestions for like follow-up content, for instance. Moving on from that, though, let's talk about shopping. Google's intelligent shopping cart envisions a future where you can just browse any store, anywhere on the web, and just add an item to like a universal shopping cart. And while they're at it, why bother using a store? Why not just highlight an image of something and ask where to buy it and then throw it in your cart from there? Taking things to another level. Are you tired of bots beating you to those floor level tickets to your favorite events? Well, Google says if you can't beat them, join them because now you could say, "Hey Gemini, can you buy me tickets to the upcoming Anene torine concert as soon as they go on sale?" And it'll scan the web in the background and let you know as soon as it's ready to pull the trigger. They also boast that it can save you time and money by looking for deals proactively or analyzing your cart for incorrect items like this CPU and motherboard and then even placing orders on your behalf. Will I give my credit card to Google when it rolls out then? Absolutely. Moving on. It's time to talk about your uncle Spaceime's wife. Anti-gravity. Okay, maybe that joke didn't land, but it makes sense because there was no gravity. Okay, it doesn't matter. The point is, considering that I've dipped my own toes into Vibe coding not that long ago, I was really impressed by the anti-gravity 2.0 demo where they used it and Gemini 3.5 Flash to code a fully armed and operational operating system. Now, to be clear, we have no idea if it actually just like copied the Linux kernel or if it actually wrote any new kernel level code. But what it did do was well, nothing. when they tried to launch Doom. That is until they asked it to fix the issue. And then a while later, graphics and keyboard drivers had been obtained and the problem was solved and the demons were ready to die. Impressive. Most impressive. But like, if there's anything that I learned from my vibe coding adventure, it's that I'm not actually a developer and I have no idea how to QA code or maintain code in the longer term. So, to put this very lightly, as cool as that was, I am a little worried about how we're going to verify all of this vibe coding that Google has seemingly fully leaned into, even in production applications. I mean, they boasted on stage that they created this Mac application that got 100 features in 100 days. You are not going to convince me you did that without vibe coding. But wait, I know. I've got the solution. We'll get the AI to do the coding and do the QA work for us. Then we can replace both the developers and the QA staff. Then we can use all that money to buy more AI tokens. This is genius. Jokes aside, as someone with aging relatives, there was at least one thing in this 2-hour long slog of a keynote that gave me a warm and fuzzy feeling, and that was the talk about synth ID and C2PA. According to Google, even now people can only detect AI videos about a quarter of the time. That is a terrifying number and that's only going to shrink as it gets better. But now that the synth ID watermark feature is expanding in partnership with OpenAI, 11 Labs, and Cacao, hopefully things will become a little more verifiable. As for C2PA, they're adding verification for those content credentials so that next time your grandmother sends you a video of crabs wearing sunglasses at an underground rave, you can tell her, "Hey, that was either fully made with AI or it was edited with AI." That is, as long as the content was generated using participating tools. On the subject of crabs with glasses, something that I have extremely mixed feelings about is mixed reality glasses. I've been playing around with the display model from Meta and objectively they are very powerful. That word again and also hardcore dystopian. Um coming back to Google IO though it's kind of funny because for Google's new product they didn't really lean into what are some of the more obvious benefits to me like at all. Like what about helping with visual impairment? In theory you could have these on your head. They could be powered on and you could just ask them stuff and get AI powered answers straight into your ear. I mean, I don't know about you, but I think someone who can't see, asking if it's safe to cross the street or if that's orange juice here in the grocery store is infinitely more important than asking to take a picture from the stage and then slopify it live. But, I mean, hey, it's a Google keynote, not an Apple one, so they got to stay on brand, I guess. Oh, another update was the improvements to Google's TPU, or tensor processing unit, or should I say units. They're going with a dual chip approach with this one. The 8 is going to be specialized for training and the other, the 8i, is specialized for watching you everywhere. Sorry, the I is actually for inference. It's good for inference. Both of these chips got some decent generational performance and efficiency improvements, which is neat. But the real jaw-dropper was their talk about seamlessly distributing training across multiple data center sites, helping to overcome the power and performance limitations of individual data centers. This gives Google the largest training cluster in the world using technologies like Jacks and Pathways, bringing us, yes, one small step closer to the singularity. Not for us, of course, just for the machines. And if you weren't scared already, well, I've got good news, everyone. All of this stuff is either already out, launching this summer, or coming later this year. And one more thing, if you're already subscribed to Gemini Pro or Ultra, those are both getting price cuts, so you're getting more AI for less. That is at least for now, because we've all seen how this works. You get the masses hooked, then you dig those claws in, and you keep them in your ecosystem forever, and you jack up your prices, just like how I'm jacking up the Segway to our sponsor. Portrait of a tech YouTuber surrounded by powerful hardware but barereft of entertainment. Regrettably oblivious to his own need for the tactile sense of joyous destruction that comes with the grip of a jet's joystick in his palm. Yet transplant his mind into a simulation like War Thunder, the most comprehensive vehicle combat game, and allow him control over 2500 of humanity's most deadly machines from the past and the present. and he may rapidly become aware of a yearning in his core for destruction. This game charges no price of admission. The only sacrifice it asks of its players, the time they previously spent on more superficial experiences. Simple health bars and hit boxes have no place in War Thunder, where highfidelity graphics and authentic sounds are a gateway to the heat of battle. Where the limitations of play reside just on the other side of a door to be unlocked with a key of his own imagination. One that may imagine playing on PC, consoles, and mobile, and find it to indeed be true, as 95 million others have. A bonus pack of vehicles, boosters, and more await new players and those returning after six months as they await you if you use the link below and join them in War Thunder. If you guys enjoyed this video, why not check out the Android event video from just last week? It covered some of the legitimately cool stuff.