The arrival of ChatGPT last year sent a rare shiver through Google’s spine. For years the company had positioned itself as a leader in the development in artificial intelligence. Suddenly, though, a product from the upstart OpenAI rocketed to tens of millions of monthly users — and observers began asking whether Google had squandered its lead.
Within weeks, leaders at the company declared a “code red” — a signal that the time to begin shipping AI features was now. (It was widely reported that CEO Sundar Pichai declared the code red, but he later told me that it wasn’t the case.)
A handful of products have shipped since — most notably Bard, the company’s ChatGPT analog. But on Wednesday, at the company’s annual developer conference, the floodgates opened. At Google I/O, a torrent of new AI features were announced, touching nearly every part of the company’s product lineup.
For the most part, these products will ship “in the coming weeks,” or “later this year.” Until then, all we really have to go on is the demonstrations we saw in demonstrations and pre-conference press briefings.
But while I imagine the features will vary in quality and usefulness, one thing is becoming clear about the near-term AI future: technology alone is not enough to totally reset the competitive landscape. Incumbents can gain significant ground simply by bringing new features into the products that people are already using — and getting users to switch platforms is proving more difficult some imagined it would be.
Let’s take platform switching first. In February, Microsoft re-launched Bing with generative AI search results powered by ChatGPT. The company hoped it would be a moment that consumers gave Bing a second look — and would perhaps give Microsoft a chance to peel off meaningful market share from its much bigger rival.
Three months later — and on the eve of Google adding generative AI results to its own search engine — that project appears to have stalled. Citing a report from the research firm YipitData, The Information reported Wednesday that Bing’s share of searches on desktops had grown just 0.25 percent in the past three months. Microsoft told the outlet that the growth rate was higher on mobile devices, and perhaps it will grow on desktops as well in the coming months.
But the same story noted that ChatGPT receives more than 65 million visits per day, compared to 40 million for 14-year-old Bing. People who want to use OpenAI’s chatbot are largely going straight to the source — and Microsoft, which is just one of dozens of companies integrating OpenAI’s technology in the hopes that it will open up new revenue streams, is finding that API access is a commodity rather than a growth engine. (I’m sure Microsoft will eventually find plenty of ways to make money from AI, starting with all the infrastructure services it provides OpenAI through its Azure platform. But still.)
The lesson here is that, with the possibly lone exception of ChatGPT, users are mostly not seeking out AI as a destination unto itself. Rather, they’re waiting for it to transform into useful products and services — ideally, products and services that they’re already using.
Last week I wrote about AI’s missing interface, and the challenges presented by a technology whose interface design begins and ends with a text box. One way of thinking about I/O this year is that Google began to fill in the missing pieces of that interface with actual product design — a commitment to nudging users, in all sorts of ways, into using AI productively.
Let’s look at a few of those ways. Until now, Bard has been an island unto itself — a sandbox for testing the limits of Google’s large language model, PaLM. Pretty soon, though, you’ll be able to export Bard’s output into Gmail, Docs, and Sheets — the places you were probably going to copy and paste it to anyway. ChatGPT probably records more copy and paste actions than any other website in the world; Google is abstracting that whole process away into a button.
Ideally, though, you’d never have to visit a dedicated website to use generative AI in the first place. For example, at the moment lots of people are having ChatGPT write their emails and then porting them over into their email client of choice. Google is taking the obvious next step: promising that later this year, you’ll be able to just ask Gmail to write the email for you in the message composer window.
I predict ChatGPT sees fewer copy and paste actions after that.
You could also just stick generative AI boxes into existing productivity tools — the way Google showed yesterday with its “sidekick” feature. In one of the day’s best demos, Google executive Aparna Pappu showed off the sidekick in Docs. As she imagined writing a short story about a missing seashell with her niece, the sidekick chimed in with contextual suggestions. What happened to the seashell, it wanted to know.
Then the sidekick offered some suggestions: maybe it was stolen by a jealous mermaid. Maybe it was taken by a time traveler. Maybe it was eaten by a squid.
If you’re a 10-year-old writing a short story, this is going to be a lot of fun. And it probably doesn’t even come across to the average user as AI per se — instead it just feels like a new creative tool that takes a popular existing product and makes it more useful.
There were a lot more demos like that yesterday. I was struck by one that generated speaker notes from a set of slides — sure to be a godsend for procrastinating workers everywhere — and another that created a list of dishes that people were bringing to some potluck based on an attached Google Sheet.
Viewed one way, some this stuff can feel pretty mundane. But in the near term, this is how AI is going to start working its way into our lives. Soon enough, we probably won’t think of it as AI anymore. (A recurring and somewhat defensive theme of yesterday’s keynote is that Google has already shipped lots of stuff that uses machine-learning but for whatever reason doesn’t meet our ever-shifting definition of what counts as AI. Searching for “dogs” in Google Photos, for example.)
There’s surely another column to be written here about Google’s planned changes to search, which will put a module of generative AI results on top of the standard 10 blue links. But I want to wait until I can actually try it for myself to get a better sense of how disruptive it feels.
For now, with search and everything else, Google has positioned AI not as an all-knowing oracle but as a useful starting point for many tasks. Google’s AI will write the first draft; offer alternate paths to consider; or do a cursory scan of a new subject you’re interested in. This has the benefit of being how people actually use AI in practice today, and it’s smart of Google to lean into that message rather than something more grandiose.
Ultimately, I still believe the AI opportunity will be much bigger than one company. But in a moment when all these large language models are converging to become roughly functionally equivalent, no one is going to win the game on technology alone.
AI is moving from a science problem to a product design and marketing problem, and the latter are things that Google has had a lot of experience with.
A better metaverse
The best thing I saw at Google I/O was Project Starline, an experimental piece of hardware that asks: what if the person on your next Zoom call was a hologram?
The year-long discussion we had about the metaverse from 2021 to 2022 often touched on the idea of “telepresence” — technologies that allow people to feel as if they are physically present with someone even when they are only being represented digitally. Other than Zoom, the best we have been able to do on this front is to strap on ungainly headsets, navigate ourselves into pixelated conference rooms, and talk to legless cartoon versions of our colleagues and loved ones.
Project Starline, which remains early in its development and would need to get radically cheaper to go mainstream, requires only that you sit down in front of the TV-like device and turn it on. There are no headsets, glasses, or headphones to fiddle with — just a person talking to you, in three dimensions and at admirably high resolution.
Andrew Nartker, Starline’s general manager, demonstrated it for me while sitting in a separate booth. When he went to give me a fist bump, his hand appeared to come through the TV screen. Later, he offered me an apple, and the effect was just as realistic. And all the while, Nartker’s voice tracked his movements as he changed positions, enhancing the illusion that he was right there in front of me.
In reality, he was in a booth a few feet away from the one I was sitting in. I’m sure that behind the scenes there were hidden technological enhancements that you might not find in the real world: a rock-solid data pipe linking the devices, for example. And in my conversation with Googlers yesterday, it was clear that the primary obstacle to Starline’s development will be making it much less expensive than it is today. (No one would tell me how expensive it is, but if you told me the whole setup cost a million dollars or more it would not seem excessive, relative to the quality of the experience.)
The good news is that there are signs Starline is coming down the cost curve. Google said this week that it has begun testing the device with partners including Salesforce, T-Mobile and WeWork, as well as at Google itself.
Given the challenges, and all the cost-cutting going on at Google and elsewhere, few would be surprised if Starline ultimately proves to be vaporware. But there’s something profound here that Meta’s metaverse hasn’t come close to achieving: a convenient, comfortable, ergonomic form of video chat that I could easily imagine myself doing for hours.
I’m sure I’ll take my share of meetings in virtual reality over the next few years, if only because of how much cheaper they are than installing Project Starline at my house.
The minute that changes, though, my webcam and headset are going into a drawer.
On the podcast this week: Kevin and I take a ride in one of Cruise’s robot-taxis, which are becoming more widely available in San Francisco. Then, Cruise CEO Kyle Vogt stops by to talk about our self-driving future. PLUS: I report live from I/O.
Just look at all this stuff!
- Google’s new “AI snapshot” feature is a major overhaul to its standard search results page, replacing links to third-party sites with a screen-filling generative response. It’s a radical new approach to search that may have drastic effects on publishers. (David Pierce / The Verge)
- Google dropped its waitlist for the Bard chatbot and made it available to 180 countries with new features like expanding language support, export functions, and visual search. (James Vincent / The Verge)
- Google unveiled its PaLM 2 large language model with a 91-page paper outlining its capabilities and claimed its capable of besting GPT-4 at text generation. (Kyle Wiggers / TechCrunch)
- Google is testing a new “Universal Translator” that redubs video footage in a new language and syncs a speaker’s lips accordingly. The company said it is aware of how the feature could be misused for deepfakes. (Devin Coldewey / TechCrunch)
- Google released MusicLM, an experimental AI tool for turning text descriptions into music, to the public through its AI Test Kitchen program. (Kyle Wiggers / TechCrunch)
- Google teased Project Tailwind, a new AI studying tool that scans your Google Drive files and acts as a digital notebook for retrieving information. (Ben Schoon / 9to5Google)
- Google is adding new labels to Google Image Search to designate when files have been AI-generated. Good! (Sarah Perez / TechCrunch)
- Google will roll out a new search feature called Perspectives to help address when users are specifically looking for human answers on forums like Reddit. (David Pierce / The Verge)
- Google announced Search Labs, a user testing program for “bold” search features incorporating AI and other tech. We will be opting in to all of these, thank you. (Steve Dent / Engadget)
- Android will get new AI features including Magic Compose, which will write replies based on your personal style, and AI-generated wallpaper. (Sean Hollister / The Verge)
- Google Photos will get a new Magic Editor feature that uses generative AI to let you edit specific parts of a photo, fill in missing gaps, and to even reposition the subject. (Sarah Perez / TechCrunch)
- Google Maps’ Immersive View for Routes feature, which creates a digital model of the world using Street View and aerial imagery, is coming to 15 cities in the coming months. (Aisha Malik / TechCrunch)
- Google announced new generative AI features for the Play Store that will help developers fill out app listings and summarize reviews. (Jon Porter / The Verge)
- Google rebranded its collaborative Workspace tools to Duet AI and pledged to bring more generative AI features to Docs, Gmail, Sheets, and Slides. It’s Google’s answer to Microsoft’s Copilot. (James Vincent / The Verge)
- The Pixel Fold is Google’s answer to Samsung’s foldable smartphone line and launches in June for a bracing $1,800. (Sam Rutherford / Engadget)
- Google’s more affordable Pixel 7a punches far above its weight for a flagship-quality smartphone, priced at only $499. (Kyle Bradshaw / 9to5Google)
- Google resurrected its Android tablet line with the Pixel Tablet, which starts shipping in June and starts at $599 with a speaker dock (but no keyboard) included. That included speaker dock is a great touch. (Scott Stein / CNET)
- Google said its Find My Device network will soon notify users of any unwanted Bluetooth trackers moving with them to address safety concerns. (Sarah Perez / TechCrunch)
- Google launched a new security feature to let you know if your email address has been posted to the dark web. Presumably that is not a good sign. (Jess Weatherbed / The Verge)
- Google revealed new in-car features for Android Auto and for vehicles running native Android software, including YouTube support and Waze navigation. (Andrew J. Hawkins / The Verge)
- Google Cloud announced new A3 supercomputer virtual machines used for training machine learning models. (Ron Miller / TechCrunch)
- The EU’s landmark AI Act moved onto its penultimate phase after lawmakers agreed to more stringent restrictions, including a ban on predictive policing and the use of facial recognition in public spaces. The bill is now set to be finalized with the European Commission and individual member states with a vote scheduled for June. (Foo Yun CheeMartin Coulter, and Supantha Mukherjee / Reuters)
- OpenAI CEO Sam Altman will testify to a Senate panel next week regarding AI safety, automation, and data privacy concerns as Congress weighs regulatory measures. (Cristiano Lima / The Washington Post)
- The American Psychological Association issued its first-ever health advisory on children’s social media use this week. Among the tips include restricting access to avoid sleep loss and encouraging kids not to compare their appearance to others — both easier said than done. (Taylor Hatmaker / TechCrunch)
- The EU is expected to green-light Microsoft’s Activision Blizzard deal on May 15th, putting the commission at odds with U.K. regulators. (Foo Yun Chee / Reuters)
- ByteDance delayed the opening of its TikTok shopping platform in the U.S. to later this summer as concerns over a national ban have deterred sellers from joining. (Raffaele Huang / WSJ)
- Hacker PlugwalkJoe, aka James O’Connor, pled guilty to cyberstalking and other crimes related to the high-profile takeover of Twitter accounts belonging to Barack Obama and Joe Biden in 2020. (Emma Roth / The Verge)
- Miami catered to crypto enthusiasts to attract tech money and entrepreneurs, but the city is now rethinking its position following the downfall of FTX and the cryptocurrency crash of 2022. (Deborah Acosta / WSJ)
- Italy’s antitrust regulator opened an investigation into Apple over the company’s treatment of developers, in particular how it imposes more restrictive privacy policies than it requires of its own apps. (Alvise Armellini and Elvira Pollina / Retuers)
- An investigation found that Israel is home to major players in the global surveillance industry that traffic in geolocation, phone cracking, and account hijacking. (Crofton BlackOmer Benjakob / Haaretz)
- Microsoft’s new AI features for Bing have barely made a dent in Google’s search engine market share, and the Windows maker is now considering a deal with Firefox in an effort to capitalize on OpenAI’s tech. (Aaron Holmes / The Information)
- Bengaluru-based software engineer Sukuru Sai Vineet launched a ChatGPT clone called GitaGPT that quotes scripture from the Bhagavad Gita in the voice of the Hindu god Krishna. (Nadia Nooreyezdan / Rest of World)
- Author Ted Chaing makes a convincing argument that modern AI is on track to serve the function of consulting firms like McKinsey, which would disempower workers and accelerate job loss. (Ted Chiang / The New Yorker)
- Meta announced a new AI Sandbox feature to allow advertisers to experiment with generative AI for producing alternative text and image backgrounds and cropping photos. (Ivan Mehta / TechCrunch)
- AI news app Artifact launched writer profiles, and will now let readers follow specific writers to further personalize their feeds. Follow us! (Artifact / Medium)
- Elon Musk said he has hired a replacement for Twitter CEO and will transition to executive chair and chief technology officer. He didn’t name the chosen person, but added that “she starts in six weeks.” Dylan Byers (and later, the Wall Street Journal) say its NBCUniversal advertising chief Linda Yaccarino. Believe it when you see it. (Elon Musk / Twitter)
- Musk has also been openly engaging with right-wing conspiracies about the Allen, Texas shooting, calling the shooter’s reported ties to neo-Nazism “odd” and “very strange.” (Jordan Pearson / Motherboard)
- Twitter launched an early version of its encrypted DMs feature for Twitter Blue subscribers, but with security limitations and no group messaging or support for media files. “Don’t trust it yet,” Musk added helpfully. (Karissa Bell / Engadget)
- Twitter disabled auto-complete in its search bar after users reported the feature returning horrific results for animal abuse, war footage, and gore videos. Another good one for trust and safety teams to throw into their “why you shouldn’t lay us off” slide decks. (Matthew Gault / Motherboard)
- Bluesky grew 606% in April, with 628,000 downloads — but has a long road ahead to match Twitter, which had 14.2 million downloads last month. (MacKenzie Sigalos and Jonathan Vanian / CNBC)
- Roblox continues to lose money, but its recent earnings report showcased record-high DAUs (66 million, up 22%) and engagement hours (14.5 billion, up 23%). (Rohan Goswami / CNBC)
- YouTube appears to be testing a new pop-up warning asking users to either disable their ad blocker or pay for a YouTube Premium subscription before watching videos. (Sergiu Gatlan / Bleeping Computer)
- Disney plans to add Hulu programming to its Disney+ app and said it will raise the price of its ad-free tier later this year. (Lillian Rizzo / CNBC)
- Meta does not publish demographic information about Facebook users, but many signs point to an aging user base as the company struggles to regain relevance among younger people. (Barbara Ortutay / Associated Press)
- The U.S. live shopping industry is trying to make inroads with younger consumers in hopes of achieving some of the scale of Chinese live shopping platforms like Alibaba’s Taobao Live. The Chinese live shopping market is expected to bring in $647 billion this year. (Jordyn Holman and Kalley Huang / The New York Times)
Those good tweets
For more good tweets every day, follow Casey’s Instagram stories.