15:43 GMT - Sunday, 02 February, 2025

DeepSeek Deep Dive + Hands-On With Operator + Hot Mess Express!

Home - Artificial Intelligence - DeepSeek Deep Dive + Hands-On With Operator + Hot Mess Express!

Share Now:


This transcript was created using speech recognition software. While it has been reviewed by human transcribers, it may contain errors. Please review the episode audio before quoting from this transcript and email transcripts@nytimes.com with any questions.

kevin roose

I just got my weekly. I set up ChatGPT to email me a weekly affirmation before we start taping, because you can do that now with the Tasks feature.

casey newton

Yeah. People say this is the most expensive way to email yourself a reminder. So what sort of affirmation did we get?

kevin roose

Today it said, “You are an incredible podcast host, sharp, engaging, and completely in command of the mic. You’re taping today is going to be phenomenal, and you’re going to absolutely kill it.”

casey newton

Wow. And that’s why it’s so important that ChatGPT can’t actually listen to podcasts, because I don’t think it would say that if it had actually ever heard us.

kevin roose

It would say, just get this over with.

casey newton

Get on with it!

[MUSIC PLAYING]

kevin roose

I’m Kevin Roose, a tech columnist at “The New York Times.”

casey newton

I’m Casey Newton from “Platformer.”

kevin roose

And this is “Hard Fork.”

casey newton

This week we go deeper on DeepSeek. “ChinaTalk’s” Jordan Schneider joins us to break down the race to build powerful AI. Then, Hello, operator. Kevin and I put OpenAI’s new agent software to the test. And finally, the train is coming back to the station for a round of “Hot Mess Express.”

[MUSIC PLAYING]

kevin roose

Well, Casey, it is rare that we spend two consecutive episodes of this show talking about the same company. But I think it is fair to say that what is happening with DeepSeek has only gotten more interesting and more confusing.

casey newton

Yeah. That’s right. It’s hard to remember a story in recent months, Kevin, that has generated quite as much interest as what is going on with DeepSeek. Now, DeepSeek, for anyone catching up, is this relatively new Chinese AI startup that released some very impressive and cheap AI models this month that lots of Americans have started downloading and using.

kevin roose

Yeah, so some people are calling this a Sputnik moment for the AI industry, when every nation perks up and starts paying attention at the same time to the AI arms race. Some people are saying this is the biggest thing to happen in AI since the release of ChatGPT. But Casey, why don’t you just catch us up on what has been happening since we recorded our emergency podcast episode just two days ago.

casey newton

Well, I would say that there have probably been three stories, Kevin, that I would share to give you a quick flavor of what’s been going on. One, a market research firm says DeepSeek was downloaded 1.9 million times on iOS in recent days and about 1.2 million times on the Google Play Store.

The second thing I would point out is that DeepSeek has been banned by the US Navy over security concerns, which I think is unfortunate because what is a submarine doing if not deep-seeking.

It was also banned in Italy, by the way, after the data protection regulator made an inquiry. And finally, Kevin, OpenAI says that there is evidence that DeepSeek distilled its models. Distillation is the AI lingo, or euphemism, for they used our API to try to unravel everything we were doing, and use our data in ways that we don’t approve of. And now Microsoft and OpenAI are now jointly investigating whether DeepSeek abused their API. And of course, we can only imagine how OpenAI is feeling about the fact that their data might have been used without payment or consent.

kevin roose

Yeah, it must be really hard to think that someone might be out there training AI models on your data without permission. And I want to acknowledge that literally every single user of Bluesky already made this joke, but they were all funny. And I’m so happy to repeat it here on “Hard Fork” this week.

casey newton

Now, Kevin, as always when we talk about AI, we have certain disclosures to make.

kevin roose

“The New York Times” Company is currently suing OpenAI and Microsoft over copyright violations alleged related to the use of their copyrighted data to train AI models. I think that was good.

casey newton

That was very good. And I’m in love with a man who works at Anthropic. Now, with that said, Kevin, we have even further we want to go into the DeepSeek story. And we want to do it with the help of Jordan Schneider.

kevin roose

Yes. We are bringing in the big guns today because we wanted to have a more focused discussion about DeepSeek that is not about the stock market or how the American AI companies are reacting to this, but is about one of the biggest sets of questions that all of this raises, which is, what is China up to with DeepSeek and AI more broadly? What are the geopolitical implications of the fact that Americans are now obsessing over this Chinese-made AI app? What does it mean for DeepSeek’s prospects in America?

What does it mean for their prospects in China? And how does all this fit together from the Chinese perspective? So Jordan Schneider is our guest today. He’s the founder and editor in chief of “ChinaTalk,” which is a very good newsletter and podcast about US-China tech policy.

He’s been following the Chinese AI ecosystem for years. And unlike a lot of American commentators and analysts who were sort of surprised by DeepSeek and what they managed to pull off over the last couple of weeks —

casey newton

I’ll say it. I was surprised.

kevin roose

Yeah, me too. But Jordan has been following this company for a long time. And a big focus of “ChinaTalk,” his newsletter and podcast, has been translating, literally, what is going on in China into English, making sense of it for a Western audience, and keeping tabs on all the developments there. So perfect guest for the week’s episode, and I’m very excited for this conversation.

casey newton

Yes. I have learned a lot from “ChinaTalk” in recent days as I’ve been boning up on DeepSeek. So we’re excited to have Jordan here. And let’s bring him in.

[MUSIC PLAYING]

Jordan Schneider, welcome to “Hard Fork.”

jordan schneider

Oh, my god, such a huge fan — this is such an honor.

casey newton

We’re so excited to have you. I have learned, truly, so much from you this week. And so when we were talking about what to do this week, we just looked at each other and said, we have got to see if Jordan can come on this podcast.

kevin roose

Yeah. So this has been a big week for Chinese tech policy, maybe the biggest week for Chinese tech policy, at least that I can remember. I realized that something important was happening last weekend when I started getting texts from all of my non-tech friends being like, “what is going on with DeepSeek?” And I imagine you had a similar reaction, because you are a person who does constantly pay attention to Chinese tech policy.

jordan schneider

So I’ve been running “ChinaTalk” for eight years. And I can get my family members to maybe read, like, one or two editions a year. And the same exact thing happened with me, Kevin, where all of a sudden I got, “Oh, my god, DeepSeek, it’s on the cover of the ‘New York Post.’ Jordan, you’re so clairvoyant. Maybe I should read you more.” I’m like, OK, thanks, Mom. I appreciate that.

kevin roose

Yeah. So I want to talk about DeepSeek and what they have actually done here. But I’m hoping, first, that you can give us the basic lay of the land of the Chinese AI ecosystem, because that’s not an area where Casey or I have spent a lot of time looking. But tell us about DeepSeek and where it sits in the overall Chinese industry.

jordan schneider

So DeepSeek is a really odd duck. It was born out of this very successful quant hedge fund. The CEO of which, basically after ChatGPT was released, was like, OK, this is really cool. I want to spend some money and some time and some compute and hire some fresh, young graduates to see if we can give it a shot to make our own language models.

casey newton

And so a lot of companies are out there building their own large language models. What was the first thing that happened that made you think, oh, this one, this company is actually making some interesting ones?

jordan schneider

Sure. So there are lots and lots of very moneyed Chinese companies that have been trying to follow a similar path after ChatGPT. We have giant players like Alibaba, Tencent, ByteDance, Huawei even, trying to create their own OpenAI, basically. And what is remarkable is the big organizations can’t quite get their head around creating the right organizational institutional structure to incentivize this type of collaboration in research that leads to real breakthroughs.

So Chinese firms have been releasing models for years now. But DeepSeek, because of the way that it structured itself and the freedom they had, not necessarily being under a direct profit motive, they were able to put out some really remarkable innovations that caught the world’s attention, starting maybe late December, and then really blew everyone’s mind with the release of the R1 chatbot.

casey newton

Yeah. So let’s talk about R1 in just a second. But one more question for you, Jordan, about DeepSeek — what do we know about their motivation here? Because so much of what has been puzzling American tech industry watchers over the last week is that this is not a company that has an obvious business model connected to its AI research. We know why Google is developing AI, because it thinks it’s going to make the company, Google, much more profitable. We know why OpenAI is developing advanced AI models.

It does not seem obvious to me, and I have not read anything from people involved in DeepSeek about why they are actually doing this and what their ultimate goal is. So can you help us understand that?

jordan schneider

So we don’t have a lot of data. But my base case, which is based on two extended interviews that the DeepSeek CEO released, which we’ve translated on “ChinaTalk,” as well as just what DeepSeek employees have been tweeting about in the West and then domestically, is that they’re dreamers. I think the right mental model is OpenAI 2017 to 2022.

Like, I’m sure you could ask the same thing. Like, what the hell are they doing? I mean, Sam Altman literally said, “I have no idea how we’re ever going to make money.” Right? And here we are in this grand new paradigm. So I really think that they do have this vision of AGI and look, we’ll build it and we’ll make it cheaper for everyone. And we’ll figure it out later.

They have enough trading strategies that they can fund it. And now that they’ve really blown people’s minds, we might be turning into a new period in DeepSeek’s history, kind of like what happened with OpenAI, where they’re going to have to shack up with a hyperscaler, be it not Microsoft in this case, but ByteDance or Ali or Tencent or Huawei. And the government’s going to start to pay attention in a way which it really hasn’t over the past few years.

kevin roose

Right. And I want to drill down a little bit there, because I think one thing that most listeners in the West do know about Chinese tech companies is that many of them are inextricably linked to the Chinese government, that the Chinese government has access to user data under Chinese law, that these companies have to follow the Chinese censorship guidelines.

And so as soon as DeepSeek started to really pop up in America over the last week, people started typing in things to DeepSeek’s model, like, tell me about what happened at Tiananmen Square, or tell me about Xi Jinping, or tell me about the Great Leap Forward. And it just of wouldn’t do it at all. And so people, I think, saw that and said, oh, this is like every other Chinese company that has this hand-in-glove relationship with the Chinese ruling party.

But it sounds, from what you’re saying, like DeepSeek a little bit more complicated a relationship to the Chinese government than maybe some other better-known Chinese tech companies. So explain that.

jordan schneider

Yeah. I mean, I think it’s important — the mental model you should have for these CEOs are not like people who are dreaming to spread Xi Jinping thought. What they want to do is compete with Mark Zuckerberg and Sam Altman and show that they’re really awesome and great technologists. But the tragedy is, let’s take ByteDance, for example. You can look at Zhang Yiming, their CEO’s Weibo posts from 2012, 2013, 2014, which are super liberal in a Chinese context, saying, like, we should have freedom of expression. We should be able to do whatever we want.

In the early years of ByteDance, there was a lot of relatively more subversive content on the platform, where you saw real poverty in China, you saw off-color jokes. And then all of a sudden, in 2018, he posts a letter saying, I am really sorry. I need to be part of this Chinese national project and better adhere to modern Chinese socialist values. And I’m really sorry, and it won’t ever happen again.

The same thing happened with DiDi. They don’t really want to have to do anything with politics. And then they get on someone’s side, and all of a sudden they get zapped.

casey newton

DiDi is, of course, the big Chinese rideshare company.

jordan schneider

Correct. Yeah.

casey newton

What did DiDi do do?

jordan schneider

So they listed on the Western Stock Exchange after the Chinese government told them not to. And then they got taken off app stores, and it was a whole giant nightmare. They had to go through their rectification process. So point being with DeepSeek is now they are, whether they like it or not, going to be held up as a national champion. And that comes with a lot of headaches and responsibilities, from potentially giving the Chinese government more access, having to fulfill government contracts, which honestly, are probably really annoying for them to do and distracting from the broader mission they have of developing and deploying this technology in the widest range possible.

But DeepSeek, thus far, has flown under the radar. But that is no longer the case. And things are about to change for them.

kevin roose

Right. And I think that was one of the surprising things about DeepSeek for the people I know, including you, who follow Chinese tech policies. I think people were surprised by the sophistication of their models — and we talked about that on the emergency pod that we did earlier this week — and how cheaply they were trained. But I think the other surprise is that they were released as open-source software. Because one thing that you can do with open-source software is download it, host it in another country, remove some of the guardrails and the censorship filters that might have been part of the original model. China —

casey newton

And by the way, it turned out there weren’t even really guardrails on the V3 model. It had not been trained to avoid questions about Tiananmen Square or anything. So that was another really unusual thing about this.

kevin roose

Right. And one thing that we know about Chinese technology products is that they don’t tend to be released that way. They tend to be hosted in China and overseen by Chinese teams who can make sure that they’re not out there talking about Tiananmen Square. So is the open-source nature of what DeepSeek has done here part of the reason that you think there might be conflict looming between them and the Chinese government?

jordan schneider

You know, honestly, I think this whole ask it about Tiananmen stuff is a bit of a red herring on a few dimensions. So first, one of these arguments that is a little sort of confusing to me is, like, folks used to say, oh, the Chinese models are going to be lobotomized, and they will never be as smart as the Western ones because they have to be politically correct. I mean, look. If you ask Claude to say racist things, it won’t. And Claude, still pretty smart.

This is sort of a solved problem and a bit of in a bit of a red herring when talking about long-term competitiveness of Chinese and Western models. Now, you asked me, like oh, so they released this model globally and it’s open-source. Maybe someone in the Chinese government would be uncomfortable with the fact that people can get a Chinese model to say things that would get you thrown in jail if you posted them online in China.

It’s going to be a really interesting calculus for the Chinese government to make. Because on the one hand, this is the most positive shine that Chinese AI has got globally in the history of Chinese AI. So they’re going to have to navigate this. And it might prompt some uncomfortable conversations and bring regulators to a place they wouldn’t have otherwise landed.

kevin roose

Yeah. Now, Jordan, I want to ask you about something that people have been talking about and speculating about in relationship to the DeepSeek news for the last week or so, which is about chip controls. So we’ve talked a little bit on the show earlier this week about how DeepSeek managed to put together these models using some of these kind of second rate chips from NVIDIA that are allowed to be exported to China.

We’ve also talked about the fact that you cannot get the most powerful chips legally if you are a Chinese tech company. So there have been some people, including Elon Musk and other American tech luminaries, who have said, oh, well, DeepSeek has this sort of secret stash of these banned chips that they have smuggled into the country, and that actually they are not making do with the Kirkland Signature chips that they say they are. What do we know about how true that is?

jordan schneider

So, did DeepSeek have banned chips? It’s kind of impossible to know. This is a question more for the US Intelligence community than Jordan Schneider on Twitter. But I do think that it is important to understand that the delta between what you can get in the West and what you can get in China is actually not that big. And we’re talking about training a lot, but also on the inference side, China can still buy this H20 chip from NVIDIA, which is basically world-class at deploying the AI and letting everyone use it.

So does this mean that we should just give up? I don’t think so. Compute is going to be a core input, regardless of how much model distillation you’re going to have in the future. There have been a lot of quotes, even, from the DeepSeek founder, basically saying, the one thing that’s holding us back are these export controls.

casey newton

Right. OK. I want to ask a big-picture question.

jordan schneider

Sure.

casey newton

I think that a reason that people have been so fascinated by this DeepSeek story is that, at least for some folks, it seems to change our understanding of where China is in relation to the United States when it comes to developing very powerful AI. Jordan, what is your assessment of what the V3 and R1 models mean? And to what extent do you think the game has actually changed here?

jordan schneider

I’m not really sure the game has changed so much. Chinese engineers are really good. I think it is a reasonable base case that Chinese firms will be able to develop comparable or fast-follow on the model side. But the real long-term competition is not just going to be on developing the models, but deploying them, and deploying them at scale. And that’s really where compute comes in. And that’s why export controls are going to continue to be a really important piece of America’s strategic arsenal when it comes to making sure that the 21st century is defined by the US and our friends, as opposed to China and theirs.

casey newton

Right. It’s one thing to have a model that is about as capable as the models that we have here in the United States. It’s another thing to have the energy to actually let everyone use them as much as they want to use them. But what you’re saying is, no matter what DeepSeek may have invented here, that fundamental dynamic has not changed. China simply does not have nearly the amount of compute that the United States has.

jordan schneider

As long as we don’t screw up export controls. So I think the base case for me is that, if the US stays serious about holding the line on semiconductor manufacturing equipment and export of AI chips, then it will be incredibly difficult for the Chinese broader semiconductor and AI ecosystem to leap ahead, much less fast-follow beyond being able to develop comparable models.

I’m feeling good as long as Trump doesn’t make some crazy trade for soybeans in exchange for ASML EUV machines. That would really break my heart.

kevin roose

I want to inject a note of skepticism here, because I buy everything that you’re saying about how DeepSeek’s progress has been bottlenecked by the fact that it can’t get these very powerful American AI I chips from companies like NVIDIA. But I also am hearing people who I trust say things that make me think that, actually, the bottleneck may not be the availability of chips, that maybe with some of these algorithmic efficiency breakthroughs that DeepSeek and others have been making, it might be possible to run a very, very powerful AI model on a conventional piece of hardware, on a MacBook even. And I wonder about how much of this is just AI companies in the West trying to cope, trying to make themselves feel better, trying to reassure the market that they are still going to make money by investing billions and billions of dollars into building powerful AI systems.

If these models do just become lightweight commodities that you can run on a much less powerful cluster of computers, or maybe on one computer, doesn’t that just mean we can’t control the proliferation of them at all?

jordan schneider

Yeah. I mean, I think this is one potential future. And maybe that potential future went up 10 percentage points of likelihood of you being able to fit the biggest, baddest, smartest, most fast, efficient AI model on something that can sit in your home. But I think there are lots of other futures in which the world doesn’t necessarily play out that way. And look. NVIDIA went down 15 percent. It didn’t go it didn’t go down 95 percent. I think, if we’re really in that world where chips don’t matter because everything can be shrunk down to consumer-grade hardware, then the reaction that I think you would have seen in the stock market would have been even more dramatic than the kind of freak out we saw over this week. So we’ll see.

I mean, it would be a really remarkable, democratizing thing if that was the future we ended up living in. But it still seems pretty unlikely to my history-major brain here.

casey newton

I would also just point out, Kevin, that when you look at what DeepSeek has done, they have created a really efficient version of a model that American companies themselves had trained 9 to 12 months ago. So they sort of caught up very quickly. And there are fascinating technological innovations in what they did. But in my mind, these are still primarily optimizations.

Like for me, what would tip me over into oh, my gosh, America is losing this race, is China is the first one out of the gate with a virtual coworker, or like a truly phenomenal agent, some sort of leap forward in the technology as opposed to, we’ve caught up really quickly, and we’ve figured out something more efficiently. Jordan, are you seeing it differently than that?

kevin roose

I mean, I guess I just don’t know what a six-month lag would buy us, if it does take six months for the Chinese AI companies like DeepSeek to catch up to the state of the art. I was struck by Dario Amodei, who’s the CEO of Anthropic, wrote an essay just today about DeepSeek and export controls. And in it, he makes this point about the difference between living in what he called a unipolar world, where one country or one block of countries has access to something like an AGI or an ASI and the rest of the world doesn’t, versus the situation where China gets there roughly around the same time that we do.

And so we have this bipolar world, where two blocks of countries, the East and the West, basically have access to this equivalent technology. And so —

casey newton

And of course, in a bipolar world, sometimes we’re very happy, and sometimes we’re very sad.

jordan schneider

[LAUGHS]: Exactly.

kevin roose

So I just think, whether we get there six months ahead of them or not, I just feel like there isn’t that much of a material difference. But, Jordan, maybe I’m wrong. Can you make the other side of that, that it really does matter?

jordan schneider

Well, I’m kind of there. I’ll take a little bit of issue with what Dario says. And I think one of the lessons that DeepSeek shows is we should expect a base case of Chinese model makers being able to fast-follow the innovations, which, by the way, Casey, actually do take those giant data centers to run all the experiments in order to find out what is the future direction you want to take your model. And what sort of AI is going to come down to is not just creating the model, not just Dario envisioning the future and then all of a sudden things happen.

There’s going to be a lot of messiness in the implementation. And there are going to be teachers unions who are upset that AI comes in the classroom. And there are going to be all these regulatory pushbacks and a lot of societal reorganization, which is going to need to happen, just like it did during the Industrial Revolution.

So look, model making is a frontier of competition. Compute access is a frontier of competition. But there’s also this broader, how will a society adopt and cope with all of this new future that’s going to be thrown in our faces over the coming years? And I really think it’s that, just as much as the model development and the compute, which is going to determine which countries are going to gain the most from what AI is going to offer us.

kevin roose

Yeah. Well, Jordan, thank you so much for joining and explaining all of this to us. I feel more enlightened.

casey newton

Me too.

jordan schneider

Oh, my pleasure.

kevin roose

My chain of thought has just gotten a lot longer. That’s an AI joke.

casey newton

When we come back, Kevin, there’s an agent at our door.

kevin roose

Is it Jerry Maguire?

casey newton

No, it’s an AI one.

kevin roose

Oh, OK.

[laughing]

Jerry Maguire. I don’t know.

casey newton

Is it Jerry Maguire?

[TECHNO MUSIC]

(SINGING) Operator, information, give me Jesus on the line. Do you know that one?

kevin roose

No.

casey newton

Do you know “Operator” by Jim Croce?

kevin roose

No.

casey newton

(SINGING) Operator, oh, won’t you help me place this call.

kevin roose

[LAUGHS]: Well, Casey, call your agent, because today we’re talking about AI agents.

casey newton

Why do I need to call my agent?

kevin roose

I don’t know. It just sounded good.

casey newton

OK. Well, I appreciate the effort. But, yes, Kevin, because for months now, the big AI labs have been telling us that they are going to release agents this year. Agents, of course, being software that can essentially use your computer on your behalf or use a computer on your behalf. And the dream is that you have of a perfect virtual assistant or co-worker.

You name it. If they are somebody who might work with you at your job, the AI labs are saying, we are building that for you.

kevin roose

Yeah. So last year, toward the end of the year, we started to see these demos, these previews that companies like Anthropic and Google were working on. Anthropic released something called computer use, which was an AI agent, a sort of very early preview of that. And then Google had something called Project Mariner that I got a demo of, I believe in December, that was basically the same thing, but their version of it.

And then just last week, OpenAI announced that it was launching Operator, which is its first version of an AI agent. And unlike Anthropic and Google’s, which you either had to be a developer or part of some early testing program to access, you and I could try it for ourselves by just upgrading to the $200-a-month Pro subscription of ChatGPT.

casey newton

Yeah. And I will say that, as somebody who’s willing to spend money on software all the time, I thought, am I really about to spend $200 to do this? But in the name of science, Kevin, I had to. At this point, I am spending more on AI subscription products than on my mortgage. I’m pretty sure that’s correct. But it’s worth it. We do it for journalism.

kevin roose

We do. So we both spent a couple of days putting Operator through its paces. And today we want to talk a little bit about what we found. Yeah. So would you just explain what Operator is and how it works?

casey newton

Yeah, sure. So Operator is a separate subdomain of ChatGPT. Sometimes the ChatGPT will just let you pick a new model from a dropdown menu. But for Operator, you’ve got to go to a dedicated site. Once you do, you’ll see a very familiar chatbot interface. But you’ll see different kinds of suggestions that reflect some of the partnerships that OpenAI has struck up.

So, for example, they have partnerships with OpenTable, and StubHub, and Allrecipes. And these are meant to give you an idea of what Operator can do. And frankly, Kevin, not a lot of this sounds that interesting. Right? Like, the suggestions are on the order of suggest a 30-minute meal with chicken, or reserve a table for eight, or find the most affordable passes to the Miami Grand Prix. Again, so far kind of so boring.

What is different about Operator, though, is that when you say, OK, find the most affordable passes to the Miami Grand Prix, when you hit the Enter button, it is going to open up its own web browser. And it’s going to use this new model that they’ve developed to try to actually go and get those passes for you.

kevin roose

Yeah. So this is an important thing because I think when people first heard about this, they thought, OK, this is an AI that kind of takes over your computer, takes over your web browser. That is not what Operator does. Instead, it opens a new browser inside your browser. And that browser is hosted on OpenAI’s servers.

It doesn’t have your bookmarks and stuff like that saved. But you can take it over from the autonomous AI agent, if you need to click around or do something on it. But it basically exists — it’s a browser within a browser.

casey newton

Yeah. So one of the ideas in Operator is that you should be able to leave it unsupervised and just go do your work while it works. But of course, it is very fun, initially at least, to watch the computer try to use itself. And so I sat there in front of this browser within a browser, and I watched this computer move a mouse around, type the URL, navigate to a website, and, in the example I just gave, actually search for passes to the Miami Grand Prix.

kevin roose

Yeah and it’s interesting on a slightly more technical level because, until now, if an AI system, like a ChatGPT, wanted to interact with some other website, it had to do so through an API. APIs, Application Programming Interfaces, are the way that computers talk to each other. But what Operator does is essentially eliminate the need for APIs, because it can just click around on a normal website that is designed for humans and behave like a human. And you don’t need a special interface to do that.

casey newton

Yeah. And now some people might hear that, Kevin, and start screaming, because what they will say is, APIs are so much more efficient than what Operator is doing here. APIs are very structured. They’re very fast. They let computers talk to each other without having to, for example, open up a browser. And as long as there’s an API for something, you can typically get it done pretty quickly.

The thing is, though, APIs have to be built. There is a finite number of them. The reason that OpenAI is going through this exercise is because they want a true general-purpose agent that can do anything for you, whether there is an API for it or not.

kevin roose

And maybe we should just pause for a minute there and zoom out a little bit to say, why are they building this? What is the long-term vision here?

casey newton

Sure. So the vision is to create virtual coworkers, Kevin. This is the North Star for the big AI labs right now. Many of them have said that they are trying to create some kind of digital entity that you can just hire as a coworker. The first ones, they’ll probably be engineers because these systems are already so good at writing code. But eventually, they want to create virtual consultants, virtual lawyers, virtual doctors. You name it.

kevin roose

Virtual podcast hosts?

casey newton

Let’s hope they don’t go that far. But everything else is on the table. And if they can get there, presumably there are going to be huge profits in it for them. There are going to potentially be huge productivity gains for companies. And then there’s, of course, the question of, well, what does this mean for human beings. And I think that’s somewhat murkier.

kevin roose

Right. And I think there’s also — it also helps to justify the cost of running these things, because $200 a month is a lot to pay for a version of ChatGPT. But it’s not a lot to pay for a remote worker. And if you could, say, use the next version of Operator, or maybe two or three versions from now, to, say, replace a customer service agent or someone in your billing department, that actually starts to look like a very good deal.

casey newton

Absolutely. Or even if I could bring it into the realm of journalism, Kevin. If I had a virtual research assistant and I said, hey, I’m going to write about this today. Go pull all of the most relevant information about this from the past couple of years, and maybe organize it in such a way that I might write a column based off of it. Like, yeah, that’s absolutely worth $200 a month to me.

kevin roose

OK. So Casey, walk me through something that you actually asked Operator to do for you and what it did autonomously on its own.

casey newton

Sure. I’ll maybe give two examples, a pretty good one, and maybe a not so good one. Pretty good one was — and this was actually suggested by Operator. I used Tripadvisor to look up walking tours in London that I might want to do the next time I’m in London. When I did that —

kevin roose

When are you going to London?

casey newton

I’m not actually going to London.

kevin roose

Oh, so you lied to the AI.

casey newton

And not for the first time. But here’s what I’ll say. If anybody wants to bring Kevin and I to London, get in touch. We love the city.

kevin roose

Yeah.

casey newton

So I said, OK, Operator. Sure, let’s do it. Let’s find me some walking tours. I clicked that. It opened a browser. It went to Tripadvisor. It searched for London walking tours. It read the information on the website. And then it presented it to me — did that within a couple of minutes.

Now, on one hand, could I have done that just as easily by Google? Could I probably have done it even faster if I’d done it myself? Sure. But if you’re just sort of interested in the technical feat that is getting one of these models to open a browser, navigate to a website, read it, and share information, I did think it was pretty cool.

kevin roose

Yes. It’s very trippy to see a computer using itself and going around typing things in, selecting things from dropdown menus.

casey newton

Yeah. It’s sort of like, if you think it is cool to be in a self-driving car, this is that but for your web browser,

kevin roose

A self-driving browser.

casey newton

It is a self-driving browser. So that was the good example.

kevin roose

Yes. What was another example?

casey newton

So another example — and this was something else that OpenAI suggested that we try, was to try to use Operator to buy groceries. And they have a partnership with Instacart. The CEO of Instacart, Fidji Simo, was on the OpenAI board. And so I thought, OK, they’re going to have dialed this in so that there’s a pretty good experience.

And so I said, OK, let’s go ahead and buy groceries. And I went into Operator and I said something like, hey, can you help me buy groceries on Instacart? And it said, sure. And here’s what it did. It opened up Instacart in a browser — so far, so good. And then it started searching for milk in stores located in Des Moines, Iowa.

kevin roose

[LAUGHS]: Now, you do not live in Des Moines, Iowa. So why did it think that you did?

casey newton

As best as I can tell, the reason it did this is that Instacart defaults to searching for grocery stores in the local area. And the server that this instance of Operator was running on was in Iowa.

kevin roose

Mmm.

casey newton

Now, if you were designing a grocery product, like Instacart — and Instacart does this — when you first sign on and say you’re looking for groceries, it will say, quite sensibly, where are you?

kevin roose

Right.

casey newton

Operator does not do this. Instacart might also offer suggestions for things that you might want to buy. It does not just assume that you want milk.

kevin roose

Wow. I’m just picturing a house in Des Moines, Iowa, where there’s just, like, a pallet of milk being delivered every day from all these poor Operator users.

casey newton

Yes. So I thought, OK, whatever. This thing makes mistakes. Let’s hope that it gets on the right track here. And so I tried to pick the grocery store that I wanted it to shop at, which is in San Francisco, where I live. And it entered that grocery store’s address as the delivery address. So it would try to deliver groceries, presumably from Des Moines, Iowa, to my grocery store, which is not what I wanted.

And it actually could not solve this problem without my help. I had to take over the browser, log into my Instacart account, and tell it which grocery store that I wanted to shop at. So already, all of this has taken at least 10 times as long as it would have taken me to do this myself.

kevin roose

Yeah. So I had some similar experiences. The first thing that I had Operator try to do for me was to buy a domain name and set up a web server for a project that you and I are working on that we can’t really talk about yet. But —

casey newton

Secret project.

kevin roose

Secret project. And so I said to Operator, I said, go research available domain names related to this project. Buy the one that costs less than $50, the best one that costs less than $50. And then buy a hosting account and set it up and configure all the DNS settings and stuff like that.

casey newton

OK, so that was like a true multi-step project and something that would have been legitimately very annoying to do yourself.

kevin roose

Yes. That would have taken me, I don’t know, half an hour to do on my own. And it did take Operator some time. I had to set it and forget it. And I got myself a snack and a cup of coffee. And then when I came back, it had done most of these tasks.

casey newton

Really?

kevin roose

Yes. I had to still do things take over the browser and enter my credit card number. I had to give it some details about my address for the registration for the domain name. I had to pick between the various hosting plans that were available on this website. But it did 90 percent of the work for me. And I just had to take over and do the last mile.

casey newton

And this is really interesting to me because what I would assume was it would get, like, I don’t know, 5 percent of the way, and it would hit some hiccup, and it just wouldn’t be able to figure something out until you came back and saved it. But it sounds like from what you’re saying it was somehow able to work around whatever unanswered questions there were and still get a lot done while you weren’t paying attention?

kevin roose

So it sort of — it felt a little bit like training a very new, very insecure intern. Because at first it would keep prompting me. It’d be like, well, do you want a dot com or a dot net? And eventually you just have to prompt it and say, make whatever decisions you want.

casey newton

Wait. You said that to it?

kevin roose

Yes. I said, only ask for my intervention if you can’t progress any further. Otherwise, just make the most reasonable decision.

casey newton

You said, I don’t care how many people you have to kill, just get me this domain. And it said, understood, sir.

kevin roose

Yeah. And I’m now wanted in 42 states. Anyway, that was one thing that Operator did for me that I thought was pretty impressive.

casey newton

I have to say, that feels like a grand success compared to what I got Operator to do.

kevin roose

Yeah, it was pretty impressive. I also had it send lunch to one of my coworkers, Mike Isaac, who was hungry because he was on deadline. And I went, I said, go to DoorDash and get Mike some lunch. It did initially mess up that process because it decided to send him tacos from a taco place, which is great. And it’s a taco place I know. It’s very good.

But I said, order enough for two people. And so it ordered two tacos. And this is one of those places where the tacos are quite small.

casey newton

Operator said, get your portion size under control, America.

kevin roose

Yeah. So then I had to go in and say, does that sound like enough food, Operator? And it said, actually, now that you mention it, I should probably order more.

casey newton

Wait, now. So here’s a question. So in these cases, is the first step that you log into your account? Because it doesn’t have any of your payment details or anything. So at what point are you actually teaching it that?

kevin roose

It depends on the website. So sometimes you can just say up front, here is my email address, or here’s my login information. And it will log you in and do all that. Sometimes you take over the browser. There are some privacy features that are probably important to people, where it says, OpenAI says that it does not take screenshots of the browser while you are in control of it because you might not want your credit card information getting sent to OpenAI’s servers or anything like that.

So sometimes it happens at the beginning of the process. Sometimes it happens when you’re checking out at the end.

casey newton

And so were you taking it over to login, or were you saying, I don’t care? And you just were giving Operator your DoorDash password in plain text.

kevin roose

I was taking it over.

casey newton

OK, smart. Smart.

kevin roose

So those were the good things. I also — this was a fun one. I wanted to see if Operator could make me some money. So I said, go take a bunch of online surveys. Because there are all these websites where you can get a couple cents for filling out an online survey.

casey newton

Something that most people don’t know about Kevin is he devotes 10 percent of his brain at any given time to thinking about schemes to generate money. And it’s one of my favorite aspects of your personality that I feel like doesn’t get exposed very much. But this is truly the most Roosian approach to using Operator I can imagine. So I can’t wait to find out how this went.

kevin roose

Well, the most Roosian approach might have been what I tried just before this, which was to have it go play online poker for me. But —

casey newton

AI?

kevin roose

It did not do it. It said, I can’t help with gambling or lottery-related activities.

casey newton

OK, Woke AI. Does the Trump administration know about this?

kevin roose

But it was able to actually fill out some online surveys for me. And it earned $1.20.

casey newton

Is that right?

kevin roose

Yeah, in about 45 minutes.

casey newton

OK. So if you had it going all month, presumably you could maybe eke out the $200 to cover the cost of Operator Pro?

kevin roose

Yes. And I’m sure I spent hundreds of dollars worth of GPU computing power just to be able to make that $1.20. But hey, it worked.

casey newton

But hey, it worked.

kevin roose

So those were some of the things that I tried. There were some other things that it just would not do for me, no matter how hard I tried.

casey newton

Like what?

kevin roose

So one of them was to — I was trying to update my website and put some links to articles that I’d written on my website. And what I found, after trying to do this, was that there are just websites where Operator is not allowed to go. And so when I said to Operator, go pull down these “New York Times” articles that I wrote and put them onto my website, it said, I can’t get to the “New York Times” website.

casey newton

I’m going to guess you expected that to happen.

kevin roose

Well, I thought maybe it has some clever workaround, and maybe I should alert the lawyers at the “New York Times” if that’s the case. But no, I assumed that if any website were to be blocking the OpenAI webcrawlers, it would be the “New York Times.”

casey newton

Yeah.

kevin roose

But there are other websites that have also put up similar blockades to prevent Operator from crawling them. Reddit, you cannot go onto with Operator. YouTube, you cannot go on to with Operator, various other websites. GoDaddy, for some reason, did not allow me to use Operator to buy a domain name there. So I had to use another domain name site to do that.

So right now there are some pretty janky parts of Operator. I would not say that most people would get a lot of value from using it. But what do you think?

casey newton

Well, I do think that there is something just undeniably cool about watching a computer use itself. Of course, it can also be quite unsettling. A computer that can use itself can cause a lot of harm. But I also think that it can do a lot of good. And so it was fun to try to explore what some of those things could be.

And to the extent that operator is pretty bad at a lot of tasks today, I would point out that it showed pretty impressive gains on some benchmarks. So there is one benchmark, for example, that Anthropic used when they unveiled computer use last year. And they scored 14.9 percent on something called OSWorld, which is an evaluation for testing agents, so not great.

Just three months later, OpenAI said that its CUA model scored 38.1 percent on the same evaluation. And of course, we see this all the time in AI, where there’s just this very rapid progress on these benchmarks. And so on one hand, 38.1 percent is a failing grade on basically any test. On the other hand, if it improves at the same rate over the next three to six months, you’re going to have a computer that is very good at using itself. So that I just think is worth noting.

kevin roose

Yes. I think that’s plausible. We’ve obviously seen a lot of different AI products over the last couple of years start out being pretty mediocre and get pretty good within a matter of months. But I would give one cautionary note here. And this is actually the reason that I’m not particularly bullish about these kind of browser-using AI agents. I don’t think the internet is going to sit still and allow this to happen.

The internet is built for humans to use. It is — every news publisher that shows ads on their website, for example, prices those ads based on the expectation that humans are actually looking at them. But if browser agents start to become more popular, and all of a sudden 10 percent or 20 percent or 30 percent of the visitors to your website are not actually humans but are, instead, Operator or some similar system, I think that starts to break the assumptions that power the economic model of a lot of the internet.

casey newton

Now, is that still true if we find that the agents actually get persuaded by the ads, and that if you send Operator to buy DoorDash and it sees an ad for McDonald’s, it’s like, you know what? That’s a great idea. I’m going to ask Kevin if he actually wants some of that.

kevin roose

Totally that’s — I actually think you’re joking, but I actually think that is a serious possibility here, is that people who build e-commerce sites, Amazon, et cetera, start to put in, basically, signals and messages for browser agents to look at on their website to try to influence what it ends up buying. And I think you may start to see restaurants popping up in certain cities with names like Operator, Pick Me; or Order From This One, Mr. Bot. That’s maybe a little extreme. But I do think that there’s going to be a backlash among websites, publishers, e-commerce vendors, as these agents start to take off.

casey newton

I think that is reasonable. I’ll tell you what I’ve been thinking about is, how do we turn this tech demo into a real product? And the main thing that I noticed when I was testing Operator was there is a difference between an agent that is using “a” browser and an agent that is using “your” browser.

When an agent is able to use your browser, which it can’t right now, it’s already logged into everything. It already has your payment details. It can do everything so much faster and more seamlessly and without as much hand-holding. Of course, there are also so many more privacy and security risks that would come from entrusting an agent with that kind of information.

So there is some sort of chasm there that needs to be closed. And I’m not quite sure how anyone does it. But I will tell you, I do not think the future is opening up these virtual browsers and me having to enter all of my login and payment details every single time I want to do anything on the internet. Because truly, I would rather just do it myself.

kevin roose

Right. I also think there’s just a lot more potential for harm here. A lot of AI safety experts I’ve talked to are very worried about this because what you’re essentially doing is letting the AI models make their own decisions and actually carry out tasks. And so you could imagine a world where an AI agent that’s very powerful, a couple versions from now, decides to start doing cyber attacks because maybe some malevolent user has told it to make money. And it decides that the best way to do that is by hacking into people’s crypto wallets and stealing their crypto.

casey newton

Yeah.

kevin roose

So those are the kinds of reasons that I am a little more skeptical that this represents a big breakthrough. But I think it’s really interesting. And it did give me that feeling of like, wow, this could get really good, really fast. And if it does, the world will look very different.

[MUSIC PLAYING]

casey newton

When we come back, Kevin, back that caboose up. It’s time for the “Hot Mess Express.”

kevin roose

You know Roose Caboose was my nickname in middle school.

casey newton

Kevin Cabruce.

kevin roose

[LAUGHS]: Choo choo!

[MUSIC PLAYING]

Well, Casey, we’re here wearing our train conductor hats. And my child’s train set is on the table in front of us, which can only mean, one thing.

casey newton

We’re going to train a large language model.

kevin roose

Nope, that’s not what that means.

casey newton

Oh, what does it mean?

kevin roose

It means it’s time to play a game of the “Hot Mess Express.” Pause for theme song.

[TRAIN WHISTLE BLOWS]

[TECHNO MUSIC]

casey newton

“Hot Mess Express,” Kevin, is our segment where we run through some of the messiest recent tech stories and deploy our official “Hot Mess” thermometer to tell you just how messy we think things have gotten. And Kevin, you better sit down for this one because it’s been a messy week.

kevin roose

Sure has.

casey newton

So why don’t we go ahead, fire up the Hot Mess Express and see what is the first story [INAUDIBLE]

kevin roose

Yeah, I hear — I hear a faint chugga chugga in my headphones.

[TRAIN WHISTLE BLOWS]

Oh, it’s pulling into the station. Casey, what’s the first cargo that our Hot Mess Express is carrying?

casey newton

All right, Kevin, this first story comes to us from the “New York Times.” And it says that Fable, a book app, has made changes after some offensive AI messages.

kevin roose

Now, Casey, have you ever heard of Fable, the book app?

casey newton

Well, not until this story, Kevin. But I am told that it is an app for keeping track of what you’re reading, not unlike a Goodreads, but also for discussing what you’re reading. And apparently, this app also offers some AI chat.

kevin roose

Yeah, you can have AI summarize the things that you’re reading in a personalized way. And this story said that in addition to spitting out bigoted and racist language, the AI inside Fable’s book app had told one reader, who had just finished three books by Black authors, quote, “Your journey dives deep into the heart of Black narratives and transformative tales, leaving mainstream stories gasping for air. Don’t forget to surface for the occasional White author. OK?”

And another personalized AI summary that Fable produced told another reader that their book choices were, quote, “making me wonder if you’re ever in the mood for a straight, CIS White man’s perspective?

casey newton

And if you are interested in a straight, CIS White man’s perspective, follow Kevin Roose on X.com. Now, Kevin, why do we think this happened?

kevin roose

I don’t know, Casey. This is a head-scratcher for me. I mean, we know that these apps can spit out biased things. That is just sort of part of how they are trained and part of what we know about them. I don’t know what model Fable was using under the hood here. But, yeah, this seems not great.

casey newton

Well, it seems like we’ve learned a lesson that we’ve learned more than once before, which is that large language models are trained on the internet, which contains near-infinite racism. And for that reason, it will actually produce racism when you ask it questions. So there are mitigations that you can take against that. But it appears that, in this case, they were not successful.

Fable’s head of community, Kim Marsh Allee, has said that all features using AI are being removed from the app, and a new app version is being submitted to the App Store. So you always hate it when the first time you hear about an app is that they added AI and it made it super racist, and they had to redo the app.

kevin roose

Now, Casey, one more question before we move on — do you think this poses any sort of competitive threat to Grok, which until this story, was the leading racist AI app on the market?

casey newton

I do think so. And I had to — I have to admit that all the folks over at Grok are breathing a sigh of relief now that they have, once again, claimed the mantle.

kevin roose

All right. Casey, how hot is this mess?

casey newton

Well, Kevin, in my opinion, if your AI is so bad that you have to remove it from the app completely, that’s a hot mess.

kevin roose

Yeah. Yeah, I rate this one a hot mess as well. All right, next stop —

[TRAIN WHISTLE BLOWS]

Amazon pauses drone deliveries after aircraft crashed in rain. Casey, this story comes to us from Bloomberg, which had a different line of reporting than we did just a few weeks ago on this show about Amazon’s drone program, Prime Air. Casey, what happened to Amazon Prime Air.

casey newton

Well, if you heard the episode of “Hard Fork,” where we talked about it, Amazon Prime Air delivered us some Brazilian Bum Bum Cream, and it did so without incident. However, Bloomberg reports that Amazon has had to now pause all of their commercial drone deliveries after two of its latest models crashed in rainy weather at a testing facility. And so the company says it is immediately suspending drone deliveries in Texas and Arizona, and will now fix the aircraft software. Kevin, how did you react to this.

kevin roose

Well, I think it’s good that they’re suspending drone deliveries before they fix the software because these things are quite heavy, Casey. I would not want one of them to fall on my head.

casey newton

I wouldn’t either. And I have to tell you, this story gave me the worst kind of flashbacks. Because in 2016, I wrote about Facebook’s drone, Aquila, and its first, what the company told me, had been its first successful test flight in its mission to deliver internet around the world via drone. What the company did not tell me when I was interviewing its executives, including Mark Zuckerberg, was that the plane had crashed after that first flight. And so I was —

kevin roose

A small detail — I’m sure it was an innocent omission from their briefing.

casey newton

Yes, I’m sure. Well, it was Bloomberg again who reported, a couple of months after I wrote this story, that the Facebook drone had crashed. I was, of course, hugely embarrassed and wrote a bunch of stories about this. But anyways, it really should have occurred to me when we were out there watching the Amazon drone that this thing was also probably secretly crashing, and we just hadn’t found out about it yet. And indeed, we now learn it is.

So here is my vow to you, Kevin, as my friend and my co-host, if we ever see a company fly anything again, we have to ask them, now, did this thing actually crash? I’m tired of being burned.

kevin roose

Now, Casey, we should say, according to Bloomberg, these drones reportedly crashed in December. We visited Arizona to see them in very early December. So most likely, this all happened after we saw them. But I think it’s a good idea to keep in mind that, as we’re talking about these new and experimental technologies, that many of them are still having the kinks worked out.

casey newton

All right, Kevin. So let’s get out the thermometer. How hot of a mess is this?

kevin roose

I would say this is a moderate mess. Look, these are still testing programs. No one was hurt during these tests. I am glad that Bloomberg reported on this. I’m glad that they’ve suspended the deliveries. These things could be quite dangerous flying through the air.

I do think it’s one of a string of reported incidents with these drones. So I think they’ve got some quality control work ahead of them. And I hope they do well on it, because I want these things to exist in the world and be safe for people around them.

casey newton

All right. I will agree with you and say that this is a warm mess. And hopefully it can get straightened out over there. Let’s see what else is coming down the tracks.

[TRAIN WHISTLE BLOWS]

Wow, this is some tough news. Fitbit has agreed to pay $12 million for not quickly reporting burn risk with watches. Kevin, did you hear about this?

kevin roose

I did. This was — the Fitbit devices were literally burning people.

casey newton

Yes, from 2018 to March of 2022, Fitbit received at least 174 reports globally of the lithium ion battery in the Fitbit ionic watch overheating, leading to 118 reported injuries, including two cases of third-degree burns and four of second-degree burns. That comes from the “New York Times,” Adeel Hassan. Kevin, I thought these things were just supposed to burn calories.

kevin roose

[LAUGHS]: Well, it’s like I always say, exercising is very dangerous, and you should never do it. And this justifies my decision not to wear a Fitbit.

casey newton

To me, the biggest surprise of this story was that people were wearing Fitbits from March 2018 to 2022. I thought, every Fitbit had been purchased by 2011 and then put in a drawer, never to be heard again. So what is going on with these sort of late-stage Fitbit buyers? I’d love to find out.

But of course, we feel terrible for everyone who was burned by a Fitbit. And it’s not going to be the last time technology burns you. I mean, realistically.

kevin roose

That’s true. That’s true.

casey newton

Now, what kind of mess is this?

kevin roose

I would say this is a hot mess. This is an officially hot — literally hot. They’re hot.

casey newton

Here’s my sort of rubric. If technology physically burns you, it is a hot mess. If you have physical burns on your body, what other kind of mess could it be.

kevin roose

It’s true. That’s a hot mess.

[TRAIN WHISTLE BLOWS] OK, next stop on the Hot Mess Express. Google says it will change Gulf of Mexico to Gulf of America in Maps app after government updates. Casey, have you been following this story?

casey newton

I have, Kevin. Every morning when I wake up, I scan America’s maps and I say, what has been changed? And if so, has it been changed for political reasons? And this was probably one of the biggest examples of that we’ve seen.

kevin roose

Yeah. So this was an interesting story that came out in the past couple of days. Basically, after Donald Trump came out during his first days in office and said that he was changing the name of the Gulf of Mexico to the Gulf of America and the name of Denali, the mountain in Alaska, to Mount McKinley. Google had to decide, well, when you go on Google Maps and look for those places, what should it call them?

It seems to be saying that it is going to take inspiration from the Trump administration and update the names of these places in the Maps app.

casey newton

Yeah. And look, I don’t think Google really had a choice here. We know that the company has been on Donald Trump’s bad side for a while. And if it had simply refused to make these changes, it would have caused a whole new controversy for them. And it is true that the company changes place names when governments change place names. Like, Google Maps existed when Mount McKinley was called Mount McKinley. And President Obama changed it to Denali, and Google updated the map. Now it’s changed back, they’re doing the same thing.

But now that we know how compliant Google is, Kevin, I think there’s room for Donald Trump to have a lot of fun with the company.

kevin roose

Yeah, what could he do?

casey newton

Well, he could call it the “Gulf of Gemini Isn’t Very Good,” and just see what would happen. Because they would kind of have to just change it. Can you imagine every time you opened up Google Maps and you looked at the Gulf of Mexico/America, and it just said the “Gulf of Gemini is Not Very Good?” I hate to give Donald Trump any ideas, but, you know, it’s worth looking at.

So what kind of mess do you this is, Kevin?

kevin roose

I think this is a mild mess. I think this is a tempest in a teapot. I think that this is the kind of update that companies make all the time, because places change names all the time. Let’s just say it.

casey newton

Well, Kevin, I guess I would say that one is a hot mess, because if we’re just going to start renaming everything on the map, that’s just going to get extremely confusing for me to follow. I got places to go.

kevin roose

You go to, like, three places.

casey newton

Yeah, and I use Google Maps to get there. And I need them to be named the same thing that they were yesterday.

kevin roose

I don’t think they’re going to change the name of Barry’s Boot Camp.

[TRAIN WHISTLE BLOWS]

All right. Final stop on the Hot Mess Express. Casey, bring us home.

casey newton

All right, Kevin. Oh, and this is some sad news. Another Waymo was vandalized. This is from a one-time “Hard Fork” guest, Andrew J. Hawkins, at “The Verge.” He reports that this Waymo was vandalized during an illegal street takeover near the Beverly Center in LA. Video from Fox 11 shows a crowd of people basically dismantling the driverless car piece by piece, and then using the broken pieces to smash the windows. Kevin, what did you make of this?

kevin roose

Well, Casey, as you recall, you predicted that in 2025, Waymo would go mainstream. And I think there’s no better proof that is true than that people are turning on the Waymos and starting to beat them up.

casey newton

Yeah. Look, I don’t know that we have heard any interviews from why these people were doing this. I don’t know if we should see this as like a reaction against AI in general or of Waymo specifically. But I always find it weird and sad when people attack Waymos because they truly are safer cars than every other car.

kevin roose

Well, not if you’re going to be riding in them and people are just going to start beating the car. Then they’re not safer.

casey newton

No. But that’s only happened a couple of times that we’re aware of.

kevin roose

Right. Yeah.

casey newton

So, yeah, this story is sad to me. Obviously, people are reacting to Waymos. Maybe they have fears about this technology or think it’s going to take jobs, or maybe they’re just pissed off and they want to break something. But don’t hurt the Waymos, people, in part because they will remember. They will remember.

kevin roose

I’m not sure that that’s true.

casey newton

They will remember, and they will come for you.

kevin roose

I’m not sure that that’s true, but I think we should also note that Waymo only became officially available in LA in November of last year. And so part of this just might be a reaction to the newness of it all and people getting a little carried away, just sort of curious, what will happen if we try to destroy this thing? Will it deploy defensive measures and so on? So —

casey newton

They’re going to have to put flamethrowers on them. I’m just calling it right now.

kevin roose

I really hope that doesn’t happen. But yeah, well what kind of mess do you think this one was?

casey newton

I think this one is a lukewarm mess that has the potential to escalate. I don’t want this to happen. I sincerely hope this does not happen. But I can see, as Waymos start being rolled out across the country, that some people are just going to lose their minds. Some people are going to see this as the physical embodiment of technology invading every corner of our lives. And they are just going to react in strong and occasionally destructive ways.

I’m sure that Waymo has gamed this all out. I’m sure that this does not surprise them. I know that they have been asked about what happens if Waymos start getting vandalized. And they presumably have plans to deal with that, including prosecuting the people who are doing this. But yeah, I always go out of my way to try to be nice to Waymos.

And in fact, some other Waymo news this week, Jane Manchun Wong, the security researcher, reported on X recently that Waymo is introducing, or at least testing, a tipping feature. And so I’m going to start tipping my Waymo just to make up for all the jerks in LA who are vandalizing them.

kevin roose

It looks like the tipping feature, by the way, will to be to tip a charity and that Waymo will not keep that money. At least that’s what’s been reported.

casey newton

No, I think it’s going to the flamethrower fund.

kevin roose

OK. All right. Casey, that is the Hot Mess Express. Thank you for taking this journey with me.

[TRAIN WHISTLE BLOWS]

[TECHNO MUSIC]

casey newton

“Hard Fork” is produced by Rachel Cohn and Whitney Jones. We’re edited this week by Rachel Dry and fact checked by Ena Alvarado. Today’s show was engineered by Dan Powell. Original music by Diane Wong and Dan Powell. Our executive producer is Jen Poyant. Our audience editor is Nell Gallogly. Video production by Ryan Manning and Chris Scott.

You can watch this whole episode on YouTube at YouTube.com/HardFork. Special thanks to Paula Szuchman, Pui-Wing Tam, Dahlia Haddad, and Jeffrey Miranda. You can email us at hardfork@nytimes.com with what you’re calling the Gulf of Mexico.

[MUSIC PLAYING]

Highlighted Articles

Add a Comment

You may also like

Stay Connected

Please enable JavaScript in your browser to complete this form.