WTF is OpenAI’s GPTBot?

This article is a WTF explainer, in which we break down media and marketing’s most confusing terms. More from the series →

Publishers have a new tool in their efforts to limit AI’s threat to their businesses. And it’s from the company behind one of the predominant threats.

In August, OpenAI announced that website owners can now block its GPTBot web crawler from accessing their webpages’ content. Since then, 12% of the 1000 most-visited sites online have done so, according to Originality AI. The list of sites shutting themselves off to OpenAI’s web crawlers includes publishers such as Bloomberg, CNN and The New York Times.

As Digiday has covered, publishers have had a hard time protecting against generative AI tools like ChatGPT sidestepping their paywalls and siphoning their content to inform the large language models. OpenAI’s announcement, however, makes that undertaking much easier.

For those unfamiliar with what a web crawler like OpenAI’s GPTBot is, not to mention how websites are able block their access, check out the explainer video skit below.

https://digiday.com/?p=517231

More in Media

Retail media meets publishing: News UK, Future and Ocado tap clean room tech for smarter data targeting

News UK, The Independent, Immediate Media and Future are teaming up with retail media network Ocado to test clean room-powered data matching. 

From sidelines to spotlight: Esports events are putting creators center stage

Esports events’ embrace of content creators reflects advertisers’ changing priorities across both gaming and the wider culture. In the past, marketers viewed esports as one of the best ways to reach gamers. In 2025, brands are instead prioritizing creators in their outreach to audiences across demographics and interest areas, including gaming.

Condé Nast and Hearst strike Amazon AI licensing deals for Rufus

Condé Nast and Hearst have joined the New York Times in signing a licensing deal with Amazon for its AI-powered shopping assistant Rufus.