WTF is OpenAI’s GPTBot?

This article is a WTF explainer, in which we break down media and marketing’s most confusing terms. More from the series →

Publishers have a new tool in their efforts to limit AI’s threat to their businesses. And it’s from the company behind one of the predominant threats.

In August, OpenAI announced that website owners can now block its GPTBot web crawler from accessing their webpages’ content. Since then, 12% of the 1000 most-visited sites online have done so, according to Originality AI. The list of sites shutting themselves off to OpenAI’s web crawlers includes publishers such as Bloomberg, CNN and The New York Times.

As Digiday has covered, publishers have had a hard time protecting against generative AI tools like ChatGPT sidestepping their paywalls and siphoning their content to inform the large language models. OpenAI’s announcement, however, makes that undertaking much easier.

For those unfamiliar with what a web crawler like OpenAI’s GPTBot is, not to mention how websites are able block their access, check out the explainer video skit below.

More in Media

How Lipton is using local creators instead of building in-house social teams 

Lipton worked with Billion Dollar Boy to activate local creators across six different markets; a new approach to global marketing

How a German publisher JV is turning LLM visibility into a premium brand buy

Germany’s BCN, the joint-venture commercial arm of three major publishing houses – Hubert Burda Media, Funke and Klambt – is rolling out a commercial product that helps brands get properly surfaced and described inside ChatGPT, Gemini and other AI assistants, not just on traditional search results pages.

AI podcast experiments march on with Forbes’ new daily audio briefing

Forbes bets on AI-generated audio with a five-minute daily news brief. Stories are selected by product, editorial and an internal AI tool.