
This article is a WTF explainer, in which we break down media and marketing’s most confusing terms. More from the series →
Publishers have a new tool in their efforts to limit AI’s threat to their businesses. And it’s from the company behind one of the predominant threats.
In August, OpenAI announced that website owners can now block its GPTBot web crawler from accessing their webpages’ content. Since then, 12% of the 1000 most-visited sites online have done so, according to Originality AI. The list of sites shutting themselves off to OpenAI’s web crawlers includes publishers such as Bloomberg, CNN and The New York Times.
As Digiday has covered, publishers have had a hard time protecting against generative AI tools like ChatGPT sidestepping their paywalls and siphoning their content to inform the large language models. OpenAI’s announcement, however, makes that undertaking much easier.
For those unfamiliar with what a web crawler like OpenAI’s GPTBot is, not to mention how websites are able block their access, check out the explainer video skit below.
More in Media

Creators are ditching Substack over ideological shift in 2025
The writers who left Substack in early 2025 represent a second wave after an initial burst of Substack creators left the platform in January 2024.

How Dotdash Meredith enlisted OpenAI to boost its contextual ad product, with Lindsay Van Kirk
Dotdash Meredith’s gm of D/Cipher discussed the publisher’s OpenAI deal in a live recording of the Digiday Podcast at last week’s Digiday Publishing Summit.

Publishers left guessing how Google’s March 2025 core update will reshape search
Google’s core updates, which happen multiple times a year, change its search algorithms and systems and have the potential to make or break publishers’ traffic.