Block AI Crawlers prohibits AI crawlers from crawling WordPress content to train AI

Block AI CrawlersThe plugin tells artificial intelligence crawlers (such as OpenAI ChatGPT) not to crawl your website content for training AI. The specific method is to update the websiterobots.txt, to block common AI crawlers. Artificial intelligence crawler will read the websiterobots.txtto check if there are any requests that are not indexed.

It blocks these AI crawlers and bots:

ChatGPT and GPTBot– Crawlers and web browsers used by OpenAI
Google Extended– A crawler used for artificial intelligence training on Google Gemini (formerly Google Bard)
FacebookBot– Crawler for Facebook AI training
CommonCrawl– Compile crawlers for datasets used to train AI models
Anthropic AI/Claude– The crawler used by Anthropic
Omgili– Omgili crawler for artificial intelligence training
Bytespider– TikTok crawler for artificial intelligence training
PerplexityBot– Used by Perplexity in its artificial intelligence products
Applebot– used by Apple to train its artificial intelligence products
Cohere– Cohere crawler for artificial intelligence training
DiffBot– Diffbot crawler for artificial intelligence training
Imagesift– Imagesift crawler for images

Experimental meta tags

The plugin also adds the “noai, noimageai” tag to the site’s meta tags. These tags tell AI bots not to include your content as part of their data set. These are experimental and not yet standardized.