Web giant Cloudflare bans AI bots from scraping content by default

来源 Cryptopolitan

Cloudflare, the internet infrastructure company responsible for routing about 20% of global web traffic, has announced it will begin blocking artificial intelligence (AI) crawlers by default.

The change, effective Tuesday, changes how AI companies will be allowed to access content hosted on the web after publishers pushed for more control and compensation for their data.

The content delivery network (CDN) helps websites cache and serve data closer to users. With this new policy, any new domain signing up for Cloudflare services will be prompted to decide when and if AI bots can access their content, or they can choose to block scrapers altogether.

Cloudflare launches tools to control AI access

The change adds to Cloudflare’s earlier initiatives to give publishers more control over their data. Last year, the company introduced a one-click solution to block all known AI bots and a dashboard to monitor crawler activity. Site owners use the tool to distinguish between crawlers scraping data for AI training, search purposes, or other uses.

Tuesday’s announcement formalizes those protections and enforces them by default. “AI crawlers have been scraping content without limits. Our goal is to put the power back in the hands of creators, while still helping AI companies innovate,” said Cloudflare CEO Matthew Prince in a statement released today.

According to company records, Cloudflare’s Pay per Crawl system, the foundation of this initiative, is a marketplace where AI companies and content owners can agree on compensation per access. 

Both parties must have Cloudflare accounts, and once set up, they can negotiate prices and terms for web crawling activities. Cloudflare acts as a broker in the transaction, charging the AI company and passing the earnings to the publisher.

AI developers rue limited website access

Several AI developers, including OpenAI, the Microsoft-backed artificial intelligence firm behind ChatGPT, have declined to participate in the program. In a recent public statement, the company lambasted Cloudflare for inserting a new intermediary between publishers and AI developers. 

OpenAI mentioned it has a history of honoring the robots.txt protocol, a file that allows website operators to control crawler access, and insisted that it respects site preferences.

In a June analysis, Cloudflare claims to have found a gap between scraping frequency and traffic referrals. Google’s crawler, for example, accessed websites 14 times for every visit it sent back. In comparison, OpenAI’s bot scraped sites 17,000 times for every referral. 

UK-based technology lawyer Matthew Holman told CNBC that AI crawlers can be intrusive and potentially harmful to user experience. 

They have been accused of overwhelming websites and significantly impacting user experience,” he said. Holman added that if Cloudflare’s system works as intended, it could nerf the ability of AI chatbots to collect and train on large-scale web data. 

Publishers rally behind Cloudflare

Major media companies are in support of Cloudflare’s efforts to reclaim control over digital content. Publishers, including TIME, The Associated Press, Conde Nast, The Atlantic, ADWEEK, and Fortune, have all agreed to block AI bots by default. 

Media outlets have been accepting data scraping from platforms like Google in exchange for traffic and ad revenue. But the current AI-driven ecosystem has no such reciprocity. For many, AI platforms like ChatGPT and Claude consume content without meaningful engagement or revenue for original sources.

Cloudflare says it will continue to work with developers to push AI crawlers that wish to be allowed access to disclose their identity, purpose, and crawling behavior.

“Original content is what makes the Internet one of the greatest inventions in the last century,” CEO Matthew Prince stated. “We have to come together to protect it.”

Your crypto news deserves attention - KEY Difference Wire puts you on 250+ top sites

免责声明:仅供参考。 过去的表现并不预示未来的结果。
placeholder
逢七必涨!美股会打破“7月上涨魔咒”吗? 7月是美股表现最强的月份之一,标普500平均回报率为3.35%。
作者  Alison Ho
7 小时前
7月是美股表现最强的月份之一,标普500平均回报率为3.35%。
placeholder
【今日市场前瞻】鲍威尔讲话来袭!美元再创三年多新低鲍威尔讲话来袭!市场将迎大波动;美元指数再创三年多新低,关注PMI数据;比特币跌破10.7万美元>>
作者  Alison Ho
8 小时前
鲍威尔讲话来袭!市场将迎大波动;美元指数再创三年多新低,关注PMI数据;比特币跌破10.7万美元>>
placeholder
美元指数跌穿97.0续创年内新低,比特币、黄金技术分析!美元指数时段内进一步跌至94.45,续创年内新低。与此同时,黄金一举收复3300美元关口至时段高位3344美元,比特币则失守108000美元并下行考验106500美元支撑,这凸显市场在一系列不确定性事件(7月9日关税暂缓期结束、美国大美丽法案、稳定币法案、美国贸易谈判)来临之际面临变量,投资者短期可重点关注本周四(7月3日)非农数据公布,黄金、比特币、美元后市该如何部署?
作者  Insights
8 小时前
美元指数时段内进一步跌至94.45,续创年内新低。与此同时,黄金一举收复3300美元关口至时段高位3344美元,比特币则失守108000美元并下行考验106500美元支撑,这凸显市场在一系列不确定性事件(7月9日关税暂缓期结束、美国大美丽法案、稳定币法案、美国贸易谈判)来临之际面临变量,投资者短期可重点关注本周四(7月3日)非农数据公布,黄金、比特币、美元后市该如何部署?
placeholder
“巴菲特溢价”正在消退!伯克希尔Q2输给标普500 股神将“晚节不保”?TradingKey - 伯克希尔(Berkshire Hathaway)第二季以来的表现令人失望,在大部分时间里都跑输标普500指数。美东时间6月30日,上半年的最后交易日,美股全面收涨,其中标普500指数涨0.52%至6205点,并创下收盘价新高。伯克希尔则黯然失色,其中A类股(BRK.A)下跌0.3%,收于728800美元,B类股(BRK.B)涨0.02%至485.77美元,二者当日表现均逊
作者  TradingKey
9 小时前
TradingKey - 伯克希尔(Berkshire Hathaway)第二季以来的表现令人失望,在大部分时间里都跑输标普500指数。美东时间6月30日,上半年的最后交易日,美股全面收涨,其中标普500指数涨0.52%至6205点,并创下收盘价新高。伯克希尔则黯然失色,其中A类股(BRK.A)下跌0.3%,收于728800美元,B类股(BRK.B)涨0.02%至485.77美元,二者当日表现均逊
placeholder
人形机器人发展前景如何?哪些人形机器人股最值得关注?TradingKey - 错过了AI硬件领域的英伟达,又没跟上AI软件领域的Palantir,要想继续乘上AI东风,被称为「AI最佳载体」之一的人形机器人可能会是下一个爆发赛道。几乎所有人都在说,2025年将是人形机器人的量产元年。OpenAI开发的ChatGPT让人工智能在生成式AI,或基于文本的单模态领域展现了实力,人类也正在探索融合文本、图像、音频、视频、或运动等感官输入的多模态AI的应用空
作者  TradingKey
9 小时前
TradingKey - 错过了AI硬件领域的英伟达,又没跟上AI软件领域的Palantir,要想继续乘上AI东风,被称为「AI最佳载体」之一的人形机器人可能会是下一个爆发赛道。几乎所有人都在说,2025年将是人形机器人的量产元年。OpenAI开发的ChatGPT让人工智能在生成式AI,或基于文本的单模态领域展现了实力,人类也正在探索融合文本、图像、音频、视频、或运动等感官输入的多模态AI的应用空
goTop
quote