Cloudflare Takes a Stand Against AI Website Scrapers

Cloudflare has released a new free tool that prevents AI companies’ bots from scraping content from its customers’ websites to train large language models. The cloud service provider is making the tool available to its entire customer base, including those on a free plan. “This feature will be automatically updated over time as we see new footprints of offending bots that we identify as broadly crawling the web to train models,” the company said.

In announcing the update, the Cloudflare team also shared some data on how its customers are responding to the rise of bots scraping content to train generative AI models. According to the company’s internal data, 85.2% of customers have chosen to block access to their sites even to AI bots that correctly identify themselves.

Cloudflare also identified the most active bots of the past year. Bytedance-owned Bytespider attempted to access 40% of the websites under Cloudflare’s control and attempted 35% of them. They accounted for half of the top four AI crawlers by number of requests on Cloudflare’s network, along with Amazonbot and ClaudeBot.

It’s proving very difficult to completely and consistently block AI bots from accessing content. The arms race to build models faster has led to cases where companies are skirting or outright breaking existing rules about blocking scrapers from scraping websites without proper permissions. But having a back-end company of Cloudflare’s magnitude seriously working to stop this behavior could lead to some results.

“We are concerned that some AI companies looking to circumvent the rules to access content are constantly adapting to evade bot detection,” the company said. “We will continue to monitor and add more bot blocks to our AI Scrapers and Crawlers rule and evolve our machine learning models to help make the internet a place where content creators can thrive and maintain full control over the models their content is used to train or run inference on.”

Source link

What's Hot

Travel the World for Less with Home Exchange: Explore Like a Local, Live Like a Local

How to watch CNN’s Harris Waltz interview | 2024 US Election

New Zealand damages boat on land on first day of America’s Cup

Cloudflare Takes a Stand Against AI Website Scrapers

Generative AI coding startup Magic raises $320M in investment from Eric Schmidt, Atlassian and others

It’s time for streaming services to tackle AI music

Nvidia CFO says ‘enterprise AI wave’ has begun and Fortune 100 companies are leading the way

California Passes Landmark Bill to Regulate Large-Scale AI Models | Artificial Intelligence (AI)

Google employees say AI conferencing tool gives executives easy questions

Salesforce rises as software company bets on AI tools to drive growth

Travel the World for Less with Home Exchange: Explore Like a Local, Live Like a Local

How to watch CNN’s Harris Waltz interview | 2024 US Election

New Zealand damages boat on land on first day of America’s Cup

The Supreme Court has indicated it would side with Trump if the election is close.

AdsPower: See you at Affiliate World Europe 2024 in Budapest!

TEMU Affiliate Program 2024: Earn up to £100,000 per month!

Hard Bacon files for bankruptcy as Google search changes strain affiliate marketing business

Getting Started in Affiliate Marketing: How to Make Passive Income in 2024

Our Picks

Travel the World for Less with Home Exchange: Explore Like a Local, Live Like a Local

How to watch CNN’s Harris Waltz interview | 2024 US Election

New Zealand damages boat on land on first day of America’s Cup

Most Popular

Working It guide to AI at work

Meta AI is fun, accessible, and free. Maybe it’s time to make AI chatbots a part of your life | Technology News

Generative AI Might Be Overrated

Subscribe to Updates

What's Hot

Cloudflare Takes a Stand Against AI Website Scrapers

Related Posts