The internet is no longer for humans. Bots now account for more than half of all web traffic, a milestone reached a full year ahead of schedule. Cloudflare, the gatekeeper for a significant portion of that traffic, has decided it is time to charge the toll.
Starting September 15, 2026, Cloudflare will block "mixed-use" crawlers from any ad-supported site by default. If a bot scrapes data for both traditional search and AI model training, it will be locked out. Unless a site owner manually changes their settings, the AI giants will find the door slammed shut.
This is a power shift. For years, AI companies have treated the open web as a free buffet. They scraped everything, trained their models, and built multi-billion dollar businesses on the back of publisher content. Cloudflare is now forcing them to negotiate.
The End of the Free Buffet
Cloudflare CEO Matthew Prince is framing this as a move toward a "sustainable ecosystem." The logic is simple: if AI companies want to use publisher content to generate value, they should pay for it.
Under the new policy, the distinction between a search crawler and an AI trainer becomes a financial one. If a company wants to crawl a site, they must prove their intent. If they blend search with training, they get blocked. It is a blunt instrument. It is also effective.
Why This Matters for AI Models
AI models are only as good as their training data. If the largest publishers on the web—many of whom rely on Cloudflare for security—opt into this block, the cost of acquiring high-quality data will skyrocket.
Google, the world’s largest search engine, is the primary target here. While Google provides a tool called Google Extended to opt out of training, Cloudflare argues that the search giant still has a massive advantage. Googlebot crawls for search, but those same crawls feed into AI Overviews and Gemini. Cloudflare’s new policy forces a separation that Google has historically resisted.
From 'Pay Per Crawl' to 'Pay Per Use'
Blocking is only half the strategy. Cloudflare is also expanding its monetization tools. The company is evolving its "Pay Per Crawl" marketplace into a "Pay Per Use" model.
This shift is significant. Instead of charging just for the act of scraping, publishers can now charge when their content actually creates value. If an AI agent uses a specific article to answer a user's query, the publisher gets paid. Cloudflare is already testing this with partners like Ceramic.ai and You.com. It is a direct challenge to the current "scrape-first, ask-later" paradigm.
What This Means for Publishers
For website owners, the decision is no longer binary. You don't have to choose between being invisible or being exploited. You can now set your price.
However, there is a risk. If too many sites block AI crawlers, the quality of AI responses could degrade. If AI companies refuse to pay, they may simply move to lower-quality, public-domain data. The next six months will be a standoff. Publishers have the leverage. Now, they have the tools to use it.
Key Takeaways
- Default Blocking: Starting September 15, 2026, Cloudflare will block mixed-use AI crawlers on ad-supported sites by default.
- Monetization Shift: Cloudflare is moving from "Pay Per Crawl" to "Pay Per Use," allowing publishers to charge AI companies when their content generates value.
- Market Pressure: The policy forces AI companies to separate search-only crawlers from training bots, effectively ending the era of free, indiscriminate scraping.
The Next Move
The deadline is set. By mid-September, the AI industry will have to decide whether to comply with these new transparency requirements or lose access to a massive swath of the internet. Expect the major AI labs to lobby hard against this. They need the data. Cloudflare is betting that publishers need the revenue more.