Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
Perplexity is allegedly scraping websites it's not supposed to, again

Perplexity is allegedly scraping websites it’s not supposed to, again

Posted on 5 August 2025 By jobuzo

Web crawlers deployed by Perplexity to scrape websites are allegedly skirting restrictions, according to a new report from Cloudflare. Specifically, the report claims that the company’s bots appear to be “stealth crawling” sites by disguising their identity to get around robots.txt files and firewalls.

Robots.txt is a simple file websites host that lets web crawlers know if they can scrape a websites’ content or not. Perplexity’s official web crawling bots are “PerplexityBot” and “Perplexity-User.” In Cloudflare’s tests, Perplexity was still able to display the content of a new, unindexed website, even when those specific bots were blocked by robots.txt. The behavior extended to websites with specific Web Application Firewall (WAF) rules that restricted web crawlers, as well.

Cloudflare

Cloudflare believes that Perplexity is getting around those obstacles by using “a generic browser intended to impersonate Google Chrome on macOS” when robots.txt prohibits its normal bots. In Cloudlfare’s tests, the company’s undeclared crawler could also rotate through IP addresses not listed in Perplexity’s official IP range to get through firewalls. Cloudflare says that Perplexity appears to be doing the same thing with autonomous system numbers (ASNs) — an identifier for IP addresses operated by the same business — writing that it spotted the crawler switching ASNs “across tens of thousands of domains and millions of requests per day.”

ADVERTISEMENT

Advertisement

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

Engadget has reached out to Perplexity for comment on Cloudflare’s report. We’ll update this article if we hear back.

Up-to-date information from websites is vital to companies training AI models, especially as service’s like Perplexity are used as replacements for search engines. Perplexity has also been caught in the past circumventing the rules to stay up-to-date. Multiple websites reported in 2024 that Perplexity was still accessing their content despite them forbidding it in robots.txt — something the company blamed on the third-party web crawlers it was using at the time. Perplexity later partnered with multiple publishers to share revenue earned from ads displayed alongside their content, seemingly as a make-good for its past behavior.

Stopping companies from scraping content from the web will likely remain a game of whack-a-mole. In the meantime, Cloudflare has removed Perplexity’s bots from its list of verified bots and implemented a way to identify and block Perplexity’s stealth crawler from accessing its customers’ content.

Perplexity is allegedly scraping websites it’s not supposed to, again


News

Post navigation

Previous Post: Jeh Aerospace nets $11M to scale the commercial aircraft supply chain in India
Next Post: Intel, Not AMD, Could Be the Secret to Kickass Next-Gen Handheld PCs

Related Posts

Chinese firms to incur higher costs for deploying AI chips from US: analysts Chinese firms to incur higher costs for deploying AI chips from US: analysts News
Act in 24 hours: Osman Hadi ally warns Muhammad Yunus-led interim govt at Saturday funeral Act in 24 hours: Osman Hadi ally warns Muhammad Yunus-led interim govt at Saturday funeral News
Tricia McLaughlin: Why top spokesperson is resigning amid DHS shutdown over oversight dispute Tricia McLaughlin: Why top spokesperson is resigning amid DHS shutdown over oversight dispute News

Latest

  • China launches space computing hub as SpaceX gears up for historic IPO
  • Kremlin says Zelensky can come to Moscow for talks any time
  • Before the first punch, Trump’s White House UFC event faces blowback
  • Who is Aaron Spencer? 5 things to know about Arkansas father whose murder charge was dropped
  • Anna Nicole Smith’s Daughter Is The Spitting Image Of Her Mother
  • Russian President Vladimir Putin rejects Ukrainian President Volodymyr Zelenskyy’s call to have a face-to-face meeting
  • The US job market is strong but many Americans are still frustrated by prospects and rising prices
  • The Samsung Galaxy S27 Ultra is Finally Real: Here is What We Know
  • Taxi driver in Bangkok returns S$12K cash left in his vehicle by Sri Lankan tourist
  • In public letter, Ukraine’s Zelenskyy calls on Putin for direct negotiations in a neutral country

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs