Muhami Logo

Reddit’s New Front in the AI Data Wars

By Shantanu Mukherjee Published: Dec. 18, 2025 Last Updated: Jan. 21, 2026
Reddit’s New Front in the AI Data Wars

On Oct. 22, 2025, Reddit filed a lawsuit in the Southern District of New York, accusing the defendants Perplexity AI and three scraping intermediaries – Oxylabs UAB, AWMProxy and SerpApi (“intermediaries”) of participating in an “industrial-scale” operation to harvest Reddit comments and use them for commercial purposes.

The complaint seeks damages and an injunction to block further use of Reddit data, due to the defendants’ conduct resulting in unfair competition, unjust enrichment, and copyright infringement.

Reddit’s Claims

According to Reddit, the intermediaries deliberately circumvented Reddit’s anti-scraping and technical controls (e.g. registered user-identification limits, IP-rate limits, captcha bot protection, and anomaly-detection tools), then sold the harvested corpus to Perplexity (which used the material to power an “answer engine,” i.e. their chatbot).

The intermediaries allegedly used tools designed to bypass Reddit’s own anti-scraping measures and to circumvent Google’s controls by scraping Reddit content directly from Google’s search engine results.

Interestingly, Reddit is effectively calling the defendants in this case would-be bank robbers, who, knowing they cannot get into the bank vault, break into the armoured truck carrying the cash instead.

Legal Allegations

In legal terms, Reddit has claimed violations on the following counts:

  1. DMCA – Circumvention of Technological Control Measures (17 U.S.C. § 1201(a)(1)(A))
  2. DMCA – Trafficking of Technology, Product, Service, or Device for Use in Circumventing Technological Measure Controlling Access (17 U.S.C. § 1201(a)(2))
  3. DMCA - Trafficking of Technology, Product, Service, or Device for Use in Circumventing Technological Measure Protecting Right of Copyright Owner (17 U.S.C. § 1201(b))
  4. Unfair Competition
  5. Unjust Enrichment

To support its claims, Reddit relied on Google’s explanation that it uses a system called SearchGuard that blocks automated systems from collecting large amounts of search results or indexed data, while still allowing normal human users to view Google’s search results.

Reddit alleges that the intermediaries masked their identities, accessed content even when direct crawling was blocked, and laundered the data through search results so Perplexity could ingest it without a direct license.

How Reddit Trapped Perplexity

Reddit specifically claims that they caught Perplexity red-handed by using the digital equivalent of marked bills (to use the bank robbery analogy) to track Reddit data and confirm that Perplexity was using Reddit data acquired through the scraping of Google SERPs.

In other words, Reddit posted content that was deliberately only reachable via Google search, then observed that Perplexity’s outputs reproduced that same content within hours. This “trap” is central to Reddit’s account of how scraping shifted from opportunistic crawling to a coordinated, third-party-driven data market.

How This Differs From the Anthropic Case

Reddit’s earlier lawsuit against Anthropic in June 2025 and the Perplexity matter share core themes, but the Perplexity lawsuit is different in the way that it confronts not just an AI company but the lesser-known services the AI industry relies on to acquire online material needed to train AI chatbots.

The Anthropic complaint centres on breach of Reddit’s user agreement, trespass to chattels, unjust enrichment, tortious interference and unfair competition. It mainly argues that Anthropic’s bots repeatedly accessed Reddit after being told not to, and that Anthropic’s conduct therefore violated contractual protections owed to Reddit and its users.

While the Perplexity lawsuit also pleads unfair competition and unjust enrichment, it stresses on the role of intermediaries and makes a stronger allegation of organised “data laundering” (i.e., that third parties systematically bypassed technical blocks and then sold the results). Companies navigating these legal and technical issues increasingly rely on an artificial intelligence (AI) law firm to understand evolving liability risks and compliance boundaries in AI data practices.

Reddit’s Existing Deals

In 2024, Reddit reached licensing arrangements with OpenAI and Google to provide structured access to Reddit content for product and model use. Those deals are evidence Reddit points to when framing Perplexity and Anthropic not as inadvertent users of public web content but as companies that could have sought a commercial license and did not. The existence of large licensing deals also sharpens Reddit’s claim that unauthorised scraping undermines a developing revenue stream tied to content licensing.

Practically speaking, the licensing deals give Reddit both standing to seek monetary relief and a narrative – “we license our data to some AI companies; others take it without paying.”

Perplexity’s Response

Perplexity responded on Reddit the same day that the lawsuit was filed, denying Reddit’s allegations and arguing that Reddit’s suit is less about legal infringement and more about strengthening its negotiating position with partners like Google and OpenAI.

The company emphasised that it does not train AI models on content and therefore doesn’t need or qualify for a data licensing deal. Instead, Perplexity says it merely summarises and cites public Reddit threads, claiming its citation feature drives users back to Reddit and promotes transparency.

Conclusion

Taken together, the Anthropic and Perplexity suits form a pattern: Reddit is asserting ownership and control over the material on its platform, using both litigation and licensing as tools to monetise and police that asset. The Perplexity case, however, adds a new angle: targeting the data-resale chain and trying to limit content use strictly by licensing. Companies facing similar disputes often seek guidance from a top technology law firm to navigate IP protection, data licensing frameworks, and platform rights enforcement.

Any Questions?

Connect with lawyers and seek expert legal advice

All Posts

Share

About the Author

Shantanu Mukherjee

GOT A LEGAL QUESTION?

Connect with lawyers and seek expert legal advice

Find Article by Practice Area

Browse articles by practice area

Related Articles

What the Estée Lauder Settlement Really Teaches Investors About Due Diligence
Knowledge

What the Estée Lauder Settlement Really Teaches I…

In today’s market, investor relations exten…

Shireen Kapoor
27 May 26
If work is a little slow, this is when relationships and commercial focus matter most
Business Insights

If work is a little slow, this is when relationsh…

It is fair to say there have been some disruption…

Christopher Adams
21 Apr 26
Social Media Around the World: What the Numbers Say and How You Can Benefit
Marketing Guides

Social Media Around the World: What the Numbers S…

If you’re a business owner or just someone …

Someli AI
16 Apr 26
VARA Licensing in Dubai: The Legal Gatekeeper of the Crypto Economy
Business Insights

VARA Licensing in Dubai: The Legal Gatekeeper of …

Dubai didn't just open its doors to crypto - …

Shireen Kapoor
14 Apr 26
5 Proven Tactics to Grow Your Social Media Presence in 2026
Marketing Guides

5 Proven Tactics to Grow Your Social Media Presen…

1. Super Short Videos: Capture Attention Quickly …

Someli AI
14 Apr 26
Unlocking New Flexibility for Mainland LLCs - Updates to the UAE Commercial Companies Law
Knowledge

Unlocking New Flexibility for Mainland LLCs - Upd…

The UAE’s corporate framework continues to …

Darren Bradshaw
13 Apr 26