A New Class Action Lawsuit Adds to OpenAI's Growing Legal Troubles

A new class action lawsuit accuses ChatGPT creator OpenAI of criminally scraping data from all over the internet, then using the stolen data to create its popular automated products. The lawsuit, filed this week by the Clarkson Law Firm in a Northern California court, is only the latest in a slew of legal challenges that strike at the very heart of the influential startup’s business model.

Netflix Passwords, ChatGPT Can’t Detect AI, and No More CoTweets | Editor Picks

Since it pivoted from a humble research organization to a for-profit business in 2019, OpenAI has been on a meteoric ascent to the very top of the tech industry. When it launched ChatGPT last November, the company became a household name.

But as OpenAI attempts to stand up its business and lay the groundwork for future expansion, the controversial nature of the technology that it’s selling may sabotage its own ambitions. Given the radicalness and newness of the AI industry, it only makes sense that legal and regulatory issues would develop. And if legal challenges like the one filed this week hold sway, they could undermine the very existence of OpenAI’s most popular products and, in turn, may threaten the nascent AI industry that revolves around them.

The Clarkson lawsuit’s allegations, explained

The central claim in the Clarkson lawsuit is that OpenAI’s entire business model is based on theft. The lawsuit specifically accuses the company of creating its products using “stolen private information, including personally identifiable information, from hundreds of millions of internet users, including children of all ages, without their informed consent or knowledge.”

It’s well known that OpenAI’s large language models—which animate platforms like ChatGPT and DALL-E—are trained on massive amounts of data. Much of this data, the startup has openly admitted, was scraped from the open internet. By and large, most web scraping is legal, though there are some wrinkles to that basic formula. While OpenAI has claimed that everything it does is above board, it has also been repeatedly criticized for a lack of transparency regarding the sources of some of its data. According to this week’s lawsuit, the startup’s hoovering practices are blatantly illegal; specifically, the suit accuses the company of violating multiple platforms’ terms of service agreements while also running afoul of various state and federal regulations—including privacy laws.

Despite established protocols for the purchase and use of personal information, Defendants took a different approach: theft. They systematically scraped 300 billion words from the internet, “books, articles, websites and posts – including personal information obtained without consent.” OpenAI did so in secret, and without registering as a data broker as it was required to do under applicable law

The lawsuit also highlights the fact that, after OpenAI freely exploited everybody’s web content, it then proceeded to use that data to build commercial products that it is now attempting to sell back to the public for exorbitant sums of money:

Without this unprecedented theft of private and copyrighted information belonging to real people, communicated to unique communities, for specific purposes, targeting specific audiences, the [OpenAI] Products would not be the multi-billion-dollar business they are today.

Whether the U.S. justice system ends up agreeing with the lawsuit’s definition of theft is yet to be determined. Gizmodo reached out to OpenAI for comment on the new lawsuit but did not hear back.

OpenAI’s legal troubles are piling up

The Clarkson lawsuit isn’t the only one that OpenAI is currently dealing with. In fact, OpenAI has been subjected to an ever growing list of legal attacks, many of which make similar arguments.

Just this week, another lawsuit was filed in California on behalf of numerous authors who say their copyrighted works were scraped by OpenAI in its effort to gobble up data to train its algorithms. The suit, again, basically accuses the company of stealing data to fuel its business—and says it created its products by “harvesting mass quantities” of copyrighted works without “consent, without credit, and without compensation.” It goes on to characterize platforms like ChatGPT as being “infringing derivative works”—essentially implying that they wouldn’t exist without the copyrighted material—“made without Plaintiffs’ permission and in violation of their exclusive rights under the Copyright Act.”

At the same time, both the Clarkson suit and the authors’ suit bare some resemblance to another lawsuit that was was filed shortly after ChatGPT’s release last November. This one, filed as a class action lawsuit by the offices of Joseph Savari in San Francisco, accuses OpenAI and its funder and partner Microsoft of having ripped off coders in an effort to train GitHub Copilot—an AI driven virtual assistant. The lawsuit specifically accuses the companies of failing to adhere to the open source licensing agreements that undergird much of the development world, claiming that they instead lifted and ingested the code without attribution, while also failing to adhere to other legal requirements. In May, a federal judge in California declined OpenAI’s motion to have the case dismissed, allowing the legal challenge to move forward.

In Europe, meanwhile, OpenAI has faced similar legal inquiries from government regulators over its lack of privacy protections for users’ data.

A New Class Action Lawsuit Adds to OpenAI’s Growing Legal Troubles

The Clarkson lawsuit’s allegations, explained

OpenAI’s legal troubles are piling up

Cooler Master MasterBox Q300L Micro-ATX Tower with Magnetic Design Dust Filter, Transparent Acrylic Side Panel…

ASUS TUF Gaming GT301 ZAKU II Edition ATX mid-Tower Compact case with Tempered Glass Side Panel, Honeycomb Front Panel…

ASUS TUF Gaming GT501 Mid-Tower Computer Case for up to EATX Motherboards with USB 3.0 Front Panel Cases GT501/GRY/WITH…

be quiet! Pure Base 500DX Black, Mid Tower ATX case, ARGB, 3 pre-installed Pure Wings 2, BGW37, tempered glass window

ASUS ROG Strix Helios GX601 White Edition RGB Mid-Tower Computer Case for ATX/EATX Motherboards with tempered glass…

Corsair 5000D Airflow Tempered Glass Mid-Tower ATX PC Case – Black

CORSAIR 7000D AIRFLOW Full-Tower ATX PC Case, Black

Bgears b-Voguish Gaming PC with Tempered Glass ATX Mid Tower, USB3.0, Support E-ATX, ATX, mATX, ITX. (Note: Fan NOT…

Phanteks (PH-EC360ATG_DWT01) Eclipse P360A Ultra-fine Performance Mesh, Mid-Tower case, Tempered Glass, Digital-RGB…

Corsair iCUE 4000X RGB Mid-Tower ATX PC Case – White (CC-9011205-WW)

How to restore gut health after stomach flu

Episode #152: Understanding Diet Culture & Its Impact on Body Image

IRON SKILLET APPLE CAKE – The Southern Lady Cooks

Homemade dog treats (quick and healthy!)

Leave a reply Cancel reply

Compare items

Shopping cart