A massive lawsuit has been filed against Meta and its CEO Mark Zuckerberg, accusing the company of knowingly using illegal copies of books and journal articles to train its artificial intelligence program, Llama. The complaint, filed in the United States District Court for the Southern District of New York, alleges that Zuckerberg personally played a direct role in encouraging the practices. According to the filing, «Zuckerberg himself personally authorized and actively encouraged the infringement.» The case marks one of the most significant legal challenges yet over how major technology companies source data for generative AI systems.
The lawsuit has been brought by a coalition of major publishers and authors, including Elsevier, Cengage Learning, Hachette Book Group, Macmillan Publishing Group, McGraw Hill, Scott Turow, and S.C.R.I.B.E., Inc. The plaintiffs are pursuing a class-action copyright infringement claim, arguing that Meta unlawfully reproduced and distributed copyrighted material. In the filing, they state the case is «for copyright infringement based on Defendants' unauthorized reproduction and distribution of Plaintiffs' copyrighted works through Meta's sourcing of content for, and development and training of, Meta's generative artificial intelligence (“AI”) platform called Llama, as well as for removal of copyright management information (“CMI”).»
«We will fight this lawsuit aggressively.»
-Meta spokesperson, Dave Arnold
The complaint portrays Meta's actions as part of a broader strategy to dominate the rapidly evolving AI sector, accusing the company of prioritizing speed over legality. «In their effort to win the AI “arms race” and build a functional generative AI model, Defendants Meta and Zuckerberg followed their well-known motto: “move fast and break Things.”» the lawsuit alleges. The phrase, long associated with Meta's corporate culture, is presented by plaintiffs as evidence of a willingness to disregard legal boundaries in pursuit of technological advancement.

According to the filing, Meta's data acquisition methods involved large-scale unauthorized downloading and copying of copyrighted works. The lawsuit claims «They first illegally torrented millions of copyrighted books and journal articles from notorious pirate sites and downloaded unauthorized web scrapes of virtually the entire internet.» It further alleges «They then copied those stolen fruits many times over to train Meta's multi-billion-dollar generative AI system called Llama.» The plaintiffs describe the scope of the operation as «one of the most massive infringements of copyrighted materials in history.»

The complaint also emphasizes that these actions were carried out with full awareness of their legal implications. It states that the conduct occurred «without permission, without providing any compensation to authors or publishers, and with full knowledge that their conduct violated copyright law.» The case adds to a growing wave of litigation targeting technology companies over the use of copyrighted material in AI training, an issue that remains largely unsettled in U.S. courts and could have significant implications for the future of the industry.
«In their effort to win the AI “arms race” and build a functional generative AI model, Defendants Meta and Zuckerberg followed their well-known motto: “move fast and break Things.”»
-Lawsuit filed in the United States District Court for the Southern District of New York
Meta has pushed back against the allegations, signaling it intends to contest the case vigorously. In a statement to The New York Times, company spokesperson Dave Arnold said «We will fight this lawsuit aggressively.» He added that «courts have rightly found that training A.I. on copyrighted material can qualify as fair use.» The defense reflects a broader legal argument being advanced by several technology companies, which contend that the use of large datasets, including copyrighted material, is essential to the development of modern AI systems and can fall within existing legal frameworks.

Created by humans, assisted by AI.