Summary
Meta Platforms Inc. is facing a proposed class-action lawsuit from major publishers and authors alleging that its Llama generative AI model was trained using copyrighted works without permission, in what plaintiffs describe as one of the largest copyright infringements in history.
Bloomberg Law
"
Companies including Elsevier Inc., Cengage Learning Inc., and Hachette Book Group Inc. filed the latest in content owner suits over AI training Tuesday in the US District Court for the Southern District of New York.
The complaint alleges Meta made unauthorized copies of millions of books and articles through torrenting, web-scraping, and training—a peer-to-peer file-sharing method—while also illegally removing copyright information from them.
The filing, which also names CEO Mark Zuckerberg as a defendant, attacks Meta’s use of the works at issue at different stages based on a number of theories, reflecting the legal uncertainties surrounding the application of copyright law to generative AI.
It addresses initially torrenting the works from illicit “pirate libraries” like Anna’s Archive, LibGen, and Sci-Hub, an intermediary stage where copies are made during the training process, and the end-game in which Meta is “flooding the market with AI-generated substitutes” for their works...."
This article was originally published in Bloomberg Law.