The Irony of 'You Wouldn't Download a Car' Making a Comeback in AI Debates

FatCat@lemmy.world · 4 months ago

The Irony of 'You Wouldn't Download a Car' Making a Comeback in AI Debates

Dasus@lemmy.world · 2 months ago

First off, this been bothering you for a month? O.o

Secondly, I personally can’t steal a single book, popular or not.

I am arguing in good faith. The training data obviously has copyrighted works in it.

These companies are being treated differently than if you personally used copyrighted works without paying.

https://www.copyright.com/blog/heart-of-the-matter-copyright-ai-training-llms-executive-summary/

Using Copyrighted Works in LLMs LLMs use massive amounts of textual works—many of which are protected by copyright. To do this, LLMs make copies of the works they rely on, which involves copyright in several ways, such as:

Using copyright-protected material in the training datasets of LLMs without permission can result in the creation of unauthorized copies: copies generated during the training process and copies in the form of representations of the training data embedded within the LLM after training. This creates potential copyright liability.

Outputs—the material generated by AI systems like LLMs—may create copyright liability if they are the same or too similar to one of the copyrighted works used as an input unless there is an appropriate copyright exception or limitation.