undefined | Better HN

0 pointsWorkaccount21y ago0 comments

The is a bar of usage built into the law, otherwise everyone who reads this wired article is violating copyright by making a full copy to their computer. Generally making non-lasting copies is fine, otherwise the internet wouldn't work.

AI doesn't need lasting copies to train, however I don't know what the actual implementation is. But if it's ruled that they can only use copyrighted data if it's not stored for more than the time it would take a human to consume, It wouldn't really cripple the models, but perhaps make training more logistically challenging.

It's important to understand that models are not data archives. They are statistical constructs made from getting quizzed, that uses human made content to generate the quiz questions.

0 comments

1 comments · 1 top-level

triceratops1y ago

> otherwise everyone who reads this wired article is violating copyright by making a full copy to their computer

Wired explicitly sent that article to their computer for the purposes of reading it so it's not a copyright violation.

j / k navigate · click thread line to collapse