Do you want to severely limit evolution of models by having them pick (and buy) a tiny subset of all books?
Should every training run put money into a pool that gets paid out to every rights holder of every book that has ever been published?
Should Meta buy a physical or electronic copy of every book they want to use for training? That has zero impact on revenue for individual authors.
Would they be paid by word, by token, by book? This makes little sense. We don’t charge people for the knowledge they acquired while going to the library over 50 years, AI just squeezes this into weeks. Our legal framework simply doesn’t fit.