1
Ask HN: License to Protect Training Data?
I am developing a system that will be used to inspect some data and identify things within it manually. I expect that, in some cases, these identifications will be used to train machine learning models. Is there an existing license that I can apply to the software that would require the end products of these outputs (i.e. the identifications and model weights) to be made public? Something like the GPL, but to democratize access to training data and models created downstream.
The application is in a niche scientific field and I am not worried about a lack of users, and I expect many users will align with the ethos I am proposing. I am simply wondering if a license or arrangement like this has been created already.