I can see this being extremely limiting in training data, as only "compatible" licensed data would be possible to package together to train each model.
That's part of the point.