It's not just about reproducibility (although I do think that's important), it's about analysis of the model. With traditional software you have a pretty well defined "this code does this", but with machine learning models one of the only ways to validate that bias or propaganda hasn't been inserted during training.