Clean room is merely a defence in case you get sued by someone saying that you copied the work. It's not legally necessary.
If the presumption is that LLM training, despite reading all the source code of everything everywhere, ultimately doesn't actually contain that source code (in a compressed form) then that is the significant bit.
If training is truly doing something transformative, maybe even a machine analogy to human learning, then anything produced directly by that LLM without another work in it's context is an entirely new work. That's all that is important.
> I think the practical answer is that clean room as a legal concept was designed for a world where reimplementation was expensive and intentional.
Whether or not it's expensive or intentional is immaterial. It always was and it's still true now. All that matters is that the actual expression, the real source code, is not copied. Clean room is just one way to have evidence that you didn't copy.