Not an assumption. This is well established. They've been doing it for twenty years!
> Without regard for factual details that do matter to copyright law? Such as license??
What license? Google doesn't in general have or need an explicit license to crawl websites and neither does OpenAI.