I think you underestimate how intelligent LLMs are. If the training data only explains a certain concept in French, the LLM will nonetheless be able to tell you about that concept in any other language it has proficiency in. There's clearly a lot more going on under the hood than just a sophisticated markov chain.