Presumably because the engineers designed the system to prevent that. They didn't build the system by looking at example API calls and constructing a system which satisfied the examples, but had random behavior elsewhere. They understood this property as an important invariant. More important than matching the timestamps to a particular ISO format.
I'm not talking about "good" design as "adapting to changing requirements" or adhering to "design principles" or whatever else people say makes a design good.
I'm talking about designing for simplicity so that the behavior of the system can be reliably predicted. This is an objective quality of the system. If you can predict the output, then the system has this quality. If you made it like that on purpose, then you designed it for this quality.
LLMs do not have this simplicity, but a software system you would trust to power a bank does.