I think that makes it a better test. An ideal model would recognize the ambiguity and either tell you what assumption it's making or ask a followup question.
They are just trained to generate a response that looks right, so they are perfectly capable of asking clarifying questions. You can try "What's the population of Springfield?" for an example.