It's counterintuitive yes, but you can. You can just look at the tests to ensure they're consistent and with the latest models that has always been the case, and it's very rare that the models try to cheat the tests.
you can convince yourself it can, by all means. But it doesn't make it true. In fact we even have this rule for apecoding: a developer cannot review their own code.