It would be good for Python to have more rigor in its definition. But the situation here doesn't have an analog in the Python world: there is no "PythonSpec" third-party test suite that is a) more complete than the official suite, b) exposing segfaults in previously working code, and c) going unused by the core developers.