I am struggling to imagine the frame of mind of someone who, when met with all this LLM progress in standardized test scores, infers that the tests are inadequate.
These tests (if not individually, at least in summation) represent some of society’s best gate-keeping measures for real positions of power.