undefined | Better HN

0 pointshammock1y ago0 comments

In that vein, perhaps the delta between o3 @ 87.5% and Human @ 85% represents a deficit in the ability of text to communicate human reasoning.

In other words, it's possible humans can reason better than o3, but cannot articulate that reasoning as well through text - only in our heads, or through some alternative medium.

0 comments

3 comments · 2 top-level

85392_school1y ago· 1 in thread

I wonder how much of an effect amount of time to answer has on human performance.

yunwal1y ago

Yeah, this is sort of meaningless without some idea of cost or consequences of a wrong answer. One of the nice things about working with a competent human is being able to tell them "all of our jobs are on the line" and knowing with certainty that they'll come to a good answer.

unsupp0rted1y ago

It's possible humans reason better through text than not through text, so these models, having been trained on text, should be able to out-reason any person who's not currently sitting down to write.

j / k navigate · click thread line to collapse