undefined | Better HN

0 pointsi_think_so2d ago0 comments

Has any enterprising hacker here yet graphed price vs "output" over time since 2023, taking "quality" into account?

That's got to be a very tricky analysis given how subjective quality is. But I'm sure there are people trying to pin it down.

0 comments

coalhouse2d ago

anything that compares proprietary models will be very miscalibrated and may not be indicative, there have been too many model changes in both chat and the api where model providers did not even say the word before it got too noticable

helloplanets2d ago

Quality would be performance against different given benchmarks, I assume?

There's multiple open weight models you can run on a pretty standard computer at home, which match the quality of GPT 4. I guess that would also change the equation.

lukewarm7071d ago

artificial analysis has an intelligence benchmark

j / k navigate · click thread line to collapse