1
I don't particularly care about SWE bench or how much higher the next model scores on it. Perhaps that's not very futurist of me.
How do you feel about this?
There's so much python slop being slung around, but I guess one day these things will write their own bytecode, I don't know.
Update based on comments: Perhaps by using 'LLM' I was too specific to today's non-deterministic LLM systems..I suppose I meant AI systems in general.
At this point I personally feel tired of everybody wanting some number of bucks per month, though I get that it aligns business costs with customer payments. How do you feel about this issue and what would you do in the case of this project?