When I use the planning mode and then code the success rate is much higher. When I ask it to work on specific isolated chunks of code with clear success/failure modes the success rate is again much higher.
Now imagine a world where it recognizes that from my simple throw away non specific prompt. If it was able to fire off 20 different prompts in quick succession it could easily cut my time spent in front of the screen by a third.
The patterns are obvious but they don't do that right now because it's a lot of compute.
We'll be looking at this time where there's a progress bar showing context space the way we look at the Turbo button.
Because the truth is to get the baseline I'm talking about is a finite amount of compute at a certain point.
No comments yet.