With Gemini CLI I blow through Pro requests in < 10 minutes and it switches to Flash. I can't trust either to be autonomous. Pro will write unit tests, get a test to 100% coverage and then delete the test. Flash will get stuck in endless loops where it replaces a string in a file, doesn't realize the string has been replaced, and keep failing to recognize that fact getting stuck in a doom loop.
Glad I didn't add an API key. I've had friends who did and ended up with $xxx in charges because the models can't think or use tools properly.