1. if it exceed the context the agent does random stuff, that are often against simplicity and coherent logical structure.
2. LLM has zero intention, and rely on you to decide what to build and more importantly not build.
As such, I'm the limit of the numbers of concurrent agents working fo rme, because there is still a limit to my output of engineering judgement. I do get better, both at generating and delivering this judgement. Exceeding this limit, the output becomes garbage.
At this current year and date, the AI does not automate me in anyway, I have something that they just flat out don't have.