Countless projects huh, CMU found the best of them has only a 30%ish success rate on basic business tasks. Many are below 90% still, but yeah let's just pull magic numbers out of thin air. How much Nvda you own bud?
Zero Nvidia. The CMU benchmark is fun, but tasks <> jobs. They found that agents can autonomously finish about a third of their simulated office tasks, but that can't be mapped to a labor-market forecast.