That's a great idea! I'll try it out, it might save me a lot of time. I haven't really done user testing before, how do you test for things like 'still feels responsive ' and 'is easy to use one handed'? I'm currently running with 0 programmed tests but I do spend like 4+ hours just using the app between Claude sessions.
I write my goals. Claude drafts a plan. I then feed it to Claude Chrome extension for live testing. Claude Chrome extension will set up the console inspector, measure response times and change UA if it's part of the plan. I'm sure you could tell Claude Chrome extension "ok make this change on the fly" by injecting JS into the console and do AB testing and output the result in an easy to understand report.