This is something I've been working on exposing to AI labs through my startup LatentEvals[1], and found similar results in other industries from lending to insurance claims.
Happy to share some sample reports if anyone is interested!
1. https://www.latentevals.com/