AB testing on what? AB tests need to produce some results which are then compared. How would releasing different versions in production help with that?
It would make more sense if that was internal and the responses were then graded.
A failed canary release would be more likely, where they released this version to a small amount of people not realising it was bad