>I wouldn’t really be confident in identifying a „stolen face“ online because you’re bound to look similar to someone when creating a face. Does that mean that the look was stolen or is it just coincidence?
Luckily, national ID can easily solve that problem offline. It'd be a more interesting problem if some company searched far and wide to identify someone that has a similar face to [famous person]. And that's part of the issue in that there's no way to verify "is my voiced sampled here?". Opening the black box can help with that, but that's something companies are fighting to the death to not allow.
> What is the likelihood of a random AI voice sounding similar to some real online creator?
Given how much data was scraped, less likely than the fact that their data is in fact in the training set.