I could be wrong, but I’m fairly sure we’re not yet at the point of convincing, real-time voice gen, nor any kind of decent quality TV (sadly), although printed text and (non-live) radio are certainly viable right now
I spent an hour or three over the weekend messing around in Skyrim VR with a mod that does player speech recognition, pipes it out to GPT with identifier tags to give scene context, sends GPT output to elevenlabs (optional), and then the mod integrates it into Skyrim mouth rigging etc.
Yes, there’s an extremely obvious lag before you get a response, but it’s on the order of seconds even though this is an early Skyrim mod with a ton of moving parts interacting.
And the result is…astounding. As someone said in a comment on the below linked video: this is the biggest leap for video games since Half Life 1.