Voice/Video is an optional XEP and if it's not supported what happens to the client exactly?
"this is false" is a terribly glib statement with literally no backing and can only be said if a person has either zero knowledge of what they're talking about or they've tied themselves to a single implementation of XMPP everywhere, which is essentially standardising a bunch of XEPs.