Words are text oriented but if you’re learning the word “blue”, text isn’t going to help you a whole lot. It’ll help you spell it and recognize the word, but it won’t help you understand what the color means nearly as well as a video which shows the word but also several images of blue things.
I’m not at all convinced that programming is primarily text oriented in any case. The underlying concepts are mathematical and logical, not textual. People don’t learn through text or video. Those are mediums. They learn through metaphor and relationships.
So the question is which medium gets those metaphors across most efficiently. Consider, for instance, a binary tree. It’s a metaphor. And I’d be willing to bet most people new to the topic would learn that metaphor better (or at least faster) with a person drawing out a tree and pointing to the path a program would take to add/remove/list/etc. the nodes on the tree than they would seeing a block of code and having someone explain it in text.