It’s one of great misunderstandings (or more likely - very effective way of getting money) to claim that machine learning is a data problem. We’re so far away from that point, that we have literally no idea what’s needed to make ML a data problem. Algorithms are extremely simple, and it’s all more or less curve fitting.
Media (and surprising amount of tech people as well) tend to claim that ML learning is like human learning - repeat something enough times and you’re done, you know how to do it. ML is no where close to that point.