That's the point I think. It should be possible to require orders of magnitude less data to create an intelligence, and we are far from achieving that (including achieving AGI in the first place even with those huge amounts of data).
My point is it took a very large amount of data for a human to be able to "produce good output". Once it had its performance was of a different strata though.