AGI happens when you DON'T need to scale pertaining + RL.
Link?
There is a mountain of data pre-1905. Certainly enough to train a decent 30B parameter model.
Now, digitizing & OCRing all of that data... THAT is a challenge.