1: How to download the internet.
2: How to parse html in a way that gives it consistent structure and meaningful interpretations.
3: Comvine 1 and 2 and make it searchable.
I’m about 12 years into it and expect I have a couple/few more to go. Hopefully people still use HTML by then.