> Pretty sure that only Google and Microsoft have the money and resources to crawl the entire internet. Or perhaps the only that can AND are willing to.
Money and resources and a dominant-enough position so that your crawlers are not blocked by websites.
Unfortunately.