There are different thresholds to "trusted": the highest that'd be useful to search would be just authoritative go-to sources for something, e.g. the official docs for an API or library. But that'd be perhaps too narrow: including stackoverflow links around that API or library usage would be fine. In other domains like product reviews, you'd include only sites that have a reputation for actually testing the products.
Use-cases: - Reliably good search results... Google et al are full of SEO spam now and soon will be full of AI slop - Better grounding for LLMs than current search grounding... even if the LLM doesn't hallucinate it can cite junk on the web - Better pre-training data... I actually don't understand how LLMs will themselves filter out their own slop from future pre-training runs
I'm not sure what form this should take if it doesn't exist yet. Maybe a github project or wiki curating links per domain (yahoo directory reinvented?), each of us curating our own bookmarks and sharing it (delicious reinvented?), something else?