1) It not really good for localized search, it might be if you're local to the US or UK.
2) No !bangs. Coming from Ecosia I frequently just do !w !maps !yt because I know where I want the answer to come from
For English language searches, it completely usable, but not quite as good as Bing or Google. I really wanted to try to use Mojeek as my default for an extended period of time, but the lack of good local search makes it a bit annoying.
What's worse is that it's probably hard to ever get working well without the internet-scale profiling Google has access to.
The European Union, at least, has limited that a bit by preventing Google from linking Google Maps from their SERP.
So now, if you're in the EU, local results will display a map but you can't click on it.
A look at search engines with their own indexes (2021) - https://news.ycombinator.com/item?id=31820149 - June 2022 (114 comments)
I’m not saying having deep per-page indexing of Reddit, for example, isn’t useful. But is there any value in a breadth-focused index that is far cheaper to maintain?
There's an inordinate amount of documents that will never be a good search result for any query. Both in trivial cases that have barely anything to index in them, but also sign-up forms, cookie policies, redundant information (e.g. any given man page exists in dozens if not hundreds of identical copies on the web).
Unless you're specifically searching for other websites' cookie policies (e.g. to understand how they work, or to do research on them, or just to plainly copy them...)
https://git.sr.ht/~seirdy/seirdy.one/log/master/item/content...
Theyre just vecotors of arbitrary.dimension and similarity is calculated by a ndimensional fnction.
I'm wondering because it seems like due to the amount of spam on the web there needs to be more human curation as opposed to algrothims deciding what websites are valuable or not.
Dmoz was also closed, but it seems like there's a "new" Dmoz called Curlie [1], founded by some of the original team members.
It's also hard to find information about it (they really need to write more about it), but it's mentioned in this article: https://thenewstack.io/more-than-an-openai-wrapper-perplexit...