I would say that for almost all of my searches the AI Overview feature contains exactly the answer I was looking for, and I don’t even have to leave Google to get it. It’s been a very positive addition.
i have the opposite experience as i've found it to mess up a lot to the point that I can't trust the AI Overview answer at all. I've seen it be confidently wrong too many times and now the trust factor just isn't there.
When the search results are bad, usually I can at least tell that they're dubious: either they're from obviously unreputable sites, or they conflict with each other, or they just don't quite address my query. But an inaccurate AI overview can look very similar to an accurate one.
Website owners losing traffic to the AI Overview feature is well-documented and has been discussed extensively here. This reality begets a few questions:
-----------------
What happens when it no longer becomes profitable for anyone to share knowledge online?
AI companies have scraped physical books to build their training sets. What happens when it no longer becomes profitable for anyone to share knowledge at all?
Are you okay with a handful of corporations owning the truth?
Are you okay with that same handful telling you _what the truth is?_
-----------------
I'm totally fine with Google telling me where to find truth. I am, personally, NOT okay with for-profit corporations like them telling me what the truth is. Every AI search I do is tacit acceptance of what they're doing, so I try not to use it.
All that said, you're definitely not in the minority outside of this site.
Heaps of people take the AI Overview result as the truth, case closed.
Given its inaccuracies (I used it the other day as a last result to locate these Korean cat treats, and AI Overview said that it was a "health food" for "wellness"), it's a massive problem that we haven't fully realized yet IMO.
Websites accepted Google scraping their content because it gave them a prominent blue link plus excerpt to drive traffic. Now everyone’s content is blended together and maybe, if they’re lucky, their site is chosen amongst the blend to get a tiny citation link.
I don't believe this for a second. It has constantly the worst output of any serious AI I've seen, by far. It's laughably wrong sometimes, usually just wrong. It can usually cope with mundane keyword searches where it's still better to just read the wiki blurb, because even those can be mangled.
I disagree with “sometimes”. But anyway, the gargantuan difference is that with websites you can get a feel for their credibility. As a simple example, documentation on MDN is miles more trustworthy that your average SEO spam blog, and you can see this as soon as you enter the page. Yes, some scammers are craftier than other, but the signal is there.
With LLMs, all answers get the same weight.
And asking for sources is not reliable. They are too often made up or contradict what the page says.
> and I don’t even have to leave Google to get it.
And what will you do when most posts on the web are just junk SEO spam to trick LLMs into telling you what they want? It’s not like that’s hard to do, even.
https://www.anthropic.com/research/small-samples-poison
https://www.bbc.com/future/article/20260218-i-hacked-chatgpt...
I don’t like AI only idea but I think it will work just fine
Yes, but most of these things are results of adtech having so much impact on the web/how we publish/consume/get paid for the content we create. I'm a bit bitter/sad about this.
I often use Perplexity for more complicated questions though.
In both cases, I really like that. I can actually ask a complicated question and get a reasonable answer.
For example, check the answer to "what is the source of vertical straw in the movie The sixth Sense with the flying taxis and all?"
(I said sprawl but the voice to text understood strawz so I left it)
If I search topics I am knowledge about, the overviews are almost always at least slightly wrong.
Not all websites are correct sources of information, but I am generally aware of which websites are trustworthy and can cross check.
For daily things - finding restaurants, looking up my kid's symptoms, etc. - I still use Google
And since now they explicitly aim to never drive any human traffic to any websites, it will only become worse.
This is the way the internet used to be before it was enshittified, you just type in some keywords and the first result was probably the most relevant and readable source of info. But now, no more.
I think if a modern search engine could deliver the same experience but with organic human written content, I’d probably use that. This is probably a new niche now for upcoming search engines, focused on finding human created works.
Then you are extremely ignorant about the topics that you research, because I don't think I've seen a single AI overview that did not have a mistake, and I don't think I've seen less than half that got a critical fact wrong.
Have you actually used it? Because sometimes is doing heavy lifting there and very rarely is flat out lying. It consistently messes up and hallucinates.
As a former blogger: I hate it. But I knew years ago where things were heading and stopped. No point in blogging/ writing etc.
All that's to say: when I paid for Kagi, I thought I was investing in additional search infrastructure, and didn't realize Kagi had no aspirations to build their own general purpose index, and instead primarily aggregate results from other indexes, either adversarily (Google, Bing) or not (Yandex, Mojeek, Brave, Apple, etc.) I understand they do maintain their own small-web index, but I thought their aspirations were higher when I first jumped on that train.
If you're questioning the AI features, know that I am only barely aware they exist. I have never, not even once, accidentally or otherwise, engaged the AI features without going out of my way to do so. I've never seen what their AI is like. I have no idea what it's for or why I'd want it.
It's beautiful. Kagi has AI I suppose, but it's over there and not in my face. I don't think I've ever seen an AI nag in the UI, but their UI itself is also over there and out of my way.
Thank you, Kagi, for staying politely the hell out of my way. I love you.
Slow gradual growth before, large increase in the daily growth rate since.
It'll be interesting keeping an eye on how that growth rate goes over time. :)
You can ban Pinterest links, boost Mozdev, ban listicles, boost whatever.
Kagi gets very good very fast as you customize it and it's easy to keep it updated as sites go up or down in quality.
The community shared boost and ban lists are a great resource too. Making it easy to see and copy what others find useful.
I get a weird feeling when I see people googling things using Google (hehe), the amount of bs is mindboggling.
The feature I missed most from Kagi was domain filtering, so I had Claude write a quick userscript for DDG that lets me boost, pin, and block specific domains. uBlock Origin aside, DDG even lets you turn off ads natively.
Kagi is good, but the redirection felt a bit flaky lately, and I was dealing with an annoying bug where my localization kept defaulting to Groningen for no apparent reason.
I’ll stick with this DIY setup for a bit, though I might well end up back on Kagi once I realize how good I had it.
I end up doing a lot of searching with Mistral Le Chat (also a subscriber).
What I'd like to know is power cost difference between the two (on the server-side). Ie. is Mistral sustainable financially or are they also running on vc / burning money. Although France uses nuclear, so it is a drop in a bucket I suppose.
That's to say, if one day Kagi also forces AI search summary down my throat and hide the search results, I will definitely leave.
Kagi is better for research and knowledge work, Google is still better for quick lookups.
I only use Google to search for reddit posts.
The rest is ChatGPT or Claude.
Do not fund the Kremlin!
I'm working on a self-hosted search service called Hister with the same goal when I started Searx development: reduce dependence on online search engines.
Hister is a full text indexer for websites and local files which automatically saves all the visited pages rendered by your browser. It provides a flexible web (and terminal) search interface & query language to explore saved content with ease or quickly fall back to traditional search engines. This is a fundamentally different approach than what Searx follows and solves most of the weaknesses of metasearch engines. Of course it has its own weaknesses as well, but most of these are not conceptual and can be resolved by improving the software (and datasets)
I've been using it for a few months and as my local index is growing I can avoid relying on external search engines - and even websites listed in results - more and more frequently.
The initial reception is overwhelmingly positive with already more than 30 contributors and hundreds of contributions. Currently it can help with "recall" type searches mainly, but I'm planning to provide pre-indexed thematic datasets and I'm drafting a peer-to-peer index sharing concept. Maybe you can find it useful as well (or at least have some constructive criticism =]).
Links: - https://hister.org/ - https://github.com/asciimoo/hister - Background/motivation/beginnings: https://hister.org/posts/how-i-cut-my-google-search-dependen... - Small read-only demo: https://demo.hister.org/
The only feedback I have is the initial indexing from my large history was rough. There were a lot of domains that kept blocking me for exceeding rate limiting or wouldn't let me index at all. I could see it being useful to import a history file and organize it by domain inside some sort of temporary database to track/distribute attempts and get a more detailed report on complete domain failures.
Regardless though - great work!
Assuming it indexes everything locally and falls back to traditional search engines if none found, how do you feel about adding a shared middle layer? A layer that simply indexes all the canonical data that doesn't have any personal info. This way, the contributors can automatically contribute the pages they index - building a shared search engine over time! The whole thing can work without a crawler of its own (under appropriate license so people can trust it)
Hister sounds like a idea I had years ago but gave up on after running into issues with index size taking up way too much storage.
Long ago I've used Searx and really liked it but after some point didn't see the point as opposed to using Google more directly. But lately in the back of my mind I've thinking about checking in on it again.
- If I wanted to use use my domain list to start hister, to download my preconfigured / like domains?
- Can I make some pages to rank higher in it?
- Can I assign tags to pages (by which I could later on filter?)
My domain index
So if i only use the Firefox extension, all pages Hister will "fetch and store" will have gone through my browsers content blocker (uBlock Origin) before being saved ?
A VPS with without a black listed IP is good. A simple rootless container, update is easy.
Configuration takes little time, not much.
I still hate that I have to double the bang to use the same bang as DDG.
Example: "!!wde Ente" to go to the German wikipedia page about duck instead of "!wde Ente" with DDG.
[1]: https://kagi.com
[2]: https://uruky.com
We also have a no ai version: https://noai.duckduckgo.com/
We live in the dystopia we deserve. We have built it with our own hands and it is here to stay.
I will say off topic that, speaking to an early googler, there is actually documentation of meetings where they discussed what "don't be evil" meant and decided actual business options they should and should not pursue. It was not just a motto or a "code of conduct", but meant as and used to justify consequential actions.
Pre-2010 Google search didn't use https by default, almost no one did besides specific cases, like processing payment. And even then, only the critical part was https, the rest, like images was plain http. So, for a true pre-2010 experience, you want http:// links.
Post-2010 Google played an important role in pushing for https. From boosting https search results, to Chrome being annoying to unencrypted connections, to sponsoring Let's Encrypt, to forcing HSTS on their TLDs.
I kind of miss http, it was a time when the web was a public thing, a place for sharing, not for keeping secrets. But to be fair that's just nostalgia, the modern (commercial) web that generalized encryption enabled is so useful and convenient that I can't imagine going back.
That's a bit unfair. Not all of us who live in it had a hand in building it. In fact, very few of us had the leverage to fight against it.
Normal people are using AI for search more already, Google is just trying to stop their primary business from completely disappearing.
I'd rather just have results without chatbot fluff.
They are transforming google search into chatbot but they already have a chatbot and by dping that they are losing google search product completely.
I miss times when search used to be fulltext search (also in youtube, ebay, etc). Now it is some kind of embedding vector search skewed by marketing and my previous visit history. It often shows completely irelevant and useless results and it is much harder (even impossible sometimes) to find something even though I am sure it exists (and I am able to find it eventually but it takes much more effort).
I think the reason is my searches are almost entirely long-tail searches that Kagi's index just isn't good enough for. I am never searching for something like "best mattress" or anything else that is heavily SEO'd - it's always something very specific - so the result page in Google looks pretty much exactly like the Kagi page, only it nearly always has the result I'm looking for where Kagi's doesn't.
The main issue I've had with Kagi is that using "before:" and "after:" just seems weirder than it does on Google, and will throw in some stuff that's visibly outside the ranges I selected sometimes.
I'd say about 1 in 5 searches for me is apparently "long tail" and I've never been more aware of how dead the internet is. I regularly still compare to DDG and Google, and spend about 300% more time on the others trying to find a match and discovering I have more results only because they made my quotes terms optional or ignored my time restrictions.
any chance you have an example of such searches off the top of your head that you wouldn’t mind sharing?
I still use and recommend Kagi but the results have gotten worse, but I also think that’s just an indicator of the health of the web as a whole.
The UI has gotten a bit clunkier over the years, but it's still good, still more focused than Google's.
As I write this, I give Google Search a quick try and notice that the first thing you see is a full-screen cookie banner!? On my laptop, I even have to scroll to reach the Reject/Accept buttons, and keyboard controls don't work at all. I can't believe people still use this crap.
(What do you search) they stink vs. Google even though Startpage is proxying them.
DDG is _not_ a “good” search engine — please, anybody have a hundred side-by-side screenshots to compare identical searches?
Edit - also admit Kagi’s great, I’m not affiliated; if you have money Kagi should be the pick (ideally purchased via their more private payment options probably)
I can't say I've noticed any changes about google search on desktop recently. Yes; there is an AI overview widget at the top of the page; but it's been there for at least a year.
Has anything changed about Google search results for you?
This is pure observation/anecdotal. I have no measurements to back this up but I think others will share this view.
Let me correct.
Search was good as late as 2010, when they changed the engine to facilitate an "instant search, search-as-you-type" feature. It was decent until around 2015, when a pivot to privileging "brands" poisoned the results. And it's been useless since the pivot to semantic search (in part to facilitate Home/Nest products) and a series of index purges over the past few years.
We had something very good, and we had it for a good while, and it was destroyed by a company that was a blackhole for investment and some of the brightest minds of my generation, sucking up untold amounts of labor and radiating little but "exceptionally deleterious to society" particles.
And at least I know that I am happy to talk about a product that I pay for. Is some of it because I feel like there's a sunk cost - the amount of money that I've paid into it? Yes, of course there is. Is it a good search engine? Yes.
Do I wish there were features added to it that they've promised for a while now? (namely being able to pay for more AI credits, especially if you paid a year in advance?) Yes.
It's a paid-only service, without ads - "you are not the product", that lets you hide results from popular (to the mainstream) sites like Instagram and Pinterest, and to filter out low-quality sources like W3Schools, while raising MDN and the Arch Linux wiki to the top.
I would rather not.
Edit: clarifying that this is not strictly due to ads. I think the article itself is an ad judging by the slug 'six-search-engines-worth-trying-now-that-google....'. Usually such articles include a plug. I am not disabling adblocker to read that plug
Maybe this is a frog in boiling water situation but… it’s just the same search as it’s always been, there’s just now a chunk of Gemini up top, that you take with a grain of salt, same way you’d take the promoted results. If you don’t like it, Adblock it.
When I ask myself honestly, has google search gotten worse over the past 25+ years? My answer is… ehhhhhh… not really?
Other people have always claimed google was ‘getting worse’ and I just don’t see why it should be any more true now than it was then. Isn’t this just the latest round of whinging?
Digging through the crap in a Google search is like finding a recipe in a blog. Sure, it's there, but the experience isn't great. I'd rather pay for a better experience, like buying a cookbook or subscribing to nytc. If you can afford it, why wouldn't you?
I use it through OpenRouter - I love how the pricing is per search and isn't a subscription.
I feel like this is a scene from a movie where a character goes "what are you talking about? Look around us, it's already gone" (idk why i am picturing Mark Whalberg for some reason).
But it's all just reddit, medium, twitter (twitter pages, w/e those blog things are).
The internet as we knew it is gone, man.
No thanks, I like ads blocked.
https://web.archive.org/web/20260523024427/https://techcrunc...
Some of my favorites are:
I also use Grok and Perplexity
I was quite fond of ixquick but it shut down ages ago. These days I like etools.ch especially since it includes results from search engines like marginalia etc that I tend to forget to search directly but like having meshed into my general searches. Plus you can change which engines it uses in settings and it shows which search engine(s) each link came from which is handy.
On a related note I like to check out Serdys list of search engines with their own indexes once if a while. It gets updated here and there and includes a fair amount of search engines I don't tend to see elsewhere.
https://seirdy.one/2021/03/10/search-engines-with-own-indexe...
So, now I have a nice video clip of "The Ballad of the S.S. Forgetful", which it turns out aired near the end of Sesame Street Episode 2130. :)
They intercept a non-trivial portion of the web's traffic and presumably are as equipped as any other company is to build a good index. They're also the only the only company that has an interesting alternative incentive structure for creators.
I've been using Ddg and Brave for general search and Yandex for deep-ocean expeditions (of the jack sparrow variety) and topics that US tech giants censor. I am looking for good Chinese search engine so that I can search things that Yandex censors or when Yandex's bot detection goes crazy (I get blocked with infinite captcha about 25% of the time).
The only thing Google remains good at is local search. If I want to buy something locally, nothing else comes close.
There’s so much content getting buried now.
If you’re looking for anything remotely niche or legally gray, like sports streams or ebooks, you’re often better off using Yandex or you’ll never find it.
The old Google search engine that used to properly index and surface the open web has been gone for a long time.
Google hasn't been "Google" for quite some time.
* I use Kagi and DuckDuckGo before that.
The last use case I had for google was Google Scholar, but it now appears to block anyone who blocks google tracking. But this is where ChatGPT does an excellent job of generating lists of technical papers and reviews and it interprets natural language queries with no problems. The kind of complex logical search queries google used to support (what, 15 years ago?) can be written without strict logical language (! & | ()) and all that. Pubmed isn’t bad for cross-checking and simple searches. And if you put sci-hub into the yandex.com/search box and click on yandex ai it tends to tell you where the current active sci-hub sites are, which is handy.
- Organize the world's information
- Don't be evil
Who was president at this time? Was this while they were denying students the option to code on the computers we bought for them because security?From https://en.wikipedia.org/wiki/Don%27t_be_evil :
> Between 21 April and 4 May 2018, Google removed the motto from the preface, leaving a mention in the final line: "And remember... don't be evil, and if you see something that you think isn't right – speak up!"
That sounds like NY to me.
I changed my default search engine to: https://www.google.com/search?q=%s&udm=14
Author should mention that you can change your browser's default search engine.
It just doesn't seem to surface much reddit content. I know I know reddit, but on niche topics they often have some good discussions/links/experts.
Also an extremely happy Kagi user but can't speak to their API, haven't tried it yet.
The idea is simple: collect search stubs, short documents with routing information about a topic. When you want to search something your agent finds the right stub and looks up where on the web to go find that information. Your information is always fresh, while a search stub changes slowly, over years, because the entry points tend to be more stable.
If we use a 4B local model and another 4GB for search stubs it could be portable enough to download and install. As you use it you also generate your own search stubs on top of the generic package. A stub could contain links to news feeds, high quality hubs, search engines and of course, actual websites. High quality search stubs can be generated with any frontier LLM piggy backing on its agentic search capabilities.
I think the idea of managing a collection of search stubs as a replacement for centralized search engines is important because it would wean us of one of the last centralized points of the internet. Google plays many ranking games on top of users and publishers, serving their own interests first. I want out of that arrangement.
local LLM + local stub index + proxy == anonymity
Google still maintains a web search mode that's free of AI overviews/chat exhortations (as well as ads, if you use an ad blocker). https://www.google.com/search?q=foo&udm=14 is the format of the search URL, and tenbluelinks has instructions on how to use it as your default engine on various platforms.
That said, I've stopped using this as a founder. While I personally like the web search results more (if I wanted synthesis of results, I'd use dedicated agentic-loop-capable tools that are a hotkey away), it's far more important to understand (and empathize with) our users' experiences, good and bad, when they use Google in its full AI extravagance in practice.
The current catalog covers 100+ providers, 1000+ universities, and 250,000+ courses.
Originally a weekend side project that I first shared with the world here on HN itself on Nov 29, 2011 [1]. Currently bootstrapped and profitable.
I'd guess due to compute constraints, AI overview will struggle to reach truly great quality. That said, for now I find adding this section at the top still useful to me. The broader decline in Google's search quality is the bigger drag on me.
I remember the first time I copy-pasted something off a major website, in quotes, because I wanted to follow up on it, and got no search results. Not even from the site I copy-pasted from. "Search" has been in name only for a very long time.
Build directories again.
It incorporates results from both Google and Bing: https://support.startpage.com/hc/en-us/articles/452243553384...
Google and others are racing to integrate AI but - for me - there are two hidden costs. 1. Access to the web becomes mediated and monetised even more. 2. Energy consumption up to 100x normal searches has an environmental impact.
This is going to happen whether Google does it or not. The toothpaste is out of the tube.
I'm afraid of them selling out at this point. If they go, I honestly don't know who will go in their place.
Before, search engines provided links, links provided the info we were looking for. Now AI provides that info without the middle man, links are just a footnote in case you need them.
you can simply set the search as you like and save as a search url [1] that can be set in the browser once and forget.
their bangs are useful to search directly in a specific site (e.g. useful workaround for reddit search results not coming anymore). when i tried kagi, i noticed that they have adopted a similar system, which has to be inspired.
my overall search experience has been poor even before the GPTs, but this is just fine for almost everything.
[1] https://duckduckgo.com/duckduckgo-help-pages/settings/params
The three dominant English search engines with their own indexes are Google, Bing, and Yandex but this list has many spiders and engines that traverse the web.
I think Google is getting its act back together.
Seeing as Apple has reportedly partnered with Google to power their LLM offering, I don’t expect that to change soon. I hope I am wrong.
Different people/bots scrape the net and add it to a distributed database optimized for search.
Each query could cost a crypto micropayment to avoid DDoS. Or maybe a slightly larger payment to download the whole database so you can use it privately or create a competing centralized or decentralized search.
Yes, we hate crypto, but it seems useful here. It's bad if 1 entity can gatekeep both the database and access to it, no matter how non-evil they seem now.
We might even index torrents, use speech-to-text for music, movies, video clips and other things like that. So you'll search for a phrase from a movie and it will be there even though no one mentions it on any website.
A couple of issues I can think of with that decentralized approach:
* copyright - fuck it, it's decentralized, it can index whole books, maybe partnering with Anna's Archive or LibGen. Maybe have a copyright-respecting database and another one that doesn't respect it if you foresee the man coming down on the project. Maybe the results from the DB that doesn't respect copyright is merged at query-time with the one that does. Or maybe, the DB that doesn't respect copyright is just a superset of the copyright-respecting DB. I don't know how easy it would be to simultaneously search more than 1 DB.
* privacy - it could run over Tor or at least allow people to access it via Tor. The privacy of the cryptocurrency also seems doable - we have Monero and other private coins but I'm not sure how easy it would be to implement private micropayments with these.
* spam, intentionally wrong archives/crawls - pay the people who submit sites something so they financial motivation to not lie. Some consensus-based reward mechanism could be used, not sure which one
* moderation, illegal content - we don't care about copyright but likely don't want real CSAM, real animal abuse and other obviously awful content. Rewards should also be able to be used somehow for moderators or for people flagging content. We might even have a decentralized way to flag/tag content for anything at all - "AI generated" or "human generated", "small web", "uses Cloudflare", etc..
* how the distributed database actually works, how searching it works, who connects to whom when making a query and so on. I hope there are smart people with knowledge on such systems (not me lol) who can shed some light on whether it's possible and how.
DuckDuckGo is about to surge!
If you really want the best search engine, ChatGPT with thinking mode enabled is by far the best search engine technology that exists today. There's nothing that comes close.
This one is also stupid:
> But if a search engine were to operate without ads, could it still make money?
> That’s what Kagi is trying to accomplish. For $5 per month — or $10 for unlimited searches — you can access an ad-free search engine without AI overviews.
UUuuh ChatGPT exists for $20 per month and does the best searches (amongst other things) and is also ad free.
----
Edit: getting downvoted
Firstly, it is pretty obvious to me and everyone else reading that this specific concern that the content producers won't make money is largely performative and insincere.
From the article:
> many users see this as yet another example of a tech company squeezing AI agents and chatbots into everything it can, making it impossible to navigate the internet without encountering a chatbot
This is purely ideological. I can say this because Ads, which are the very thing keeping content alive, is the very thing opposed generally by the same people.
Secondly, it is exactly Google, the company that pioneered ads, the thing that people take an issue with, are the ones doing this. Surely such a company knows how to balance ad revenue and long term user growth. If your concern is so valid that content creators won't make money, why do you think Google is doing all of this, especially when they are bound to lose their main source of revenue? It was Google that even made content creation possible by providing revenue.
Thirdly, and I can't prove it but I mean this in an normative and a positive way: AI for search is good for humanity, good for content creators as well. The large second order effects can't be explained but making it quick and easy for users to search and provide results for complicated prompts is a _good_ thing. I generally do click people's blogs and learn more about them and follow them.
In fact, if the concern were actually sincere, we would be seeing the second order effects more lucidly: lower SEO spam and higher quality publications.
I'm already seeing newer forms of content monetisation in the form of substack etc. This is by far a better, more aligned approach than SEO cat and mouse games. I also see advertisements working better because a rich prompt has better CTR which opens up a potentially better content economy. But I predict this very thing would infuriate the same people even more. "How dare I get more relevant ads and make Google richer??"
I'm sure what I typed up would be downvoted because of ideological reasons, but the few that think a bit more deeply might agree and see my point. Performative concern is tiring.