Web Performance Profiling: Google.com (opens in new tab)

(requestmetrics.com)

117 pointstoddgardner5y ago83 comments

83 comments

62 comments · 16 top-level

Am I the only person here who wasn’t thinking about the frontend, but rather all the things that need to happen in the backend to render the search results?

To me it feels like an oversight when answering the question “how the hell is Google so fast?” and not digging into how Google is able to return the results to your actual search query in a matter of milliseconds. That, to me, is the real miracle.

kempbellt5y ago

I find this to be the more fascinating part of Google's response time as well. Sending an optimized html file to a client in a matter of milliseconds is cool, but static pages should all load very quickly, so I don't see much surprise here - they've just optimized their front-end and done a good job of it.

Them being able to take your query, discover the data that answers your query, then optimize that data down to a little html snippet that fast is significantly more impressive to me.

sozy7775y ago

And setup ads biddings, customized to you...

ma2rten5y ago

There is a lecture from Jeff Dean about it: https://www.youtube.com/watch?v=modXC5IWTJI

It's from 2010, but the fundamentals probably haven't changed.

brundolf5y ago

Sure, but that's a) impossible to dig into from the outside and b) less likely to yield useful information for the average developer who's trying to speed up a website.

tyingq5y ago

It is handy, though, that search results are free to be very "eventually consistent" and easily distributed.

Not discounting the other magic, but exposing read-only data, close to the end user, where freshness isn't a huge concern does simplify things.

GuB-425y ago

Something interesting I noticed is that the further you go down the pages, the slower it gets.

By their own metric, page 10+ take several times longer to arrive, and most search terms only have a few hundred results at best, even when showing billions of results, you only see the real number when you hit the last page. For example, for me, the word "the" only has 445 actual results (instead of 25 billions), and page 45 takes 2 seconds to complete, compared to 0.7 seconds for page 1.

mhh__5y ago

It is an oversight, but equally for people like me it is helpful - I know how to optimise C++ (maybe not at google scale on my own!) but I've bullied compilers and measured pmc's, I haven't got a clue how to optimise frontends (beyond my usual solution of give up go and do something pleasurable)

GretchenKlein915y ago

yeah me too

TekMol5y ago· 6 in thread

It feels fast because most other sites are insanely slow.

Just build a normal HTML+CSS+JS site with serverside rendering, route the assets through a CDN and voila! your site will be just as "fast" as Google.

thdrdt5y ago

Edit: I was too quick to judge your post. The article is indeed just about serving content, not about why their results are fast.

anthony_r5y ago

This reminds me of that joke video about replacing MongoDB with /dev/null. Look at the write speeds! /dev/null is web scale!

https://www.youtube.com/watch?v=b2F-DItXtZs

sushshshsh5y ago

Lots of servers and wires and things stored in RAM long before you ever ask for them basically

nevir5y ago

By doing all the calculation in memory. Disk is too slow

1 more reply

TekMol5y ago

The article is not about how they come up with the results. Only about how they deliver the page.

The article compares the speed to other websites like nike.com which have nothing to do with a search engine.

maple31425y ago

I think the point here is about how to make page faster about the same size.

fatnoah5y ago· 5 in thread

When I think of "fast", I always think of a time I was building a product that required subscribing to Twitter's API to receive updates from specific users.

In my testing, I'd set a breakpoint on my server code to see when I'd get the "push" from Twitter's API and would use the Twitter App on my phone to create test tweets. Every single time, I'd create a tweet and my server breakpoint would be hit immediately. Not soon, but immediately. I'd see my breakpoint triggered well before the UI in the app even refreshed after submitting the tweet.

moritonal5y ago

I think every dev should try once in their life to write "fast" code. Learn Rust or C++ or whatever and write an echo server and try every trick you can find to make it fast. Run it a billion times for fun and just bask in it.

MacsHeadroom5y ago

What's really wild is that the architects who envision and oversee implementation of systems that make these things possible earn a tiny fraction of what executives make.

They make 5x+ the next highest IC, sure. But if they worked half as much and BS'ed 100x as much they'd make 500x.

Personally, that makes me wonder a lot more than a clever combination of memcache and map-reduce.

Answerawake5y ago

Technology can be understood and reasoned from first principles. Can you really say the same about human relationships, especially the complex kinds that can result in such a logical mismatch? There is a lot of irrationality in the real world.

dmitriid5y ago

I have a bot in Java that re-transmits messages between Slack/XMPP/Telegram.

Sending message in Slack is:

- sent to Slack servers

- bot looked up and data sent to it

- bot (on a server in DO) figures out what to do with the message (working with an MQ server running locally :))) )

- sends message back to a server (slack/xmpp/etc.)

- that message gets processed and pushed to the corresponding client

I could never properly measure the time between the original message and the translated message. It was always way way subsecond.

Everything we have now: networks, servers, code is very fast.

[1] Badly written bot here: https://github.com/dmitriid/tetrad

oarsinsync5y ago

> I could never properly measure the time between the original message and the translated message. It was always way way subsecond. Everything we have now: networks, servers, code is very fast.

All of that said, if the time is measured in more than triple digit nanoseconds, relative to the hardware capabilities that we have today, it’s slow.

Please do not take that as a reflection on you personally or your work, but rather a reflection on the layers and layers of abstraction we’ve collectively added, and keep adding. While we’ve made it easier for people (especially developers) to write code, we’ve made everything slower, and just continue to mask that with hardware improvements.

1 more reply

epanchin5y ago· 5 in thread

Conversely, why is Windows search so miserably slow?

faeyanpiraat5y ago

Just use “voidtools everything”. It allows you to search all files, even with regex patterns, and it returns realtime results.

If you want to search on file contents, use “agent ransack.

epanchin5y ago

Outlook search is even worse, I doubt I would employ anyone with Outlook Search team on their cv.

jeffbee5y ago

Google search can be fast because thousands of computers complete your request at once, whereas the index of junk on your Windows box only has the one little computer to serve it. Same reason why operations in Gmail are so much faster than a local client like macOS Mail: the instantaneous compute power brought to bear on your request while it is running is thousands of times larger than one computer.

Miraste5y ago

While that's true, Windows 10 search is visibly worse than even Windows 7 search, and much worse than MacOS search. And their fuzzy matching algo for the start menu is deranged.

scarmig5y ago

I would guess that aggressive caching and indexing are at least as important as having thousands of computers complete the request at once (and I'd be surprised if the number of computers on the request path is nearly that high--if nothing else, thousands of computers mean you're almost guaranteed to hit p999 latency every time).

1 more reply

rsynnott5y ago· 5 in thread

I mean... is it? It feels a _lot_ slower than it used to, especially on slower devices.

Possibly that 700kB of random crap is implicated :)

johncena335y ago

Classic HN. Snarky, condescending and nothing of substance. Reminds me why I visit HN less and less lately.

dang5y ago

Please don't sneer, including at the rest of the community.

https://news.ycombinator.com/newsguidelines.html

rsynnott5y ago

I mean, it's fast _for an SPA that weighs 700kB compressed_. It is, however, substantially slower than it used to be, for no clear benefit.

1 more reply

Nextgrid5y ago

He makes a valid point; why does a page with a text field and links to other pages need to weigh anywhere close to 700kB?

1 more reply

MacsHeadroom5y ago

https://lite.duckduckgo.com/lite is ~100x faster.

1 more reply

ouid5y ago· 4 in thread

I have a related question. How is instacart so slow? I am usually pretty unbothered by slow load times, but searching for and selecting groceries is a full 10 times slower than any other experience on the internet. Is this a deliberate push to get me to use the mobile app? Some dark pattern thing?

adventured5y ago

It is remarkably, painfully slow. Every letter entered into their search box (eg when searching a particular grocery store for a product) appears to be doing a full reload of results. You can feeeeel the horrible lag as you try to type and the site can't keep up with either your typed text or the results.

They should be doing a timed release on that. If you stop entering text for N ms, then go for a result. Otherwise they need to have cached results for a huge number of common combinations of letters, very frequently updated, for every store. It's a resource intensive thing to do well at their scale, for such a seemingly simple feature. If they are in fact caching all of those drop-down search results properly, something is very wrong on serving up the cached content.

And for the full results pages, when you try to load them for a given grocery store - the only answer I can guess for that is again mediocre caching. There usually isn't any other culprit other than that for such simple pages. In an effort to match current inventory, my guess is their caching isn't very good (constantly invalidated or rarely put into cache, so they're doing something that isn't very performant; I'd be astounded if they weren't doing some amount of caching on the results).

ouid5y ago

This makes sense technically, but it still doesn't answer the question of why it isn't fixed. Is it really that hard to experiment with caching policies? Is there secretly only one developer at instacart?

Traubenfuchs5y ago

> They should be doing a timed release on that. If you stop entering text for N ms, then go for a result.

I believe that is called debouncing and it's something any non-junior frontend developer should have in their toolkit.

nattaylor5y ago

Early on they built a Rails app backed by Postgres [0] to store, these days, 500 million items. Then in this post [1] they say they're rearchitecting the database, then mention Snowflake. It sounds to me like they need a document store like Lucene in order to get fast search performance--but they may be optimizing for other use cases like tracking orders which is probably easier in a RDMBS.

It also looks like they might first do all the work to retrieve all the results for a query. The first request is to an endpoint `search_v3/{term}?cache_key={key}` then subsequent requests are to `asyncresultset_{n}?cache_key={key}` so seems like they might have the result set cached from the first query.

The response to the autocomplete endpoint, which is fast, contains an `elastic_id` so perhaps they do partially use a document store.

[0] - https://stackshare.io/posts/the-tech-behind-instacarts-groce... [1] - https://tech.instacart.com/the-story-behind-an-instacart-ord...

saberience5y ago· 3 in thread

You can make Google searches plenty slow if you Google something uncached.

I just Googled: the OR google OR a OR badger -the -google -a

it took: 5.66 seconds.

This is probably cached now so don't try it yourself, replace badger with some other weird word. :)

asdfasgasdgasdg5y ago

The slowness of this query is more related to the negations than the fact that the query is uncached. Most uncached queries will be handled much more quickly. I'm surprised this one takes so long, because I would have expected the negations to eliminate the corresponding branches of the OR during query simplification, but maybe there is some expansion happening before we get to that point that makes it hard to detect this logical conflict.

Probably also the sheer frequency of the terms in the OR don't help. Normally "the" would be treated as a stopword (https://en.wikipedia.org/wiki/Stop_word). But I'm not sure if that logic applies when it appears alone in a sequence of terms (as when it's the child of an OR).

ramshanker5y ago

About 9,33,000 results (5.97 seconds)

Perhaps cache evicted. :D Or probably my PoP is different.

jedberg5y ago

5.3 seconds. I think you may have found a pathological case. Or we're just all on different pops (because I got .38 seconds when I did it again so obviously it was cached the second time).

codegladiator5y ago· 3 in thread

> How to Be Fast, Like Google

>> Make Less Requests

From 130 requests, that's an odd takeaway.

toddgardnerOP5y ago

I don't think you read the article. Devtools shows 130 requests because it considers inline data URIs to be a request--instead, there are lots of inlined images. so the page looks mostly complete with only the original document request.

codegladiator5y ago

I did read the article that's I picked to sentences from there. I did not pick up from the article that chrome counts inline data as requests and I did not know that earlier.

I find that even more odd that inline data is counted as requests.

edit: stand corrected, thanks for pointing out

2 more replies

tylerhou5y ago

Clearly the article means "make fewer render blocking requests."

sushshshsh5y ago· 3 in thread

Climb to the top of a mountain in the USA (or live in the countryside of a developing country), where you can only get 1 bar of 3g service.

Then try to load https://cr.yp.to

Ok now try to do a google search for dog food.

Tell me which one executes in less than a second, and which one hangs forever

codegladiator5y ago

Are you comparing search with a static page load ?

Just compare the google page load if you want to compare with that ?

sushshshsh5y ago

I'm just asking you to do the test, because it was forced on me for 6 months of my life :)

Firstly, cr.yp.to is hosted on some really basic consumer grade hardware most likely and has to make an additional hop through Tonga of all places for DNS resolution.

Of course, the page size of cr.yp.to is very small and does not involve any other communication with other servers to deliver a request.

But Google has x million machines, x million miles of fiber, x million sticks of ram, and the page size on google also isn't terribly huge. And it's serving a cached result usually, the robot has already scraped it.

But still, because of the number of network pings that serving a single google search takes, it is extremely common on cell phones to lose service for a millisecond and completely destroy the bidirectional connection between you and Google, and your mobile browser will just sit there hanging forever until you force restart it.

2 more replies

james4125y ago

A cold cache google page load here pushes 725kb just to render a logo and search box. To avoid fingerprinting my cache is always cold.

Google search is a huge dog

1 more reply

cblconfederate5y ago· 2 in thread

What s the deal with the CSS thing? I 've noticed that custom fonts cause considerable slowness, but CSS?

anthony_r5y ago

Nothing special about it, it's just another round-trip that you'd have to make.

There's also the statistical impact of many round-trips: imagine a network where 1% of connections take 1 second, while 99% take 1 milisecond. If you issue > 100 request then most page loads will be slow.

missblit5y ago

It doesn't need to be an extra round-trip in HTTP/2 or HTTP/3 thanks to server push (which is basically link rel preload, but without the roundtrip part)

dimtion5y ago· 1 in thread

One factor that is missing from this post is the processing time on the backend which is also insanely fast. This post only considers frond-end optimizations.

On the author's benchmark, a roundtrip seems to take on average 30ms, and the time to first byte for the main content is around 140ms. Which means that in less than 110ms Google is able to parse the search query, and build the HTTP response.

I'm sure they are heavily relying on caches and other optimizations, and for tail-end requests the result might not be as impressive. But compared to many other websites in 2020 this is still unfortunately not the norm.

arafsheikh5y ago

> for tail-end requests the result might not be as impressive

Yep, it is possible to craft search queries that take multiple seconds to process. Example [1]:

    the OR google OR a OR "supercalifragilisticexpialidocious" -the -google -a

[1] https://news.ycombinator.com/item?id=20605589

offsky5y ago· 1 in thread

If you inline everything then you can’t take advantage of caching those resources. I wonder if there is a fancy way to inline the resources and then somehow use JS to cache the data, set a cookie and then on the second page load it’s even faster because you don’t have resend the inlined stuff.

samsquire5y ago

I had this idea too. I proposed it as an alternative to web bundles.

Include a cache attribute to inline resources to mark the inline resource as cacheable. Whenever an empty element with the same attribute is encountered, use the cache. Could use a header with a serialized bloom filter to convey what has been cached to the server.

tanilama5y ago

Doesn't come as surprising to me honestly. Besides performance reason, inlining everything also minimizes your dependencies as well, and now you can make sure what you send out is what user would be able to see.

I can also see how this makes integration testing magnitude easier/effective.

toddgardnerOP5y ago

It's real sad that the title of this was changed from "How the hell is Google so Fast?"

newbie5785y ago

Honestly, Google is one of the modern marvels, even when I read the article and understand _why_ Google is so fast, I still cannot comprehend it.

To people over 30 years ago, this is a literal example of magic.

chmod7755y ago

> How is Google so fast?

It is not. At least not in the way the article is talking about. So while yes, requests are fast, it doesn't really feel that way when I can't click anything for a good second after hitting search.

My browser takes roughly half a second crunching JavaScript, doing layout, and drawing the result page AFTER doing a whole boatload of HTTP requests. Annoyingly half that JavaScript runs AFTER the page is rendered (150ms-200ms of JS functions when I can already see results, why?), making the page appear to lag if you immediately try to click/tap something.

If they would serve the page as static html and css instead, load times could easily be below 50ms with another 20ms for my browser to present me with something that is interactive right away.

The impressive part is how fast they generate results.

j / k navigate · click thread line to collapse

83 comments

62 comments · 16 top-level

stingraycharles5y ago· 8 in thread

Am I the only person here who wasn’t thinking about the frontend, but rather all the things that need to happen in the backend to render the search results?

kempbellt5y ago

Them being able to take your query, discover the data that answers your query, then optimize that data down to a little html snippet that fast is significantly more impressive to me.

sozy7775y ago

And setup ads biddings, customized to you...

ma2rten5y ago

There is a lecture from Jeff Dean about it: https://www.youtube.com/watch?v=modXC5IWTJI

It's from 2010, but the fundamentals probably haven't changed.

brundolf5y ago

Sure, but that's a) impossible to dig into from the outside and b) less likely to yield useful information for the average developer who's trying to speed up a website.

tyingq5y ago

It is handy, though, that search results are free to be very "eventually consistent" and easily distributed.

Not discounting the other magic, but exposing read-only data, close to the end user, where freshness isn't a huge concern does simplify things.

GuB-425y ago

Something interesting I noticed is that the further you go down the pages, the slower it gets.

mhh__5y ago

GretchenKlein915y ago

yeah me too

TekMol5y ago· 6 in thread

It feels fast because most other sites are insanely slow.

Just build a normal HTML+CSS+JS site with serverside rendering, route the assets through a CDN and voila! your site will be just as "fast" as Google.

thdrdt5y ago

Edit: I was too quick to judge your post. The article is indeed just about serving content, not about why their results are fast.

anthony_r5y ago

This reminds me of that joke video about replacing MongoDB with /dev/null. Look at the write speeds! /dev/null is web scale!

https://www.youtube.com/watch?v=b2F-DItXtZs

sushshshsh5y ago

Lots of servers and wires and things stored in RAM long before you ever ask for them basically

nevir5y ago

By doing all the calculation in memory. Disk is too slow

1 more reply

TekMol5y ago

The article is not about how they come up with the results. Only about how they deliver the page.

The article compares the speed to other websites like nike.com which have nothing to do with a search engine.

maple31425y ago

I think the point here is about how to make page faster about the same size.

fatnoah5y ago· 5 in thread

When I think of "fast", I always think of a time I was building a product that required subscribing to Twitter's API to receive updates from specific users.

moritonal5y ago

MacsHeadroom5y ago

What's really wild is that the architects who envision and oversee implementation of systems that make these things possible earn a tiny fraction of what executives make.

They make 5x+ the next highest IC, sure. But if they worked half as much and BS'ed 100x as much they'd make 500x.

Personally, that makes me wonder a lot more than a clever combination of memcache and map-reduce.

Answerawake5y ago

dmitriid5y ago

I have a bot in Java that re-transmits messages between Slack/XMPP/Telegram.

Sending message in Slack is:

- sent to Slack servers

- bot looked up and data sent to it

- bot (on a server in DO) figures out what to do with the message (working with an MQ server running locally :))) )

- sends message back to a server (slack/xmpp/etc.)

- that message gets processed and pushed to the corresponding client

I could never properly measure the time between the original message and the translated message. It was always way way subsecond.

Everything we have now: networks, servers, code is very fast.

[1] Badly written bot here: https://github.com/dmitriid/tetrad

oarsinsync5y ago

> I could never properly measure the time between the original message and the translated message. It was always way way subsecond. Everything we have now: networks, servers, code is very fast.

All of that said, if the time is measured in more than triple digit nanoseconds, relative to the hardware capabilities that we have today, it’s slow.

1 more reply

epanchin5y ago· 5 in thread

Conversely, why is Windows search so miserably slow?

faeyanpiraat5y ago

Just use “voidtools everything”. It allows you to search all files, even with regex patterns, and it returns realtime results.

If you want to search on file contents, use “agent ransack.

epanchin5y ago

Outlook search is even worse, I doubt I would employ anyone with Outlook Search team on their cv.

jeffbee5y ago

Miraste5y ago

While that's true, Windows 10 search is visibly worse than even Windows 7 search, and much worse than MacOS search. And their fuzzy matching algo for the start menu is deranged.

scarmig5y ago

1 more reply

rsynnott5y ago· 5 in thread

I mean... is it? It feels a _lot_ slower than it used to, especially on slower devices.

Possibly that 700kB of random crap is implicated :)

johncena335y ago

Classic HN. Snarky, condescending and nothing of substance. Reminds me why I visit HN less and less lately.

dang5y ago

Please don't sneer, including at the rest of the community.

https://news.ycombinator.com/newsguidelines.html

rsynnott5y ago

I mean, it's fast _for an SPA that weighs 700kB compressed_. It is, however, substantially slower than it used to be, for no clear benefit.

1 more reply

Nextgrid5y ago

He makes a valid point; why does a page with a text field and links to other pages need to weigh anywhere close to 700kB?

1 more reply

MacsHeadroom5y ago

https://lite.duckduckgo.com/lite is ~100x faster.

1 more reply

ouid5y ago· 4 in thread

adventured5y ago

ouid5y ago

Traubenfuchs5y ago

> They should be doing a timed release on that. If you stop entering text for N ms, then go for a result.

I believe that is called debouncing and it's something any non-junior frontend developer should have in their toolkit.

nattaylor5y ago

The response to the autocomplete endpoint, which is fast, contains an `elastic_id` so perhaps they do partially use a document store.

[0] - https://stackshare.io/posts/the-tech-behind-instacarts-groce... [1] - https://tech.instacart.com/the-story-behind-an-instacart-ord...

saberience5y ago· 3 in thread

You can make Google searches plenty slow if you Google something uncached.

I just Googled: the OR google OR a OR badger -the -google -a

it took: 5.66 seconds.

This is probably cached now so don't try it yourself, replace badger with some other weird word. :)

asdfasgasdgasdg5y ago

ramshanker5y ago

About 9,33,000 results (5.97 seconds)

Perhaps cache evicted. :D Or probably my PoP is different.

jedberg5y ago

5.3 seconds. I think you may have found a pathological case. Or we're just all on different pops (because I got .38 seconds when I did it again so obviously it was cached the second time).

codegladiator5y ago· 3 in thread

> How to Be Fast, Like Google

>> Make Less Requests

From 130 requests, that's an odd takeaway.

toddgardnerOP5y ago

codegladiator5y ago

I did read the article that's I picked to sentences from there. I did not pick up from the article that chrome counts inline data as requests and I did not know that earlier.

I find that even more odd that inline data is counted as requests.

edit: stand corrected, thanks for pointing out

2 more replies

tylerhou5y ago

Clearly the article means "make fewer render blocking requests."

sushshshsh5y ago· 3 in thread

Climb to the top of a mountain in the USA (or live in the countryside of a developing country), where you can only get 1 bar of 3g service.

Then try to load https://cr.yp.to

Ok now try to do a google search for dog food.

Tell me which one executes in less than a second, and which one hangs forever

codegladiator5y ago

Are you comparing search with a static page load ?

Just compare the google page load if you want to compare with that ?

sushshshsh5y ago

I'm just asking you to do the test, because it was forced on me for 6 months of my life :)

Firstly, cr.yp.to is hosted on some really basic consumer grade hardware most likely and has to make an additional hop through Tonga of all places for DNS resolution.

Of course, the page size of cr.yp.to is very small and does not involve any other communication with other servers to deliver a request.

2 more replies

james4125y ago

A cold cache google page load here pushes 725kb just to render a logo and search box. To avoid fingerprinting my cache is always cold.

Google search is a huge dog

1 more reply

cblconfederate5y ago· 2 in thread

What s the deal with the CSS thing? I 've noticed that custom fonts cause considerable slowness, but CSS?

anthony_r5y ago

Nothing special about it, it's just another round-trip that you'd have to make.

missblit5y ago

It doesn't need to be an extra round-trip in HTTP/2 or HTTP/3 thanks to server push (which is basically link rel preload, but without the roundtrip part)

dimtion5y ago· 1 in thread

One factor that is missing from this post is the processing time on the backend which is also insanely fast. This post only considers frond-end optimizations.

arafsheikh5y ago

> for tail-end requests the result might not be as impressive

Yep, it is possible to craft search queries that take multiple seconds to process. Example [1]:

    the OR google OR a OR "supercalifragilisticexpialidocious" -the -google -a

[1] https://news.ycombinator.com/item?id=20605589

offsky5y ago· 1 in thread

samsquire5y ago

I had this idea too. I proposed it as an alternative to web bundles.

tanilama5y ago

I can also see how this makes integration testing magnitude easier/effective.

toddgardnerOP5y ago

It's real sad that the title of this was changed from "How the hell is Google so Fast?"

newbie5785y ago

Honestly, Google is one of the modern marvels, even when I read the article and understand _why_ Google is so fast, I still cannot comprehend it.

To people over 30 years ago, this is a literal example of magic.

chmod7755y ago

> How is Google so fast?

It is not. At least not in the way the article is talking about. So while yes, requests are fast, it doesn't really feel that way when I can't click anything for a good second after hitting search.

If they would serve the page as static html and css instead, load times could easily be below 50ms with another 20ms for my browser to present me with something that is interactive right away.

The impressive part is how fast they generate results.

j / k navigate · click thread line to collapse