My £4 a month server can handle 4.2M requests a day (opens in new tab)

(mark.mcnally.je)

781 pointsmark_mcnally_je4y ago452 comments

452 comments

206 comments · 62 top-level

nostrademons4y ago· 47 in thread

People tend to severely underestimate how fast modern machines are and overestimate how much you need to spend on hardware.

Back in my last startup, I was doing a crypto market intelligence website that subscribed to full trade & order book feeds from the top 10 exchanges. It handled about 3K incoming messages/second (~260M per day), including all of the message parsing, order book update, processing, streaming to websocket connections on any connected client, and archival to PostGres for historical processing. Total hardware required was 1 m4.large + 1 r5.large AWS instances, for a bit under $200/month, and the boxes would regularly run at about 50% CPU.

mynameisash4y ago

What Andy giveth, Bill taketh away.[0]

I'm more than a little annoyed that so much data engineering is still done in Scala Spark or PySpark. Both suffer from pretty high memory overhead, which leads to suboptimal resource utilization. I've worked with a few different systems that compile their queries into C/C++ (which is transparent to the developer). Those tend to be significantly faster or can use fewer nodes to process.

I get that quick & dirty scripts for exploration don't need to be super optimized, and that throwing more hardware at the problem _can_ be cheaper than engineering time, but in my experience, the latter ends up costing my org tens of millions of dollars annually -- just write some code and allocate a ton of resources to make it work in a reasonable amount of time.

I'm hopeful that Ballista[1], for example, will see uptake and improve this.

[0] https://en.wikipedia.org/wiki/Andy_and_Bill%27s_law

[1] https://github.com/apache/arrow-datafusion/tree/master/balli...

Spooky234y ago

I get a kick out of stuff like this - I’m mostly an exec these days, but I recently prototyped a small database system to feed a business process in SQLite on my laptop.

To my amusement, my little SQLite prototype smoked the “enterprise” database. Turns out that a MacBook Pro SSD performs better than the SAN, and the query planner needs more tlc. We ended up running the queries off my laptop for a few days while the DBAs did their thing.

2 more replies

Waterluvian4y ago

What reminded me of this the other day is how MacOS will grow your cursor if you “shake” it to help you find it on a big screen.

I was thinking about how they must have a routine that’s constantly taking mouse input, buffering history, and running some algorithm to determine when user input is a mouse “shake”.

And how many features like this add up to eat up a nontrivial amount of resources.

5 more replies

b9a2cab54y ago

Abstractions almost always end up leaky. Spark SQL, for example, does whole-stage codegen which collapses multiple project/filter stages into a single compiled stage, but your underlying data format still needs to be memory friendly (i.e. linear accesses, low branching, etc.). The codegen is very naive and the JVM JIT can only do so much.

What I've seen is that you need people who deeply understand the system (e.g. Spark) to be able to tune for these edge cases (e.g. see [1] for examples of some of the tradeoffs between different processing schemes). Those people are expensive (think $500k+ annual salaries) and are really only cost effective when your compute spend is in the tens of millions or higher annually. Everyone else is using open source and throwing more compute at the problem or relying on their data scientists/data engineers to figure out what magic knob to turn.

[1]: https://www.vldb.org/pvldb/vol11/p2209-kersten.pdf

1 more reply

SNosTrAnDbLe4y ago

I work in the Analytics space and been mostly on Java and I am so glad that other people feel the same. At this point, people have become afraid of suggesting something other than Spark. I see something written in Rust to be much better at problems like this. I love the JVM but it works well with transactional workloads and starts showing its age when its dealing with analytical loads. The worst thing is then people start doing weak references and weird off the heap processing usually by a senior engineer but really defeats the purpose of the JVM

2 more replies

willvarfar4y ago

Yes, the whole idea of sending “agents” to do processing is poor performing and things like snowflake and Trino, where queries go to already deployed code, run rings around it.

Furthermore, pyspark is by far the most popular and used spark, and it’s also got the absolute world-worst atrocious mechanical sympathy. Why?

Developer velocity trumps compute velocity any day?

(I want the niceness of python and the performance of eg firebolt. Why must I pick?)

(There is a general thing to get spark “off heap” and use generic query compute on the spark sql space, but it is miles behind those who start off there)

iaw4y ago

Could you elaborate on other systems besides Ballista? (which looks great btw, thank you for sharing)

1 more reply

danudey4y ago

A lot of that is due to absolutely lousy code.

We had a system management backend at my last company. Loading the users list was unbearably slow; 10+ seconds on a warm cache. Not too terrible, except that most user management tasks required a page reload, so it was just wildly infuriating.

Eventually I took a look at the code for the page, which queried LDAP for user data and the database for permissions data. It did:

    get list of users
    
    foreach user:
        get list of all permissions
        filter down to the ones assigned directly to the user
    
    foreach user:
        get list of all groups
        foreach group:
            get list of all permissions
            filter down to the ones assigned to the group
        filter down to the ones the user has

I'm no algorithm genius, but I'm pretty sure O(n^2+n^3) is not an efficient one.

I replaced it with

    get list of all users
    get list of all groups
    get list of all permissions

    <filter accordingly>

Suffice to say, it was a lot more responsive.

Also worth noting was that fetching the user list required shelling out to a command (a python script) which shelled out to a command (ldapsearch), and the whole system was a nightmare. There were also dozens of pages where almost no processing was done in the view, but a bunch of objects with lazy-loaded properties were passed into the template and always used, so when benchmarking you'd get 0.01 seconds for the entire function and then 233 seconds for "return render(...)' because for every single row in the database (dozens or hundreds) the template would access a property that would trigger another SQL call to the backend, rather than just doing one giant "SELECT ALL THE THINGS" and hammering it out that way.

Note that we also weren't using Django's foreign keys support, so we couldn't even tell Django to "fetch everything non-lazily" because it had no idea.

If that app were written right it could have run on a Raspberry Pi 2, but instead there was no amount of cores that could have sped it up.

benhoyt4y ago

Yeah, I see this a lot. I think it's especially easy to introduce this kind of "accidentally quadratic" behaviour using magical ORMs like Django's, where an innocent-looking attribute access like user.groups can trigger a database query ... access user.groups inside a loop and things get bad quickly.

In the case of groups and permissions there's probably only a few of each, so fetching all of them is probably fine. But depending on your data -- say you're fetching comments written by a subset of users, you can tweak the above to use IN filtering, something like this Python-ish code:

  users = select('SELECT id, name FROM users WHERE id IN $1', user_ids)
  comments = select('SELECT user_id, text FROM comments WHERE user_id IN $1', user_ids)
  comments_by_user_id = defaultdict(list)
  for c in comments:
    comments_by_user_id[c.user_id].append(c)
  for u in users:
    u.comments = comments_by_user_id[u.id]

Only two queries, and O(users + comments).

For development, we had a ?queries=1 query parameter you could add to the URL to show the number of SQL queries and their total time at the bottom of the page. Very helpful when trying to optimize this stuff. "Why is this page doing 350 queries totalling 5 seconds? Oops, I must have an N+1 query issue!"

4 more replies

tra34y ago

This is an example of N+1 problem [0]. It should be a FizzBuzz for anyone doing any CRUD apps.

[0]: https://stackoverflow.com/questions/97197/what-is-the-n1-sel...

JJMcJ4y ago

Your pattern is quite powerful: get data from several sources and do the rearranging on the client (which might be a web server), instead of multiple interactions for each data item.

For SQL you can also do a stored procedure. Sometimes that works well if you are good at your DBMS's procedure language and the schema is good.

1 more reply

chenxiaolong4y ago

I had to fix a similar thing in our internal password reset email sender last year. The code was doing something like:

    for each user in (get_freeipa_users | grep_attribute uid):
        email = (get_freeipa_users | client_side_find user | grep_attribute email)
        last_change = (get_freeipa_users | client_side_find user | grep_attribute krblastpwdchange)
        expiration = (get_freeipa_users | client_side_find user | grep_attribute krbpasswordexpiration)

        # Some slightly incorrect date math...

        send_email

I changed it to a single LDAP query for every user that requests only the needed attributes. It cut that Jenkins job's runtime from 45 minutes to 0.2 seconds.

Thaxll4y ago

Filter in app is rarely the right solution, your data should be organized in a way that you can get what you need in a single query. Reasons:

- it's memory efficient

- it's atomic

- it's faster

Also doesn't LDAP support filtering in query?

1 more reply

VintageCool4y ago

I did some work to improve performance on a dashboard several years ago. The way the statistics were queried was generally terrible, so I spent some time setting up aggregations and cleaning that up, but then... the performance was still terrible.

It turned out that the dashboard had been built on top of Wordpress. The way that it checked if the user had permission to access the dashboard was to query all users, join the meta table which held the permission as a serialized object, run a full text search to check which users had permission to access this page, and return the list of all users with permission to access the page. Then, it checked if the current user was in that list.

I switched it to only check permissions for the current user, and the page loaded instantaneously.

1 more reply

unclebucknasty4y ago

Not sure what the exact use case was (i.e. the output of the filtering) but—from reading the first algo—seems to be something to do with determining group membership and permissions for a user.

In that case, was there a reason joins couldn't be used? As it still seems pretty wasteful (and less performant) to load all of this data in memory and post-process; whereas a well-indexed database could possibly do it faster and with less-memory usage.

hnbad4y ago

In defense of whoever wrote the original code: it probably would have been reasonably fast if it had been a database query with proper indexes. The filters would have whittled the selection down to only the relevant data, whereas returning basically three entire tables of data to then throw away most of it would have been extremely inefficient.

The mistake of course was not thinking about why this approach is faster in a database query and that it doesn't work that way when you already need to get all the data out of LDAP to do anything with it.

onlyrealcuzzo4y ago

Yeah - you likely want to do this a single simple query - which you can optimize if necessary. an O(N+1) query is bad. An O(N^2) query is something I have rarely seen. Congrats!

codebolt4y ago

I'd also add that groups and permissions are probably constant and can be cached with a long timeout.

afrodc_4y ago

Is there a reason you shelled out with a subprocess versus using a library like ldap3? Just curious

IfOnlyYouKnew4y ago

I believe the parent's point was that code tends to be faster than what people expect, not slower.

1 more reply

blacklion4y ago

Crypto markets are very small :-)

I'm working and company which process "real" exchanges, like NASDAQ, LSE, and, especially, OPRA feed.

We've added 20+ crypto exchanges in our portfolio this year, and all of them are processed on one old server which is unable to process NASDAQ Total View in real-time anymore.

On the other hand, whole OPRA feed (more than 5Gbit/s or 65B/day, yes, it is billions, messages of very optimized binary protocol, not this crappy JSON) is processed by our code on one modern server. Nothing special, two sockets of Intel Xeons (not even Platinums).

Foomf4y ago

I've read your few posts a few times and I'm still not sure why you made your post. You're telling the person that you handle more data than them and thus need more resources than them. Was your goal to smugly belittle them? It's not like they said any problem can be solved on their specific resources.

4 more replies

secondcoming4y ago

It depends on what your server does with each request though; '65B a day' means little. If all it does is write it to a log then I'm surprised you're not using a rPI.

joering24y ago

Could you share some more about that very optimized binary protocol? I know there are ways to be more efficient than JSON but since you call it crappy, your solution must be much much better. Honestly interested to readup more.

5 more replies

ingas4y ago

In 2001 we started a GSM-operator on Compaq Server (it was before they were bought by HP) with whole 1Gb(!) of RAM and 2x10Gb SCSI disks.

It served up to 70K of subscribers, call center with 30-40 employees, payment systems integration, everything.

Next was 8 socket Intel server. We were never able to saturate it's CPUs - 300 Mhz (or was it 400 ?) bus was a stopper. It served 350-400K of subscribers.

And next: we changed architecture and used 2 servers with 2 socket Intel CPUs again but that was time when Ghz frequencies appeared on market. We dreamed about 4xAMD server. We came to ~1 mln of active subscribers.

Nowadays: every phone has more power than it was those servers. Typical react application consumes more resources than billing system. Gigabyte here, gigabyte there - nobody counts them.

/grumpy oldster mode

fhood4y ago

People may underestimate how fast modern machines are, but that is probably in part because, at least in my fairly relevant experience, I have literally never seen a CPU bottleneck under normal circumstances. Memory pressure is nearly always the driving issue.

nine_k4y ago

The CPU is rarely used up to 100% because most code fails to utilize several cores efficiently.

OTOH a service loading the single core with the main thread is a frequent sight :( Interpreted languages like Python can easily spend 30% of time just on the deserialization overhead, converting the data from a DB into a result set, and then into ORM instances.

foobarbazetc4y ago

Yeah. Now that CPUs are insanely powerful and you have NVMe SSDs etc the bottleneck is always memory.

3 more replies

SilverRed4y ago

In my experience it is almost always the database holding things up. If your app does not use a database or it makes very simple use of it, then I'm not surprised it is blazing fast. As soon as you need to start joining tables and applying permissions to queries, it all gets slow.

OneEyedRobot4y ago

I've seen exactly the opposite although you certainly can't ignore memory speed.

Gepsens4y ago

I'm running a crypto trading platform I'm developing on 30$ on DigitalOcean. I coded exclusively in Rust and recently added a dynamic interface to python. Today during the BTC crash it spiked at 20k events/s, and that's only incoming data.

danudey4y ago

> I coded exclusively in Rust

This reminds me of back in 2003, a friend of mine worked for an online casino vendor; basically, if you wanted to run an online casino, you'd buy the software from a company and customize it to fit your theme.

They were often written in Java, ASP.NET, and so on. They were extremely heavyweight. They'd need 8-10 servers for 10k users. They hogged huge amounts of RAM.

My friend wrote the one this company was selling in C. Not even C++, mind you, just C. The game modules were chosen at compile time, so unwanted games didn't exist. The entire binary (as in, 100% of the code) compiled to just over 3 MB when stripped. He could handle 10k concurrent users on one single-core server.

I'm never gonna stop writing things in Python, but it still amazes me what can happen when you get down close to the metal.

2 more replies

Andrew_nenakhov4y ago

Probably, Erlang would be a good fit for your task.

2 more replies

giancarlostoro4y ago

is that $30 for a single droplet or is it spread out between a few different services? I'm kind of curious since I use DO for small projects myself.

3 more replies

lewisjoe4y ago

I'm running two different projects on a single instance of cheap VM too. Both of them runs on a not-so-memory-efficient programming runtime, yet the VM handles the load just fine.

Of course, a lot of it depends on what your app does for each request but most apps are simple enough and can live with being a monolith / single fat binary running on a single instance.

The problem with today's DevOps culture is that they present K8's as answers for everything. Instead of defining a clear line on when to use them and when not to.

fizwhiz4y ago

Would you mind describing your stack in more detail? Did you use gRPC with Go?

nostrademons4y ago

Sure, startup is defunct now and I think arbitrage & data on centralized exchanges is a dead market now. Wall Street HFTs got into the arbitrage game, and the data sites laypeople actually visit are the ones started in 2014.

Codebase was pure server-side Kotlin running on the JVM. Jackson for JSON parsing, when the exchange didn't provide their own client library (I used the native client libraries when they did). Think I used Undertow for exchange websockets, and Jetty for webserving & client websockets. Postgres for DB.

The threading model was actually the biggest bottleneck, and took a few tries to get right. I did JSON parsing and conversion to a common representation on the incoming IO thread. Then everything would get dumped into a big producer/consumer queue, and picked up by a per-CPU threadpool. Main thread handled price normalization (many crypto assets don't trade in USD, so you have to convert through BTC/ETH/USDT to get dollar prices), order book update, volume computations, opportunity detection, and other business logic. It also compared timestamps on incoming messages, and each new second, it'd aggregate the messages for that second (I only cared about historical data on a 1s basis) and hand them off to a separate DB thread. DB would do a big bulk insert every second; this is how I kept database writes below Postgres's QPS limit. Client websocket connections were handled internally within Jetty, which I think uses a threadpool and NIO.

Key architectural principles were 1) do everything in RAM - the RDS machine was the only one that touched disk, and writes to it were strictly throttled 2) throw away data as soon as you're done with it - I had a bunch of OOM issues by trying to put unparsed messages in the main producer/consumer queue rather than parsing and discarding them 3) aggregate & compute early - keep final requirements in mind and don't save raw data you don't need 4) separate blocking and non-blocking activities on different threads, preferring non-blocking whenever possible and 5) limit threads to only those activities that are actively doing work.

2 more replies

milkywaybrain4y ago

True. I can completely relate with this. I developed an open source crypto exchange connector [0] and created a fun twitter bot [1] on top of that. Currently twitter bot processes all the USDT market trades from Binance (around 260 markets with average 30,000 trades per minute) and calculates OHLC metrics every 15 minutes using InfluxDB. All these installations and calculations are done in a free tier 1 VCPU / 1 GB RAM AWS server (less than 10% CPU and less than 40% RAM usage always). [0] : https://github.com/milkywaybrain/cryptogalaxy [1] : https://twitter.com/moon_or_earth

Cthulhu_4y ago

They really do, and because of that they reach for over-engineered infrastructure solutions. I mean I get that you'd like to have some redundancy for your webserver and database, maybe some DDOS mitigation, off-site backup, etc, but you don't need an overblown microservices architecture for a low-traffic CRUD application. That just creates problems, instead of solving problems you WISH you had, and it's slower than starting simple.

Breza4y ago

I totally agree! I run a data science department at a corporation and it's amazing how much of our job is done on our laptops. I have a Dell Precision. When I need more power (a very rare situation), I can spin up a GPU cloud server and complete my big analysis for under $5.

dirkg4y ago

may be OT, but how do you subscribe to these trade feeds, is there a unified service or do you need to do it individually for each source, and how much does it cost approximately ?

I'm guessing if you put all this data into Kinesis or message queues it would end up costing quite a bit more.

nostrademons4y ago

There are probably unified services that let you do it - I was kinda competing in this area but didn't want to deal with enterprise sales, and it's a bit of a hard sell anyway.

If you do it individually, there are public developer docs for each exchange that explain how their API works. It's generally free as long as you're not making a large number of active trades.

meltedcapacitor4y ago

Never heard of a crypto exchange that charges for data feeds, the norm is free and fast. One of the positive of the industry compared to old school finance.

They're rent seeking in other ways though, no worries.

arjie4y ago

How do I get in touch with you? Definitely using more resources than this to process fewer integrations. I’m curious what trade offs you made to enable this.

If you’re anywhere in the US, let me know.

rattray4y ago

What were the most important aspects of the technology stack which enabled that?

aoms4y ago

How much bandwidth did this use daily/monthly?

nostrademons4y ago

I don't remember offhand. I think bandwidth costs averaged about $40-50 out of that ~$200/month.

noduerme4y ago· 19 in thread

When I launched my former bitcoin casino in 2011 (it's gone, but it was a casino where all games, even roulette tables were multiplayer, on a platform built from scratch starting in '08), I handled all web requests through a server in Costa Rica that cost about $6/mo. Where I had a shell corporation for $250/year. Once the front end -- the bullet containing the entire casino code, about 250kb -- loaded, from Costa Rica, and once a user logged in, they were socketed to a server that handled the gaming action in the Isle of Man. Graphics and sounds were still sent from the Costa Rica server. I didn't have a gaming license in the IoM, though - that was around $400k to acquire legally. So I found a former IoM MP who was a lawyer, who wrote a letter to the IoM gov't stating that we didn't perform random calculations on their server, thus weren't a gambling site under IoM law. Technically that meant that no dice roll or card shuffle leading to a gambling win or loss took place on that server. So the IoM server handled the socketed user interactions, chat, hand rotation and tournament stuff. Also the bitcoin daemon and deposits/withdrawals. But to avoid casino licensing, I then set up a VPS in Switzerland that did only one thing: Return random numbers or shuffled decks of cards, with its own RNG. It was a quick backend cUrl call that would return a fresh deck or a dice roll, for any given random interaction on the casino servers. The IoM server would call the Swiss server every time a hand was dealt or a wheel was spun; the user was still looking at a site served from a cheap web host in Costa Rica. And thus... yeah, I guess I handled millions of requests a day over a $6 webserver, if you want to count it that way.

j1elo4y ago

Focusing on a small but important detail that some have already mentioned but with a more aggressive tone... was your "loophole" system tested in an actual litigation at any point?

What I mean is that this:

> The IoM server would call the Swiss server every time a hand was dealt

might seem like a clever loophole around the laws in IoM, but in reality it sounds to me like the kind of technicalities that wouldn't really pass the reasoning of a human judge, who in their duty of interpreting the law and its intended spirit, would probably consider this an invalid trick and thus that the RNG of the system still resided in IoM, even if technically it didn't.

But of course, none of this matters if the casino never had any legal battle to fight where this idea could be tested in court, which is the equivalent of not being "caught".

noduerme4y ago

It was never legally tested. It was what I felt I had to do such that the randomness didn't take place on the island. And no randomness ever did. I was in touch with a lot of officers of the large casinos operating out of there at the time, who were curious but skeptical about Bitcoin. I think by the time they realized it was a potentially valuable thing, I had already shut down operations, because I wasn't willing to chase the market into legally gray areas.

1 more reply

seanalltogether4y ago

Yeah, that's like setting up a casino in one location with a permanent phone line open to switzerland to ask where the ball landed on the roulette wheels. Doesn't seem like it would hold up under scrutiny.

1 more reply

amelius4y ago

> in reality it sounds to me like the kind of technicalities that wouldn't really pass the reasoning of a human judge

Yes, only big companies can successfully "hack" the law based on its letter, see e.g. tax evasion.

mvc4y ago

He got an MP on his side which seems to be all it takes these days in the UK.

3 more replies

amelius4y ago

It would be even cooler if you derived the random numbers from a lava lamp visible through a window of the Swiss embassy.

hdjjhhvvhga4y ago

> we didn't perform random calculations on their server, thus weren't a gambling site under IoM law.

Does it really matter if you get your random number from /dev/urandom or a server in Switzerland?

withinboredom4y ago

I don’t think it’d hold up in court. This is the same as saying: “I bet you ten bucks when I mail my friend in Switzerland, he will mail back a ten.” The random part is being done in another country, but the betting is still being done wherever you and your friend are.

1 more reply

the_absurdist4y ago

Why doesn't it matter?

bcrosby954y ago

Why go through IoM at all? Why not Costa Rica -> Switzerland, rather than Costa Rica -> IoM -> Switzerland?

noduerme4y ago

Good question. The original plan was to take normal payment methods (Visa/MasterCard) but it became apparent after Bush passed the ban on online poker that Costa Rica was going to follow suit (or that visa/mc would soon start holding up payments from CR casinos... in which case we might be stuck with debt we couldn't use to pay winnings). Setting up a CR bank acct as a shell requires you to hand over power of attorney to a CR citizen, and also given how shady the entire corporate structure there was and the legal outfit we hired (who thought we could take $100 deposits as payments through their fake real estate portal) I evaluated other routes. These included landing funds in Cyprus and processing through Israeli banks at a 10% markup, and other shady sounding things. I had begun to give up on my little side code project when Bitcoin showed up. The benefit of the Isle of Man was that all funds could be landed there - in a Bitcoin wallet, on a caged dedicated server - without triggering any other financial issues. the only trouble was randomness and gambling.

1 more reply

oliv__4y ago

I would love to read a blog post about this. Hell I'd read a book: sounds like there might've been much more to this venture than you're letting on

arcticbull4y ago

Very crafty. Nice. I'm legitimately impressed by every part of that.

wiz21c4y ago

Does this article count as "Cost Rica leaks ?"

noduerme4y ago

yeah, I'm blowing my HN cover.

ajnin4y ago

Thanks for sharing that, I hope it does not land you into trouble. So, how many billions did you make with that site ?

Gepsens4y ago

You're a fricking genius.

elcomet4y ago

He's a frickin genius because he found ways to break the law with impunity for his own profit ?

I'm surprised by HN sometimes.

10 more replies

noduerme4y ago

Sarcasm much? I'm a full blown idiot. I was just solving one problem at a time.

1 more reply

fbrchps4y ago· 17 in thread

I understand your excitement for being able to handle a decent amount of requests on such a small server, but just like many other websites that get on the frontpage of HN, your site is taking multiple seconds to load for me, depending on when I refresh.

As you said in your post, adding caching to your site increased your throughput by ~20% (or +10/req/sec). What you and other sites seem to lack is a more distributed caching, a la CloudFlare, S3 CloudFront, Azure CDN, etc. Those last two only really work well for a static site, however as mentioned in your post that's essentially what you're serving.

While I'm all for having a free-as-in-freedom hosting solution and keeping things lean, the internet is a fickle beast, and nothing looks worse for a company who posts on HN when their technology-oriented site can't handle a few thousand requests per minute. (Or in this case, when a blog claims to handle 4.2M requests a day -- 2.9k req/min)

floren4y ago

Looks fine over here, and he doesn't have to route through a fucking Internet gatekeeper like Cloudflare or Amazon... let's enjoy this golden era before Chrome starts flagging any site which isn't fronted by a "reputable" cache like Cloudflare, Amazon, or whatever Google decides to introduce.

fbrchps4y ago

It would also be possible for OP to spin up their own Redis cache, and have multiple POPs near their target audience, and handle DoS type attacks against their site if need be, and easily be able to brush aside bot traffic, and...

Not all the above apply to a hobby-blog style site, but I wasn't referring only to OP's site in my original comment. I understand that not everyone needs to feed into "fucking Internet gatekeeper"s as you described, but the fact that they provide valuable services is undeniable. They make a complex operation -- one that could mean the difference between a company being able to sell their product or not -- simple.

1 more reply

ezfe4y ago

Oh I hate Google as much as the next guy, but that's not something they've shown any interest in doing.

2 more replies

remram4y ago

Google Cloud CDN? https://cloud.google.com/cdn/

1 more reply

blfr4y ago

your site is taking multiple seconds to load for me, depending on when I refresh.

Barely over a second here. Much better than vast majority of "webscale" services.

fbrchps4y ago

For sure, OP's site is handling this much better than most. And like I said, it's not every time that it takes multiple seconds. Some websites featured on HN/Reddit don't load at all when under load. However I was able to get it to take ~30s to load multiple times, over a period of around 10 minutes.

1 more reply

jeroenhd4y ago

Are you sure that's related to the server itself? The page loads instantly for me and the DNS is still resolving to an OVH IP address.

Timing info from Firefox: Blocked: 0ms DNS resolution: 8ms Connecting: 9ms TLS setup: 12ms Sending: 0ms Waiting: 30ms

The very last resource (favicon.ico) loaded after 466ms and that's mostly because of the other files being requested only after the CSS has come in (after about 195ms). All in all the entire site (without the Matomo tracking JS) loaded in half a second.

Maybe the website has switched hosts in the last ten minutes, I guess, but I doubt it. I think this is more likely to be a problem related to distance to origin and saturation of the underlying connection.

mark_mcnally_jeOP4y ago

Nope, website has not changed at all - speaking of the Matomo tracking; that is running on a separate server and has actually crashed!

mark_mcnally_jeOP4y ago

Yeah if I was building a business website I would want distributed caching/a CDN, mainly to support spikes, like what is happening now!

fbrchps4y ago

Working in the space, that's one of the more frustrating things to see on HN/Reddit/etc. It's not a complex or niche thing, and especially for sites that only make profit when people can actually visit them, it's kind of a necessity to stay up as much as possible.

(Obviously the sales thing doesn't apply to OP)

1 more reply

spicybright4y ago

Working extremely fast for me right now. Your post was about 20 minutes ago. I don't know how HN traffic fluctuates, but it seems really solid compared to most sites.

tiew9Vii4y ago

It sounds like they are caching database queries.

> Parts of the blog posts are cached using memcached for 10 mins

That means Django needs to accept the request, route it, pull the data from memcached, render the template.

For such a site I'd just set the `Cache-Control` headers and stick Varnish in-front of it acting as a reverse proxy. That'd likely increase the page load times significantly and make the backend simpler not worrying about manually caching in memcached and just setting the correct `Cache-Control` http header.

As it's budget hosting i'd probably not even bother with Varnish and outsource that to Cloudflares generous free tier, it's cheating as your server (Origin) isn't doing 4.2m requests but the practicality is really convenient.

_fizz_buzz_4y ago

Weird, this site loads really fast for me actually. Much faster than most sites that I visit.

mark_mcnally_jeOP4y ago

Thanks! Just goes to show that it runs quickly even when it hits #1 on HN ;)

1 more reply

umvi4y ago

Loaded instantly for me

crazy_horse4y ago

200 rps is not great.

fridif4y ago

Wikipedia as a whole has 8k rps, and that's with multiple racks in multiple data centers.

I haven't read recently, but they were only doing 200 rps per server.

polote4y ago· 16 in thread

What's the point of this post? OP is serving a file at 50req/sec. There is not even mention of a dB query. How is that able to relate to any kind of normal app?

I guess that the post was written as an answer to the mangadex post [1]. Mangadex was handling 3k req/sec involving dB queries. It was not just a cached Html page.

50req/sec for a Html file is super low which shows that a $4 month server cant do much actually. So yes this is enough for a blog, but a lot of websites are not blogs

[1] https://news.ycombinator.com/item?id=28440742

throwdecro4y ago

> How is that able to relate to any kind of normal app?

There's too much competition involved in writing normal apps, which often attract significant investment that bootstrapped startups struggle to compete with.

It's interesting to see what kind of performance is possible for next to no money, when you throw out basic assumptions like using a database, and then start thinking about what you could build out of it.

quickthrower24y ago

My recent submission of HNBadges was made like this. It's just 3 files (html, css, JS) which I hosted for free on Netlify, but could have been hosted on a setup like OP. I used other services for XHR requests. I imagine it got a tonne of traffic from being on the first page, but I wasn't taking metrics.

Another example of clever use of resources is the https://haveibeenpwned.com/ website. Using a bloom filter (I think) to turn what could have been a back-end lookup into a "front end lookup" by requesting a small file from the server based on the password hash.

The only issue I have with the OP is his assumption that you'd get a nice smooth 60 request/second throughout the day! Most likely will be lumpy, and in the top of the lumpy periods (where most of your visitors visit) performance will be bad.

wyager4y ago

My $5/mo server can handle several thousand requests per second. It’s mostly a question of what server software you use. If you use some node, python, ruby thing, it’s going to be slow as shit and need a reverse proxy in front of it. If you use a fast compiled language with a good framework, you can rip through requests no problem.

I tried a bunch of different stuff and ended up using Haskell - all of its popular web libraries are fast as hell. Go was fast but its standard library leaked sockets or I was not cleaning up connections properly or something, and it would tank whenever something went viral. All the popular interpreted language backend I tried were absurdly slow, like tens of RPS.

Source for my current thing is at http://yager.io/Server.hs. It also does all my RSS stuff, image processing for my photo gallery, etc.

pdimitar4y ago

I'm curious if you evaluated Rust?

1 more reply

vymague4y ago

> What's the point of this post? OP is serving a file at 50req/sec.

I'd guess a response to the mangadex thread? https://news.ycombinator.com/item?id=28440742

SPBS4y ago

No way, OP is essentially serving a static webpage from a database. That's nothing to brag about if comparing with mangadex.

Arch-TK4y ago

>There is not even mention of a dB query.

Did you read the post?

wokwokwok4y ago

> These benchmarks show that a very cheap server can easily handle 50 requests a minute to a "full stack" website.

I did, and all I see is someone spinning some numbers idly, like, hey, if I can lay 1 brick every second, then with 20000 people we can build a house in one second! So good!

a) entirely and totally lacking in experience running a heavy load website.

b) 50 requests a minute is so atrociously bad, it’s not even worth talking about.

c) there isnt any db load going on here, this is a full page single table query. See https://docs.djangoproject.com/en/3.2/ref/contrib/flatpages/

Sure maybe a db exists, but it’s not relevant when you compare this to the complexity of doing write operations.

Ie. this is some hiiiigh level arm chair commentary right here.

Sure, they’re just talking about their website, but anyone going “oh yeah, look at this, those mangadex guys should learn a thing or two and run it on django”. …has no idea what they’re talking about.

2 more replies

latexr4y ago

From the guidelines[1]:

> Please don't comment on whether someone read an article. "Did you even read the article? It mentions that" can be shortened to "The article mentions that."

[1]: https://news.ycombinator.com/newsguidelines.html

great-potential4y ago

Exactly, I dont see the point bragging about this nevertheless posting about it on HN ...

1 more reply

vietvu4y ago

People just want to show their works, it's normal. What I found strange is that not many people seems to be surprised about this and upvote.

gaptoothclan4y ago

Is this just an apache bench mark

napworth4y ago

It's called boasting

sillycube4y ago

Boasting of what? Serving static sites doesn't even need a server.

Use apache to serve Django + wsgi? Just use Django asgi and nginx and you will get a higher number.

TruthWillHurt4y ago

+1. I remember modest VPS/Parrallels serving PHP at 350r/s

quickthrower24y ago

Something like:

    <?php echo("this is a benchmark") ?>

markandrewj4y ago· 8 in thread

Normally benchmarks for things like this are measured in how many concurrent requests can be handled, i.e the C10K problem, not by how many requests you are able to serve in a day. It's also well known that you can serve a large amount of requests on limited hardware.

https://en.wikipedia.org/wiki/C10k_problem

"By the early 2010s millions of connections on a single commodity 1U rackmount server became possible: over 2 million connections (WhatsApp, 24 cores, using Erlang on FreeBSD),[6][7] 10–12 million connections (MigratoryData, 12 cores, using Java on Linux).[5][8]"

Although I do understand the boxes listed above have more resources then the VPS you are using. I am also not criticizing your write up, or results, bench-marking is in general interesting to do. I just wanted to provide some additional information.

spyder4y ago

Yes, handling 50 requests per second doesn't mean the server can handle 4.2 million a day. That can only happen if they are uniformly distributed throughout the day, which isn't the case for most website traffic.

habibur4y ago

Right. I calculated what 5m/day converts into. And it's like 60 req/sec. Considering non even distribution and spikes, I would assume its like 200req/sec.

adtac4y ago

unrelated but 4x increase isn't really a spike

ijidak4y ago

How does a single server run millions of active connections?

Wouldn't you run out of TCP sockets?

What am I missing?

dreyfan4y ago

A connection isn't just a dest_port, it's the unique combination of 4 components: source_ip:source_port:dest_ip:dest_port

1 more reply

jfrunyon4y ago

I would guess they may be muxed over fewer sockets, by their LBs, but that's not strictly necessary.

I'm not sure exactly what you mean by "run out of TCP sockets", but theoretically speaking, the only limitation is how much memory is available to store the necessary info about the socket (like address/protocol info and process info).

In practice, OS's do have a "max socket" or "max FD" limit, but that's usually configurable and (with enough RAM) could easily be set to "millions".

1 more reply

remram4y ago

On my default Ubuntu install, the hard limit for open files of a process is 1048576 (ulimit -Hn). So you you have to run a handful of processes.

arthurcolle4y ago

socket multiplexing

idworks14y ago· 7 in thread

One of my proudest moment in my career is when I lowered our app processing time from ~8hrs to 17 minutes. When I deployed my first update, it reduced it to 2 hours. The sysadmin immediately contacted me that there was something unusual. I confirmed the results but he was skeptical.

Then with my second update, he told me that the app must be broken or that the script must be dying. There is no way it could complete this fast.

What was the issue? We processed terabytes of data. Each and every single line processed created a new connection to the database and left it hanging. A try catch was added when the connections failed and restarted the process. Removing the connection from the for loop and properly handling it reduced the time drastically.

And... why would you loop through millions of records when you can use batches? Also this was a phperlbashton* script. I turned it into a single PHP script and called it a day.

As a consequence, backup time was reduced to 2 hours as opposed to 12 hours (no one was allowed on the website until the back up was done).

Modern machines are incredibly fast.

* PHP/Perl/Bash/Python

globular-toast4y ago

I have a similar story where I reduced the memory used by a script from 1TB (yes, TB) to a few megabytes. The runtime was massively reduced too, from something like 1 day to a few minutes.

This was for a genomics project and they ran it on a supercomputer. When I looked into it, they were reading the entire input into a giant array before doing one pass and dumping the result out to disk. I made a tiny change (it was a Perl script) to make it stream the I/O instead.

This is the most extreme example I've come across of people using computing power just because it's there. Nobody questioned why the script took so long to run because the data really was in the TBs and other stuff also took that long to run. Waiting a day for the results was considered normal. I see the same thing on desktop apps etc., on a much smaller scale, of course. When I run an electron app it takes several hundred milleseconds to do anything at all. But nobody questions whether it should because everything takes several hundred milliseconds.

BatteryMountain4y ago

Mine was when I rewrote some application code to be processed within postgres to send a few thousand sms's (the logic to decide who should get them was the slow part + amount of data involved). It went from about 45 minutes to less than 1 minute. Was an amazing feeling to see the sms table filled up with the correct data - I also thought something was broken since we just accepted that those runs were normally supposed to be slow.

nicbou4y ago

I've done something like that at my last university internship. I wrote about it here: https://nicolasbouliane.com/projects/pratt-whitney-redesign

It's the same story as yours, but with human effort. I was about to cut the human out entirely, and fix a ton of errors in the process.

darylteo4y ago

I had a similar experience as a intern years ago! Part of my daily responsibility was to manually enable/disable API integrators who had increased levels of traffic as it would take the API down for all partners. Pretty bad. Thousands of requests at peak would bring the server to its knees.

Until I worked out during some minor maintenance task that every request was logged to a flat file. Appended. Every request. The file was probably 100gb by the time I found it and every request log would lock the logging file. The server had been running for a couple of years by that time.

Of course I screwed up more than I fixed. :D

devortel4y ago

> no one was allowed on the website until the back up was done

I'm assuming this was an internal website and backups were scheduled for evenings/weekends?

nicbou4y ago

I've done something like that at my last university internship. I wrote about it here: https://nicolasbouliane.com/projects/pratt-whitney-redesign

It's the same story as yours, but with human effort. I was about to cut the human out entirely.

vmception4y ago

and then you were fired because your job was dependent on this taking forever

tristor4y ago· 4 in thread

The comments in this thread surprise me quite a lot. I suppose it shouldn't, but it does. This post + the response calls to the surface how badly basic system operations knowledge is needed in the industry and how much of it is missing from the toolkit of most developers.

planet-and-halo4y ago

Any recommended reading? I've bought a few things in that vein but I'm self-taught and always looking to improve on the ops/perf side.

tristor4y ago

I don't have any one complete book that I can recommend, and I don't even really have a great reading list for this. But I'll make an attempt to share what I think is useful as a starting point.

1. Systems Operations is first and foremost about understanding systems, in all of their complexity, which means understanding the internals of your OS primarily.

2. Performance and networking, in particular, are super important areas to focus on understanding when it comes to learning the topic to help with software development.

3. A lot of it is about understanding concepts in abstract and being able to extrapolate to other situations and apply these concepts, so there's actually quite a lot of useful information that can be learned on one OS and still applied to another OS (or on one game engine and applied to another, et al).

Here's a few books I think are worth reading, not in any particular order of prevalence, but loosely categorized

Databases:

High Performance MySQL: https://www.amazon.com/gp/product/1449314287/

SQL Queries for Mere Mortals: https://www.amazon.com/gp/product/0321992474/

The Art of SQL: https://www.amazon.com/gp/product/0596008945/

Networking:

TCP/IP Illustrated: https://www.amazon.com/exec/obidos/ISBN=0201633469/wrichards... (updates on author's site at http://www.kohala.com/start/tcpipiv1.html)

The TCP/IP Guide: https://www.amazon.com/TCP-Guide-Comprehensive-Illustrated-P...

UNIX Network Programming: https://www.amazon.com/dp/0131411551

Beej's Guide to Network Programming: http://beej.us/guide/bgnet/

Operating Systems:

Operating Systems Concepts: https://www.amazon.com/Operating-System-Concepts-Abraham-Sil... (various editions, I have the 7th edition... I recommend you find the latest)

Modern Operating Systems: https://www.amazon.com/Modern-Operating-Systems-Andrew-Tanen... (the "Tanenbaum Book")

Operating Systems Design and Implementation: https://www.amazon.com/Operating-Systems-Design-Implementat-... (the other one, the "MINIX Book")

Windows Internals:

Part 1: https://www.amazon.com/Windows-Internals-Part-architecture-m...

Part 2: https://www.amazon.com/Windows-Internals-Part-2-7th/dp/01354... (I had the pleasure of being taught from this book by Mark Russinovich and David Solomon at a previous employer, was an amazing class and these books are incredible resources even applied outside of Windows, we used 5th edition, I linked 7th, which has the 2nd part pending publication).

MacOS Internals:

Part 1: https://www.amazon.com/MacOS-iOS-Internals-User-Mode/dp/0991...

Part 2: https://www.amazon.com/MacOS-iOS-Internals-II-Kernel/dp/0991...

Part 3: https://www.amazon.com/MacOS-iOS-Internals-III-Insecurity/dp...

Linux Kernel Programming:

Part 1: https://www.amazon.com/Linux-Kernel-Development-Cookbook-pro...

Part 2: https://www.amazon.com/Linux-Kernel-Programming-Part-Synchro...

The Linux Programming Interface: https://www.amazon.com/Linux-Programming-Interface-System-Ha...

General Systems Administration:

Essential Systems Administration: https://www.amazon.com/gp/product/0596003439/

UNIX and Linux Systems Administration Handbook: https://www.amazon.com/UNIX-Linux-System-Administration-Hand...

The Linux Command Line and Shell Scripting Bible: https://www.amazon.com/Linux-Command-Shell-Scripting-Bible/d...

UNIX Shell Programming: https://www.amazon.com/Unix-Shell-Programming-Stephen-Kochan...

BASH Hackers Wiki: https://wiki.bash-hackers.org/

TLDP Advanced BASH Scripting Guide: https://tldp.org/LDP/abs/html/

The Debian Administrator's Handbook: https://debian-handbook.info/browse/stable/

TLDP Linux System Administrator's Guide: https://tldp.org/LDP/sag/html/index.html

Performance & Benchmarking:

Systems Performance: https://www.amazon.com/Systems-Performance-Brendan-Gregg-dp-... (this is Brendan Gregg's book where you learn about the magic of dtrace)

BPF Performance Tools: https://www.amazon.com/Performance-Tools-Addison-Wesley-Prof... (the newer Brendan Gregg book about BPF, stellar)

The Art of Computer Systems Performance Analysis: https://www.cse.wustl.edu/~jain/books/perfbook.htm (no longer available from Amazon, but is available direct from publisher. This is basically the one book you should read about creating and structuring benchmarks or performance tests)

I guess that's a "reading list", but this is just a small part of what you need to know to excel in systems operations.

I would say for the typical software developer writing web applications, the most important thing to know is how databases work and how networking works, since these are going to be the primary items affecting your application performance. But there's obviously topics not included in this list that are also worth understanding, such as browser/DOM internals, how caching and CDNs work, and web-specific optimizations that can be achievable with HTTP/2 or QUIC.

For the average software developer writing desktop applications, I'd say make sure you /really/ understand OS internals... at the base everything you do on a computer system is based on what the OS provides to you. Even though you are abstracted (possibly many layers) away from this, being able to peel back the layers and understand what's /really/ happening is essential to writing high-quality application code that is performant and secure, as well as making you a champ at debugging issues.

If you're trying to get into systems operations as a field, this is just a brush over the top surface and there's a lot deeper diving required.

1 more reply

vymague4y ago

> basic system operations knowledge

Perhaps you can suggest a book or roadmap to learn it?

tristor4y ago

I replied to your sibling comment with something approaching a book list.

ppeetteerr4y ago· 3 in thread

How did this make it to the number two spot on HackerNews?

dreyfan4y ago

Broadly speaking people on HN have no clue how to setup a performant httpd/app server and are impressed by abysmal performance/cost metrics like this or the MangaDex post. Everything these days is obscured through multiple layers of SaaS offerings and unnecessary bloat like kubernetes.

~10k rps (it was concurrent connections but close enough) was state of the art in 1999. Now 22 years later ~50 rps is somehow impressive.

5 more replies

ok_coo4y ago

IMHO, it's better than a deluge of political posts.

I don't mind reading about politics but I come to HN to read about tech. We can go elsewhere to get whatever politics we desire.

wpietri4y ago

An awful lot of professional programmers work in such heavyweight contexts that they don't have a good idea of how fast modern hardware can be.

I was talking with an architect at a bank whose team was having trouble getting under a 2-second maximum for page views. They blamed it on having to make TCP requests to other services, and said something like "at a couple hundred milliseconds per request, it adds up quickly!" My head nearly exploded at that. I spun up some quick tests in AWS to show exactly how many requests one could make in 2000 ms. I don't have the numbers handy, but the number is very large.

This junky slice of a server handling full page requests in 20 ms is a fine example to counter thinking that's endemic in enterprise spaces.

2 more replies

bob10294y ago· 2 in thread

All these "X requests per unit time" posts are starting to make me want to break out some of my experimental code... I have some services that can process several million events per second. This includes: compressing the event batch, persisting to disk, validation of business logic, execution of all view updates (state tracked server-side), aggregation and distribution of client update events, etc. These implementations are easily capable of saturating NVMe flash.

If you want to see where the theoretical limits lie, check out some of the fringe work around the LMAX Disruptor and .NET/C#:

https://medium.com/@ocoanet/improving-net-disruptor-performa...

You will find the upper bound of serialized processing to be somewhere around 500 million events per second.

Personally, I have not pushed much beyond 7 million per second, but I also use reference types, non-ideal allocation strategies, etc.

For making this a web-friendly thing: The trick I have found is to establish a websocket with your clients, and then pipe all of their events down with DOM updates coming up the other way. These 2 streams are entirely decoupled by way of the ringbuffer and a novel update/event strategy. This is how you can chew through insane numbers of events per unit time. All client events get thrown into a gigantic bucket which gets dumped into the CPU furnace in perfectly-sized chunks. The latency added by this approach is measured in hundreds of microseconds to maybe a millisecond. The more complex the client interactions (i.e. more events per unit time), the better this works. Blazor was the original inspiration for this. I may share my implementation at some point in the near future.

mariushn4y ago

> The trick I have found is to establish a websocket with your clients, and then pipe all of their events down with DOM updates coming up the other way. These 2 streams are entirely decoupled by way of the ringbuffer and a novel update/event strategy.

Could you detail this, please? I don't get it. What is the flow?

1. Browser is sending events to web server via web socket, instantly as the event is occurring (?)

2. ? (what exactly does the server do?)

bob10294y ago

You got 1 correct. Everything that happens gets sent immediately as an event to the server (e.g. KeyDownEvent). These are pushed without blocking for a response to each - The websocket guarantees delivery and ordering.

Upon receiving an event from the client socket, it is immediately inserted into the LMAX ring buffer for processing.

Updates to the client are triggered by events+state determining when a redraw is required and issuing a special "ClientRedraw" event into the same queue. These events are grouped by client so that we can aggregate multiple potential updates in a single actual redraw. These result in view updates being pushed back down to the relevant clients. One performance trick here is that the client redraw is dispatched asynchronously from the server, so there is no blocking on processing the subsequent batches each time.

You can think of an E2E client view update as always requiring 2 events - the client event that triggered the change to domain state, and the actual redraw event(s) that result. For applications where the client should update at a fixed interval (e.g. game), a high performance timer implementation injects periodic redraw events. Because the upper bound of the ring buffer latency is around a millisecond, this allows for incredibly low jitter on real time events. Scheduling client draws as simple domain events is feasible.

1 more reply

johnklos4y ago· 2 in thread

This is a good, simple way to show how much can be done with modest resources.

Sometimes we see people fetishizing bigger and faster, then gatekeeping when people want to do the same work with modest means, whether it a four quid a month hosting service or a first generation Raspberry Pi. Not everyone has the money or desire for bigger & faster, and it's nice to see that here.

mark_mcnally_jeOP4y ago

That's why I made this post, was very happy to see how much it could theoretically handle.

danjac4y ago

I'm not sure it's so much about fetishizing, and more about realistic expectations around software development in larger teams.

If you are the sole developer working on your own site - be it a side project/hobby/labour of love or your source of income - you have complete control up and down the stack and have the leeway to tweak performance wherever needed - whether that's indexing and optimizing queries in the backend, reducing the size of your static assets, caching, whatever. You can even yank whole features if you feel their inherent complexity and load outweighs their usefulness.

In anything including and above a medium sized company, a single developer will rarely have the leeway to do anything beyond tinker with their small slice of the stack. They might spend some hours carefully optimizing a query, but it's for naught because the frontend team have screwed up the webpack settings and the JS load runs into many MB. Or you have both done your jobs but the PM wants a ton of analytics on every page. And the CEO's pet feature is a maintenance and performance nightmare but nobody has the clout to have it removed or even simplified. Nobody wants to waste sprints on paying down tech debt in a feature factory, so it becomes progressively harder to fix performance issues.

At that point, the cheaper and politically easier option is to just fire the money cannon at expensive cloud services and hope the extra spend squeezes out some performance gains.

jonplackett4y ago· 2 in thread

#1 on HN and still up. That speaks for itself.

mark_mcnally_jeOP4y ago

jonplackett4y ago

Are your starting to feel the pressure yet?

1 more reply

stef254y ago· 2 in thread

And here I am building a simple API using Lumen (Laravel's stripped down, hopefully faster cousin) getting response times that are just abysmal.

The raw queries themselves are fast enough, but for some reason running them in a framework, transforming them in to a Resource and dumping it as json takes so long that I'm scared to find out what this super popular framework is even doing under the hood.

Once I learn enough Python I'd like to compare its performance to something like FastAPI. But even that probably won't come near what these recent posts are describing.

(Disclaimer - it's just a side project and I haven't really looked in to making it faster)

Kiro4y ago

That's strange. Laravel should be able to handle thousands of requests a second even on the cheapest hardware.

euoia4y ago

In my experience with Laravel, it is often the transformation to JSON that takes some time for large responses.

yupper324y ago· 2 in thread

Well the front page of HN won't get you 4.2M views today, but it's a pretty good real world test!

mark_mcnally_jeOP4y ago

It is! It's crashed my analytics but the website seems to be doing fine.

jonplackett4y ago

True, but it may very well get you a lot more than 50 a minute.

privacyonsec4y ago· 2 in thread

My 1cpu 2gb ram server: - nginx - slack bot behind Django - OpenVPN - Minecraft server (20 players) - tunneling (reverse proxy for local dev)

whatsthatabout4y ago

You're running an 20 players mc server with that hardware? How? In my experience Minecraft-servers are insanely resource hungry, especially whilst generating new parts of the map or larger red stone contraptions. Back when I played around with hosting servers I used the "most optimized" mc-server fork called Tuinity - and even with that I had to allocate way more cpu and ram to the vms then u use. Would love to hear about your setup.

mktk10014y ago

Press X to doubt.

great-potential4y ago· 2 in thread

Dont mean to be the negative Joe but you dont need a webserver if you're serving a static-able website.

nayuki4y ago

You mean use someone else's web server instead?

1 more reply

bruce3434344y ago

How are you supposed to _serve_ that static _web_site then?

modeless4y ago· 2 in thread

You don't need £4. As long as you can structure your site as a set of static files with interactivity done client side, even if some of the files change every minute or two, you can serve everything for $0 with Cloudflare in front of any free host. I've served 1M pageviews a day for $0 with Cloudflare + App Engine free tier and there's no reason it wouldn't scale to 100M or beyond.

jnieminen4y ago

You could also use Github pages or Cloudflare pages to host.

1 more reply

goodpoint4y ago

No thanks. Cloudflare is very harmful to Tor and to user's freedom and privacy in general.

janmo4y ago· 1 in thread

There is a difference between being able to handle 4.2M requests a day, and handling 4.2M requests per day.

Visitors don't come neatly one after the other. You might only have 1M requests a day but get random spikes with 100 requests at the same time.

bArray4y ago

Very true. It also makes a difference as to which resource is being pulled, whether it is cached, what transport is being requested (SSL, compression, etc).

I really suspect the website would fall long before it hits anything close to 4.2 million requests (which the author also seems to except).

That all said - long live tiny web servers!

Rd6n64y ago· 1 in thread

Re: benchmarking, sometimes the bottleneck is the machine or server that issues the requests, not the receiver that you are testing. To figure out your actual capacity, you sometimes need multiple request servers or a more powerful request server. This was the case for a project I did a few years ago. Not a critique of the blog post, just remembering something out loud

His site, https://peepopoll.com/, took about 10s to load for me. It’s also good to chart other metrics like response times while you benchmark. Requests per second isn’t the same as a low response time

hu34y ago

Indeed. Recently a client needed to bench raw req/s processing power of their application server and I had to ask for a powerful server running on the same DC in order to discard any potential routing issues.

danbrooks4y ago· 1 in thread

I hosted draftsim.com on a $3/month hosting plan for a few years.

We served 500GB of data the first month.

I imagine that the hosting company lost money on us (but they never called to complain).

flobosg4y ago

Just wanted to thank you for draftsim.com! As someone who got into MTG Arena a few months ago it has been very useful to learn some basics.

MangoCoffee4y ago· 1 in thread

this story remind me, the dot com bubble. dotcom companies bought servers from Sun Microsystem. they needs to handle the large traffics that "PC" server can't handle.

anyone remember Cobalt server?

https://en.wikipedia.org/wiki/Cobalt_Networks#/media/File:Co...

sgt4y ago

I remember many stories and discussions on Slashdot about it.

5faulker4y ago· 1 in thread

That's a good spec for 4 bucks. With cloud hosting you might be able to push the cost down a bit with less CPU resource and memory.

mark_mcnally_jeOP4y ago

I am cloud hosting using OVH cloud, I brought the server a couple of years ago now so they probably have some better specs for the price I am paying.

sigg34y ago· 1 in thread

Put Wordpress on it, and do a new battery of TTFB tests ;)

celsoazevedo4y ago

Even WordPress would work fine if we use a plugin like WP Super Cache (no idea why they don't cache things by default). It wouldn't beat a simple static page, but WordPress + Cache plugin + cheap VPS can easily handle #1 on HN.

olingern4y ago· 1 in thread

I might get downvoted for not getting on the 2000s style of development bandwagon, but do you really need a web server to serve static text?

bellyfullofbac4y ago

What do you suggest, Gopher?

4 more replies

ComputerGuru4y ago

Absolutely. Even a terrible Wordpress instance can be beautifully (and transparently!) cached behind either nginx or varnish with ease, in which case you’re just serving static html pages and can probably handle any traffic you are likely to ever get.

louwrentius4y ago

I'm hosting my static blog site on a physical Raspberry Pi 3B+ powered by Solar [1].

That blog post got hugged by HN but it didn't even raise the CPU above 10% on a single core.

And a Raspberry Pi 3B+ is dog slow. And severely limited by bandwidth, unlike the Raspberry Pi 4B+. (But it uses less power so that's why I use a 3B+).

However I have another point to make. Professional rack-mount servers from HP and Dell can be had second hand for dirt cheap and you get a ton of CPU (20+ cores) and an ocean of RAM for next to nothing.

For many applications, an old Gen8 or similar Dell server will perform more than adequately. Even more so if you have a little bit more to spend on Gen9.

They are so cheap that you can like buy four to eight, sprinkle them across two different datacenters and even if one breaks, you won't be in any hurry.

[1]: https://louwrentius.com/this-blog-is-now-running-on-solar-po...

reilly30004y ago

That estimate can be verified with load testing systems like Artillery. My theory is that things would break far sooner than estimated along the following lines:

- Too many WSGI connections if the timeouts aren’t tweaked

- Too many database connections, especially without caching and tuning

- on the Apache side if MaxRequestWorkers isn’t set there will be memory issues with 1GB RAM

- the disk could easily hit IOPS limits, especially if there is a noisy neighbor

It’s not likely all or any of these things will hit IRL, but that all depends on traffic and usage patterns. It matters not, if you were getting 4.2 M requests each day you’d be in the Alexa Top 1000 and could probably shell out for the $8 server :)

seandong4y ago

All these examples show even more why software engineer(not developer) is a discipline where 10x salary difference can be seen between the best and the worst. For the 10x you are paying, you are getting a (n^2 - logN) times performance gain, especially when you are dealing with problems with large amount of data.

However, relying on people themselves is often not the best stable solution. I am wondering if all these N^2 mistakes people made can be prevented by innovative means like language features, framework improvements, tooling and etc. And I'm talking about prevention, not the post mortem perf measure and fix kind

Ice_cream_suit4y ago

In contrast, the Australian 2016 National Census crashed and burned since the system could not cope with over 250 requests a second.

Parliamentary enquiry:(PDF) https://www.aph.gov.au/DocumentStore.ashx?id=0a7f6bd5-8716-4...

https://www.zdnet.com/article/census-2016-among-worst-it-deb...

https://www.theguardian.com/australia-news/2016/aug/10/compu...

jaymzcampbell4y ago

I'm not sure there was ever an argument saying otherwise. The ease of processing X million requests is heavily dependant on what those requests actually do. Trivial use cases shouldn't be a surprise to have high throughput.

erdo4y ago

Working on other companies' mobile apps, about half the performance problems I've discovered have been down to some accidental crazy, like initialising something you only need once, in a loop, in 3 different places (because the code has become so unnecessarily complicated that no one really knows what it's doing). The rest are due to some piece of code accidentally blocking the UI thread.

A well written mobile app doesn't really have any need to be sluggish at all, including smooth animations and fast scrolling lists, it was doable 10 years ago, it's doable now. (*I don't know about games).

But unlike on the server side, the accepted wisdom in most places I've worked at is that the answer to the performance problems is: a new framework.

(I feel like this is a lie that developers tell the business side, and maybe themselves. It avoids having to explain that software is hard, sometimes you don't get it right the first time, and if you don't spend time and effort tending to it, it can turn into an ungodly and expensive mess - and that's got nothing to do with the hardware or the framework)

tambourine_man4y ago

> These benchmarks show that a very cheap server can easily handle 50 requests a minute

I think you mean a second, but yeah, old tech is fast.

I find it funny when I read “raw html” emphatically, as if it was akin to writing assembly.

zxcvbn40384y ago

Combined with a CDN and you can do a lot more, I like to think in terms of origin requests rather than raw req/s. The problem I have is getting people to really understand how caching works at the cdn and browser layers and design their frontends and backend responses around that. There is also lot you can do with edge compute to clean the incoming requests before the CDN evaluates them to increase cache hits. Even if you are trying to give a "real time" view of data, caching it for even a second and allowing stale data to be served while it is updating can reduce origin requests significantly. I've seen people hammer sites hundreds of times a second looking for changes that only happen once every fifteen seconds or once per minute - the best thing you can do for yourself is handle all those requests at the CDN level (eventually you'll do log analysis and see the activity and can take other measures, but in the mean time don't let all of those requests go to the origin). Your CDN is probably giving you better rates for network egress then Amazon or Google anyway - the later are more focused on incentivizing you to use their ecosystem exclusively by penalizing you for sending data "outside". Cheap VPS hosts discourage you exceeding your bandwidth allocation because they are overselling their capacity and heavy usage upsets that - so again you want to shift as much as you can to your CDN.

brokencode4y ago

It’s the database part that gets expensive for web applications. Serving up static web pages is absolutely trivial for modern servers.

The database is also the part that doesn’t easily scale, unless you pick a highly scalable database from the outset, and those have their own complexity and tradeoffs as well.

That’s why I believe every project should start with a bulletproof model of how the database will work first, then fill in the other details from there.

It’s not always as easy as picking Postgres and calling it a day, unfortunately.

fabian2k4y ago

50 rps not that much, though of course easily sufficient for many situations. This is also Django which certainly isn't the fastest choice. I played around with it a long time ago and liked it quite a bit, but you don't choose Django for performance but for the other benefits.

I'm really more surprised that static serving is so slow at 180 rps. This should be able to easily saturate the network, statically serving files is very, very fast. From what I see in the blog I doubt that the files are very large, so there is probably some other bottleneck or I'm missing something here.

Wronnay4y ago

If you host it on GitHub Pages, GitLab Pages or Vercel it's even £4 less :o

ignoramous4y ago

I run a FOSS Pi-Hole esque public DoH resolver in the most expensive way possible (over Cloudflare Workers) and it costs $2 for 4M requests. Granted the CPU slice is limited (50ms) but the IO slice (30s) is plenty for our workloads (stub dns-resolution).

The reason this is cheaper in a sense is because Workers deploys globally and needs zero devops. Per our estimates, this setup (for our workload) gets expensive once the request range goes beyond 1.5 billion a month after which deploying to colos worldwide becomes cheaper even with associated cost of devops.

wyager4y ago

When my stuff has been shared on HN and Reddit, I’ve seen peaks of around 500 rps (according to google analytics), so if you can hit 1k rps you’re almost certainly ready to weather any kind of viral sharing that a blog might experience. Several krps on cheap VPS hardware is easy with a good compiled language backend (Haskell’s Warp is a good one). If you use a node/python/ruby/other interpreted backend, you will need aggressive caching through a reverse proxy.

iambozdar4y ago

You have clearly stated in your footnote that if anything goes wrong then the given numbers can go down. To be honest, that's what happens all the time. Developers do something wrong and number of requests just drop down to the floor.

Only static websites are the one which handle large amount of requests at low cost. Web hosting providers don't make money out of those clients, so they run shared plans.

sam0x174y ago

Impressed that it is front page of HN and it isn't "This page cannot be displayed" with that kind of premise. Props.

bullen4y ago

My Java HTTP app. server manages that on a Raspberry 2 at 2 watts serving this reddit clone: http://talk.binarytask.com

Really we need to compare apples to apples (how many watt)!

Most of you have an external IP address, open port 80 and put it to good use before they put you behind a shared IP!

mkl954y ago

Nginx can handle ~250M requests a day out of the box, and ~600M by tuning a few parameters. *

* https://www.cloudbees.com/blog/tuning-nginx

zhuzhu4y ago

My website https://www.v2ph.com

$6 VPS can handle 500,000 requests daily

On this server, I have PHP-fpm workers, nginx and MariaDB

The average CPU usage about 30%, load average is about 0.5

fimdomeio4y ago

Most of the times the more interesting question is not really how to make a server that can handle 4.2M requests a day, but how to make something so useful that it gets more than 100 pageviews a day.

iradik4y ago

"If we can handle 50 requests a second that means we can handle 4.2 million requests a day."

Not really. Real world traffic won't be uniform over one entire day. 50 QPS would be more accurate.

muzani4y ago

I was skeptical that it would fail under real situations (the calculation says it's 50 requests/second). But it looks like reaching the top of HN didn't crash it.

shultays4y ago

The new metrics for server performance is "unique buzzwords per paragraph". If you don't write a 4 page blog post with at least 4-5 ubpp then it is shit.

baby4y ago

I used to be on the top 500 of Alexa with a shared hosting server running PHP. Never had any issue. Mind you, I wasn't doing anything complicated, but still.

danielsamuels4y ago

Can it?

> Service Unavailable

> The server is temporarily unable to service your request due to maintenance downtime or capacity problems. Please try again later.

ksec4y ago

160 Request /s for a No DB Call, Simple File Serving on a Single Core System.

Why am I the only one not impressed by this.

jka4y ago

There are a few comments in here that predictably suggest that simple static sites can handle large request rates easily.

Sure, that's true - but to try to progress the conversation: how would you measure the complexity of serving web requests, in order to perform more advanced cost comparisons?

(bandwidth wouldn't be quite right.. or at least not sufficient - maybe something like I/O, memory and compute resource used?)

gruellan4y ago

Hey! Someone else from Jersey! Nice one

nicoburns4y ago

So 50req/sec. I'd hope it could handle a lot more than that!

fnord774y ago

apache + wsgi isn't even close to being the most performant webapp server software, either. Bet he'd get 5x the performance out of nginx + lua on the same virtual hardware.

taf24y ago

I wonder what the 99th - 95th percentile response times look like…

anaganisk4y ago

But can it run k8s /s

kjgkjhfkjf4y ago

That's less than 50qps. I can't count that low.

KronisLV4y ago

It feels to me like most websites out there could run on way less hardware, if only people would embrace a few things.

#1 Minimalism. You don't need 400 KB of JS to display some mostly text content to your users with some interactivity sprinkled in.

You don't need to reinvent office software, or very rich text editors in browsers, stop using the web as a universal delivery platform/mechanism, because that's not what it was meant for. When browsers will ship integrated dependencies so that even CDNs don't need to be hit (like versions of jQuery, Bootstrap and numerous JS frameworks as well as WASM code like Blazor which contains a .NET runtime), then you'll be able to do that, but arguably that will never happen.

Use the web as a platform for displaying primarily text content with the occasional images, forms and a little bit of interactivity sprinkled in. Most sites out there simply aren't and shouldn't be like this (that said, when you have exceptional reasons for throwing aside that suggestion, do so): https://geargenerator.com

#2 Static content. You don't need to use Wordpress, Drupal, Joomla or many of the other CMSes out there, since they can get really heavyweight with numerous plugins and are not only a security challenge, but are also problematic from a performance perspective.

Consider using static site generators instead. When reading an article of yours, the DB shouldn't even be hit, since most of the article contents are unlikely to change often, so you should be able to pre-render each of the article versions as a set of static HTML and use the common JS/CSS that you already have for the rest of the articles. Furthermore, it's easy to just jump into CMSes and introduce ungodly amounts of complexity, all of which cause your back end to process bunches of code for each request. Static files don't have that drawback.

#3 Caching. Know when and what to cache, and how. Images, JS files, CSS files and even entire HTML pages should be cache friendly. Know which ones aren't, make exceptions for those and cache everything else.

Not only is it not necessary to hit the DB for many of the pages in your site at all, but also sometimes you shouldn't even hit the back end either. The most popular pages of your site should just live in a cache somewhere, be it within your web servers or a separate solution, so that they can be returned instantly. HTML is good for this, use it.

Furthermore, know what cache policies to use. Sometimes even the cache resources shouldn't be redownloaded, if the user already has these resources loaded from a different page. Use bundle splitting responsibly, extract common functionality in easily cacheable bundles and set the appropriate headers.

And yet, i've seen a surprising amount of ignorance in regards to caching, static site generation and even how large webpages have gotten: https://idlewords.com/talks/website_obesity.htm

I don't claim to know it all, but working towards the goal of efficiently using pages should definitely be viewed as an important one: be it because you want to pay less for your infrastructure, or care about the environment, or even just want to manage fewer nodes.

Instead, nowadays far too many orgs just try to be the first to market and ignore the engineering based approach to ensuring that the solutions are not only functional but also sustainable. That saddens me.

_alex_4y ago

tl;dr: a machine with 2 GB of ram that is doing pulling a blog post out of a DB and passing it through Django can do ~50tps

Shadonototro4y ago

i'm running my SaaS for 10 years already, and still using the same $5 plan a month

HTTPS and certificates? i have no clue how to setup that, i use dns from cloudflare and they have it all automatic for free

if your employees are asking you to pay ton of money for your services, hire someone else

throwaway203714y ago

You can also handle 50 requests per second on a 66MHz 486DX2 with 16MB of RAM and a 10Mbit/s network card. Not with modern "I have infinite resources" software, but we used to handle more than that traffic regularly in the early 90s.

pluc4y ago

> Not taking into account any issues that may occur around CPU/RAM/Disk IO due to sustained levels of traffic as well as bandwidth issues

congrats?

welder4y ago

This one's even cheaper:

My £0 a month server can handle 4.2M requests a day [1]

[1] https://ahamlett.com/blog/post/My-%C2%A30-a-month-server-can...

j / k navigate · click thread line to collapse

452 comments

206 comments · 62 top-level

nostrademons4y ago· 47 in thread

People tend to severely underestimate how fast modern machines are and overestimate how much you need to spend on hardware.

mynameisash4y ago

What Andy giveth, Bill taketh away.[0]

I'm hopeful that Ballista[1], for example, will see uptake and improve this.

[0] https://en.wikipedia.org/wiki/Andy_and_Bill%27s_law

[1] https://github.com/apache/arrow-datafusion/tree/master/balli...

Spooky234y ago

I get a kick out of stuff like this - I’m mostly an exec these days, but I recently prototyped a small database system to feed a business process in SQLite on my laptop.

2 more replies

Waterluvian4y ago

What reminded me of this the other day is how MacOS will grow your cursor if you “shake” it to help you find it on a big screen.

I was thinking about how they must have a routine that’s constantly taking mouse input, buffering history, and running some algorithm to determine when user input is a mouse “shake”.

And how many features like this add up to eat up a nontrivial amount of resources.

5 more replies

b9a2cab54y ago

[1]: https://www.vldb.org/pvldb/vol11/p2209-kersten.pdf

1 more reply

SNosTrAnDbLe4y ago

2 more replies

willvarfar4y ago

Yes, the whole idea of sending “agents” to do processing is poor performing and things like snowflake and Trino, where queries go to already deployed code, run rings around it.

Furthermore, pyspark is by far the most popular and used spark, and it’s also got the absolute world-worst atrocious mechanical sympathy. Why?

Developer velocity trumps compute velocity any day?

(I want the niceness of python and the performance of eg firebolt. Why must I pick?)

(There is a general thing to get spark “off heap” and use generic query compute on the spark sql space, but it is miles behind those who start off there)

iaw4y ago

Could you elaborate on other systems besides Ballista? (which looks great btw, thank you for sharing)

1 more reply

danudey4y ago

A lot of that is due to absolutely lousy code.

Eventually I took a look at the code for the page, which queried LDAP for user data and the database for permissions data. It did:

    get list of users
    
    foreach user:
        get list of all permissions
        filter down to the ones assigned directly to the user
    
    foreach user:
        get list of all groups
        foreach group:
            get list of all permissions
            filter down to the ones assigned to the group
        filter down to the ones the user has

I'm no algorithm genius, but I'm pretty sure O(n^2+n^3) is not an efficient one.

I replaced it with

    get list of all users
    get list of all groups
    get list of all permissions

    <filter accordingly>

Suffice to say, it was a lot more responsive.

Note that we also weren't using Django's foreign keys support, so we couldn't even tell Django to "fetch everything non-lazily" because it had no idea.

If that app were written right it could have run on a Raspberry Pi 2, but instead there was no amount of cores that could have sped it up.

benhoyt4y ago

  users = select('SELECT id, name FROM users WHERE id IN $1', user_ids)
  comments = select('SELECT user_id, text FROM comments WHERE user_id IN $1', user_ids)
  comments_by_user_id = defaultdict(list)
  for c in comments:
    comments_by_user_id[c.user_id].append(c)
  for u in users:
    u.comments = comments_by_user_id[u.id]

Only two queries, and O(users + comments).

4 more replies

tra34y ago

This is an example of N+1 problem [0]. It should be a FizzBuzz for anyone doing any CRUD apps.

[0]: https://stackoverflow.com/questions/97197/what-is-the-n1-sel...

JJMcJ4y ago

Your pattern is quite powerful: get data from several sources and do the rearranging on the client (which might be a web server), instead of multiple interactions for each data item.

For SQL you can also do a stored procedure. Sometimes that works well if you are good at your DBMS's procedure language and the schema is good.

1 more reply

chenxiaolong4y ago

I had to fix a similar thing in our internal password reset email sender last year. The code was doing something like:

    for each user in (get_freeipa_users | grep_attribute uid):
        email = (get_freeipa_users | client_side_find user | grep_attribute email)
        last_change = (get_freeipa_users | client_side_find user | grep_attribute krblastpwdchange)
        expiration = (get_freeipa_users | client_side_find user | grep_attribute krbpasswordexpiration)

        # Some slightly incorrect date math...

        send_email

I changed it to a single LDAP query for every user that requests only the needed attributes. It cut that Jenkins job's runtime from 45 minutes to 0.2 seconds.

Thaxll4y ago

Filter in app is rarely the right solution, your data should be organized in a way that you can get what you need in a single query. Reasons:

- it's memory efficient

- it's atomic

- it's faster

Also doesn't LDAP support filtering in query?

1 more reply

VintageCool4y ago

I switched it to only check permissions for the current user, and the page loaded instantaneously.

1 more reply

unclebucknasty4y ago

Not sure what the exact use case was (i.e. the output of the filtering) but—from reading the first algo—seems to be something to do with determining group membership and permissions for a user.

hnbad4y ago

onlyrealcuzzo4y ago

Yeah - you likely want to do this a single simple query - which you can optimize if necessary. an O(N+1) query is bad. An O(N^2) query is something I have rarely seen. Congrats!

codebolt4y ago

I'd also add that groups and permissions are probably constant and can be cached with a long timeout.

afrodc_4y ago

Is there a reason you shelled out with a subprocess versus using a library like ldap3? Just curious

IfOnlyYouKnew4y ago

I believe the parent's point was that code tends to be faster than what people expect, not slower.

1 more reply

blacklion4y ago

Crypto markets are very small :-)

I'm working and company which process "real" exchanges, like NASDAQ, LSE, and, especially, OPRA feed.

We've added 20+ crypto exchanges in our portfolio this year, and all of them are processed on one old server which is unable to process NASDAQ Total View in real-time anymore.

Foomf4y ago

4 more replies

secondcoming4y ago

It depends on what your server does with each request though; '65B a day' means little. If all it does is write it to a log then I'm surprised you're not using a rPI.

joering24y ago

5 more replies

ingas4y ago

In 2001 we started a GSM-operator on Compaq Server (it was before they were bought by HP) with whole 1Gb(!) of RAM and 2x10Gb SCSI disks.

It served up to 70K of subscribers, call center with 30-40 employees, payment systems integration, everything.

Next was 8 socket Intel server. We were never able to saturate it's CPUs - 300 Mhz (or was it 400 ?) bus was a stopper. It served 350-400K of subscribers.

Nowadays: every phone has more power than it was those servers. Typical react application consumes more resources than billing system. Gigabyte here, gigabyte there - nobody counts them.

/grumpy oldster mode

fhood4y ago

nine_k4y ago

The CPU is rarely used up to 100% because most code fails to utilize several cores efficiently.

foobarbazetc4y ago

Yeah. Now that CPUs are insanely powerful and you have NVMe SSDs etc the bottleneck is always memory.

3 more replies

SilverRed4y ago

OneEyedRobot4y ago

I've seen exactly the opposite although you certainly can't ignore memory speed.

Gepsens4y ago

danudey4y ago

> I coded exclusively in Rust

They were often written in Java, ASP.NET, and so on. They were extremely heavyweight. They'd need 8-10 servers for 10k users. They hogged huge amounts of RAM.

I'm never gonna stop writing things in Python, but it still amazes me what can happen when you get down close to the metal.

2 more replies

Andrew_nenakhov4y ago

Probably, Erlang would be a good fit for your task.

2 more replies

giancarlostoro4y ago

is that $30 for a single droplet or is it spread out between a few different services? I'm kind of curious since I use DO for small projects myself.

3 more replies

lewisjoe4y ago

I'm running two different projects on a single instance of cheap VM too. Both of them runs on a not-so-memory-efficient programming runtime, yet the VM handles the load just fine.

Of course, a lot of it depends on what your app does for each request but most apps are simple enough and can live with being a monolith / single fat binary running on a single instance.

The problem with today's DevOps culture is that they present K8's as answers for everything. Instead of defining a clear line on when to use them and when not to.

fizwhiz4y ago

Would you mind describing your stack in more detail? Did you use gRPC with Go?

nostrademons4y ago

2 more replies

milkywaybrain4y ago

Cthulhu_4y ago

Breza4y ago

dirkg4y ago

may be OT, but how do you subscribe to these trade feeds, is there a unified service or do you need to do it individually for each source, and how much does it cost approximately ?

I'm guessing if you put all this data into Kinesis or message queues it would end up costing quite a bit more.

nostrademons4y ago

There are probably unified services that let you do it - I was kinda competing in this area but didn't want to deal with enterprise sales, and it's a bit of a hard sell anyway.

If you do it individually, there are public developer docs for each exchange that explain how their API works. It's generally free as long as you're not making a large number of active trades.

meltedcapacitor4y ago

Never heard of a crypto exchange that charges for data feeds, the norm is free and fast. One of the positive of the industry compared to old school finance.

They're rent seeking in other ways though, no worries.

arjie4y ago

How do I get in touch with you? Definitely using more resources than this to process fewer integrations. I’m curious what trade offs you made to enable this.

If you’re anywhere in the US, let me know.

rattray4y ago

What were the most important aspects of the technology stack which enabled that?

aoms4y ago

How much bandwidth did this use daily/monthly?

nostrademons4y ago

I don't remember offhand. I think bandwidth costs averaged about $40-50 out of that ~$200/month.

noduerme4y ago· 19 in thread

j1elo4y ago

Focusing on a small but important detail that some have already mentioned but with a more aggressive tone... was your "loophole" system tested in an actual litigation at any point?

What I mean is that this:

> The IoM server would call the Swiss server every time a hand was dealt

But of course, none of this matters if the casino never had any legal battle to fight where this idea could be tested in court, which is the equivalent of not being "caught".

noduerme4y ago

1 more reply

seanalltogether4y ago

1 more reply

amelius4y ago

> in reality it sounds to me like the kind of technicalities that wouldn't really pass the reasoning of a human judge

Yes, only big companies can successfully "hack" the law based on its letter, see e.g. tax evasion.

mvc4y ago

He got an MP on his side which seems to be all it takes these days in the UK.

3 more replies

amelius4y ago

It would be even cooler if you derived the random numbers from a lava lamp visible through a window of the Swiss embassy.

hdjjhhvvhga4y ago

> we didn't perform random calculations on their server, thus weren't a gambling site under IoM law.

Does it really matter if you get your random number from /dev/urandom or a server in Switzerland?

withinboredom4y ago

1 more reply

the_absurdist4y ago

Why doesn't it matter?

bcrosby954y ago

Why go through IoM at all? Why not Costa Rica -> Switzerland, rather than Costa Rica -> IoM -> Switzerland?

noduerme4y ago

1 more reply

oliv__4y ago

I would love to read a blog post about this. Hell I'd read a book: sounds like there might've been much more to this venture than you're letting on

arcticbull4y ago

Very crafty. Nice. I'm legitimately impressed by every part of that.

wiz21c4y ago

Does this article count as "Cost Rica leaks ?"

noduerme4y ago

yeah, I'm blowing my HN cover.

ajnin4y ago

Thanks for sharing that, I hope it does not land you into trouble. So, how many billions did you make with that site ?

Gepsens4y ago

You're a fricking genius.

elcomet4y ago

He's a frickin genius because he found ways to break the law with impunity for his own profit ?

I'm surprised by HN sometimes.

10 more replies

noduerme4y ago

Sarcasm much? I'm a full blown idiot. I was just solving one problem at a time.

1 more reply

fbrchps4y ago· 17 in thread

floren4y ago

fbrchps4y ago

1 more reply

ezfe4y ago

Oh I hate Google as much as the next guy, but that's not something they've shown any interest in doing.

2 more replies

remram4y ago

Google Cloud CDN? https://cloud.google.com/cdn/

1 more reply

blfr4y ago

your site is taking multiple seconds to load for me, depending on when I refresh.

Barely over a second here. Much better than vast majority of "webscale" services.

fbrchps4y ago

1 more reply

jeroenhd4y ago

Are you sure that's related to the server itself? The page loads instantly for me and the DNS is still resolving to an OVH IP address.

Timing info from Firefox: Blocked: 0ms DNS resolution: 8ms Connecting: 9ms TLS setup: 12ms Sending: 0ms Waiting: 30ms

mark_mcnally_jeOP4y ago

Nope, website has not changed at all - speaking of the Matomo tracking; that is running on a separate server and has actually crashed!

mark_mcnally_jeOP4y ago

Yeah if I was building a business website I would want distributed caching/a CDN, mainly to support spikes, like what is happening now!

fbrchps4y ago

(Obviously the sales thing doesn't apply to OP)

1 more reply

spicybright4y ago

Working extremely fast for me right now. Your post was about 20 minutes ago. I don't know how HN traffic fluctuates, but it seems really solid compared to most sites.

tiew9Vii4y ago

It sounds like they are caching database queries.

> Parts of the blog posts are cached using memcached for 10 mins

That means Django needs to accept the request, route it, pull the data from memcached, render the template.

_fizz_buzz_4y ago

Weird, this site loads really fast for me actually. Much faster than most sites that I visit.

mark_mcnally_jeOP4y ago

Thanks! Just goes to show that it runs quickly even when it hits #1 on HN ;)

1 more reply

umvi4y ago

Loaded instantly for me

crazy_horse4y ago

200 rps is not great.

fridif4y ago

Wikipedia as a whole has 8k rps, and that's with multiple racks in multiple data centers.

I haven't read recently, but they were only doing 200 rps per server.

polote4y ago· 16 in thread

What's the point of this post? OP is serving a file at 50req/sec. There is not even mention of a dB query. How is that able to relate to any kind of normal app?

I guess that the post was written as an answer to the mangadex post [1]. Mangadex was handling 3k req/sec involving dB queries. It was not just a cached Html page.

50req/sec for a Html file is super low which shows that a $4 month server cant do much actually. So yes this is enough for a blog, but a lot of websites are not blogs

[1] https://news.ycombinator.com/item?id=28440742

throwdecro4y ago

> How is that able to relate to any kind of normal app?

There's too much competition involved in writing normal apps, which often attract significant investment that bootstrapped startups struggle to compete with.

quickthrower24y ago

wyager4y ago

Source for my current thing is at http://yager.io/Server.hs. It also does all my RSS stuff, image processing for my photo gallery, etc.

pdimitar4y ago

I'm curious if you evaluated Rust?

1 more reply

vymague4y ago

> What's the point of this post? OP is serving a file at 50req/sec.

I'd guess a response to the mangadex thread? https://news.ycombinator.com/item?id=28440742

SPBS4y ago

No way, OP is essentially serving a static webpage from a database. That's nothing to brag about if comparing with mangadex.

Arch-TK4y ago

>There is not even mention of a dB query.

Did you read the post?

wokwokwok4y ago

> These benchmarks show that a very cheap server can easily handle 50 requests a minute to a "full stack" website.

I did, and all I see is someone spinning some numbers idly, like, hey, if I can lay 1 brick every second, then with 20000 people we can build a house in one second! So good!

a) entirely and totally lacking in experience running a heavy load website.

b) 50 requests a minute is so atrociously bad, it’s not even worth talking about.

c) there isnt any db load going on here, this is a full page single table query. See https://docs.djangoproject.com/en/3.2/ref/contrib/flatpages/

Sure maybe a db exists, but it’s not relevant when you compare this to the complexity of doing write operations.

Ie. this is some hiiiigh level arm chair commentary right here.

2 more replies

latexr4y ago

From the guidelines[1]:

> Please don't comment on whether someone read an article. "Did you even read the article? It mentions that" can be shortened to "The article mentions that."

[1]: https://news.ycombinator.com/newsguidelines.html

great-potential4y ago

Exactly, I dont see the point bragging about this nevertheless posting about it on HN ...

1 more reply

vietvu4y ago

People just want to show their works, it's normal. What I found strange is that not many people seems to be surprised about this and upvote.

gaptoothclan4y ago

Is this just an apache bench mark

napworth4y ago

It's called boasting

sillycube4y ago

Boasting of what? Serving static sites doesn't even need a server.

Use apache to serve Django + wsgi? Just use Django asgi and nginx and you will get a higher number.

TruthWillHurt4y ago

+1. I remember modest VPS/Parrallels serving PHP at 350r/s

quickthrower24y ago

Something like:

    <?php echo("this is a benchmark") ?>

markandrewj4y ago· 8 in thread

https://en.wikipedia.org/wiki/C10k_problem

spyder4y ago

habibur4y ago

Right. I calculated what 5m/day converts into. And it's like 60 req/sec. Considering non even distribution and spikes, I would assume its like 200req/sec.

adtac4y ago

unrelated but 4x increase isn't really a spike

ijidak4y ago

How does a single server run millions of active connections?

Wouldn't you run out of TCP sockets?

What am I missing?

dreyfan4y ago

A connection isn't just a dest_port, it's the unique combination of 4 components: source_ip:source_port:dest_ip:dest_port

1 more reply

jfrunyon4y ago

I would guess they may be muxed over fewer sockets, by their LBs, but that's not strictly necessary.

In practice, OS's do have a "max socket" or "max FD" limit, but that's usually configurable and (with enough RAM) could easily be set to "millions".

1 more reply

remram4y ago

On my default Ubuntu install, the hard limit for open files of a process is 1048576 (ulimit -Hn). So you you have to run a handful of processes.

arthurcolle4y ago

socket multiplexing

idworks14y ago· 7 in thread

Then with my second update, he told me that the app must be broken or that the script must be dying. There is no way it could complete this fast.

And... why would you loop through millions of records when you can use batches? Also this was a phperlbashton* script. I turned it into a single PHP script and called it a day.

As a consequence, backup time was reduced to 2 hours as opposed to 12 hours (no one was allowed on the website until the back up was done).

Modern machines are incredibly fast.

* PHP/Perl/Bash/Python

globular-toast4y ago

I have a similar story where I reduced the memory used by a script from 1TB (yes, TB) to a few megabytes. The runtime was massively reduced too, from something like 1 day to a few minutes.

BatteryMountain4y ago

nicbou4y ago

I've done something like that at my last university internship. I wrote about it here: https://nicolasbouliane.com/projects/pratt-whitney-redesign

It's the same story as yours, but with human effort. I was about to cut the human out entirely, and fix a ton of errors in the process.

darylteo4y ago

Of course I screwed up more than I fixed. :D

devortel4y ago

> no one was allowed on the website until the back up was done

I'm assuming this was an internal website and backups were scheduled for evenings/weekends?

nicbou4y ago

I've done something like that at my last university internship. I wrote about it here: https://nicolasbouliane.com/projects/pratt-whitney-redesign

It's the same story as yours, but with human effort. I was about to cut the human out entirely.

vmception4y ago

and then you were fired because your job was dependent on this taking forever

tristor4y ago· 4 in thread

planet-and-halo4y ago

Any recommended reading? I've bought a few things in that vein but I'm self-taught and always looking to improve on the ops/perf side.

tristor4y ago

I don't have any one complete book that I can recommend, and I don't even really have a great reading list for this. But I'll make an attempt to share what I think is useful as a starting point.

1. Systems Operations is first and foremost about understanding systems, in all of their complexity, which means understanding the internals of your OS primarily.

2. Performance and networking, in particular, are super important areas to focus on understanding when it comes to learning the topic to help with software development.

Here's a few books I think are worth reading, not in any particular order of prevalence, but loosely categorized

Databases:

High Performance MySQL: https://www.amazon.com/gp/product/1449314287/

SQL Queries for Mere Mortals: https://www.amazon.com/gp/product/0321992474/

The Art of SQL: https://www.amazon.com/gp/product/0596008945/

Networking:

TCP/IP Illustrated: https://www.amazon.com/exec/obidos/ISBN=0201633469/wrichards... (updates on author's site at http://www.kohala.com/start/tcpipiv1.html)

The TCP/IP Guide: https://www.amazon.com/TCP-Guide-Comprehensive-Illustrated-P...

UNIX Network Programming: https://www.amazon.com/dp/0131411551

Beej's Guide to Network Programming: http://beej.us/guide/bgnet/

Operating Systems:

Operating Systems Concepts: https://www.amazon.com/Operating-System-Concepts-Abraham-Sil... (various editions, I have the 7th edition... I recommend you find the latest)

Modern Operating Systems: https://www.amazon.com/Modern-Operating-Systems-Andrew-Tanen... (the "Tanenbaum Book")

Operating Systems Design and Implementation: https://www.amazon.com/Operating-Systems-Design-Implementat-... (the other one, the "MINIX Book")

Windows Internals:

Part 1: https://www.amazon.com/Windows-Internals-Part-architecture-m...

MacOS Internals:

Part 1: https://www.amazon.com/MacOS-iOS-Internals-User-Mode/dp/0991...

Part 2: https://www.amazon.com/MacOS-iOS-Internals-II-Kernel/dp/0991...

Part 3: https://www.amazon.com/MacOS-iOS-Internals-III-Insecurity/dp...

Linux Kernel Programming:

Part 1: https://www.amazon.com/Linux-Kernel-Development-Cookbook-pro...

Part 2: https://www.amazon.com/Linux-Kernel-Programming-Part-Synchro...

The Linux Programming Interface: https://www.amazon.com/Linux-Programming-Interface-System-Ha...

General Systems Administration:

Essential Systems Administration: https://www.amazon.com/gp/product/0596003439/

UNIX and Linux Systems Administration Handbook: https://www.amazon.com/UNIX-Linux-System-Administration-Hand...

The Linux Command Line and Shell Scripting Bible: https://www.amazon.com/Linux-Command-Shell-Scripting-Bible/d...

UNIX Shell Programming: https://www.amazon.com/Unix-Shell-Programming-Stephen-Kochan...

BASH Hackers Wiki: https://wiki.bash-hackers.org/

TLDP Advanced BASH Scripting Guide: https://tldp.org/LDP/abs/html/

The Debian Administrator's Handbook: https://debian-handbook.info/browse/stable/

TLDP Linux System Administrator's Guide: https://tldp.org/LDP/sag/html/index.html

Performance & Benchmarking:

Systems Performance: https://www.amazon.com/Systems-Performance-Brendan-Gregg-dp-... (this is Brendan Gregg's book where you learn about the magic of dtrace)

BPF Performance Tools: https://www.amazon.com/Performance-Tools-Addison-Wesley-Prof... (the newer Brendan Gregg book about BPF, stellar)

I guess that's a "reading list", but this is just a small part of what you need to know to excel in systems operations.

If you're trying to get into systems operations as a field, this is just a brush over the top surface and there's a lot deeper diving required.

1 more reply

vymague4y ago

> basic system operations knowledge

Perhaps you can suggest a book or roadmap to learn it?

tristor4y ago

I replied to your sibling comment with something approaching a book list.

ppeetteerr4y ago· 3 in thread

How did this make it to the number two spot on HackerNews?

dreyfan4y ago

~10k rps (it was concurrent connections but close enough) was state of the art in 1999. Now 22 years later ~50 rps is somehow impressive.

5 more replies

ok_coo4y ago

IMHO, it's better than a deluge of political posts.

I don't mind reading about politics but I come to HN to read about tech. We can go elsewhere to get whatever politics we desire.

wpietri4y ago

An awful lot of professional programmers work in such heavyweight contexts that they don't have a good idea of how fast modern hardware can be.

This junky slice of a server handling full page requests in 20 ms is a fine example to counter thinking that's endemic in enterprise spaces.

2 more replies

bob10294y ago· 2 in thread

If you want to see where the theoretical limits lie, check out some of the fringe work around the LMAX Disruptor and .NET/C#:

https://medium.com/@ocoanet/improving-net-disruptor-performa...

You will find the upper bound of serialized processing to be somewhere around 500 million events per second.

Personally, I have not pushed much beyond 7 million per second, but I also use reference types, non-ideal allocation strategies, etc.

mariushn4y ago

Could you detail this, please? I don't get it. What is the flow?

1. Browser is sending events to web server via web socket, instantly as the event is occurring (?)

2. ? (what exactly does the server do?)

bob10294y ago

Upon receiving an event from the client socket, it is immediately inserted into the LMAX ring buffer for processing.

1 more reply

johnklos4y ago· 2 in thread

This is a good, simple way to show how much can be done with modest resources.

mark_mcnally_jeOP4y ago

That's why I made this post, was very happy to see how much it could theoretically handle.

danjac4y ago

I'm not sure it's so much about fetishizing, and more about realistic expectations around software development in larger teams.

At that point, the cheaper and politically easier option is to just fire the money cannon at expensive cloud services and hope the extra spend squeezes out some performance gains.

jonplackett4y ago· 2 in thread

#1 on HN and still up. That speaks for itself.

mark_mcnally_jeOP4y ago

jonplackett4y ago

Are your starting to feel the pressure yet?

1 more reply

stef254y ago· 2 in thread

And here I am building a simple API using Lumen (Laravel's stripped down, hopefully faster cousin) getting response times that are just abysmal.

Once I learn enough Python I'd like to compare its performance to something like FastAPI. But even that probably won't come near what these recent posts are describing.

(Disclaimer - it's just a side project and I haven't really looked in to making it faster)

Kiro4y ago

That's strange. Laravel should be able to handle thousands of requests a second even on the cheapest hardware.

euoia4y ago

In my experience with Laravel, it is often the transformation to JSON that takes some time for large responses.

yupper324y ago· 2 in thread

Well the front page of HN won't get you 4.2M views today, but it's a pretty good real world test!

mark_mcnally_jeOP4y ago

It is! It's crashed my analytics but the website seems to be doing fine.

jonplackett4y ago

True, but it may very well get you a lot more than 50 a minute.

privacyonsec4y ago· 2 in thread

My 1cpu 2gb ram server: - nginx - slack bot behind Django - OpenVPN - Minecraft server (20 players) - tunneling (reverse proxy for local dev)

whatsthatabout4y ago

mktk10014y ago

Press X to doubt.

great-potential4y ago· 2 in thread

Dont mean to be the negative Joe but you dont need a webserver if you're serving a static-able website.

nayuki4y ago

You mean use someone else's web server instead?

1 more reply

bruce3434344y ago

How are you supposed to _serve_ that static _web_site then?

modeless4y ago· 2 in thread

jnieminen4y ago

You could also use Github pages or Cloudflare pages to host.

1 more reply

goodpoint4y ago

No thanks. Cloudflare is very harmful to Tor and to user's freedom and privacy in general.

janmo4y ago· 1 in thread

There is a difference between being able to handle 4.2M requests a day, and handling 4.2M requests per day.

Visitors don't come neatly one after the other. You might only have 1M requests a day but get random spikes with 100 requests at the same time.

bArray4y ago

Very true. It also makes a difference as to which resource is being pulled, whether it is cached, what transport is being requested (SSL, compression, etc).

I really suspect the website would fall long before it hits anything close to 4.2 million requests (which the author also seems to except).

That all said - long live tiny web servers!

Rd6n64y ago· 1 in thread

hu34y ago

danbrooks4y ago· 1 in thread

I hosted draftsim.com on a $3/month hosting plan for a few years.

We served 500GB of data the first month.

I imagine that the hosting company lost money on us (but they never called to complain).

flobosg4y ago

Just wanted to thank you for draftsim.com! As someone who got into MTG Arena a few months ago it has been very useful to learn some basics.

MangoCoffee4y ago· 1 in thread

this story remind me, the dot com bubble. dotcom companies bought servers from Sun Microsystem. they needs to handle the large traffics that "PC" server can't handle.

anyone remember Cobalt server?

https://en.wikipedia.org/wiki/Cobalt_Networks#/media/File:Co...

sgt4y ago

I remember many stories and discussions on Slashdot about it.

5faulker4y ago· 1 in thread

That's a good spec for 4 bucks. With cloud hosting you might be able to push the cost down a bit with less CPU resource and memory.

mark_mcnally_jeOP4y ago

I am cloud hosting using OVH cloud, I brought the server a couple of years ago now so they probably have some better specs for the price I am paying.

sigg34y ago· 1 in thread

Put Wordpress on it, and do a new battery of TTFB tests ;)

celsoazevedo4y ago

olingern4y ago· 1 in thread

I might get downvoted for not getting on the 2000s style of development bandwagon, but do you really need a web server to serve static text?

bellyfullofbac4y ago

What do you suggest, Gopher?

4 more replies

ComputerGuru4y ago

louwrentius4y ago

I'm hosting my static blog site on a physical Raspberry Pi 3B+ powered by Solar [1].

That blog post got hugged by HN but it didn't even raise the CPU above 10% on a single core.

And a Raspberry Pi 3B+ is dog slow. And severely limited by bandwidth, unlike the Raspberry Pi 4B+. (But it uses less power so that's why I use a 3B+).

For many applications, an old Gen8 or similar Dell server will perform more than adequately. Even more so if you have a little bit more to spend on Gen9.

They are so cheap that you can like buy four to eight, sprinkle them across two different datacenters and even if one breaks, you won't be in any hurry.

[1]: https://louwrentius.com/this-blog-is-now-running-on-solar-po...

reilly30004y ago

That estimate can be verified with load testing systems like Artillery. My theory is that things would break far sooner than estimated along the following lines:

- Too many WSGI connections if the timeouts aren’t tweaked

- Too many database connections, especially without caching and tuning

- on the Apache side if MaxRequestWorkers isn’t set there will be memory issues with 1GB RAM

- the disk could easily hit IOPS limits, especially if there is a noisy neighbor

seandong4y ago

Ice_cream_suit4y ago

In contrast, the Australian 2016 National Census crashed and burned since the system could not cope with over 250 requests a second.

Parliamentary enquiry:(PDF) https://www.aph.gov.au/DocumentStore.ashx?id=0a7f6bd5-8716-4...

https://www.zdnet.com/article/census-2016-among-worst-it-deb...

https://www.theguardian.com/australia-news/2016/aug/10/compu...

jaymzcampbell4y ago

erdo4y ago

But unlike on the server side, the accepted wisdom in most places I've worked at is that the answer to the performance problems is: a new framework.

tambourine_man4y ago

> These benchmarks show that a very cheap server can easily handle 50 requests a minute

I think you mean a second, but yeah, old tech is fast.

I find it funny when I read “raw html” emphatically, as if it was akin to writing assembly.

zxcvbn40384y ago

brokencode4y ago

It’s the database part that gets expensive for web applications. Serving up static web pages is absolutely trivial for modern servers.

The database is also the part that doesn’t easily scale, unless you pick a highly scalable database from the outset, and those have their own complexity and tradeoffs as well.

That’s why I believe every project should start with a bulletproof model of how the database will work first, then fill in the other details from there.

It’s not always as easy as picking Postgres and calling it a day, unfortunately.

fabian2k4y ago

Wronnay4y ago

If you host it on GitHub Pages, GitLab Pages or Vercel it's even £4 less :o

ignoramous4y ago

wyager4y ago

iambozdar4y ago

Only static websites are the one which handle large amount of requests at low cost. Web hosting providers don't make money out of those clients, so they run shared plans.

sam0x174y ago

Impressed that it is front page of HN and it isn't "This page cannot be displayed" with that kind of premise. Props.

bullen4y ago

My Java HTTP app. server manages that on a Raspberry 2 at 2 watts serving this reddit clone: http://talk.binarytask.com

Really we need to compare apples to apples (how many watt)!

Most of you have an external IP address, open port 80 and put it to good use before they put you behind a shared IP!

mkl954y ago

Nginx can handle ~250M requests a day out of the box, and ~600M by tuning a few parameters. *

* https://www.cloudbees.com/blog/tuning-nginx

zhuzhu4y ago

My website https://www.v2ph.com

$6 VPS can handle 500,000 requests daily

On this server, I have PHP-fpm workers, nginx and MariaDB

The average CPU usage about 30%, load average is about 0.5

fimdomeio4y ago

Most of the times the more interesting question is not really how to make a server that can handle 4.2M requests a day, but how to make something so useful that it gets more than 100 pageviews a day.

iradik4y ago

"If we can handle 50 requests a second that means we can handle 4.2 million requests a day."

Not really. Real world traffic won't be uniform over one entire day. 50 QPS would be more accurate.

muzani4y ago

I was skeptical that it would fail under real situations (the calculation says it's 50 requests/second). But it looks like reaching the top of HN didn't crash it.

shultays4y ago

The new metrics for server performance is "unique buzzwords per paragraph". If you don't write a 4 page blog post with at least 4-5 ubpp then it is shit.

baby4y ago

I used to be on the top 500 of Alexa with a shared hosting server running PHP. Never had any issue. Mind you, I wasn't doing anything complicated, but still.

danielsamuels4y ago

Can it?

> Service Unavailable

> The server is temporarily unable to service your request due to maintenance downtime or capacity problems. Please try again later.

ksec4y ago

160 Request /s for a No DB Call, Simple File Serving on a Single Core System.

Why am I the only one not impressed by this.

jka4y ago

There are a few comments in here that predictably suggest that simple static sites can handle large request rates easily.

Sure, that's true - but to try to progress the conversation: how would you measure the complexity of serving web requests, in order to perform more advanced cost comparisons?

(bandwidth wouldn't be quite right.. or at least not sufficient - maybe something like I/O, memory and compute resource used?)

gruellan4y ago

Hey! Someone else from Jersey! Nice one

nicoburns4y ago

So 50req/sec. I'd hope it could handle a lot more than that!

fnord774y ago

apache + wsgi isn't even close to being the most performant webapp server software, either. Bet he'd get 5x the performance out of nginx + lua on the same virtual hardware.

taf24y ago

I wonder what the 99th - 95th percentile response times look like…

anaganisk4y ago

But can it run k8s /s

kjgkjhfkjf4y ago

That's less than 50qps. I can't count that low.

KronisLV4y ago

It feels to me like most websites out there could run on way less hardware, if only people would embrace a few things.

#1 Minimalism. You don't need 400 KB of JS to display some mostly text content to your users with some interactivity sprinkled in.

And yet, i've seen a surprising amount of ignorance in regards to caching, static site generation and even how large webpages have gotten: https://idlewords.com/talks/website_obesity.htm

_alex_4y ago

tl;dr: a machine with 2 GB of ram that is doing pulling a blog post out of a DB and passing it through Django can do ~50tps

Shadonototro4y ago

i'm running my SaaS for 10 years already, and still using the same $5 plan a month

HTTPS and certificates? i have no clue how to setup that, i use dns from cloudflare and they have it all automatic for free

if your employees are asking you to pay ton of money for your services, hire someone else

throwaway203714y ago

pluc4y ago

> Not taking into account any issues that may occur around CPU/RAM/Disk IO due to sustained levels of traffic as well as bandwidth issues

congrats?

welder4y ago

This one's even cheaper:

My £0 a month server can handle 4.2M requests a day [1]

[1] https://ahamlett.com/blog/post/My-%C2%A30-a-month-server-can...

j / k navigate · click thread line to collapse