Saving cloud costs by writing our own database (opens in new tab)

(hivekit.io)

211 pointswolframhempel2y ago160 comments

160 comments

136 comments · 40 top-level

RHSeeger2y ago· 35 in thread

> we’ve replaced our $10k/month Aurora instances with a $200/month Elastic Block Storage (EBS) volume.

Without any intent to insult what you've done (because the information is interesting and the writeup is well done)... how do the numbers work out when you account for actually implementing and maintaining the database?

- Developer(s) time to initially implement it

- PjM/PM time to organize initial build

- Developer(s) time for maintenance (fix bugs and enhancement requirements)

- PjM/PM time to organize maintenance

The cost of someone to maintain the actual "service" (independent of the development of it) is, I assume, either similar or lower, so there's probably a win there. I'm assuming you have someone on board that was on charge of making sure Aurora was configured / being used correctly; and it would be just as easier if not easier to do the same for your custom database.

The cost of 120,000/year for Aurora seems like it would be less than the cost of development/organization time for the custom database.

Note: It's clear you have other reasons for needing your custom database. I get that. I was just curious about the costs.

g9yuayon2y ago

> PjM/PM time to organize initial build

This sounds what big companies or a disorganized company would need. For an efficient enough company, a project like this needs just one or two dedicated engineers.

In fact, I can't imagine why this project needs a PM at all. The database is used by engineers and is built by engineers. Engineers should be their own PMs. It's like we need a PM for a programming language, but no, the compiler writer must be the language designer and must use the the language. Those who do not use a product or do not have in-depth knowledge in the the domain should not be the PM of the product.

vannevar2y ago

>For an efficient enough company, a project like this needs just one or two dedicated engineers.

Maybe for a research project or a hobby project, but not for a real, high performance database to be used in a business-critical application.

FTA:

"Databases are a nightmare to write, from Atomicity, Consistency, Isolation, and Durability (ACID) requirements to sharding to fault recovery to administration - everything is hard beyond belief."

>Engineers should be their own PMs.

For small projects, sure (your "one or two dedicated engineers"). But once you start tackling projects that require larger teams, or even teams of teams, you need someone to track and prioritize the work remaining and the work in progress (as well as the corresponding budgets for personnel, services, and other resources). Similar to the way a sole proprietor can do their own accounting, but a multi-million dollar business probably should have an accountant.

As an aside, I wonder if this might be a use case for a bitmap db engine like Featurebase (https://www.featurebase.com/).

delusional2y ago

> what we’ve built is just a cursor streaming a binary file feed with a very limited set of functionality - but then again, it’s the exact functionality we need and we didn’t lose any features.

The trick is that they didn't need a database that provides "Atomicity, Consistency, Isolation, and Durability (ACID)". By only implementing what they need they were able to keep the project small.

It's like people are scared of doing anything without making it into some huge multi hundred developer effort. They've written a super simple append only document store. It's not rocket science. It's not a general purpose arbitrary SQL database.

2 more replies

cortesoft2y ago

> a project like this needs just one or two dedicated engineers.

So that is at least 20k a month, for fairly cheap engineers.

yosef1232y ago

I doubt he meant « two engineers full time », I suspect it’s more like a role of maintaining / improving when necessary.

intelVISA2y ago

1-2 days work is 20k a month? This place hiring?? Even Google wanted me to work minimum 3 days a month...

vineyardmike2y ago

> In fact, I can't imagine why this project needs a PM at all. The database is used by engineers and is built by engineers. Engineers should be their own PMs.

What about when two different projects have two different requirements they need supported by the database. Which one is implemented first? What about if there is only engineering capacity to implement one?

I don’t think a database is the place for “just send a PR for adding your required feature and ping the team that owns it” kind of development. It requires research, planning, architecture review, testing, etc. It’s not a hobby project, it’s a critical tool for the business.

delusional2y ago

> Which one is implemented first?

One of them. This is true whether you have a person named "PM" or not. It's just a matter of who picks.

> What about if there is only engineering capacity to implement one?

How does naming some guy "PM" solve the issue? The team just picks one of the features.

preommr2y ago

I feel like the word "database" is throwing people off because they're comparing it with something like MySql/Postgres, when this seems slightly more complex than a k/v store stored to a file, with some other indexing, where data integrity is a low priority. That shouldn't take too much time and should be fairly isolated on the tech side so little involvement from product/project managers.

hmottestad2y ago

A k/v store typically is really fast at looking up the value based on the key. So there are usually some pretty advanced indexes involved.

arandomusername2y ago

or a simple b-tree...

1 more reply

ozr2y ago

For a k/v store, a hash table will take you very, very far.

paulddraper2y ago

80/20

rdtsc2y ago

> The cost of 120,000/year for Aurora seems like it would be less than the cost of development/organization time for the custom database

Only if they planned on hiring someone just to develop this new database and if they switch to Aurora they’d let them go immediately. If the said developer was already costing them $250k to maintain and develop the application and work on top of Aurora cost seems like a good way to save $100k/year.

organsnyder2y ago

There’s the opportunity cost of whatever else they could have been paying that developer to work on.

rdtsc2y ago

True. Also, to your point, one could argue that if that developer leaves, they'd have an easier time hiring anyone with Aurora experience as opposed to someone to learn and maintain the custom database.

But at the same time, Aurora costs could also scale with usage. It may cost $120k one year, $180k next year, $500k the year after. If the database they have now is well designed after it's already built it may not need active development every year but adding a feature here and there. Also, switching back to Aurora could also be an opportunity cost "we should have written our own thing and could have saved millions ...".

addicted2y ago

Well considering the cost is lower than Aurora isn’t the opportunity cost in favor of the home built situation.

1 more reply

grogenaut2y ago

Agreed, a developer that can pull this off is pretty good, if maybe distracted by shiny objects, what could they do working on the actual product instead of this technological terror?

ReflectedImage2y ago

Well presumably they need only 1/3 of a developer to do this and they intend to scale up 10x in the next 5 years.

$60,000 per year in-house vs $1,200,000 per year aurora. No brainer really.

andai2y ago

Also worth mentioning that it's 150x faster.

cortesoft2y ago

Its $120,000 a year for aurora, not $1,200,000.

b1122y ago

What has changed though, is that we’ve replaced our $10k/month Aurora instances with a $200/month Elastic Block Storage (EBS) volume.

Note 'instances' eg plural, versus a singular EBS. There is some ambiguity here, I'm not sure where the 10x came from, but it seems plausible.

1 more reply

Spivak2y ago

I think for this kind of thing their needs are so simple and well-suited to a bespoke implementation that it probably paid for itself in less than 4 months. This doesn't seem like a db implementation that's going to need dedicated maintenance.

They're operationally using a funny spelling of SQLite and I don't imagine anyone arguing that such a thing needs constant attention.

deedasmi2y ago

Don’t forget this is a largely one time cost vs Aurora, which scales cost with usage.

Also they said their current volume is around 13k/second. They’ve built the new platform for 30k/sec per node. This should last them a long time with minimal maintenance.

kalleboo2y ago

Using Aurora is also not free in terms of cost of development. Developers need to be trained on it's implementation, features, constraints, performance implications, keeping up with API changes, etc

mbrumlow2y ago

The problem is all those people you listed would still exist plus the 120k bill to Amazon.

They may or may not be doing other things depending on the company size and state.

You could drop the PM, engineers writing for engineers don’t need a PM.

You will likely hire similarly costed engineers to maintain the database stack anyways.

You basically hit all the talking points big cloud has brainwashed people into thinking into being true, but every day we see stories of a handful of engineers doing something we are told can’t be done and saving millions in cloud cost.

It’s so painful to watch. Software Engineering became a thing because you could hire a engineer solve your problem, and big businesses stepped in and told you that your problem was something else and gone out of its way to stifle innovation by settling industry standards on how to do things that only guarantee you use cloud services.

Any company that wants to own its destiny knows to stay away from lock-in.

samatman2y ago

I would imagine, as someone with no special insight into goings-on at Hivekit, that the answer is intended scale.

They mention 13.5k simultaneous connections. The US has 4.2 million tractors alone, just the US, just tractors. If they get 10% of those tractors on the network that's a 30x to their data storage needs. So multiply that across the entire planet, and all the use cases they hope to serve.

Investing time early on so that they can store 50x data-per-dollar is almost certainly time well spent.

kdazzle2y ago

Presumably those tractors wouldn't be connecting directly to the db though. Not sure why they dont just go the standard iot events route and store data in a data lake and propagate into an analytics db/warehouse from there. Add a layer to make recent events available immediately.

S3 is relatively cheap.

donohoe2y ago

I came here to ask the same question.

If this db requires 1 full-time developer then the cost would immediately be not worth it (assuming salary + benefits > $120k/yr)

As you say, without details it’s hard to know if this was a good idea.

solatic2y ago

I actually disagree with you here. There are costs above and beyond the engineer's effect on the balance sheet. There's the partial salary of management to manage them, plus asking them to document their work and train others so that the database won't have a bus factor of 1. So in well-run engineering departments, there's no such thing as paying for a "single" engineering salary. You have teams; a team maintains the system and it has a pre-existing workload.

A large part of the value of popular platforms is precisely that they are not bespoke. You can hire engineers with MySQL/Postgres experience. You cannot hire engineers who already have experience with your bespoke systems.

bilsbie2y ago

Shouldn’t we up our standard developer cost for inflation?

That barely qualifies for the median mortgage in the US.

5 more replies

sophacles2y ago

They said 'instances' (plural), so that number should be at least $240k

exe342y ago

> PjM/PM

What do you need them for?

thehappypm2y ago

To stop the engineers from spending too much time optimizing the database and to focus on customers

exe342y ago

So now every change takes 10x as long because you need to explain it to somebody who's bonus depends on it not being done? I love how the solution to too much management is always more management.

yau8edq12i2y ago· 17 in thread

Wasn't this already discussed here yesterday? The main criticism of the article is that they didn't write a database, they wrote an append-only log system with limited query capabilities. Which is fine. But it's not a "database" in the sense that someone would understand when reading the title.

throwaway634672y ago

Why isn’t that a database? In my understanding a DB needs to be able to store structured data and retrieve it, so not sure what’s missing here? Many modern DBs are effectively append only logs with compaction and some indexing on top as well as a query engine, so personally I don’t think it’s weird to call this a DB.

didgetmaster2y ago

I agree. Is there an industry accepted definition of what a system must do before it can be called a database?

I also wrote a KV system to keep track of metadata (tags) for an object store I invented. I discovered that it could also be used to create relational tables and perform fast queries against them without needing separate indexes.

I started calling it a database and many people complained that I was misusing the term because it can't yet do everything that Postgres, MySQL, or SQLite can do.

josephg2y ago

Sounds like a database to me.

Databases have a long history that reaches back much further than the modern, full featured SQL databases we have today. What you built sounds like it would fit in well amongst the non-sql databases of the world, like Berkeleydb, indexeddb, mongo, redis, and so on.

yau8edq12i2y ago

Don't be absurd. By your standard, cat, grep and a file form a database. Sure, if you interpret literally what a database is, that fits. But once again, it's not what people have in mind when they read "we cut cloud costs by writing our own database".

com2kid2y ago

File systems are databases. Different file systems choose different trade offs, different indexing strategies, etc.

Git is also a database. I got into this argument with someone when I proposed using Github as a database to store configuration entries. Our requirements included needing the ability to review changes before they went live, and the ability to easily undo changes to the config. If your requirements for a DB include those two things, Github is a damn good database platform! (Azure even has built in support for storing configurations in Github!)

1 more reply

swiftcoder2y ago

cat + grep absolutely constitute a database (and it's probably in use in production somewhere). No need to gatekeep the concept of a database

halayli2y ago

Agreed. Database in this context is expected to be a DBMS or equivalent.

hmottestad2y ago

I don’t know what point you are really trying to make. At uni the DBMS that everyone learns in their database course is an SQL database. The database part is technically just a binary file, but it’s not what people usually mean when they say they need a database for their project. Just like a search engine doesn’t have to be anything more than indexOf and a big text file. It’s just not very useful to think of it like that.

Symbiote2y ago

You're describing a relational database management system, which is a specific type of software implementing a specific type of database.

superq2y ago

It's difficult to be pedantic about an ambiguous term like database without additional qualification or specificity.

There are more types of databases than those that end in "SQL".

A CSV file alone is a database. The rows are, well, rows. So is a DBM file, which is what MySQL was originally built on (might still be). Or an SQLite file.

The client or server API doesn't have to be part of the database itself.

eatonphil2y ago

> they wrote an append-only log system with limited query capabilities.

This sounds like a database to me.

forrestthewoods2y ago

Writing custom code that does exactly what you need and nothing else is underrated. More people should do that! This is a great example.

mamcx2y ago

> But it's not a "database" in the sense that someone would understand when reading the title.

Sure, because it is common for people to mix a "database" (aka: data in some kind of structure) with a paradigm (relational, SQL, document, kv) with a "database system" aka: and app that manages the database.

intelVISA2y ago

> purpose built, in process storage engine that’s part of the same executable as our core server. It writes a minimal, delta based binary format

Get that engineer a sales gig, that's insane upselling of the reality: git commit -am 'added array to store structs'

Retr0id2y ago

If you described those needs to the average engineer, they'd correctly say "use a database".

hmottestad2y ago

Yeah. They basically defined a binary format. I wouldn’t call it a database either.

xyst2y ago

Sounds like Kafka to me. Except have to rewrite components like ksqldb

mdaniel2y ago· 11 in thread

Anytime I hear "we need to blast in per-second measurements of ..." my mind jumps to "well, have you looked at the bazillions of timeseries databases out there?" Because the fact those payloads happen to be (time, lat, long, device_id) tuples seems immaterial to the timeseries database and can then be rolled up into whatever level of aggregation one wishes for long-term storage

It also seems that just about every open source "datadog / new relic replacement" is built on top of ClickHouse, and even they themselves allege multi-petabyte capabilities <https://news.ycombinator.com/item?id=39905443>

OT1H, I saw the "we did research" part of the post, and I for sure have no horse in your race of NIH, but "we write to EBS, what's the worst that can happen" strikes me as ... be sure you're comfortable with the tradeoffs you've made in order to get a catchy blog post title

speedgoose2y ago

ClickHouse is one of the few databases that can handle most of the time-series use cases.

InfluxDB, the most popular time-series database, is optimised for a very specific kind of workloads: many sensors publishing frequently to a single node, and frequent queries that are not going far back in time. It's great for that. But it doesn't support doing slightly advanced queries such an average over two sensors. It also doesn't scale and is pretty slow to query far back in time due to its architecture.

TimeScaleDB is a bit more advanced, because it's built on top of PostGreSQL, but it's not very fast. It's better than vanilla PostGreSQL for time-series.

The TSM Bench paper has interesting figures, but in short ClickHouse wins and manage well in almost all benchmarks.

https://dl.acm.org/doi/abs/10.14778/3611479.3611532

https://imgur.com/a/QmWlxz9

Unfortunately, the paper didn't benchmark DuckDB, Apache IoTDB, and VictoriaMetrics. They also didn't benchmark proprietary databases such as Vertica or BigQuery.

If you deal with time-series data, ClickHouse is likely going to perform very well.

lispisok2y ago

I work on a project that ingests sensor measurements from the field and in our testing found timescaledb was by far the best choice. The performance x all their timeseries specific features like continuous aggregates and `time_bucket` plus access to the postgres ecosystem was killer for us. We get about 90% reduction in storage with compression without much performance hit too

omeze2y ago

Did you try clickhouse? What were its weak points?

1 more reply

Too2y ago

Apache Parquet as data format on disk seems to be popular these days for similar DIY log/time series applications. It can be appended locally and flushed to S3 for persistence.

robertlagrant2y ago

> but "we write to EBS, what's the worst that can happen" strikes me as ... be sure you're comfortable with the tradeoffs you've made in order to get a catchy blog post title

In what way?

freeone30002y ago

EBS latency is all over the place. The jitter is up to the 100ms scale, even on subsequent IOPS. We’ve also had intermittent failures for fsync(), which is a case that should be handled but is exceptionally rare for traditionally-attached drives.

RHSeeger2y ago

The author does note in the writeup that they are comfortable with some (relatively rare) data loss; like server failure and the like. Given their use cases, it seems like the jitter/loss of EBS wouldn't be too impactful to them.

1 more reply

Spivak2y ago

I mean if you spun up Postgres on EC2 you would be directly writing to EBS so that's not really the part I'm worried about. I'm more worried about the lack of replication, seemingly no way to scale reads or writes, beyond a single server, and no way to failover uninterrupted.

I'm guessing it doesn't matter for their use-case which is a good thing. When you realize you only need like this teeny subset of db features and none of the hard parts writing you own starts to get feasible.

drdaeman2y ago

Replication and reads can be scaled with something like Patroni or even a DIY replication setup (if one knows what they’re doing, of course), but writes are difficult.

VirusNewbie2y ago

Right, cassandra/scylla model is really good for time series use cases, i’ve yet to see good arguments against them.

jpgvm2y ago

It's generally good for append-only workloads.

Where C* databases seems to fall down are point updates and in this case, requirement to implement your own aggregations.

For these workloads you are much better off (unless you are already running C* somewhere and are super familiar with it) with something like Clickhouse or if you need good slice and dice then Druid or Pinot.

1 more reply

Simon_ORourke2y ago· 7 in thread

I've no doubt this is true, however, anyone I've ever met who exclaimed "let's create our own database" would be viewed as dangerous, unprofessional or downright uneducated in any business meeting. There's just too much can go badly wrong, for all the sunk cost in getting anything up and running.

mavili2y ago

That is such a problem in today's world. Of course you don't want to re-invent the wheel and all that, but we must be open to the idea of having to do it. Innovation stagnates if people suggesting redoing something are immediately seen as "dangerous, unprofessional or downright uneducated"

MaKey2y ago

I think the issue is that you rarely get to see a neat new solution to a given problem. Usually you'll see some kind of half-baked attempted solution that's worse than the already existing alternatives.

mavili2y ago

Yes, but what I'm describing is the problem of not even listening to the idea of a new attempt.

kikimora2y ago

The real problem is people who want to write a database never write one and have a very brief understanding of the domain and its complexities. Another problem is such database can became an engineering bottleneck. Other teams need new features easily found in a conventional db but “core team” unable to meet their demands. I’ve seen this in practice, result is a dead product.

akira25012y ago

> would be viewed as dangerous, unprofessional or downright uneducated in any business meeting

Sounds like a great place to work.

> There's just too much can go badly wrong, for all the sunk cost in getting anything up and running.

Engineering is the art of compromise. In many cases the compromises would not be worth it, but that doesn't mean there are zero places where it would be, and eschewing the discussion out fear of how it would be perceived is the opposite of Engineering.

democracy2y ago

Depends on their meaning of a "database"

Simon_ORourke2y ago

True, if it's some limited in-memory key-value store then that's a lot more readily implemented than something resembling Oracle's latest enterprise offering, and potentially could fall under the umbrella of it being a "database".

icsa2y ago· 5 in thread

How is it possible to save more than 100% ?

jayd162y ago

Move from the cloud to on-prem and then sell extra availability.

wolframhempelOP2y ago

Fair, should be 98%. Can't change the title anymore though

jbverschoor2y ago

Aws credits

aclatuts2y ago

Receive a license fee from someone else for using your software!

olddustytrail2y ago

Isn't it obvious?!

1. Write your own database

2. ???

3. Profit!

jrockway2y ago· 4 in thread

Everyone seems fixated on the word database and the engineering cost of writing one. This is a log file. You write data to the end of it. You flush it to disk whenever you've filled up some unit of storage that is efficient to write to disk. Every query is a full table scan. If you have multiple writers, this works out very nicely when you have one API server per disk; each server writes its own files (with a simple mutex gating the write out of a batch of records), and queries involve opening all the files in parallel and aggregating the result. (Map, shuffle, reduce.)

Atomic: not applicable, as there are no transactions. Consistent: no, as there is no protection about losing the tail end of writes (consider "no space left on device" halfway through a record). Independent: not applicable, as there are no transactions. Durable: no, the data is buffered in memory before being written to the network (EBS is the network, not a disk).

So with all of this in mind, the engineering cost is not going to be higher than $10,000 a month. It's a print statement.

If it sounds like I'm being negative, I'm not. Log files are one of my favorite types of time series data storage. A for loop that reads every record is one of my favorite query plans. But this is not what things like Postgres or Aurora aim to do, they aim for things like "we need to edit past data several times per second and derive some of those edits from data that is also being edited". Now you have some complexity and a big old binary log file and some for loops isn't really going to get you there. But if you don't need those things, then you don't need those things, and you don't need to pay for them.

The question you always have to ask, though, is have you reasoned about the business impacts of losing data through unhandled transactional conflicts? "read committed" or "non-durable writes" are often big customer service problems. "You deducted this bill payment twice, and now I can't pay the rent!" Does it matter to your end users? If not, you can save a lot of time and money. If it does, well, then the best-effort log file probably isn't going to be good for business.

bradleyjg2y ago

If you only need those things there’s also an off the shelf solution for log files. Time you spend reinventing the wheel is time you aren’t spending finding product-market fit (if you’ve already found it you wouldn’t even consider it because you’d be too busy servicing the flood of customers.)

Unless your company is so far past product market fit that it hires qualified applicants by the classfull or whatever-it-is is their product, they have no business coding up custom infra bits. The opportunity cost alone is sufficient argument against, though far from the only one.

jrockway2y ago

I think that EBS is the difficult engineering problem that they purchased instead of built from scratch here. Writing binary records to a file and reading them all into memory is not going to be a time sink that prevents you from finding product/market fit. The $120,000/year burn rate on Aurora they had seems alarming; an alarm that strongly implies "we didn't use the right system for this problem".

My guess for "why didn't they use something off the shelf" is that no existing software would be satisfied with the tradeoffs they made here. Nobody else wants this.

happymellon2y ago

It's also nonsense.

If those were your requirements, why on earth are you using Aurora?

Aurora is a multi-region, failover protected, backup managed service.

This isn't. It would have been cheaper and quicker to install an OpenSource logging DB on an EC2. Like Elastic.

jrockway2y ago

I mean, that's basically my default. If I have some problem involving storing data, I pretty much 100% of the time type "pgx.Connect(" and go from there. If that's not good enough, only then do I start thinking about the problem. 99.9% of the time, PostgreSQL is going to get the job done for you.

Aurora is a nice drop-in replacement for Postgres. Kind of a good way of dealing "oh shit, I built way too much stuff on top of a database that can't scale to this workload." It was good enough to find product/market fit, but now it's too slow. Solution: pay Amazon some money. If that works, that works.

If it doesn't, then you need to take a different approach. At this point, you're like 8 levels of weirdness down the stack, and that's when it's time for innovation. Nobody says "I need to make a CRUD app, so I'm going to build a team to build a database first." No! You just use Postgres!

When that starts failing in a measurable business metric kind of way, then your creative juices start flowing and you start thinking about making your own database.

(An aside: I make a database at work. We use Postgres for all the stuff that doesn't need to scale. "Is this auth token one associated with an authorized user?" That's Postgres. "Are these bytes being written to object storage approximately the same as bytes that have been written before?" That's our thing.)

(Another aside: My all-time favorite database is Spanner. I used that a lot when I worked at Google. I remember someone on my team working on the Spanner integration with our product, which was basically an in-memory database, but we needed durability. I've never seen a piece of software laugh at our 20,000 writes per second workload so hard. I imagined outages, rollbacks, furrowed brows... nope. I thought there was no way a durable replicated storage system could handle our workload. I was wrong. It just accepted all of our data and never caused a problem. So to me, that was the solution to every problem. But after I left Google, my thought was to use Cloud Spanner for everything. I quickly realized I did not have enough money to pay for Spanner! Nobody had enough money. So postgres is what I settled on. I have enough money to afford it ($0 is the starting cost) and it has never done me wrong. So I don't blame OP for starting there. It's what I would have done, anyway. BTW, I think they lowered the price a lot since I last looked, so if it sounds like the kind of thing you need, you should probably just buy it. But never underestimate "beefy computer with Postgres". By the time that fails you, you will probably have a lot of money if people like the thing you're making.)

kroolik2y ago· 4 in thread

I could be missing something, but I can't really wrap my head around "unlimited paralelism".

What they say is that the logic is embedded into their server binary and they write to a local EBS. But what happens when they have two servers? EBS can't be rw mounted in multiple places.

Won't adding the second and more servers cause trouble like migrating data when new server joins the cluster, or a server leaves the cluster?

I understand Aurora was too expensive for them. But I think it is important to note their whole setup is not HA at all (which may be fine, but the header could be misleading).

trebecks2y ago

i got caught up on that statement too. i interpreted it as they can spin up more servers with their own volumes but thats not really 1 ebs volume anymore. maybe at their current load they only need 1? the op mentioned copying stuff to s3 at a certain size so it sounds like the disk isn't very big at any moment. i don't think there would be much to do if another server joins.

> Amazon EBS offers a higher durability volume (io2 Block Express), that is designed to provide 99.999% durability with an annual failure rate (AFR) of 0.001%, where failure refers to a complete or partial loss of the volume.

if they take snapshots often enough to feel comfortable with that low failure rate, it does seem kind of reasonable to me. really low risk of a given volume failing.

klohto2y ago

EBS can be multi attached for a long time now, no perf impact

kroolik2y ago

Oh thanks! Always thought it was what EFS was for. They are still limited to the same AZ, so no multi AZ redundancy.

klohto2y ago

yea, multi AZ failover would be an issue but I assume they don’t care that much.

you could spinup new EBS from the backup when the first region fails or keep a warm copy there, but seems like a lot of extra engi work.

MuffinFlavored2y ago· 2 in thread

> We want to be able to handle up to 30k location updates per second per node. They can be buffered before writing, leading to a much lower number of IOPS.

> This storage engine is part of our server binary, so the cost for running it hasn’t changed. What has changed though, is that we’ve replaced our $10k/month Aurora instances with a $200/month Elastic Block Storage (EBS) volume. We are using Provisioned IOPS SSD (io2) with 3000 IOPS and are batching updates to one write per second per node and realm.

I would be curious to hear what that "1 write per second" looks like in terms of throughput/size?

zaroth2y ago

Well they said ~40 bytes per update, so 30k * 40 = 1.2MB/sec… so quite trivial.

They also said 30GB per month which works out to 0.7MB/sec if load is perfectly constant.

MuffinFlavored2y ago

> we’ve replaced our $10k/month Aurora

How does 0.7MB/sec end up costing $10k/mo in a hosted database?

Can you not achieve 1MB/sec of "queued writes" or something clever against SQLite?

2 more replies

xyst2y ago· 2 in thread

This seems like they rewrote Kafka to me.

Even moderately sized Kafka clusters can handle the throughput requirement. Can even optimize for performance over durability.

Some limited query capability with components such as ksqldb.

Maybe offload historical data to blob storage.

Then again, Kafka is kind of complicated to run at these scales. Very easy to fuck up.

jpgvm2y ago

Eh, Kafka isn't easy to fuck up. If anything it's stupidly hard to fuck up assuming you aren't completely incompetent and didn't read up on how it works or the main caveats. I say that because there really isn't much to Kafka in the first place (as long as you aren't including things like Kafka Streams or Kafka Connect etc).

Operationally there are some annoying things in OSS Kafka (hot partitions, controller failover slowness pre-KRaft, etc) but it's overall bog simple and easy to work with if you can accept the things it doesn't do (queue-like behavior).

I don't love Kafka these days but the fear mongering is a bit much.

Sidenote: If you think you want Kafka you should probably check Pulsar first, in most cases you probably want Pulsar or due to changing requirements you would have been better off going Pulsar from the start.

kdazzle2y ago

Plus managed kafka is pretty expensive

fifilura2y ago· 2 in thread

Would building a data lakehouse be an option?

Stream the events to s3 stored as Parquet or Avro files, maybe in Iceberg format.

And then use Trino/Athena to do the long term heavy lifting. Or for on-demand use cases.

Then only push what you actually need live to a Aurora.

bsaul2y ago

I had a similar idea (except using kafka) : have all the nodes write to a kafka cluster, used for buffering, and let some consumer write those data in batch, into whatever database engine(s) you need for querying, with intermediate pre-processing steps whenever needed. This lets you trade latency for write buffering, while not loosing data thanks to kafka durability guarantees.

What would you use for streaming directly to s3 in high volumes ?

fifilura2y ago

Yeah kafka would handle it, except in my experience i would like to avoid kafka if possible, since it adds complexity. (Fair enough it depends on how precious your data is, if it is acceptable to loose some of it if a node crashes)

But somehow they are ingesting the data over network. Would writing files to s3 be slower than that? Otherwise you don't need much more than a RAM buffer?

Edit: to be clear, kafka is probably the right choice here, it is just that kafka and me is not a love story.

But it should be cheaper to store long term data in s3 than storing it in kafka, right?

zX41ZdbW2y ago· 1 in thread

Sounds totally redundant to me. You can write all location updates into ClickHouse, and the problem is solved.

As a demo, I've recently implemented a tool to browse 50 billion airplane locations: https://adsb.exposed/

Disclaimer: I'm the author of ClickHouse.

whalesalad2y ago

Heh, my first thought reading this post was “did you try clickhouse”

kumarm2y ago· 1 in thread

I have built similar system in 2002 using JGroups (JavaGroups at the time before open source project was acquired by JBoss) while persisting asynchronously to DB (Oracle at the time). Our scale even in 2002 was much higher than 13,000 vehicles.

The project I believe still appears in success story on JGroups website after 20+ years. I am surprised people are writing their own databases for location storage in 2024 :). There was no need to invent new technology in 2002 and definitely not in 2024.

_xivi2y ago

> There was no need to invent new technology in 2002 and definitely not in 2024.

Sure we have had roads for hundreds of years but they're not the same ones we have today, even though it's the same concept and function.

You can't just take advantage of the technology we have today and at the same time refuse to acknowledge it and hand-wavingly claim it was just the same in '02. It's easy to say so about anything if we're going to conveniently gloss over the details.

Believe it or not, many technologies that changed the world and we depend on today actually have origin stories similar to the one shared in the article. Many of them started as custom internal tools that I imagine you would've been similarly critical towards them for trying to invent new technology needlessly.

yunohn2y ago· 1 in thread

This is more a bespoke file format than a full blown database. It’s optimized for one table schema and a few specific queries.

Not a negative though, not everything needs a general purpose database. Clearly this satisfies their requirements, which is the most important thing.

Kalanos2y ago

Exactly. There are a hundred questions that come to mind like how does it handle concurrent writes, sharding, views.

https://en.wikipedia.org/wiki/Database#Database_management_s...

I'm sure they learned a lot, but probably a waste in the long run

bawolff2y ago· 1 in thread

Kind of misleading to not include the cost of developing it yourself.

I think everything is cheaper than cloud if you do it yourself when you don't count staffing cost.

benrutter2y ago

Yeah and for most companies without a huge supply of developers the financial risk of having all your stuff blitzed when your home spun solution fails.

remram2y ago· 1 in thread

They mention all those features of databases, presenting them as important:

> Databases are a nightmare to write, from Atomicity, Consistency, Isolation, and Durability (ACID) requirements to sharding to fault recovery to administration - everything is hard beyond belief.

Then talk about their geospatial requirements, PostGIS etc, making it seems they need geospatial features ("PostGIS for geospatial data storage" -- wtf? you need PostGIS for geospatial query not merely storage...)

In reality, they did not require any of the features they mention throughout the article. What a weird write-up!

I guess the conclusion is "read the F*-ing specs". Don't grab a geospatial DBMS just because you heard the words "longitude" and "database" once.

physicles2y ago

It’s like SpaceX’s development process. Step 1: “make your requirements less dumb”

nikonyrh2y ago· 1 in thread

Very interesting, it must feel great to get to apply CS knowledge at work, rather than writing basic CRUD apis / websites.

hasmanean2y ago

Stick the gps data in a binary file. Store then filename in the database record.

bevekspldnw2y ago· 1 in thread

“We are running a cloud platform that tracks tens of thousands of people and vehicles simultaneously”

…that’s not something to brag about.

sneak2y ago

Why, because you think the surveillance implies that it’s nonconsensual and thus unethical, or the very small scale (<100k clients) means this isn’t actually a very difficult engineering challenge?

time0ut2y ago

Good article.

> EBS has automated backups and recovery built in and high uptime guarantees, so we don’t feel that we’ve missed out on any of the reliability guarantees that Aurora offered.

It may not matter for their use case, but I don't believe this is accurate in a general sense. EBS volumes are local to an availability zone while Aurora's storage is replicated across a quorum of AZs [0]. If a region loses an AZ, the database instance can be failed over to a healthy one with little downtime. This has only happened to me a couple times over the past three years, but it was pretty seamless and things were back on track pretty fast.

I didn't see anything in the article about addressing availability if there is an AZ outage. It may simply not matter or maybe they have solved for it. Could be a good topic for a follow up article.

[0] https://aws.amazon.com/blogs/database/introducing-the-aurora...

afro882y ago

These two sentences don't work together:

> [We need to cater for] Delivery companies that want to be able to replay the exact seconds leading up to an accident.

> We are ok with losing some data. We buffer about 1 second worth of updates before we write to disk

Impressive engineering effort on it's own though!

the_duke2y ago

I don't know what geospatial features are needed, but otherwise time series databases are great for this use case.

I especially like Clickhouse, it's generic but also a powerhouse that handles most things you throw at it, handles huge write volumes (with sufficient batching), supports horizontal scaling, and offloading long-term storage to S3 for much smaller disk requirements. The geo features in clickhouse are pretty basic, but it does have some builtin geo datatypes and functions for eg calculating the distance.

kaladin_12y ago

I love the attitude, we didn't see a good fit so we rolled ours.

Sure it won't cover the bazillion cases the DBs out there do but that's not what you need. The source code is small enough for any team member to jump in and debug while pushing performance in any direction you want.

Cudos!

CapeTheory2y ago

It's amazing what can happen when software companies start doing something approximating real engineering, rather than just sitting a UI on top of some managed services.

diziet2y ago

As others had mentioned, probably hosting your own clickhouse instance could yield major savings while allowing for much more flexibility in the future for querying data. If your use case can be served by what clickhouse offers, gosh is it an incredibly fast and reliable open source solution that you can host yourself.

rstuart41332y ago

A lot of people here are making very confident sounding assertions, yet some as saying it's just an append only log file and some imply it's sharded. Something everyone does agree on is they are very vague about what geospartial features they need.

The one thing they do say is "no ACID". That implies no b-trees, because an unexpected stop means a corrupted b-tree. Perhaps they use a hash instead, but it would have to be a damned clever hash tree implementation to avoid the same problem. Or perhaps they just rebuild the index after a crash.

Even a append only log file has to be handled carefully without ACID. An uncontrolled shutdown in more file systems will at leave blocks of nulls in the file and 1/2 written blocks if they cross disk block boundaries.

It's a tantalising headline, but after reading the 1,200 words I'm none the wiser on what they built or whether it meets their own specs. A bit of a disappointment.

INTPenis2y ago

That is such an insane headline.

You might as well say "we saved 100% of cloud costs by writing our own cloud".

endisneigh2y ago

It would be interesting to see a database built from the ground up for being trivial to maintain.

I use managed databases, but is there really that much to do for maintaining a database? The host requires some level of maintenance - changing disks, updating the host operating system, failover during downtime for machine repair, etc. if you use a database built for failover I imagine much of this doesn’t actually affect the operations that much assuming you slightly over provision.

For a database alone I think the work needed to maintain is greatly exaggerated. That being said I still think it’s more than using a managed database, which is why my company still does so.

In this case though, an append log seems pretty simple imo. Better to self host.

rvba2y ago

> So - given that we don’t know upfront what level of granularity each customer will need, we store every single location update.

Maybe Im cynical but interesting that "the business" didnt start to check it to cut costs. I know that customers love this feature. Cynically I can see it costing more, so some customers would drop in.

Also it looks they rewrote a log / timeseries "database" / key value store? As pthers mention sounds like reinventing the wheel to get a cool blog post and boost career solving "problems".

rad_gruchalski2y ago

> we’ve replaced our $10k/month Aurora instances with a $200/month Elastic Block Storage (EBS) volume

Reminds me how I implemented mssql active-active log replication over dropbox shares back in 2010 to synchronise two databases in the Us and in the UK. Worked perfectly fine except of that one hurricane that took them out for longer than 14 days. This was more than the preconfigured log retention period.

pheatherlite2y ago

How fast can reads be thou? Even if skipping along a fixed offset, reading 4 byte identifiers to filter out location updates for vehicles, that's still a sequential scan of a massive file. Wouldn't this read issue become a choking point to a degree that would make growth a curse? Then you get into weird architectures that exist solely to facilitate predigested reads?

trebecks2y ago

if i'm reading the op right, they kind of use ebs as a buffer for fresh data until it ages out to s3. they use a "local" disk to hold the stuff used by the queries that people actually make and the queries run quick. they let the old stuff rot in s3 where its almost never used. that sounds like a good idea to save money plus the stuff that's done often is fast.

the ebs slas look reasonable to a non expert like me and you can take snapshots. it sound like you need to be careful when snapshotting to avoid inconsistencies if stuff is only partially flushed to disk. so you'd need to pause io while it snapshots if those inconsistencies matter. that sounds bad and would encourage you to take less frequent snapshots...? you also pay for the snapshot storage but i guess you wouldn't need to keep many. i like that aws defines "SnapshotAPIUnits" to describe how you get charged for the api calls.

with aurora, it looks like you can synchronously replicate to a secondary (or multiple secondaries) across azs in a single region. it sounds nice to have a sync copy of stuff that people are using. op says the'yre ok with a few seconds of data loss so i'm wondering how painful losing a volume right before taking a snapshot would be.

i wonder if anything off the shelf does something similar. it sounds like people are suggesting clickhouse. i saw buffer table in their docs and it sounds similar https://clickhouse.com/docs/en/engines/table-engines/special.... it looks like it has stuff to use s3 as cold storage too. i even see geo types and functions in the docs. i've never used clickhouse so i don't know if i'm understanding what i read, but it sounds like you could do something similar to whats described in the post with clickhouse if the existing geo types + functions work and you are too lazy to roll something yourself.

loftsy2y ago

Apache Cassandra could be a good fit here. Highly parallel frequent writes with some consistency loss allowed.

exabrial2y ago

Why is everyone dead set on “must use aws” these days? One can cut their cloud costs by 100x with colo.

And if you write your own db as they did here, it can 100% take advantage of your setup.

zinodaur2y ago

Very cool! When I started reading the article I thought it was going to end up using an LSM tree/RocksDB but y'all went even more custom than that

mavili2y ago

That's called engineering; you had a problem, you came up with a solution THAT WORKS for your needs. Nicely done and thanks for sharing.

avidphantasm2y ago

Seems like DuckDB or TileDB backed by S3 may meet your needs and be a lot cheaper than Aurora.

awinter-py2y ago

we have invented write concern = 0

halayli2y ago

They talk about what they store but zero mention on their retreival requirements.

tshanmu2y ago

"Of course, that’s an unfair comparison, after all, Postgres is a general purpose database with an expressive query language and what we’ve built is just a cursor streaming a binary file feed with a very limited set of functionality - but then again, it’s the exact functionality we need and we didn’t lose any features."

brianhama2y ago

Honestly, this doesn’t seem like that high of requirements. There are tens of thousands of companies that are doing more spatial data processing and are using standard cloud databases just fine.

SmellTheGlove2y ago

I'm surprised to see the (mostly) critical posts. My reaction before coming to the comments was:

- This is core to their platform, makes sense to fit it closely to their use cases

- They didn't need most of what a full database offers - they're "just" logging

- They know the tradeoffs and designed appropriately to accept those to keep costs down

I'm a big believer in building on top of the solved problems in the world, but it's also completely okay to build shit. That used to be what this industry did, and now it seems to have shifted in the direction of like 5-10% of large players invent shit and open source it, and the other 90-95% are just stitching together things they didn't build in infrastructure that they don't own or operate, to produce the latest CRUD app. And hell, that's not bad either, it's pretty much my job. But it's also occasionally nice to see someone build to their spec and save a few dollars. It's a good reminder that costs matter, particularly when money isn't free and incinerating endless piles of it chasing a (successful) public exit is no longer the norm.

I get the arguments that developer time isn't free, but neither is running AWS managed services, despite the name. And they didn't really build a general purpose database, they built a much simpler logger for their use case to replace a database. I'd be surprised if they hired someone additional to build this, and if they did, I'd guess (knowing absolutely nothing) that the added dev spends 80% of their time doing other things. It's not like they launched a datacenter. They just built the software and run it on cheaper AWS services versus paying AWS extra for the more complex product.

1 more reply

j / k navigate · click thread line to collapse

160 comments

136 comments · 40 top-level

RHSeeger2y ago· 35 in thread

> we’ve replaced our $10k/month Aurora instances with a $200/month Elastic Block Storage (EBS) volume.

- Developer(s) time to initially implement it

- PjM/PM time to organize initial build

- Developer(s) time for maintenance (fix bugs and enhancement requirements)

- PjM/PM time to organize maintenance

The cost of 120,000/year for Aurora seems like it would be less than the cost of development/organization time for the custom database.

Note: It's clear you have other reasons for needing your custom database. I get that. I was just curious about the costs.

g9yuayon2y ago

> PjM/PM time to organize initial build

This sounds what big companies or a disorganized company would need. For an efficient enough company, a project like this needs just one or two dedicated engineers.

vannevar2y ago

>For an efficient enough company, a project like this needs just one or two dedicated engineers.

Maybe for a research project or a hobby project, but not for a real, high performance database to be used in a business-critical application.

FTA:

"Databases are a nightmare to write, from Atomicity, Consistency, Isolation, and Durability (ACID) requirements to sharding to fault recovery to administration - everything is hard beyond belief."

>Engineers should be their own PMs.

As an aside, I wonder if this might be a use case for a bitmap db engine like Featurebase (https://www.featurebase.com/).

delusional2y ago

> what we’ve built is just a cursor streaming a binary file feed with a very limited set of functionality - but then again, it’s the exact functionality we need and we didn’t lose any features.

The trick is that they didn't need a database that provides "Atomicity, Consistency, Isolation, and Durability (ACID)". By only implementing what they need they were able to keep the project small.

2 more replies

cortesoft2y ago

> a project like this needs just one or two dedicated engineers.

So that is at least 20k a month, for fairly cheap engineers.

yosef1232y ago

I doubt he meant « two engineers full time », I suspect it’s more like a role of maintaining / improving when necessary.

intelVISA2y ago

1-2 days work is 20k a month? This place hiring?? Even Google wanted me to work minimum 3 days a month...

vineyardmike2y ago

> In fact, I can't imagine why this project needs a PM at all. The database is used by engineers and is built by engineers. Engineers should be their own PMs.

delusional2y ago

> Which one is implemented first?

One of them. This is true whether you have a person named "PM" or not. It's just a matter of who picks.

> What about if there is only engineering capacity to implement one?

How does naming some guy "PM" solve the issue? The team just picks one of the features.

preommr2y ago

hmottestad2y ago

A k/v store typically is really fast at looking up the value based on the key. So there are usually some pretty advanced indexes involved.

arandomusername2y ago

or a simple b-tree...

1 more reply

ozr2y ago

For a k/v store, a hash table will take you very, very far.

paulddraper2y ago

80/20

rdtsc2y ago

> The cost of 120,000/year for Aurora seems like it would be less than the cost of development/organization time for the custom database

organsnyder2y ago

There’s the opportunity cost of whatever else they could have been paying that developer to work on.

rdtsc2y ago

addicted2y ago

Well considering the cost is lower than Aurora isn’t the opportunity cost in favor of the home built situation.

1 more reply

grogenaut2y ago

Agreed, a developer that can pull this off is pretty good, if maybe distracted by shiny objects, what could they do working on the actual product instead of this technological terror?

ReflectedImage2y ago

Well presumably they need only 1/3 of a developer to do this and they intend to scale up 10x in the next 5 years.

$60,000 per year in-house vs $1,200,000 per year aurora. No brainer really.

andai2y ago

Also worth mentioning that it's 150x faster.

cortesoft2y ago

Its $120,000 a year for aurora, not $1,200,000.

b1122y ago

What has changed though, is that we’ve replaced our $10k/month Aurora instances with a $200/month Elastic Block Storage (EBS) volume.

Note 'instances' eg plural, versus a singular EBS. There is some ambiguity here, I'm not sure where the 10x came from, but it seems plausible.

1 more reply

Spivak2y ago

They're operationally using a funny spelling of SQLite and I don't imagine anyone arguing that such a thing needs constant attention.

deedasmi2y ago

Don’t forget this is a largely one time cost vs Aurora, which scales cost with usage.

Also they said their current volume is around 13k/second. They’ve built the new platform for 30k/sec per node. This should last them a long time with minimal maintenance.

kalleboo2y ago

Using Aurora is also not free in terms of cost of development. Developers need to be trained on it's implementation, features, constraints, performance implications, keeping up with API changes, etc

mbrumlow2y ago

The problem is all those people you listed would still exist plus the 120k bill to Amazon.

They may or may not be doing other things depending on the company size and state.

You could drop the PM, engineers writing for engineers don’t need a PM.

You will likely hire similarly costed engineers to maintain the database stack anyways.

Any company that wants to own its destiny knows to stay away from lock-in.

samatman2y ago

I would imagine, as someone with no special insight into goings-on at Hivekit, that the answer is intended scale.

Investing time early on so that they can store 50x data-per-dollar is almost certainly time well spent.

kdazzle2y ago

S3 is relatively cheap.

donohoe2y ago

I came here to ask the same question.

If this db requires 1 full-time developer then the cost would immediately be not worth it (assuming salary + benefits > $120k/yr)

As you say, without details it’s hard to know if this was a good idea.

solatic2y ago

bilsbie2y ago

Shouldn’t we up our standard developer cost for inflation?

That barely qualifies for the median mortgage in the US.

5 more replies

sophacles2y ago

They said 'instances' (plural), so that number should be at least $240k

exe342y ago

> PjM/PM

What do you need them for?

thehappypm2y ago

To stop the engineers from spending too much time optimizing the database and to focus on customers

exe342y ago

So now every change takes 10x as long because you need to explain it to somebody who's bonus depends on it not being done? I love how the solution to too much management is always more management.

yau8edq12i2y ago· 17 in thread

throwaway634672y ago

didgetmaster2y ago

I agree. Is there an industry accepted definition of what a system must do before it can be called a database?

I started calling it a database and many people complained that I was misusing the term because it can't yet do everything that Postgres, MySQL, or SQLite can do.

josephg2y ago

Sounds like a database to me.

yau8edq12i2y ago

com2kid2y ago

File systems are databases. Different file systems choose different trade offs, different indexing strategies, etc.

1 more reply

swiftcoder2y ago

cat + grep absolutely constitute a database (and it's probably in use in production somewhere). No need to gatekeep the concept of a database

halayli2y ago

Agreed. Database in this context is expected to be a DBMS or equivalent.

hmottestad2y ago

Symbiote2y ago

You're describing a relational database management system, which is a specific type of software implementing a specific type of database.

superq2y ago

It's difficult to be pedantic about an ambiguous term like database without additional qualification or specificity.

There are more types of databases than those that end in "SQL".

A CSV file alone is a database. The rows are, well, rows. So is a DBM file, which is what MySQL was originally built on (might still be). Or an SQLite file.

The client or server API doesn't have to be part of the database itself.

eatonphil2y ago

> they wrote an append-only log system with limited query capabilities.

This sounds like a database to me.

forrestthewoods2y ago

Writing custom code that does exactly what you need and nothing else is underrated. More people should do that! This is a great example.

mamcx2y ago

> But it's not a "database" in the sense that someone would understand when reading the title.

intelVISA2y ago

> purpose built, in process storage engine that’s part of the same executable as our core server. It writes a minimal, delta based binary format

Get that engineer a sales gig, that's insane upselling of the reality: git commit -am 'added array to store structs'

Retr0id2y ago

If you described those needs to the average engineer, they'd correctly say "use a database".

hmottestad2y ago

Yeah. They basically defined a binary format. I wouldn’t call it a database either.

xyst2y ago

Sounds like Kafka to me. Except have to rewrite components like ksqldb

mdaniel2y ago· 11 in thread

speedgoose2y ago

ClickHouse is one of the few databases that can handle most of the time-series use cases.

TimeScaleDB is a bit more advanced, because it's built on top of PostGreSQL, but it's not very fast. It's better than vanilla PostGreSQL for time-series.

The TSM Bench paper has interesting figures, but in short ClickHouse wins and manage well in almost all benchmarks.

https://dl.acm.org/doi/abs/10.14778/3611479.3611532

https://imgur.com/a/QmWlxz9

Unfortunately, the paper didn't benchmark DuckDB, Apache IoTDB, and VictoriaMetrics. They also didn't benchmark proprietary databases such as Vertica or BigQuery.

If you deal with time-series data, ClickHouse is likely going to perform very well.

lispisok2y ago

omeze2y ago

Did you try clickhouse? What were its weak points?

1 more reply

Too2y ago

Apache Parquet as data format on disk seems to be popular these days for similar DIY log/time series applications. It can be appended locally and flushed to S3 for persistence.

robertlagrant2y ago

> but "we write to EBS, what's the worst that can happen" strikes me as ... be sure you're comfortable with the tradeoffs you've made in order to get a catchy blog post title

In what way?

freeone30002y ago

RHSeeger2y ago

1 more reply

Spivak2y ago

drdaeman2y ago

Replication and reads can be scaled with something like Patroni or even a DIY replication setup (if one knows what they’re doing, of course), but writes are difficult.

VirusNewbie2y ago

Right, cassandra/scylla model is really good for time series use cases, i’ve yet to see good arguments against them.

jpgvm2y ago

It's generally good for append-only workloads.

Where C* databases seems to fall down are point updates and in this case, requirement to implement your own aggregations.

1 more reply

Simon_ORourke2y ago· 7 in thread

mavili2y ago

MaKey2y ago

mavili2y ago

Yes, but what I'm describing is the problem of not even listening to the idea of a new attempt.

kikimora2y ago

akira25012y ago

> would be viewed as dangerous, unprofessional or downright uneducated in any business meeting

Sounds like a great place to work.

> There's just too much can go badly wrong, for all the sunk cost in getting anything up and running.

democracy2y ago

Depends on their meaning of a "database"

Simon_ORourke2y ago

icsa2y ago· 5 in thread

How is it possible to save more than 100% ?

jayd162y ago

Move from the cloud to on-prem and then sell extra availability.

wolframhempelOP2y ago

Fair, should be 98%. Can't change the title anymore though

jbverschoor2y ago

Aws credits

aclatuts2y ago

Receive a license fee from someone else for using your software!

olddustytrail2y ago

Isn't it obvious?!

1. Write your own database

2. ???

3. Profit!

jrockway2y ago· 4 in thread

So with all of this in mind, the engineering cost is not going to be higher than $10,000 a month. It's a print statement.

bradleyjg2y ago

jrockway2y ago

My guess for "why didn't they use something off the shelf" is that no existing software would be satisfied with the tradeoffs they made here. Nobody else wants this.

happymellon2y ago

It's also nonsense.

If those were your requirements, why on earth are you using Aurora?

Aurora is a multi-region, failover protected, backup managed service.

This isn't. It would have been cheaper and quicker to install an OpenSource logging DB on an EC2. Like Elastic.

jrockway2y ago

When that starts failing in a measurable business metric kind of way, then your creative juices start flowing and you start thinking about making your own database.

kroolik2y ago· 4 in thread

I could be missing something, but I can't really wrap my head around "unlimited paralelism".

What they say is that the logic is embedded into their server binary and they write to a local EBS. But what happens when they have two servers? EBS can't be rw mounted in multiple places.

Won't adding the second and more servers cause trouble like migrating data when new server joins the cluster, or a server leaves the cluster?

I understand Aurora was too expensive for them. But I think it is important to note their whole setup is not HA at all (which may be fine, but the header could be misleading).

trebecks2y ago

if they take snapshots often enough to feel comfortable with that low failure rate, it does seem kind of reasonable to me. really low risk of a given volume failing.

klohto2y ago

EBS can be multi attached for a long time now, no perf impact

kroolik2y ago

Oh thanks! Always thought it was what EFS was for. They are still limited to the same AZ, so no multi AZ redundancy.

klohto2y ago

yea, multi AZ failover would be an issue but I assume they don’t care that much.

you could spinup new EBS from the backup when the first region fails or keep a warm copy there, but seems like a lot of extra engi work.

MuffinFlavored2y ago· 2 in thread

> We want to be able to handle up to 30k location updates per second per node. They can be buffered before writing, leading to a much lower number of IOPS.

I would be curious to hear what that "1 write per second" looks like in terms of throughput/size?

zaroth2y ago

Well they said ~40 bytes per update, so 30k * 40 = 1.2MB/sec… so quite trivial.

They also said 30GB per month which works out to 0.7MB/sec if load is perfectly constant.

MuffinFlavored2y ago

> we’ve replaced our $10k/month Aurora

How does 0.7MB/sec end up costing $10k/mo in a hosted database?

Can you not achieve 1MB/sec of "queued writes" or something clever against SQLite?

2 more replies

xyst2y ago· 2 in thread

This seems like they rewrote Kafka to me.

Even moderately sized Kafka clusters can handle the throughput requirement. Can even optimize for performance over durability.

Some limited query capability with components such as ksqldb.

Maybe offload historical data to blob storage.

Then again, Kafka is kind of complicated to run at these scales. Very easy to fuck up.

jpgvm2y ago

I don't love Kafka these days but the fear mongering is a bit much.

kdazzle2y ago

Plus managed kafka is pretty expensive

fifilura2y ago· 2 in thread

Would building a data lakehouse be an option?

Stream the events to s3 stored as Parquet or Avro files, maybe in Iceberg format.

And then use Trino/Athena to do the long term heavy lifting. Or for on-demand use cases.

Then only push what you actually need live to a Aurora.

bsaul2y ago

What would you use for streaming directly to s3 in high volumes ?

fifilura2y ago

But somehow they are ingesting the data over network. Would writing files to s3 be slower than that? Otherwise you don't need much more than a RAM buffer?

Edit: to be clear, kafka is probably the right choice here, it is just that kafka and me is not a love story.

But it should be cheaper to store long term data in s3 than storing it in kafka, right?

zX41ZdbW2y ago· 1 in thread

Sounds totally redundant to me. You can write all location updates into ClickHouse, and the problem is solved.

As a demo, I've recently implemented a tool to browse 50 billion airplane locations: https://adsb.exposed/

Disclaimer: I'm the author of ClickHouse.

whalesalad2y ago

Heh, my first thought reading this post was “did you try clickhouse”

kumarm2y ago· 1 in thread

_xivi2y ago

> There was no need to invent new technology in 2002 and definitely not in 2024.

Sure we have had roads for hundreds of years but they're not the same ones we have today, even though it's the same concept and function.

yunohn2y ago· 1 in thread

This is more a bespoke file format than a full blown database. It’s optimized for one table schema and a few specific queries.

Not a negative though, not everything needs a general purpose database. Clearly this satisfies their requirements, which is the most important thing.

Kalanos2y ago

Exactly. There are a hundred questions that come to mind like how does it handle concurrent writes, sharding, views.

https://en.wikipedia.org/wiki/Database#Database_management_s...

I'm sure they learned a lot, but probably a waste in the long run

bawolff2y ago· 1 in thread

Kind of misleading to not include the cost of developing it yourself.

I think everything is cheaper than cloud if you do it yourself when you don't count staffing cost.

benrutter2y ago

Yeah and for most companies without a huge supply of developers the financial risk of having all your stuff blitzed when your home spun solution fails.

remram2y ago· 1 in thread

They mention all those features of databases, presenting them as important:

> Databases are a nightmare to write, from Atomicity, Consistency, Isolation, and Durability (ACID) requirements to sharding to fault recovery to administration - everything is hard beyond belief.

In reality, they did not require any of the features they mention throughout the article. What a weird write-up!

I guess the conclusion is "read the F*-ing specs". Don't grab a geospatial DBMS just because you heard the words "longitude" and "database" once.

physicles2y ago

It’s like SpaceX’s development process. Step 1: “make your requirements less dumb”

nikonyrh2y ago· 1 in thread

Very interesting, it must feel great to get to apply CS knowledge at work, rather than writing basic CRUD apis / websites.

hasmanean2y ago

Stick the gps data in a binary file. Store then filename in the database record.

bevekspldnw2y ago· 1 in thread

“We are running a cloud platform that tracks tens of thousands of people and vehicles simultaneously”

…that’s not something to brag about.

sneak2y ago

time0ut2y ago

Good article.

> EBS has automated backups and recovery built in and high uptime guarantees, so we don’t feel that we’ve missed out on any of the reliability guarantees that Aurora offered.

I didn't see anything in the article about addressing availability if there is an AZ outage. It may simply not matter or maybe they have solved for it. Could be a good topic for a follow up article.

[0] https://aws.amazon.com/blogs/database/introducing-the-aurora...

afro882y ago

These two sentences don't work together:

> [We need to cater for] Delivery companies that want to be able to replay the exact seconds leading up to an accident.

> We are ok with losing some data. We buffer about 1 second worth of updates before we write to disk

Impressive engineering effort on it's own though!

the_duke2y ago

I don't know what geospatial features are needed, but otherwise time series databases are great for this use case.

kaladin_12y ago

I love the attitude, we didn't see a good fit so we rolled ours.

Cudos!

CapeTheory2y ago

It's amazing what can happen when software companies start doing something approximating real engineering, rather than just sitting a UI on top of some managed services.

diziet2y ago

rstuart41332y ago

It's a tantalising headline, but after reading the 1,200 words I'm none the wiser on what they built or whether it meets their own specs. A bit of a disappointment.

INTPenis2y ago

That is such an insane headline.

You might as well say "we saved 100% of cloud costs by writing our own cloud".

endisneigh2y ago

It would be interesting to see a database built from the ground up for being trivial to maintain.

For a database alone I think the work needed to maintain is greatly exaggerated. That being said I still think it’s more than using a managed database, which is why my company still does so.

In this case though, an append log seems pretty simple imo. Better to self host.

rvba2y ago

> So - given that we don’t know upfront what level of granularity each customer will need, we store every single location update.

Also it looks they rewrote a log / timeseries "database" / key value store? As pthers mention sounds like reinventing the wheel to get a cool blog post and boost career solving "problems".

rad_gruchalski2y ago

> we’ve replaced our $10k/month Aurora instances with a $200/month Elastic Block Storage (EBS) volume

pheatherlite2y ago

trebecks2y ago

loftsy2y ago

Apache Cassandra could be a good fit here. Highly parallel frequent writes with some consistency loss allowed.

exabrial2y ago

Why is everyone dead set on “must use aws” these days? One can cut their cloud costs by 100x with colo.

And if you write your own db as they did here, it can 100% take advantage of your setup.

zinodaur2y ago

Very cool! When I started reading the article I thought it was going to end up using an LSM tree/RocksDB but y'all went even more custom than that

mavili2y ago

That's called engineering; you had a problem, you came up with a solution THAT WORKS for your needs. Nicely done and thanks for sharing.

avidphantasm2y ago

Seems like DuckDB or TileDB backed by S3 may meet your needs and be a lot cheaper than Aurora.

awinter-py2y ago

we have invented write concern = 0

halayli2y ago

They talk about what they store but zero mention on their retreival requirements.

tshanmu2y ago

brianhama2y ago

Honestly, this doesn’t seem like that high of requirements. There are tens of thousands of companies that are doing more spatial data processing and are using standard cloud databases just fine.

SmellTheGlove2y ago

I'm surprised to see the (mostly) critical posts. My reaction before coming to the comments was:

- This is core to their platform, makes sense to fit it closely to their use cases

- They didn't need most of what a full database offers - they're "just" logging

- They know the tradeoffs and designed appropriately to accept those to keep costs down

1 more reply

j / k navigate · click thread line to collapse