When an SQL Database Makes a Great Pub/Sub (opens in new tab)

(threedots.tech)

188 pointsm1106y ago67 comments

67 comments

49 comments · 15 top-level

inopinatus6y ago· 13 in thread

There’s a perspective that the transaction log of a typical RDBMS is the canonical form and the rows & tables merely the event-sourced projection. After all, if you replay the former, you should always get exactly the same in the latter.

It’s curious that over those projections, we then build event stores for CQRS/ES systems with their own projections mediated by application code.

Let’s also mention the journaled filesystem on which the database logs reside. And the log structure that your SSD is using internally to balance writes.

It’s been a long time since we wrote an application event stream linearly straight to media, and although I appreciate the separate concerns that each of these layers addresses, I’d probably struggle to justify them from first principles to even a slightly more Socratic version of myself.

migueloller6y ago

It is curious indeed. The first time I noticed this curiosity is when I saw Martin Kleppman’s Turning the Database Inside Out [1]. It’s a great watch (or read) and I really recommend it!

[1] https://www.confluent.io/blog/turning-the-database-inside-ou...

etaioinshrdlu6y ago

Maybe all this results in a really durable and foolproof system. I don't see how this is a bad thing. It looks like defense in depth against errors and corruption.

Also, to my knowledge, the logs in a DB are not kept forever. Instead they are trimmed as soon as reasonable. It starts to smell a little bit like a https://en.wikipedia.org/wiki/Log-structured_merge-tree

notduncansmith6y ago

It also smells like an example of the https://en.wikipedia.org/wiki/Inner-platform_effect

sradman6y ago

Perhaps the title of the article doesn't capture its underlying value. PubSubOnSQL Layers rarely make sense compared to dedicated Pub/Sub systems but the trade-off is acceptable in the three use cases described: 1. When your dedicated Pub/Sub is ephemeral and you need message durability, 2. When you need distributed transactions across SQL and your Pub/Sub system, and 3. When you need a poor man's heterogeneous SQL replication system.

The academic/enterprise database space has been discussing and tackling the types of questions you raise for decades. I don't think that is a useful lens to evaluate this article which is effectively a "tips on when to use our GoLang SQL Pub/Sub layer".

jerf6y ago

I'd add a #4, when you have such low volume that it just isn't worth it to put up a full system, with the accompanying need for deployment, resources, monitoring, and additional knowledge and skills added to the minimum set of knowledge and skills your team must possess.

One must be careful not to use this as an excuse, of course, and keep an eye on the scaling concerns and certain other details. SQL-as-pubsub has certain well-known issues and anyone using it this way ought to be aware of them. But it's a thought worth having.

I've got a system I'm managing where Cassandra is the backend. I've got about 5 "documents" (in the MongoDB sense, let's say) I want to store in the system. I don't put up a whole "document DB" for them, I just have a table in Cassandra. I have in my entire system, one distributed lock I'd like to have per certain resource, of which I expect there to be single digit numbers of that resource over the lifetime of the system. Cassandra is not a great distributed locker, but it does work (and as far as I can tell, done properly, is also correct), so rather than install an entire distributed lock server, I use Cassandra.

Should this ever turn out to become a mistake, all code that uses either of these functions is cleanly isolated and I can easily swap them out later. I am aware of the possibility that could happen in the future and have prepared for it. In the meantime, I've avoided two entire systems being poorly deployed and understood in favor of the one system that is well-deployed and understood by the team.

2 more replies

marco_craveiro6y ago

Indeed, I've also puzzled over this a fair bit. Its almost as if we are lacking a lower-level interface to the transaction log that enables one to push events without going via the higher-level representation of tables etc. However, the implementation details are somewhat beyond me :-) Postgres' Bottled Water [1] was what made me think about this. I mean, why bother with exporting to Kafka at all, and instead just use the Postgres transaction log directly?

[1] https://github.com/confluentinc/bottledwater-pg

wenc6y ago

Just a build: Bottled Water was the original concept, but Debezium[1] actually provides a production-ready product.

I would update all references of the former to the latter.

[1] https://debezium.io/

ryeguy6y ago

It's much easier to scale kafka than a relational db. There's also an advantage to offloading the read load to another system instead of hitting the db for that, too.

1 more reply

chrisseaton6y ago

> the transaction log of a typical RDBMS is the canonical form

Do databases keep the whole transaction log forever? It seems like that could keep growing forever even when the tables stay at constant size.

MSM6y ago

Usually you'll keep the transaction logs for a certain period, say for two weeks. That gives you point in time recovery for any moment in that two weeks (assuming you've set up everything correctly!). Logs past that are usually archived or just dumped, but you'll have full backups that go back further.

For your question about the logs growing forever, there are usually points in the process where a transaction log is saved off somewhere else and then can be overwritten, but on a transactional system the logs over a period of a week or two can sometimes be many times larger than the actual data stored at any given time, yes.

dtech6y ago

> It’s curious that over those projections, we then build event stores for CQRS/ES systems with their own projections mediated by application code.

That's really logical. From the view of the application there is no transaction log, only a table. It's an implementation detail of the database.

The application wants similar guarantees a log can provide, so they build their own.

corebit6y ago

Yep yep yep. Amazing how we keep reinventing the same thing over and over and over again.

adambyrtek6y ago

I wouldn't say this is "reinventing the wheel", it's more like applying the same design/architectural pattern to solve a common problem.

rmetzler6y ago· 10 in thread

Really, I wouldn’t teach junior developers that it’s ok to use a database table when a queue is needed. Sure, you can get away with this and there are cases when it’s all you need. But I’ve been one of those juniors who forgot to limit the query, who didn’t have enough indices, who tried to order all records by date and had full table scans everywhere, who implemented the worker with a cron job and didn’t synchronize this with a lock.

It might work, but it’s not the general case and you might spend more time to debug your table then to write the code to use a real queue.

And I’ve also seen people build their own queueing engine for a few hundred tasks per day. Why don’t they just choose one of the very good open source solutions?

blowski6y ago

Like many design patterns when implemented badly, queuing will usually result in a lot of problems. But if your team is competent, you’re using an off-the-shelf library, and you don’t have crazy demands, then re-using infrastructure can be a good idea as it’s fewer things to manage.

tonetheman6y ago

Fewer things to manage is the key!

rumanator6y ago

What's the rationale to teach databases as message queues when it requires special querying and updates when there are so many message queues services already available, easy to use, and standard compliant?

to11mtm6y ago

> What's the rationale to teach databases as message queues when it requires special querying and updates when there are so many message queues services already available, easy to use, and standard compliant?

If you already need a database for something else, using the DB as a Queue means you don't need to list {mqFlavorOfChoice} as a requirement for new hires. You also don't have to manage that extra infrastructure. Of course, you are putting additional load on the DB.

Mind you, I'm speaking of a pub-sub type queue and not a FIFO here. You can do FIFO queues in DB as well of course, it's just not as compelling of a story nowadays.

Also way easier to look at and 'poke' a Database queue if you need to. The queries are also not really difficult to write for a general purpose use case.

SahAssar6y ago

I agree that using a traditional DB as a queue has it's pitfalls, but it also has great benefits with regards to consistency, reliability and simplicity.

If I was building a new system that required a queue I'd definitely put in the same postgres db as the rest of the data until I had a good reason not to.

andrewstuart6y ago

"Because someone might add bugs to the code, or do it wrong" isn't a reason not to take a particular approach.

mzz806y ago

Of course it is. When one approach is far more likely to introduce bugs, complicated interactions, and be more difficult to maintain, it is absolutely a reason to not take a particular approach. It’s one of factors you need to consider in everyday software engineering.

If you don’t take this into consideration then you’re detracting from the business to satisfy another need.

james_s_tayler6y ago

Sure it is. If one approach is easier to mess up than the other then take the approach with less footguns.

jf226y ago

All the issues you mention are fairly easy to fix.

rmetzler6y ago

After you found them. And you find them after you become aware of problems.

And how hard it is to clean up the data at this point in time depends entirely on the kind of system you're working with.

rmrfchik6y ago· 5 in thread

Seems like they fall into the same pit as many does: using primary keys with autoincrement as offset. This leads to skipping messages because there is no guarantees that primary keys will be available in monotonic order. Because, you know, transactions.

Fire-Dragon-DoL6y ago

Can you expand a bit on this? From my understanding, autoincrement keys ca mn have gaps, but are always increasing. Sometimes a message might arrive "late", so you get a 3,then a 2. This problem cannot really be solved without giant locks that are not ideal. As far as I'm aware, all messaging systems are subject to this problem.

Messages will never arrive, arrive out of order and I don't remember the third one right now (messages will arrive late?)

nicois6y ago

Databases such as postgresql will effectively issue a buffer of keys to each connection, meaning in some circumstances the sequences will not be monotonic with respect to time. Also that usually long running transactions will use the timestamp the transaction was opened, regardless of how many seconds have passed between then and when the statement is executed.

1 more reply

rmrfchik6y ago

Yes, you described the problem exactly as it is. The problem is not in arrive order to subscriber, the problem is "selecting next messages with offset > last_offset". And in this case you simply miss late messages.

1 more reply

abhishekjha6y ago

Off topic but does this not effect the Pagination functionality of databases as well? Using primary keys to skip first N pages and then limit the count of results seems to be the suggested way for getting items for the Nth page. If primary key is not monotonic then this is going to give jumbled results thus messing up results in the Nth page.

EDIT: More context for the above process[1]

[1]https://www.eversql.com/faster-pagination-in-mysql-why-order...

siscia6y ago

Hummm, not sure I follow but most likely no.

What parent mean is that there may be holes in the sequence of primary keys. What you do with pagination is that you first sort the sequence, then thrown away the first N results, and finally select only the next M results.

It will work just fine.

3 more replies

zzzeek6y ago· 2 in thread

The "database as message queue" pattern is quite common and often considered to be an antipattern, which I tend to agree with but I don't have that strong of a position on it myself. I've certainly used this pattern for expediency, but that was before we had all the messaging solutions we do today. http://mikehadlow.blogspot.com/2012/04/database-as-queue-ant... has some good points.

bradstewart6y ago

A lot has changed in the 7 years since that was written.

Polling isn't a huge issue to begin with, and is mitigated with LISTEN/NOTIFY (on certain DBs). Inserts with indexes are not a performance problem at the scale of most applications. A separate messaging service won't prevent you from building a "hugely coupled monster".

Personally, I almost always start with the database as a queue. The operational overhead of running, updating, and monitoring another entire service is non trivial. If the messaging rate exceeds the database's capabilities in the future, I'll migrate then.

tartoran6y ago

If you need just one queue yes. If you have lots of queues it’s worth investing in a queue service of some sort and there are many of them out there which is a good thing but could turn into a bad thing quickly. In the past I worked at a place that had 3 different queueing services implemented by different developers and it became a pain to manage them or to even know what was on the queues.

120bits6y ago· 2 in thread

This could slightly out of context.

I'm working on a module that send notifications to a user when an alert is generated. I have PostGreSQL as the database and NodeJS is the handler and for connection pooling. Are there any good pub/sub tools that I can use. Thanks in advance.

porsager6y ago

How about simply having an after insert trigger on an alerts table that calls notify, and then you listen for that in node? It's a simple setup with less moving parts and could probably get you a long way...

120bits6y ago

Thanks, this will probably work for me. However, if the inserts are higher, I don't want to get notified that frequently. How would I add a periodic alerts to this? Thank you!

1 more reply

kiwicopple6y ago· 1 in thread

For anyone just looking for ‘plug and play’ web socket pub/sub functionality, I have been developing something that provides the functionality for PostgreSQL: https://github.com/supabase/realtime

It's an Elixir server (Phoenix) that allows you to listen to changes in your database via websockets. Basically the Phoenix server listens to PostgreSQL's replication functionality, converts the byte stream into JSON, and then broadcasts over websockets. The beauty of listening to the replication functionality is that you can make changes to your database from anywhere - your api, directly in the DB, via a console etc - and you will still receive the changes via websockets.

The article suggests Postgres’ native LISTEN/NOTIFY functionality. I tried that originally and found that NOTIFY payloads have a limit of 8000 bytes, as well a few other inconveniences.

It's still in very early stages, although I am using it in production at my company and will work on it full time starting Jan.

starik366y ago

One way to get around the 8k NOTIFY limit is to only use the capability to notify only. It would them be incumbent on the client to go fetch the data from a table somewhere. I ran into a similar limitation with SQL Server 2005 years ago and used this approach with great success.

TheCowboy6y ago· 1 in thread

One fun open source software I've played with, that I don't think many have heard of, is Deepstream.io. It attempts to be a batteries included real-time web server that works with websockets, and can function as pub/sub server and client. It has a connector for using PostgreSQL as the database. The frontend JavaScript library is really easy to get working.

https://deepstream.io/tutorials/concepts/what-is-deepstream/

https://github.com/deepstreamIO/deepstream.io

(I'm not affiliated with the project.)

_frkl6y ago

Thanks, this does really look interesting. I'd like to find something generic, lightweight to replace Kafka or Pulsar. Not sure this could be it, but it looks like it'd be worth having a look at...

zinxq6y ago

I've always considered message-queues as a close cousin (if not sibling) of databases. Arguably performing the same function with different foci. Pub/sub focusing on the "oplog". DBMS focusing on "state".

(Blockchain another "oplog" that ends up caring a lot about state eventually).

It's no wonder you can use them interchangeably in many common base cases.

linuxhansl6y ago

Perhaps it's not so much about pub/sub, but about store-and-forward.

When the "forward" part of "store-and-forward" is most important then Kafka is a fine solution.

However, when the "store" part - for example you want to be able to stream historical data again, or interact with the data in different ways - is most important I have recommended HBase (+ Phoenix) as a better solution in the past.

marco_craveiro6y ago

MessageDB was doing the rounds in reddit the other day [1]. Looks interesting for simple use cases...

[1] https://www.reddit.com/r/PostgreSQL/comments/ebu6nh/message_...

gunnarmorling6y ago

That's basically the same pattern as the "outbox pattern", e.g. listed in Chris Richardson's pattern of microservices patterns.

An alternative implementation is provided by Debezium [1], a general solution for change data capture for MySQL, Postgres, MongoDB, SQL Server and others, based on top of Apache Kafka (but can also be used with Pulsar and others).

There's support for outbox coming as part of Debezium out of the box [2].

Disclaimer: I'm working on Debezium.

[1] https://debezium.io/ [2] https://debezium.io/documentation/reference/1.0/configuratio...

nickjj6y ago

If anyone is using Elixir, Oban[0] is a job processor that uses PostgreSQL for its back-end and state management.

It's incredibly well written and I am using it in a project.

[0]: https://github.com/sorentwo/oban

vlasky6y ago

Meteor provides pub/sub with a MySQL backend using the atmosphere package vlasky:mysql.

It works by following the MySQL binary log and triggering a reactive query based on event conditions specified by the programmer, e.g. a change in a field.

https://atmospherejs.com/vlasky/mysql

slowhand096y ago

Oracle has a very advanced and flexible system for this. It is called Advanced Queueing.

Halluxfboy0096y ago

Ever use google's firebase? While not SQL -- I've always felt `tis a nice solution to persistence+async...

j / k navigate · click thread line to collapse

67 comments

49 comments · 15 top-level

inopinatus6y ago· 13 in thread

It’s curious that over those projections, we then build event stores for CQRS/ES systems with their own projections mediated by application code.

Let’s also mention the journaled filesystem on which the database logs reside. And the log structure that your SSD is using internally to balance writes.

migueloller6y ago

It is curious indeed. The first time I noticed this curiosity is when I saw Martin Kleppman’s Turning the Database Inside Out [1]. It’s a great watch (or read) and I really recommend it!

[1] https://www.confluent.io/blog/turning-the-database-inside-ou...

etaioinshrdlu6y ago

Maybe all this results in a really durable and foolproof system. I don't see how this is a bad thing. It looks like defense in depth against errors and corruption.

notduncansmith6y ago

It also smells like an example of the https://en.wikipedia.org/wiki/Inner-platform_effect

sradman6y ago

jerf6y ago

2 more replies

marco_craveiro6y ago

[1] https://github.com/confluentinc/bottledwater-pg

wenc6y ago

Just a build: Bottled Water was the original concept, but Debezium[1] actually provides a production-ready product.

I would update all references of the former to the latter.

[1] https://debezium.io/

ryeguy6y ago

It's much easier to scale kafka than a relational db. There's also an advantage to offloading the read load to another system instead of hitting the db for that, too.

1 more reply

chrisseaton6y ago

> the transaction log of a typical RDBMS is the canonical form

Do databases keep the whole transaction log forever? It seems like that could keep growing forever even when the tables stay at constant size.

MSM6y ago

dtech6y ago

> It’s curious that over those projections, we then build event stores for CQRS/ES systems with their own projections mediated by application code.

That's really logical. From the view of the application there is no transaction log, only a table. It's an implementation detail of the database.

The application wants similar guarantees a log can provide, so they build their own.

corebit6y ago

Yep yep yep. Amazing how we keep reinventing the same thing over and over and over again.

adambyrtek6y ago

I wouldn't say this is "reinventing the wheel", it's more like applying the same design/architectural pattern to solve a common problem.

rmetzler6y ago· 10 in thread

It might work, but it’s not the general case and you might spend more time to debug your table then to write the code to use a real queue.

And I’ve also seen people build their own queueing engine for a few hundred tasks per day. Why don’t they just choose one of the very good open source solutions?

blowski6y ago

tonetheman6y ago

Fewer things to manage is the key!

rumanator6y ago

to11mtm6y ago

Mind you, I'm speaking of a pub-sub type queue and not a FIFO here. You can do FIFO queues in DB as well of course, it's just not as compelling of a story nowadays.

Also way easier to look at and 'poke' a Database queue if you need to. The queries are also not really difficult to write for a general purpose use case.

SahAssar6y ago

I agree that using a traditional DB as a queue has it's pitfalls, but it also has great benefits with regards to consistency, reliability and simplicity.

If I was building a new system that required a queue I'd definitely put in the same postgres db as the rest of the data until I had a good reason not to.

andrewstuart6y ago

"Because someone might add bugs to the code, or do it wrong" isn't a reason not to take a particular approach.

mzz806y ago

If you don’t take this into consideration then you’re detracting from the business to satisfy another need.

james_s_tayler6y ago

Sure it is. If one approach is easier to mess up than the other then take the approach with less footguns.

jf226y ago

All the issues you mention are fairly easy to fix.

rmetzler6y ago

After you found them. And you find them after you become aware of problems.

And how hard it is to clean up the data at this point in time depends entirely on the kind of system you're working with.

rmrfchik6y ago· 5 in thread

Fire-Dragon-DoL6y ago

Messages will never arrive, arrive out of order and I don't remember the third one right now (messages will arrive late?)

nicois6y ago

1 more reply

rmrfchik6y ago

1 more reply

abhishekjha6y ago

EDIT: More context for the above process[1]

[1]https://www.eversql.com/faster-pagination-in-mysql-why-order...

siscia6y ago

Hummm, not sure I follow but most likely no.

It will work just fine.

3 more replies

zzzeek6y ago· 2 in thread

bradstewart6y ago

A lot has changed in the 7 years since that was written.

tartoran6y ago

120bits6y ago· 2 in thread

This could slightly out of context.

porsager6y ago

120bits6y ago

Thanks, this will probably work for me. However, if the inserts are higher, I don't want to get notified that frequently. How would I add a periodic alerts to this? Thank you!

1 more reply

kiwicopple6y ago· 1 in thread

For anyone just looking for ‘plug and play’ web socket pub/sub functionality, I have been developing something that provides the functionality for PostgreSQL: https://github.com/supabase/realtime

The article suggests Postgres’ native LISTEN/NOTIFY functionality. I tried that originally and found that NOTIFY payloads have a limit of 8000 bytes, as well a few other inconveniences.

It's still in very early stages, although I am using it in production at my company and will work on it full time starting Jan.

starik366y ago

TheCowboy6y ago· 1 in thread

https://deepstream.io/tutorials/concepts/what-is-deepstream/

https://github.com/deepstreamIO/deepstream.io

(I'm not affiliated with the project.)

_frkl6y ago

Thanks, this does really look interesting. I'd like to find something generic, lightweight to replace Kafka or Pulsar. Not sure this could be it, but it looks like it'd be worth having a look at...

zinxq6y ago

(Blockchain another "oplog" that ends up caring a lot about state eventually).

It's no wonder you can use them interchangeably in many common base cases.

linuxhansl6y ago

Perhaps it's not so much about pub/sub, but about store-and-forward.

When the "forward" part of "store-and-forward" is most important then Kafka is a fine solution.

marco_craveiro6y ago

MessageDB was doing the rounds in reddit the other day [1]. Looks interesting for simple use cases...

[1] https://www.reddit.com/r/PostgreSQL/comments/ebu6nh/message_...

gunnarmorling6y ago

That's basically the same pattern as the "outbox pattern", e.g. listed in Chris Richardson's pattern of microservices patterns.

There's support for outbox coming as part of Debezium out of the box [2].

Disclaimer: I'm working on Debezium.

[1] https://debezium.io/ [2] https://debezium.io/documentation/reference/1.0/configuratio...

nickjj6y ago

If anyone is using Elixir, Oban[0] is a job processor that uses PostgreSQL for its back-end and state management.

It's incredibly well written and I am using it in a project.

[0]: https://github.com/sorentwo/oban

vlasky6y ago

Meteor provides pub/sub with a MySQL backend using the atmosphere package vlasky:mysql.

It works by following the MySQL binary log and triggering a reactive query based on event conditions specified by the programmer, e.g. a change in a field.

https://atmospherejs.com/vlasky/mysql

slowhand096y ago

Oracle has a very advanced and flexible system for this. It is called Advanced Queueing.

Halluxfboy0096y ago

Ever use google's firebase? While not SQL -- I've always felt `tis a nice solution to persistence+async...

j / k navigate · click thread line to collapse