New Features Coming in PostgreSQL 10 (opens in new tab)

(rhaas.blogspot.com)

517 pointsioltas9y ago135 comments

135 comments

100 comments · 26 top-level

fiatjaf9y ago· 9 in thread

Ok, I'm not a database manager for enormous projects, so these changes may be great, but I don't understand them and don't care about them. Postgres is already the most awesome thing in Earth to me.

Still, if my opinion counts I think SELF-UPDATING MATERIALIZED VIEWS should be the next priority.

rhaas9y ago

The work that has been done on transition tables is intended to enable future work on automatically updated materialized views; the idea is that the system will automatically derive a query to update the view based on the deltas between the set of old rows and the set of new rows. That will take more work, though. I do agree it would be valuable. It's possible to set up similar things by writing your own triggers, and having transition tables available in PL/pgsql will make it easier, but it's not necessarily easy to figure it all out by hand for a complex view involving joins and aggregates.

pgaddict9y ago

I wonder why not to go for a changelog-based implementation. Instead of modifying the materialized view directly, write the changes into a changelog, and then update the matview in the background. More efficient, less locking issues, etc.

1 more reply

fiatjaf9y ago

Thank you.

rachbelaid9y ago

Some work started in this direction. I didn't follow closely the whole thread but I don't think that got commited in PG10.

https://www.postgresql.org/message-id/flat/20170119213859.GA...

More info in the EDB roadmap: https://wiki.postgresql.org/wiki/EnterpriseDB_database_serve...

Postgres has been amazing in shipping the foundation required to deliver complex feature.. Logical Replication is an example of it, all the piece commited in the last 6y allowed to make this patch achievable.

petepete9y ago

How do you mean? Couldn't you use a trigger to update the view?

ams61109y ago

IMHO triggers are almost always best avoided. There are some exceptions but most of the time you want changes to be explicit not happening "by magic" behind the scenes as a side-effect of something else.

mozumder9y ago

Triggers on materialized views are really error-prone and tedious. It's cache invalidation, which is hard.

phamilton9y ago

Effectively yes, but if it's that simple why not make this a built in functionality? Other DBs have it.

okket9y ago

A trigger on what? Every update, insert, delete, etc.? On every table in the view?

Even if that is possible, it may be a major performance killer. This has to be done internally, I think.

3 more replies

jacques_chester9y ago· 7 in thread

I deeply appreciate the great care that Postgres committers take in writing their merge messages.

I think of it as a sign of respect for future developers to take the time to write a clear account of what has happened.

atombender9y ago

Postgres is one of the few projects that still use a strict patch-oriented development process that's based almost entirely around mailing-list communication.

While core team members can commit directly to the repo, everyone else must submit the code changes for review to the pgsql-hackers mailing list as a clean, self-contained patch, where it's discussed and considered for inclusion. An accepted patch might be committed right away, or it will be queued up for the next scheduled "commitfest" [2], when patches are reviewed and finally committed to mainline. (I don't know how the commitfest interacts with git exactly; the commitfest database doesn't even link to git, only to email discussions.)

From the outside it seems a bit antiquated, but it's apparently been working well for them. The Postgres team is a pretty conservative bunch; they only switched from CVS to git in late 2010, for example.

They also really care about code quality, getting the design right early, and covering all possible edge cases. As a result, Postgres solid, clean, has unusually few legacy oddities, and almost never any subtle, suprising breaking changes. If you read the MySQL manual, it's absolutely littered with sloppy little breakages throughout its history: Like how, until 5.0.something, when comparing a "date" value with a "datetime" value, the time portion would be silently ignored and ('2017-04-08 14:04' = '2017-04-08') would return true; but they fixed that, and broke a lot of client code because they didn't stop to realize that a lot of developers depended on that behaviour.

[1] https://wiki.postgresql.org/wiki/Submitting_a_Patch

[2] https://commitfest.postgresql.org

lathiat9y ago

> If you read the MySQL manual, it's absolutely littered with sloppy little breakages throughout its history: Like how, until 5.0.something, when comparing a "date" value with a "datetime" value, the time portion would be silently ignored and ('2017-04-08 14:04' = '2017-04-08') would return true; but they fixed that, and broke a lot of client code because they didn't stop to realize that a lot of developers depended on that behaviour.

This is an interesting comment for two reasons. Firstly because a lot of people also complain about MySQL's archaic defaults which often stay too long because of upgrade concerns (though they fortunately are fixing a lot of them already or for MySQL 8.0 - hooray).

But also because it speaks volumes, in my opinion, about the MySQL documentation that these are documented in the first place. I worked at MySQL for 9 years and though it was always clear our manual was always a good source of information, now that I am working on Ubuntu & OpenStack it is painfully obvious just how good the MySQL documentation team and processes were compared to many other projects. Even just the version ChangeLog.

I'm not saying other projects don't get it right (and have no opinion at all about postgresql's documentation state), but MySQL seems to get it pretty right in general.

1 more reply

pgaddict9y ago

Linux kernel is another such project, I think.

One of the reasons why it's done this way (through mailing lists and not e.g. through pull requests on github) is that all the history is tracked in a way that's fully under control of the community. So it's fairly easy to find who/when submitted the patch, how it looked like, etc.

Of course, another reason is history - most of the process was established long before git, when CVS was the VCS.

scrollaway9y ago

Wine is another such project :)

But I gotta say it's only working for those projects because they have an extremely high barrier of entry to the code itself in the first place (working on projects like Wine and Postgres is scary, even though you can get started with easy stuff).

It also works for them because they have maintainers and core committers used to the workflow, already tooled on the workflow etc. But I wonder how much productivity would be gained by using a github-like flow maybe enhanced a bit.

1 more reply

js29y ago

Git itself is another such project.

anarazel9y ago

Being one of them, though not a native speaker which is more than sometimes noticeable, I'd not even describe it as caring for future developers. It's self-care. I've spent enough time staring at code changes made long ago, trying to understand the reasoning, that providing enough context for my future self is justification enough.

jacques_chester9y ago

I agree with that sentiment. I consider my future self to be an example of "other developers".

When I am working with peers on writing a commit message, I sometimes use the analogy of a newspaper. Any given newspaper is out of date very quickly. But we keep newspaper archives and store copies of every single newspaper.

Why? Because we don't know when we will need to refer to them, or which ones we will refer to. All that we know is that some of them will vital in future, as the journal of record.

And so it is with commit messages. We owe readers the courtesy of explaining our thinking.

djcj889y ago· 6 in thread

I did read the article, but I can't find any mention of addressing the "Write amplification" issue as described by Uber when they moved away from postgres. https://eng.uber.com/mysql-migration/ I had heard talk on Software Engineering Daily that this new major revision was supposed to address that.

Is this issue resolved by the new "Logical replication" feature? It doesn't seem directly related, but it seems like maybe that is what he is referring to in this blog post?

anarazel9y ago

There's a patch reducing write amplifications (when caused by indexes), by a significant degree. Unfortunately it didn't quite get ready in time for the feature freeze of 10 - as it affects the on-disk format, we considered the risk to be too high.

pavanvd9y ago

As the author of the patch I don't quite agree to it. But it's true that the patch did not receive adequate review even though most of the on-disk changes were known and coded at least 7 months before the feature freeze. So it's hard to tell which part of the patch wasn't ready. But there is always next cycle. So lets work towards getting it ready for v11.

snuxoll9y ago

Write amplification is a result of PostgreSQL's decision to not used clustered indexes, there's not much that can be done to avoid it without a massive redesign of the storage engine - though there are patches out there to reduce the penalty in some cases. In all reality though, Uber wanted a key-value store and not an RDBMS, MySQL was a better choice for this since InnoDB isn't much more than a fast K/V store (hence why MySQL uses clustered indexes).

anarazel9y ago

> Write amplification is a result of PostgreSQL's decision to not used clustered indexes, there's not much that can be done to avoid it without a massive redesign of the storage engine

I don't think that's entirely accurate - the issue is more that indexes contain pointers to the heap position (simplified) of a tuple, rather than being indirect and pointing to the primary key, which then is resolved by another index (be that clustered / primary or not).

Updates already don't have to update indexes iff none of the indexed columns change (HOT - Heap-Only-Tuples). The proposed change (WARM - write amplification reduction method), allows to avoid updating indexes on non-changing columns, even if other indexes change.

https://www.postgresql.org/message-id/CABOikdMNy6yowA+wTGK9R...

> In all reality though, Uber wanted a key-value store and not an RDBMS

Agreed on that.

evanelias9y ago

Not arguing with your assessment of Uber's requirements; but in general, why do you view InnoDB as not much more than a K/V store? And why do you equate clustered indexes with K/V stores?

InnoDB is a complex piece of software, supporting transactions, row-level locking, MVCC, schemas, secondary indexes, crash recovery, hot copy/backup, complex caching and buffering, many tunables, and extensive metrics visibility. Just because it's more appropriate for Uber's rather unusual EAV-like use-case, this doesn't mean InnoDB is a glorified K/V store.

Re: clustered indexes, it's a storage engine architecture choice with well-known trade-offs, both positive and negative. SQL Server also uses clustered indexes and is widely respected among database experts.

Regarding the topic overall, there are use-cases where Postgres is the best choice, and there are use-cases where it isn't. That doesn't inherently mean that other databases are uniformly worse. People like to trash MySQL, sometimes for completely valid reasons, but other times for FUD. But fwiw, several of the major features in Postgres 10 have already been supported in MySQL/InnoDB for a long time, in some cases for over a decade. Of course, that goes both ways; there are awesome major features that Postgres has had for a decade that MySQL still lacks.

1 more reply

frik9y ago

> massive redesign of the storage engine

Have the Postgres thought about adding support for more than one storage engine? Then they could implement new ideas in a fork, an one could run them side-by-side and migrate over to it.

https://www.postgresql.org/message-id/4CB597FF.1010403@cheap...

For example MySQL had been mocked for its old ISAM storage engine. Then MySQL added InnoDB as another storage engine, the SQL interface is the same.

2 more replies

nickpeterson9y ago· 6 in thread

Can anyone recommend a decently up to date book on postgres administration? Or are docs really the only way? I've used SQL Server for years but would likely choose postgres for an independent project if I intended to commercialize it. That said, I don't use it at work so it's hard to get in depth experience.

chillydawg9y ago

I'm not familiar with any books, but the docs really are excellent and have various sections for beginners and getting to know the system.

A good way in is to look at external tools like barman which manage dumps+streaming replication along with point-in-time restoration automatically for you rather than manually invoking all the stuff directly.

Mostly, postgres just works.

fiatjaf9y ago

There are no better docs than Postgres docs.

arc_of_descent9y ago

I understand your need for a book. I prefer to read a book when diving into a new tech. That being said, when I started out using PostgreSQL years back, there were only online docs, and I must say they are although lengthy at time, very good.

Also, pgAdmin?

pgaddict9y ago

There's PostgreSQL 9 Admin Cookbook from Simon Riggs, for example (disclosure: I work for Simon).

Packt has several other good books about PostgreSQL, but always check the author - they started publishing books authored by people entirely unknown in the community, that are "inspired" by book published before (you might also use "plagiarism" instead).

nickpeterson9y ago

Yeah packtpub is a real crapshoot. They're great in that they'll seemingly publish whatever tech subject you want to write about. The downside is they publish anything...

1 more reply

pgaddict9y ago

I just remembered there's also "PostgreSQL: Up and Running" published by O'Reilly. It deals with more stuff than just administration, but Regina O. Obe and Leo S. Hsu are good authors.

api9y ago· 6 in thread

The feature I'd really love is master selection with Raft or similar and automatic query redirection to the master for all write queries (and maybe for reads with a query keyword).

That would make it very easy and robust to cluster pg without requiring a big complicated (a.k.a. high admin overhead and failure prone) stack with lots of secondary tools.

This kind of fire and forget cluster is really the killer feature of things like MongoDB and RethinkDB. Yes people with really huge deployments might want something more tunable, but that's only like 1% of the market.

Of course those NoSQL databases also offer eventual and other weaker but more scalable consistency modes, but like highly tuned manual deployment these too are features for the 1% of the market that actually needs that kind of scale.

A fire and forget cluster-able fully consistent SQL database would be nirvana for most of the market.

mb4nck9y ago

About redirection of write queries to the master, from 10 on, you will be able to specify all members of the cluster in the connection string and demand to connect to the master (like "postgresql://host1:5432,host2:5432/somedb?target_session_attrs=read-write"); libpq will do this automatically for you then, see the parameters "host" (now plural) and "target_session_attr" in section 33.1.2. here: https://www.postgresql.org/docs/devel/static/libpq-connect.h...

About raft-based leader-election, I believe the current recommendation is to look at patroni ( https://github.com/zalando/patroni), which has been built for docker and is now being integrated with Kubernetes; however, I don't think there is an inherent limitation that it couldn't be run on bare-metal.

rachbelaid9y ago

With 2ndquadrant working on Postgres-XL (http://www.postgres-xl.org/), I think that you can be confident that you will see a lot of the features being proposed to core postgres. It will just take some times to build the building block necessary like: global index, distributed sequence, repartition ...

I quite confident that the postgresql from 5y in future will be quite different in term of storage / server topology support. I won't be surprise pg_bouncer capacity to finally make its way to core when we have a coordinator.

Postgres has steady progression (even if not fast enough for some people) but they are moving without compromising robustness of their product for the users.

api9y ago

Yes on the first one, and a big nope on the second for now. I passionately loathe Rube Goldberg machine deployments and am the kind of engineer who constantly asks "do we really need that?". I love exterminating complexity. But maybe that will change when we get to millions of concurrent users and tens of millions of devices and actually need Kubernetes to scale.

Raft is not complex. I doubt leader elect would be terribly hard to implement.

xyzzy_plugh9y ago

Can't pgbouncer/pgpool2 solve query redirection? I don't understand the desire for all-in-one solutions.

api9y ago

That desire comes from three places:

1. Minimize cognitive load by minimizing the number of things you have to learn.

2. Minimize deployment complexity and dependencies.

3. Complexity is just evil I'm general. Linear increases in complexity result in exponential increases in bugs, vulnerabilities, and failure modes. It's just combinatorics.

2 more replies

manigandham9y ago

> I don't understand the desire for all-in-one solutions

You really don't understand it? It's less moving pieces to think about, worry about, read about, deploy, maintain, fix and almost always leads to better performance and security.

Something as simple as connection pooling should've already been part of the database and query redirection is even more important to have included.

1 more reply

acdha9y ago· 6 in thread

What's the ops experience for a replicated setup like these days? i.e. assuming you want basic fault-tolerance at non-exotic size / activity levels, how much of a job is someone acquiring if, say, there are reasons they can't just use AWS RDS?

Heliosmaster9y ago

Streaming replication isn't hard at all: http://davide.im/setting-up-a-failover-database-for-postgres...

scurvy9y ago

It's not hard to setup initially, but I'll admit that it's not very good.

It's not very good in a long-lived scenario where you're changing your replication topology for routine maintenance tasks. Changing from master to replica is easy, but now you have to rebuild that original master off of the former replica now. Completely start over. You can't just start up again from a given transaction ID. MySQL's GTID implementation is much better in this regard. You can change masters and replicas all repoint them without rebuilding. You can't do that (currently) with Postgresql. It's a major pain point.

1 more reply

api9y ago

Yes but here's the problem. Consider common scenarios like:

Master goes down. Slave takes over. Master comes back. Slave goes down 10 minutes later. Repeat.

This is common in e.g. multi data center replication and is often due to transient network failures. Netflix has a great open source tool called chaos monkey that can induce lots of random failure scenarios like this or much worse. Don't get me started on transient partial failures due to latency and packet loss spikes.

The manual nature of pg replication setup makes me really nervous here. What happens when it finds itself in a state where manual intervention is needed? You are now down.

This is tolerable for big companies with dedicated SREs and DBAs and enough of them that it's easy to always have someone on call, but it's a nightmare for smaller ventures. Even for larger ventures this adds a lot of cost overhead.

Like I said elsewhere this was really the true killer feature of the more successful NoSQL document store type databases. Everything else was largely hype.

We switched recently to RethinkDB for this reason. We miss the richness of SQL (to the point that we still use PG too for warehousing and analytics) but in return we got incredible robustness across three data centers. Of course our app does not need rich queries or strong consistency 99% of the time so YMMV. For some jobs ACID and complex queries on live data are not optional.

mb4nck9y ago

Note that starting from Postgres 10 (which this thread is about), you don't need to adjust wal_level and max_wal_senders (or max_replication_slots, for that matter) anymore. You still have to enable hot_standby=on on the standbys, though.

(and it is in general a good idea to keep the configuration the same as much as possible between primary and standbys).

acdha9y ago

Thanks! I was asking here mostly out of curiousity about how people felt after running it for awhile since it has certainly sounded like it has improved massively since I last dealt with it in the 8.x era.

chillydawg9y ago

get two or three dedicated servers, stick a 2 disk mirrored raid in each one, run one primary and have barman manage your snapshots+streaming replication. hook in some cronjobs to monitor age of stuff and failure of stuff. that is pretty much it for anything less than, say, 5TB. Beyond that your copy times are too high and making snapshots becomes extremely difficult.

Normal_gaussian9y ago· 5 in thread

Extended Statistics! I was following the replication changes, but have just discovered the extended statistics and am more excited about them.

The directory renaming at the bottom of the post is interesting - I wonder if many other projects have to do things like this?

anarazel9y ago

> The directory renaming at the bottom of the post is interesting - I wonder if many other projects have to do things like this?

The background is that, over the years, a number of people deleted the pg_xlog and pg_clog directories when they noticed they're running out of space, thinking it's just server logs. Unfortunately that's the directories containing the database journal, and transaction status (committed/aborted/in-progress). Which means they'll loose data. The idea is to rename them to something that's less likely to be mistaken for unimportant data.

pgaddict9y ago

To be fair, the extended statistics available in PG10 are about the most primitive ones possible. If the PG10 stats help with your queries, great, but otherwise it's mainly laying infrastructure for the more advanced stuff - histograms, MCV lists, expression statistics, and ultimately join statistics (which is about the main source of estimation issues). Oleg also mentioned it might be useful for JSON statistics, which would be cool.

Hopefully those bits will get in faster - I first submitted the patch in 2014, just before pgconf.eu, I think. OTOH I can't really complain, because I'm shitty developer so the initial versions were far from committable. The quality requirements for PostgreSQL patches are damn high these days.

BTW if you have examples of real-world queries hurt by poor estimates, report them to pgsql-performance mailing list. It's an important piece of information about what cases to look at first. Obviously, we already have already collected various queries, but having more is good.

JoachimSchipper9y ago

Having good directory names helps a lot; I'd be very surprised if other projects didn't also need to make it clear to admins what's happening.

(On the other hand, some projects need to make things _less_ clear - https://github.com/mackyle/sqlite/blob/3cf493d4018042c70a4db... - "users would (...) call [the developers] to wake them up at night and complain".)

frik9y ago

It would be great if some Linux distros clear up the directory mess. There are directories in there with names that no ones remembers what they originally meant in the UNIX of 1970s, or what ever. For compatibility they could be just hard/soft-links to a more sane directory structure.

Well the same goes for Windows. With Win95, WinNT 3.5, WinXP, WinVista they restructured the internal directory tree and renamed things. It was okay with WinXP, just the long user folder was trouble some because of 260 chars MAX_PATH limit. But with Vista and 64-bit support the fucked up and it's now a big mess in Win7+ (syswow64, system32, registry, winsxs, dotNet folders, ... such a big mess and sometimes also waste of HDD space by duplicates of files).

barrkel9y ago

winsxs uses hard links - space wastage is more likely from more versions than just dupes. Also, many windows tools won't account t correctly for hard links in disk usage stats.

hartator9y ago· 4 in thread

I am considering more and more a move back from MongoDB to PostgreSQL. I will be missing being schema less so much though. Migrations - particularly Rails migrations - left a bad taste in my mouth. Anyone did the move recently and what are their feelings?

cwp9y ago

I did just that move when I realized that I was doing a lot of work so impose schemas on my "schemaless" data and another bunch of work to implement joins in my application.

I found the best way to do migrations is with vanilla SQL. I wrote a little tool to read migrations from SQL files in a directory, send them to the server and keep track of which ones have already been applied. Simple and easy.

The big benefit of migration is that your app code doesn't have to deal with every possible schema that you've ever used; it can rely on the data being uniform.

I'm very happy with the switch; wouldn't go back to Mongo for anything.

JohnDotAwesome9y ago

Yeah, all the things that need flexible schema can go in a jsonb column. While querying on JSON has gotten less painful, it's still a bit of a chore. But I've found that I rarely need to do that. Or if I do, I just denormalize a bit and put those fields in a regular ol' column.

We did the move maybe 4-5 years ago? At least in the JavaScript world, this makes your life so so much better.

Of course, you still have to handle migrations, but at least you have transactions :)

stemcc9y ago

You can easily have schema-less with Postgres's jsonb data type.

hartator9y ago

Not really. Postgres ORMs are not meant to do schema-less and tables still need to be created.

3 more replies

mark_l_watson9y ago· 4 in thread

I know that several RDF data stores use PostgreSQL as a backend data store. With new features like better XML support, as well as older features for storing hierarchical data, I am wishing for a plugin or extension for handling RDF with limited (not RDFS or OWL) SPARQL query support. I almost always have PostgreSQL available, and for RDF applications it would be very nice to not have to run a separate service.

I tend to view PostgreSQL as a "Swiss Army knife" and having native RDF support would reinforce that.

anarazel9y ago

As one of the people regularly working on postgres, I'm honestly a bit doubtful that it's realistic to add SPARQL frontend. Doing that well is a considerable amount of work, and there's relatively little overlap in experience between the communities. I suspect that focusing on other areas will have a bigger ROI.

But that's just my personal opinion, and other contributors and companies might very well disagree.

frik9y ago

Which RDF data store uses Postgres as DB backend? And can one import WikiData? (does it scale) (I would rather avoid these old school RDF special case stores from SematicWeb days 10 years ago.)

kuschku9y ago

https://github.com/cayleygraph/cayley is currently on the frontpage of HN, and it does use PGSQL as backend.

1 more reply

oever9y ago

Can you say which RDF data stores these are? Are they generic or special purpose?

avar9y ago· 3 in thread

This bit about ICU support v.s. glibc:

    > [...] Furthermore, at least on Red Hat, glibc regularly whacks
    > around the behavior of OS-native collations in minor releases,
    > which effectively corrupts PostgreSQL's indexes, since the index
    > order might no longer match the (revised) collation order.  To
    > me, changing the behavior of a widely-used system call in a
    > maintenance release seems about as friendly as locking a family
    > of angry racoons in someone's car, but the glibc maintainers
    > evidently don't agree.

Is a reference to the PostgreSQL devs wanting to make their index order a function of strxfrm() calls and to not have it change when glibc updates, whereas some on the glibc list think it should only be used for feeding it to the likes of strcmp() in the same process:

    > The only thing that matters about strxfrm output is its strcmp
    > ordering.  If that changes, it's either a bug fix or a bug
    > (either in the code or in the locale data).  If the string
    > contents change but the ordering doesn't, then it's an
    > implementation detail that is allowed to change.

-- https://sourceware.org/ml/libc-alpha/2015-09/msg00197.html

JoachimSchipper9y ago

Florian Weimer's reply is also interesting:

"Why do you think that? I don't see this documented anywhere, and I doubt it is something many readers of the C standard, the man page, or the glibc manual would expect.

The manual suggests to store the strxfrm output and use it for sorting. I expect that some applications put it into on-disk database indexes as a result. This will lead to subtle breakage on glibc updates.

(The larger problem is that there are definitely databases out there which use B-tree indexes in locale collation order, which break in even more subtle ways if we make minor changes to the collation order.)"

ajross9y ago

Which manual suggests storing the output of strxfrm? The glibc man page doesn't seem to.

I don't know that this is resolvable. The documented behavior of strxfrm() is just about its output properties. Improvements to the transformation algorithm would be expected to be made, if it's improvable.

If a database needs this to be static over time it needs to pick a particular transformation algorithm and specify it exactly, not just rely on whatever the C library happens to provide.

I mean, not only are PostgreSQL locale-sorted-indexes not portable across glibc releases. They aren't portable across any other system change either. No moving between distros or doing distro upgrades, etc... Those are all misfeatures probably worth fixing.

4 more replies

paulddraper9y ago

Well what do you expect?

Patch releases are for bug fixes. If you can't handle any change in behavior, including a bug fix, then you shouldn't be upgrading.

I understand the problem, and kudos to Postresql for figuring out a solution, but railing on glibc for fixing bugs in patch releases makes about as much sense as breeding raccoon families to chuck into people's cars.

lazzlazzlazz9y ago· 3 in thread

How is Postgres so consistently the best open-source DB project from features to documentation? It's unreal.

int_19h9y ago

Not just a DB project, either. I'd say it's one of the best executed (in a very broad sense of the word) open source projects around, in general.

From end user perspective, they have stable, quality releases with a predictable cycle and subsequent maintenance releases. They have great documentation - one of the best in the industry, much less open source. Things generally work as you'd expect them to, and when not (e.g. for historical or implementation reasons), you have clear and convincing explanations. And so on.

I haven't seen their developer side, but based on other people's feedback, it's also good - high quality bar for code, stringent review process etc. More importantly, they seem to be making the right (= leading to more stable quality releases with great features) technical decisions consistently, which to me is a hallmark of a very well run team.

I also can't remember any publicized "drama" around Postgres, either on the inside (dev disagreements etc), or between the team and the users. It looks like everyone's happy, or at least happy enough.

I don't know what the magic sauce is here, but it feels like many other open source projects could learn a lot from the Postgres team and community.

akurilin9y ago

I second this, the Postgres contributors are consistently setting the bar for the rest of OSS projects out there, it's consistently been my favorite part of the stack for a long time now.

Don't forget about their phenomenal #postgresql channel on Freenode. The folks working on Postgres have been gracious enough to patiently answer my not always fully baked questions for the past 5 years on there, they're a bottomless treasure trove of best practices and pragmatic advice.

dhbx99y ago

I'd say it is the best DB hands down. Open source or otherwise.

elvinyung9y ago· 3 in thread

Dumb question: does declarative partitioning pave the way for native sharding in Postgres? I'm not super super familiar, but it seems like along with some other features coming in Postgres 10, like parallel queries and logical replication, that this is eventually the goal.

rhaas9y ago

I hope that it will have that effect. We need a few other features first: partitionwise join, partitionwise aggregate, asynchronous query, and ideally hash partitioning.

qeternity9y ago

Doesn't this basically replicate all of the work done by Citus?

elvinyung9y ago

I see -- thanks! Really cool stuff.

hodgesrm9y ago· 3 in thread

Impressive feature list. Glad to see logical replication is finally making it in.

brianwawok9y ago

What is you use case for it? My only thought was sending just one table to replica to be used to do analytics on ..

rhaas9y ago

Replication across major versions, for example to upgrade without downtime. Partial replication, to distribute shared data across a series of clusters, or for analytics and reporting as you mention. Replicating the data without replicating any table bloat. Being able to do limited writes (e.g. to temporary tables) on the standby. http://rhaas.blogspot.com/2011/02/case-for-logical-replicati...

1 more reply

bladecatcher9y ago

Isn't analytics a massive usecase in itself?

iEchoic9y ago· 2 in thread

I'm so excited for table partitioning. I use table inheritance in several places in my current project, but have felt the pain of foreign key constraints not applying to inherited children. Reading about table partitioning, I'm realizing that this is a much better fit for my use case.

Postgres continues to amaze me with the speed at which they introduce the right features into such a heavily-used and production-critical product. Thanks Postgres team!

amitlan9y ago

Unfortunately, foreign keys won't be supported right away.

Read about the new feature and its limitations here: https://www.postgresql.org/docs/devel/static/ddl-partitionin...

iEchoic9y ago

Thanks, I hadn't read this. That's too bad, hopefully we'll see that in the future (if it's technically possible at all?). That'd be a huge feature for me.

smac89y ago· 2 in thread

Wow, so awesome. I do hope at some point we can see some language improvements to PLPGSQL. More basic data structures could go a long way in making that language really useful, and I still consider views/stored procedures a superior paradigm to client side sql logic

rhaas9y ago

I agree with you that stored procedures are superior to client-side logic, because it means that you can have multiple routes of access to the database and all of them enforce the same business logic. But what exactly do you mean by "more basic data structures"?

pgaddict9y ago

PL/SQL has various types of collections, for example, that are super-useful when you need to do more complicated processing without having to create temporary tables and such.

ams61109y ago· 2 in thread

A question on this statement, in the SCRAM authentication description: stealing the hashed password from the database or sniffing it on the wire is equivalent to stealing the password itself

How is that the case? That's exactly the thing that hashed passwords prevent. Of course, if it's just an MD5 hash that's feasibly vulnerable to brute-forcing today, but it's still not "equivalent" to having the clear-text password.

jhgg9y ago

The point is that you only send the hash to the database to connect. If you steal the hash, you can connect to the database using said hash, not needing the plaintext. The password might as well be the hash in this case. Hence the equivalency.

Using that scheme, all you prove is that you know the hash of the password. SCRAM allows you to prove you know the plaintext password without actually transmitting it.

xyzzy_plugh9y ago

If you steal the hash from the database, yes. I don't know how stealing the hash over-the-wire is equivalent to having the password, since it is salted (with a salt generated by the server) and is not reusable.

1 more reply

mozumder9y ago· 2 in thread

I could use a count of the number of file I/Os that each query takes, in order to optimize my queries further...

anarazel9y ago

That's been there for a while:

    EXPLAIN (ANALYZE, BUFFERS) yourquery;

If you enable track_io_timing (has some overhead on platforms with slow timestamps, e.g. older VMware), you even get timing.

If you want that aggregated, rather than for an individual query, you should look into pg_stat_statements.

mozumder9y ago

The BUFFERS count is more for row count info as it operates on large chunks of data, instead of index optimization that needs to count how many times index structures are accessed. Counting IOs directly would be more useful for tuning indexes.

1 more reply

bladecatcher9y ago· 1 in thread

This is great because I couldn't go to production with earlier releases of logical decoding. Now we don't have to depend on a third party add on!

felixge9y ago

We're currently experimenting with logical decoding in 9.6, so I'd be curious to hear what problems you've been running into.

qaq9y ago

Even a single feature from the list would make 10 an amazing release, all of them together is just unbelievable. Very happy we are using PG :)

jordanthoms9y ago

Will DDL replication for the logical replication be landing in 10 or later?

We have some use cases where logical replication would be very helpful, but keeping the schema in sync manually seems like a pain - will there be a documented workaround if DDL replication doesn't make it in?

StreamBright9y ago

For analytical loads the following is going to be great:

  While PostgreSQL 9.6 offers parallel query, this feature 
  has been significantly improved in PostgreSQL 10, with new 
  features like Parallel Bitmap Heap Scan, Parallel Index 
  Scan, and others.  Speedups of 2-4x are common with 
  parallel query, and these enhancements should allow those 
  speedups to happen for a wider variety of queries.

knv9y ago

Any recommendations for scaling Postgresql's best practices? Really appreciate it.

awinter-py9y ago

fascinating that the road to improving the expr evaluator is better opcode dispatch and jit -- same tradeoffs every programming language project is looking at right now.

qxmat9y ago

DECLARE @please VARCHAR(3) = '???';

MR4D9y ago

You guys are awesome - keep up the good work!

awinter-py9y ago

the join speedup for provably unique operands sounds awesome

j / k navigate · click thread line to collapse

135 comments

100 comments · 26 top-level

fiatjaf9y ago· 9 in thread

Ok, I'm not a database manager for enormous projects, so these changes may be great, but I don't understand them and don't care about them. Postgres is already the most awesome thing in Earth to me.

Still, if my opinion counts I think SELF-UPDATING MATERIALIZED VIEWS should be the next priority.

rhaas9y ago

pgaddict9y ago

1 more reply

fiatjaf9y ago

Thank you.

rachbelaid9y ago

Some work started in this direction. I didn't follow closely the whole thread but I don't think that got commited in PG10.

https://www.postgresql.org/message-id/flat/20170119213859.GA...

More info in the EDB roadmap: https://wiki.postgresql.org/wiki/EnterpriseDB_database_serve...

petepete9y ago

How do you mean? Couldn't you use a trigger to update the view?

ams61109y ago

mozumder9y ago

Triggers on materialized views are really error-prone and tedious. It's cache invalidation, which is hard.

phamilton9y ago

Effectively yes, but if it's that simple why not make this a built in functionality? Other DBs have it.

okket9y ago

A trigger on what? Every update, insert, delete, etc.? On every table in the view?

Even if that is possible, it may be a major performance killer. This has to be done internally, I think.

3 more replies

jacques_chester9y ago· 7 in thread

I deeply appreciate the great care that Postgres committers take in writing their merge messages.

I think of it as a sign of respect for future developers to take the time to write a clear account of what has happened.

atombender9y ago

Postgres is one of the few projects that still use a strict patch-oriented development process that's based almost entirely around mailing-list communication.

[1] https://wiki.postgresql.org/wiki/Submitting_a_Patch

[2] https://commitfest.postgresql.org

lathiat9y ago

I'm not saying other projects don't get it right (and have no opinion at all about postgresql's documentation state), but MySQL seems to get it pretty right in general.

1 more reply

pgaddict9y ago

Linux kernel is another such project, I think.

Of course, another reason is history - most of the process was established long before git, when CVS was the VCS.

scrollaway9y ago

Wine is another such project :)

1 more reply

js29y ago

Git itself is another such project.

anarazel9y ago

jacques_chester9y ago

I agree with that sentiment. I consider my future self to be an example of "other developers".

Why? Because we don't know when we will need to refer to them, or which ones we will refer to. All that we know is that some of them will vital in future, as the journal of record.

And so it is with commit messages. We owe readers the courtesy of explaining our thinking.

djcj889y ago· 6 in thread

Is this issue resolved by the new "Logical replication" feature? It doesn't seem directly related, but it seems like maybe that is what he is referring to in this blog post?

anarazel9y ago

pavanvd9y ago

snuxoll9y ago

anarazel9y ago

> Write amplification is a result of PostgreSQL's decision to not used clustered indexes, there's not much that can be done to avoid it without a massive redesign of the storage engine

https://www.postgresql.org/message-id/CABOikdMNy6yowA+wTGK9R...

> In all reality though, Uber wanted a key-value store and not an RDBMS

Agreed on that.

evanelias9y ago

Not arguing with your assessment of Uber's requirements; but in general, why do you view InnoDB as not much more than a K/V store? And why do you equate clustered indexes with K/V stores?

1 more reply

frik9y ago

> massive redesign of the storage engine

Have the Postgres thought about adding support for more than one storage engine? Then they could implement new ideas in a fork, an one could run them side-by-side and migrate over to it.

https://www.postgresql.org/message-id/4CB597FF.1010403@cheap...

For example MySQL had been mocked for its old ISAM storage engine. Then MySQL added InnoDB as another storage engine, the SQL interface is the same.

2 more replies

nickpeterson9y ago· 6 in thread

chillydawg9y ago

I'm not familiar with any books, but the docs really are excellent and have various sections for beginners and getting to know the system.

Mostly, postgres just works.

fiatjaf9y ago

There are no better docs than Postgres docs.

arc_of_descent9y ago

Also, pgAdmin?

pgaddict9y ago

There's PostgreSQL 9 Admin Cookbook from Simon Riggs, for example (disclosure: I work for Simon).

nickpeterson9y ago

Yeah packtpub is a real crapshoot. They're great in that they'll seemingly publish whatever tech subject you want to write about. The downside is they publish anything...

1 more reply

pgaddict9y ago

I just remembered there's also "PostgreSQL: Up and Running" published by O'Reilly. It deals with more stuff than just administration, but Regina O. Obe and Leo S. Hsu are good authors.

api9y ago· 6 in thread

The feature I'd really love is master selection with Raft or similar and automatic query redirection to the master for all write queries (and maybe for reads with a query keyword).

That would make it very easy and robust to cluster pg without requiring a big complicated (a.k.a. high admin overhead and failure prone) stack with lots of secondary tools.

A fire and forget cluster-able fully consistent SQL database would be nirvana for most of the market.

mb4nck9y ago

rachbelaid9y ago

Postgres has steady progression (even if not fast enough for some people) but they are moving without compromising robustness of their product for the users.

api9y ago

Raft is not complex. I doubt leader elect would be terribly hard to implement.

xyzzy_plugh9y ago

Can't pgbouncer/pgpool2 solve query redirection? I don't understand the desire for all-in-one solutions.

api9y ago

That desire comes from three places:

1. Minimize cognitive load by minimizing the number of things you have to learn.

2. Minimize deployment complexity and dependencies.

3. Complexity is just evil I'm general. Linear increases in complexity result in exponential increases in bugs, vulnerabilities, and failure modes. It's just combinatorics.

2 more replies

manigandham9y ago

> I don't understand the desire for all-in-one solutions

You really don't understand it? It's less moving pieces to think about, worry about, read about, deploy, maintain, fix and almost always leads to better performance and security.

Something as simple as connection pooling should've already been part of the database and query redirection is even more important to have included.

1 more reply

acdha9y ago· 6 in thread

Heliosmaster9y ago

Streaming replication isn't hard at all: http://davide.im/setting-up-a-failover-database-for-postgres...

scurvy9y ago

It's not hard to setup initially, but I'll admit that it's not very good.

1 more reply

api9y ago

Yes but here's the problem. Consider common scenarios like:

Master goes down. Slave takes over. Master comes back. Slave goes down 10 minutes later. Repeat.

The manual nature of pg replication setup makes me really nervous here. What happens when it finds itself in a state where manual intervention is needed? You are now down.

Like I said elsewhere this was really the true killer feature of the more successful NoSQL document store type databases. Everything else was largely hype.

mb4nck9y ago

(and it is in general a good idea to keep the configuration the same as much as possible between primary and standbys).

acdha9y ago

chillydawg9y ago

Normal_gaussian9y ago· 5 in thread

Extended Statistics! I was following the replication changes, but have just discovered the extended statistics and am more excited about them.

The directory renaming at the bottom of the post is interesting - I wonder if many other projects have to do things like this?

anarazel9y ago

> The directory renaming at the bottom of the post is interesting - I wonder if many other projects have to do things like this?

pgaddict9y ago

JoachimSchipper9y ago

Having good directory names helps a lot; I'd be very surprised if other projects didn't also need to make it clear to admins what's happening.

frik9y ago

barrkel9y ago

winsxs uses hard links - space wastage is more likely from more versions than just dupes. Also, many windows tools won't account t correctly for hard links in disk usage stats.

hartator9y ago· 4 in thread

cwp9y ago

I did just that move when I realized that I was doing a lot of work so impose schemas on my "schemaless" data and another bunch of work to implement joins in my application.

The big benefit of migration is that your app code doesn't have to deal with every possible schema that you've ever used; it can rely on the data being uniform.

I'm very happy with the switch; wouldn't go back to Mongo for anything.

JohnDotAwesome9y ago

We did the move maybe 4-5 years ago? At least in the JavaScript world, this makes your life so so much better.

Of course, you still have to handle migrations, but at least you have transactions :)

stemcc9y ago

You can easily have schema-less with Postgres's jsonb data type.

hartator9y ago

Not really. Postgres ORMs are not meant to do schema-less and tables still need to be created.

3 more replies

mark_l_watson9y ago· 4 in thread

I tend to view PostgreSQL as a "Swiss Army knife" and having native RDF support would reinforce that.

anarazel9y ago

But that's just my personal opinion, and other contributors and companies might very well disagree.

frik9y ago

Which RDF data store uses Postgres as DB backend? And can one import WikiData? (does it scale) (I would rather avoid these old school RDF special case stores from SematicWeb days 10 years ago.)

kuschku9y ago

https://github.com/cayleygraph/cayley is currently on the frontpage of HN, and it does use PGSQL as backend.

1 more reply

oever9y ago

Can you say which RDF data stores these are? Are they generic or special purpose?

avar9y ago· 3 in thread

This bit about ICU support v.s. glibc:

    > [...] Furthermore, at least on Red Hat, glibc regularly whacks
    > around the behavior of OS-native collations in minor releases,
    > which effectively corrupts PostgreSQL's indexes, since the index
    > order might no longer match the (revised) collation order.  To
    > me, changing the behavior of a widely-used system call in a
    > maintenance release seems about as friendly as locking a family
    > of angry racoons in someone's car, but the glibc maintainers
    > evidently don't agree.

    > The only thing that matters about strxfrm output is its strcmp
    > ordering.  If that changes, it's either a bug fix or a bug
    > (either in the code or in the locale data).  If the string
    > contents change but the ordering doesn't, then it's an
    > implementation detail that is allowed to change.

-- https://sourceware.org/ml/libc-alpha/2015-09/msg00197.html

JoachimSchipper9y ago

Florian Weimer's reply is also interesting:

"Why do you think that? I don't see this documented anywhere, and I doubt it is something many readers of the C standard, the man page, or the glibc manual would expect.

ajross9y ago

Which manual suggests storing the output of strxfrm? The glibc man page doesn't seem to.

If a database needs this to be static over time it needs to pick a particular transformation algorithm and specify it exactly, not just rely on whatever the C library happens to provide.

4 more replies

paulddraper9y ago

Well what do you expect?

Patch releases are for bug fixes. If you can't handle any change in behavior, including a bug fix, then you shouldn't be upgrading.

lazzlazzlazz9y ago· 3 in thread

How is Postgres so consistently the best open-source DB project from features to documentation? It's unreal.

int_19h9y ago

Not just a DB project, either. I'd say it's one of the best executed (in a very broad sense of the word) open source projects around, in general.

I don't know what the magic sauce is here, but it feels like many other open source projects could learn a lot from the Postgres team and community.

akurilin9y ago

I second this, the Postgres contributors are consistently setting the bar for the rest of OSS projects out there, it's consistently been my favorite part of the stack for a long time now.

dhbx99y ago

I'd say it is the best DB hands down. Open source or otherwise.

elvinyung9y ago· 3 in thread

rhaas9y ago

I hope that it will have that effect. We need a few other features first: partitionwise join, partitionwise aggregate, asynchronous query, and ideally hash partitioning.

qeternity9y ago

Doesn't this basically replicate all of the work done by Citus?

elvinyung9y ago

I see -- thanks! Really cool stuff.

hodgesrm9y ago· 3 in thread

Impressive feature list. Glad to see logical replication is finally making it in.

brianwawok9y ago

What is you use case for it? My only thought was sending just one table to replica to be used to do analytics on ..

rhaas9y ago

1 more reply

bladecatcher9y ago

Isn't analytics a massive usecase in itself?

iEchoic9y ago· 2 in thread

Postgres continues to amaze me with the speed at which they introduce the right features into such a heavily-used and production-critical product. Thanks Postgres team!

amitlan9y ago

Unfortunately, foreign keys won't be supported right away.

Read about the new feature and its limitations here: https://www.postgresql.org/docs/devel/static/ddl-partitionin...

iEchoic9y ago

Thanks, I hadn't read this. That's too bad, hopefully we'll see that in the future (if it's technically possible at all?). That'd be a huge feature for me.

smac89y ago· 2 in thread

rhaas9y ago

pgaddict9y ago

PL/SQL has various types of collections, for example, that are super-useful when you need to do more complicated processing without having to create temporary tables and such.

ams61109y ago· 2 in thread

A question on this statement, in the SCRAM authentication description: stealing the hashed password from the database or sniffing it on the wire is equivalent to stealing the password itself

jhgg9y ago

Using that scheme, all you prove is that you know the hash of the password. SCRAM allows you to prove you know the plaintext password without actually transmitting it.

xyzzy_plugh9y ago

1 more reply

mozumder9y ago· 2 in thread

I could use a count of the number of file I/Os that each query takes, in order to optimize my queries further...

anarazel9y ago

That's been there for a while:

    EXPLAIN (ANALYZE, BUFFERS) yourquery;

If you enable track_io_timing (has some overhead on platforms with slow timestamps, e.g. older VMware), you even get timing.

If you want that aggregated, rather than for an individual query, you should look into pg_stat_statements.

mozumder9y ago

1 more reply

bladecatcher9y ago· 1 in thread

This is great because I couldn't go to production with earlier releases of logical decoding. Now we don't have to depend on a third party add on!

felixge9y ago

We're currently experimenting with logical decoding in 9.6, so I'd be curious to hear what problems you've been running into.

qaq9y ago

Even a single feature from the list would make 10 an amazing release, all of them together is just unbelievable. Very happy we are using PG :)

jordanthoms9y ago

Will DDL replication for the logical replication be landing in 10 or later?

StreamBright9y ago

For analytical loads the following is going to be great:

  While PostgreSQL 9.6 offers parallel query, this feature 
  has been significantly improved in PostgreSQL 10, with new 
  features like Parallel Bitmap Heap Scan, Parallel Index 
  Scan, and others.  Speedups of 2-4x are common with 
  parallel query, and these enhancements should allow those 
  speedups to happen for a wider variety of queries.

knv9y ago

Any recommendations for scaling Postgresql's best practices? Really appreciate it.

awinter-py9y ago

fascinating that the road to improving the expr evaluator is better opcode dispatch and jit -- same tradeoffs every programming language project is looking at right now.

qxmat9y ago

DECLARE @please VARCHAR(3) = '???';

MR4D9y ago

You guys are awesome - keep up the good work!

awinter-py9y ago

the join speedup for provably unique operands sounds awesome

j / k navigate · click thread line to collapse