Automerge-Repo: A "batteries-included" toolkit for local-first applications (opens in new tab)

(automerge.org)

219 pointsgklitt2y ago43 comments

43 comments

33 comments · 16 top-level

pcl2y ago· 2 in thread

Cool stuff!

What do you suggest is the sweet spot for document size and "hotness"? Your cookbook [0] says "We suspect that an Automerge document is best suited to being a unit of collaboration between two people or a small group." Does that mean tens of kilobytes? Hundreds? More? And how much concurrent contention is viable? And is the "atom of contention" the document as a whole, or do you have any plans for merging of sub-parts?

Also, do you have support for juggling multiple transports, either concurrently or back-to-back? In particular, I'm thinking about synchronizing via the cloud when connected, and falling back to peer-to-peer when offline. In that peer-to-peer case, how many peers can I have, and can my peer network behave as a mesh, or must it stick together to some degree?

And finally, it looks like your tutorial [1] doesn't actually exist! You refer to it in a blog post [2], but it's a dead link.

[0] https://automerge.org/docs/cookbook/modeling-data/

[1] https://automerge.org/docs/tutorial/introduction/

[2] https://automerge.org/blog/automerge-2/

pvh2y ago

The way I think about it is that if the data should always travel together it should be in one document. For example -- if your TODO list always goes as a unit, then make it an array of objects in a single Automerge document. On the other hand, if you want to build an issue tracker and to be able to link to individual issues or share them individually then a document each is the way to go. Does that help?

As for network transports you can indeed have multiple at once. I usually have a mix of in-browser transports (MessageChannels) and WebSocket connections. I suspect we'll need to do a little adjusting to account for prioritization once people really start to push on this with things like mDNS vs relay server connections but the design should accommodate that just fine.

As for the docs, my apologies. The "tutorial" was merged into the quickstart as part of extensive documentation upgrades over the last few months. We should update the link in the old blog post accordingly.

Here's a link to save you the effort: https://automerge.org/docs/quickstart/

pcl2y ago

Cool, thanks for the details.

So if I smoosh everything in my sorta “collaboration context” together into one document, are there any provisions for delta updates on the wire? Your browser-side storage format sounds like it’s compatible with that approach, but what about clients that are far apart version-wise? Are you storing full relay history and also a snapshot?

I see in your format docs [0] that you store change chunks. Are these exposed in the API for atomicity at all? Are there any atomicity guaranties?

And you discuss backends, but I don’t see any pointers to an S3 or Postgres implementation. Is that something you’re keeping closed source for your business model, or am I just missing something?

I haven’t found anything about authorization. Have you done any work there? I quite like the Firebase model in which you can write simple validation rules that can evaluate against the document itself —- “only allow users who are listed in path `members` to write to this document” or whatever.

[0] https://automerge.org/automerge-binary-format-spec/#chunk-co...

1 more reply

satvikpendem2y ago· 2 in thread

Nice, I use automerge with Rust via autosurgeon [0] which is their Rust wrapper, but looks like it hasn't been updated recently, any updates on that? I'm guessing with a small team that web support is taking priority right now, as I'm running this on my Rust client (technically Flutter but via the FFI package flutter_rust_bridge [1]) and server (via the Axum web server crate).

[0] https://github.com/automerge/autosurgeon

[1] https://github.com/fzyzcjy/flutter_rust_bridge

izak302y ago

We're using the same stack, along with automerge-repo-rs, we haven't needed much in the way of updates, what are you hoping for that doesn't exist?

Edit: Typo `autosurgeon-repo-rs` to `automerge-repo-rs` and link. https://github.com/automerge/automerge-repo-rs

satvikpendem2y ago

Is autosurgeon-repo-rs separate from autosurgeon? I can't find anything by that specific name on Google.

The API is still a little clunky, with hydrating and reconciling, and it's not as clean as the automerge-repo one, especially with those React examples.

1 more reply

zyang2y ago· 2 in thread

Last time I looked into CRDT, automerge was not as fast/efficient as yjs, but the team was actively improving the algorithm. Is there any benchmark to show the progress.

heathermiller2y ago

still not as fast/efficient as Yjs. there are some benchmarks here from late September’23: https://arxiv.org/abs/2212.02618

disclaimer: i’m a co-author and the paper is focused on a different CRDT framework, but point is that it measures Yjs and automerge side by side

pvh2y ago

The benchmarks Matt Weidner has been working on are great and outside scrutiny is always welcome, but I should note that I find there's an element of artificiality to them. In particular, testing the performance of the sync system while simulating many users typing into the same document doesn't really measure behaviour we have observed "in the wild". In our research, we've found that editing is usually serial or asynchronous. (See https://inkandswitch.com/upwelling for further discussion of our collaboration research.)

The benchmark that concerns me (and that I'm pleased with our progress on!) is that you can edit an entire Ink & Switch long-form essay with Automerge and that the end-to-end keypress-to-paint latency using Codemirror is under 10ms (next frame at 100hz).

While these kinds of benchmarks are incredibly appreciated and absolutely drive us to work on optimizing the problems they uncover, we try to work backwards from experienced problems in real usage as our first priority.

3 more replies

zby2y ago· 2 in thread

So is it like https://en.wikipedia.org/wiki/Google_Wave - but with limited scope and with new algos and finally usable?

fragmede2y ago

Google Docs is the spiritual successor to Wave, and has been usable for quite some time.

Closi2y ago

Spiritual predecessor - it was released 3 years before Wave :)

1 more reply

GeneralMaximus2y ago· 2 in thread

Would Automerge be a good choice for a non-realtime single-user app that just needs to have reliable offline support?

E.g a personal note-taking app where the user will never have any collaborators, but where they expect the app to work fully offline on multiple devices and reliably sync up when they come online.

neftaly2y ago

Yes, it's a great fit for this. You would probably want an internet-accessable sync server with a copy of the repo, so that the data is still available when no peers are online. Bluetooth/uPnP network adapters would be the cherry on top but AFAIK aren't ready yet.

neftaly2y ago

*mDNS not uPnP sorry

parhamn2y ago· 2 in thread

Anyone have any info on who is behind this project, how reliable it is (will it be around in 2 years), etc? Considering using it for one of my projects.

pvh2y ago

Ink & Switch is behind it; or more expansively mostly Orion Henry, Alex Good, Martin Kleppmann, and myself. As an organization, we have been working on Automerge for about six years now. We also have a wonderful community of other contributors both in industry and research.

Automerge is not VC-backed software. Indeed, for a number of years Automerge was primarily a research project used within the lab. Over the last year, it has matured to production software under the supervision of Alex Good. The improved stability and performance has been a great benefit to both our community and internal users. Our intention is to run the project as sponsored open source for the foreseeable future and thus far we have done so thanks to the support of our sponsors and through some development grants.

Ink & Switch's research interests drive a lot of Automerge development but funding from sponsors allows us to work on features that are not research-oriented or to accelerate work that we'd like to do but that doesn't have current research applications. If you adopt Automerge for a commercial project, I'd encourage you to join the sponsors of Automerge to ensure its long-term viability.

dboreham2y ago

Quick note for the casual reader that this is one of the oldest and preeminent projects in the space.

xrd2y ago· 1 in thread

If you are interested in this, check out the video from StrangeLoop 2023:

https://www.youtube.com/watch?v=Mr0a5KyD6BU

Also, check out the unconf for localfirst that happened right after 2023:

https://github.com/LoFiUnconf/stlouis2023

Ink & Switch is doing such interesting stuff. Their after party at StrangeLoop was so cool.

bryanlarsen2y ago

It's too bad the unconf was full, I didn't get in. Hopefully they do it again.

scotttrinh2y ago· 1 in thread

Super excited to see Automerge getting this high-level API out. Been following since before 1.0 and I can't wait to play around with the latest incarnation! Congrats to the Automerge team.

pvh2y ago

Thanks, Scott. This API should make it much, much easier for folks to build with Automerge and kind of just encapsulates everything we've been doing in-house over the last few years.

digitalsanctum2y ago· 1 in thread

This is exciting in several ways including the fact that Martin Kleppmann is involved with the project. Filed at the top of my reading list if nothing else to see an undoubtedly good example of a complex Rust project.

windock2y ago

Author of Designing Data-Intensive Applications book. It is such a well-written book, reading it right now, and cannot recommend enough.

bomewish2y ago· 1 in thread

Seems we have a really great technical spec — but aren’t y’all gonna build a product on it and let us pay you to use? A google docs for markdown/quarto documents would be brilliant but apparently does not yet exist…

pvh2y ago

Automerge is a library that anyone can adopt, and we are a research organization, not a product company.

We have built a variety of projects with Automerge, both publicly and for use in private, including recently the markdown-with-comments editor we call Tiny Essay Editor (https://tiny-essay-editor.netlify.app/) by Geoffrey Litt.

That said, sponsoring the Automerge team helps us build faster and is always welcome. (Thanks to our current and past sponsors for their support!)

davgoldin2y ago· 1 in thread

Congrats! Many moons ago the lack of undo/redo was the main blocker. Has this been added?

pvh2y ago

Robust undo/redo remains an ongoing research project. Leo Stewen's work was presented at PLF 2023 a few days ago. It turns out to be a subtle problem to really get completely right, but in my experience you can usually get passable results by letting editors default undo behaviour reverse text input.

For applications with more document-structured data, you can now produce inverse patches using Automerge.diff to go between any two points. To implement a reasonable undo in this environment you can record whatever document heads you consider useful undo points and then patch between them.

To perhaps expand on why the problem remains unsolved slightly further, there was a robust discussion about what the expected behaviour of "undo" out to be in even simple cases at the conference.

caelinsutch2y ago

Curious how you think about this compared to Electric SQL [1] - I'm currently deciding what sync solution we're going to use for a product rebuild and have been looking at quite a few

[1] https://electric-sql.com/

anglinb2y ago

This is super powerful, been playing around with the previous releases for the past few days. It works really well, but still needs a few dx tweaks to make it performant for large applications. You have to watch the callbacks yourself to update slices of state and unless your app is small enough that the whole thing can re-render every update.

That being said, I love everything automerge is doing and hope this pace will keep up!

idosh2y ago

How is it compared replicache, watermelondb and the rest?

coding1232y ago

It sounds like ms word saving to onedrive

benatkin2y ago

By calling it "repos", they're trying to capitalize on the popularity of VCS repositories, but these don't have their history implicitly tracked the way Automerge does, just explicitly tracked by committing and pushing.

I think it's cool, but I still see CRDTs as very niche.

I also want "local-first" but what I really want is something closer to how traditional desktop apps just open up, edit, and save files, not some real time collaboration that is already set up before I add my first collaborator.

j / k navigate · click thread line to collapse

43 comments

33 comments · 16 top-level

pcl2y ago· 2 in thread

Cool stuff!

And finally, it looks like your tutorial [1] doesn't actually exist! You refer to it in a blog post [2], but it's a dead link.

[0] https://automerge.org/docs/cookbook/modeling-data/

[1] https://automerge.org/docs/tutorial/introduction/

[2] https://automerge.org/blog/automerge-2/

pvh2y ago

Here's a link to save you the effort: https://automerge.org/docs/quickstart/

pcl2y ago

Cool, thanks for the details.

I see in your format docs [0] that you store change chunks. Are these exposed in the API for atomicity at all? Are there any atomicity guaranties?

And you discuss backends, but I don’t see any pointers to an S3 or Postgres implementation. Is that something you’re keeping closed source for your business model, or am I just missing something?

[0] https://automerge.org/automerge-binary-format-spec/#chunk-co...

1 more reply

satvikpendem2y ago· 2 in thread

[0] https://github.com/automerge/autosurgeon

[1] https://github.com/fzyzcjy/flutter_rust_bridge

izak302y ago

We're using the same stack, along with automerge-repo-rs, we haven't needed much in the way of updates, what are you hoping for that doesn't exist?

Edit: Typo `autosurgeon-repo-rs` to `automerge-repo-rs` and link. https://github.com/automerge/automerge-repo-rs

satvikpendem2y ago

Is autosurgeon-repo-rs separate from autosurgeon? I can't find anything by that specific name on Google.

The API is still a little clunky, with hydrating and reconciling, and it's not as clean as the automerge-repo one, especially with those React examples.

1 more reply

zyang2y ago· 2 in thread

Last time I looked into CRDT, automerge was not as fast/efficient as yjs, but the team was actively improving the algorithm. Is there any benchmark to show the progress.

heathermiller2y ago

still not as fast/efficient as Yjs. there are some benchmarks here from late September’23: https://arxiv.org/abs/2212.02618

disclaimer: i’m a co-author and the paper is focused on a different CRDT framework, but point is that it measures Yjs and automerge side by side

pvh2y ago

3 more replies

zby2y ago· 2 in thread

So is it like https://en.wikipedia.org/wiki/Google_Wave - but with limited scope and with new algos and finally usable?

fragmede2y ago

Google Docs is the spiritual successor to Wave, and has been usable for quite some time.

Closi2y ago

Spiritual predecessor - it was released 3 years before Wave :)

1 more reply

GeneralMaximus2y ago· 2 in thread

Would Automerge be a good choice for a non-realtime single-user app that just needs to have reliable offline support?

E.g a personal note-taking app where the user will never have any collaborators, but where they expect the app to work fully offline on multiple devices and reliably sync up when they come online.

neftaly2y ago

*mDNS not uPnP sorry

parhamn2y ago· 2 in thread

Anyone have any info on who is behind this project, how reliable it is (will it be around in 2 years), etc? Considering using it for one of my projects.

pvh2y ago

dboreham2y ago

Quick note for the casual reader that this is one of the oldest and preeminent projects in the space.

xrd2y ago· 1 in thread

If you are interested in this, check out the video from StrangeLoop 2023:

https://www.youtube.com/watch?v=Mr0a5KyD6BU

Also, check out the unconf for localfirst that happened right after 2023:

https://github.com/LoFiUnconf/stlouis2023

Ink & Switch is doing such interesting stuff. Their after party at StrangeLoop was so cool.

bryanlarsen2y ago

It's too bad the unconf was full, I didn't get in. Hopefully they do it again.

scotttrinh2y ago· 1 in thread

Super excited to see Automerge getting this high-level API out. Been following since before 1.0 and I can't wait to play around with the latest incarnation! Congrats to the Automerge team.

pvh2y ago

Thanks, Scott. This API should make it much, much easier for folks to build with Automerge and kind of just encapsulates everything we've been doing in-house over the last few years.

digitalsanctum2y ago· 1 in thread

windock2y ago

Author of Designing Data-Intensive Applications book. It is such a well-written book, reading it right now, and cannot recommend enough.

bomewish2y ago· 1 in thread

pvh2y ago

Automerge is a library that anyone can adopt, and we are a research organization, not a product company.

That said, sponsoring the Automerge team helps us build faster and is always welcome. (Thanks to our current and past sponsors for their support!)

davgoldin2y ago· 1 in thread

Congrats! Many moons ago the lack of undo/redo was the main blocker. Has this been added?

pvh2y ago

To perhaps expand on why the problem remains unsolved slightly further, there was a robust discussion about what the expected behaviour of "undo" out to be in even simple cases at the conference.

caelinsutch2y ago

Curious how you think about this compared to Electric SQL [1] - I'm currently deciding what sync solution we're going to use for a product rebuild and have been looking at quite a few

[1] https://electric-sql.com/

anglinb2y ago

That being said, I love everything automerge is doing and hope this pace will keep up!

idosh2y ago

How is it compared replicache, watermelondb and the rest?

coding1232y ago

It sounds like ms word saving to onedrive

benatkin2y ago

I think it's cool, but I still see CRDTs as very niche.

j / k navigate · click thread line to collapse