Neo4j raises $325M series F (opens in new tab)

(neo4j.com)

154 pointshugofirth5y ago133 comments

133 comments

83 comments · 19 top-level

Grimm15y ago· 16 in thread

What this tells me is that the graph db space has a lot of room in it for someone to come along and make a kick ass product, because honestly every time I've had a problem a graph db can solve I remember I basically only have a few mediocre choices to choose from.

Neo4J has been very meh in my experience, but they are the biggest.

spriggancg5y ago

I discovered that there is a fork of a previous Neo4j enterprise version known as ONgDB. Don't know if it will have a sufficient pool of maintainers to fix and make evolve such a product. But at least it remains fully open source.(Neo4J enterprise version open source code has been removed..)

dfraser9925y ago

I investigated using Neo4j back in 2010 for storing the schedules/routes of ocean shipping freight companies, forwarders, etc. The first cut of the system stored the schedule data in a graph like way in MySQL, but to do recursive SQL queries (needed for the types of queries being performed) was annoying. Things worked well enough, but a graph database would have resulted in a much more logical representation of the data structures being used - and the code behind the queries easier to develop.

That said, the 2nd system never got off the ground, I quit the badly run startup before finishing it. And now that I have a bit more experience with Neo4j, I'd say it would have been a bear to fully implement. Java is too heavy and Neo4j is a memory pig. It works, I can't say it is bad or perhaps iffy like Tinkerpop, but it is "Enterprise Software" and everything that is associated with that meme.

I have been using Tigergraph for my latest research into modeling the schedules etc of rail transport. It is much faster than Neo4j, requires far less memory (I can store every bit of data I need to in it unlike with Neo4 (would need multiple 64GB of RAM servers)), and its programming language is pretty nice once you get the hang of it.

So I'd recommend Tigergraph. The downside is that it is not as 'plug and play' as Neo4j, does not has all the mindshare/fancy bells and whistles, and is entirely C++/Unix based. So having some UNIX sysadmin experience is helpful unless you want to use their cloud solution.

zozbot2345y ago

> to do recursive SQL queries (needed for the types of queries being performed) was annoying. Things worked well enough, but a graph database would have resulted in a much more logical representation of the data structures being used

I think there's plenty of room to disagree with this view that modeling graph data in SQL is not "logical enough". Though to be fair, there seems to be some ongoing work on adding some "property"-based layer to bare SQL in order to make common graphs-and-properties oriented queries a bit more ergonomic.

1 more reply

feu5y ago

Do you have any opinions on ArangoDB or Dgraph? My new tech lead is talking about switching from MongoDB to one of those.

handrous5y ago

> My new tech lead is talking about switching from MongoDB to one of those.

File under, "not sure if a very good joke, or serious".

I'm leaning toward the former. "New tech lead" is the give-away (or is it?).

2 more replies

ianbutler5y ago

The most recent graph DB I've used was Dgraph, I've found the interface to be good to work with and it does scale well performance wise. Memory consumption was still too high for my tastes and if you need to build common algos on top like PageRank, again for example, they don't support that out of the box. If you read through their forum you'll see they may never choose to support things like that natively so you have to do things like I did which was export the data out. This was maybe 7 months ago now.

I'll also say that working on the entire graph if you need to is difficult, they're not oriented around working on the whole more like fragments that you've paired down through your query modifiers so if you know you're going to be doing a lot of work that requires you to do things on the entire graph that may change the performance characteristics for you a lot.

I like it and would use it again but there are rough edges to work around still and it is young so know your use case and know the trade offs you're making.

rch5y ago

Dgraph looks solid and has been coming up more often lately.

1 more reply

zurfer5y ago

We use ArangoDB and are super happy. But I guess it depends on your use case. We operate in the area of 1 million records. Everything is super fast and the ability to also have search, graph and document workloads was most important for us.

goodoldneon5y ago

We used ArangoDB at my previous job. I liked it, though it was my first engineering job so I didn't get deep into the technicals. I thought it had a nice UI and query language

jonahss5y ago

Redis has an official graph module called RedisGraph. Check it out.

The_rationalist5y ago

What make you consider Tinkerpop mediocre?

Grimm15y ago

Honestly I never heard of it, I had a few in mind but that wasn't one. I'll give it a try next time I need a graph db maybe it can scratch my itch so to speak.

My concerns basically range around memory consumption, query language and language ecosystem.

Edit: Oh and I guess around like functional extensibility. The last time I used a graph DB I had to export from the db itself to HDFS and use Spark to do things like PageRank and I'd rather be able to write that natively in their query lang or some like UDF equivalent.

1 more reply

hsaliak5y ago

That’s DGraph

nick_5y ago

Have you checked out RedisGraph extension for Redis?

azinman25y ago

Redis is memory-backed, so you have limitations here. When I think of neo4j and its equivalents, it's about super giant datasets that are beyond memory limits.

1 more reply

thanatos5195y ago

Yes. The performance is an order of magnitude better than Neo4J, but it lacks feature parity.

tgtweak5y ago· 8 in thread

I always find it amusing how much graphQL there is without actual graph db behind it.

Seems the concept of having fluid relationships is appealing for querying but not structuring/storing... which seems like a disconnect.

I have only seen a few Neo4J systems in serious production workloads and they were ALL on logistics... I'm not sure that it's being positioned (or interpreted) as a nice simple solution to start out on.

Edit: I just checked out neo4j "bloom", and it's definitely a good way to make graph more accessible - they should continue to build further on it.

handrous5y ago

GraphQL has been such a bad name to deal with. I've seen so much "we need a graph, so isn't graphQL a good idea?" or "this is a graph database, so doing graphQL on it will be way easier & more natural than on a SQL db, right?". Even from technical and semi-technical people.

deckard15y ago

This always annoyed me more than it should.

It's also torturing the definition of "query language." There is no equivalent of "join", or any other typical query feature such as aggregation, grouping, sorting, filtering. GraphQL has as much to do with graphs or query languages as my smart TV has to do with intelligence. It's RPC, but RPC fell out of fashion when SOAP/WSDL/XML died.

3 more replies

feu5y ago

Is the latter not the case? I'm genuinely asking here, as this is a point of view expressed by my tech lead (though in our case it's a graph DB vs an existing MongoDB setup).

2 more replies

tshaddox5y ago

I don’t think that’s particularly amusing or a “disconnect.” From what I can tell, the entire point of GraphQL at Facebook was to expose an interface that looks like a graph database but is in fact able to load data from any number of different underlying data stores (without the query developer needing to know about that part).

rdevsrex5y ago

To me, the best part about graphql is subscriptions over websockets.

jexp5y ago

Not sure if you saw our graphql integration, that takes typedefs and converts a graphql query into a single cypher query, which can then be executed directly. https://neo4j.com/product/graphql-library/ has links to docs and api scaffolding tools.

When I started back then in 2016 with it, it was pretty cool how directly graphql mapped to the graph model in the db.

tgtweak5y ago

This looks pretty cool, directly speaks to what I was talking about re: why doesn't more of this exist.

mrjn5y ago

Dgraph was built around GraphQL and natively supports it.

softwaredoug5y ago· 6 in thread

> By 2025, graph technologies will be used in 80% of data and analytics innovations, up from 10% in 2021, facilitating rapid decision making across the enterprise.”

What is behind the thought that graph databases are going to grow so much in the next few years? To me they've always had a niche use... Are they really going to be ubiquitous (like this funding seems to assume?)

zozbot2345y ago

Historically, graph databases did a passable job of supporting data models and queries that were not really possible in SQL (absent proprietary, vendor-specific extensions). That's all over now, because recent versions of SQL support recursive queries that can handle general graphs quite easily. No real need for a specialized solution, even plain vanilla Postgres is good to go.

AzzieElbab5y ago

Sql syntax for such queries is super awkward with tones of gottyas, not sure how performance compares

3 more replies

handrous5y ago

Neo4j is all-in on, "almost everything looks like (or can be made to look like) a graph, so almost everyone should be using a graph database".

As for those specific figures, I'm guessing there's enough wiggle room in "data and analytics innovations" (emphasis mine) to find or project almost any trend one wishes. What are data analytics innovations? Why, it's the set of things that will see 80% use of graph technologies! "Graph technologies" is also so potentially-vague that it could plausibly be 100% of almost anything related to software.

vincent-toups5y ago

"Everything looks like a graph" is more damning of the idea of a graph as storage than it is praise. The whole point of a database is to impose _additional_ constraints on the data to ease subsequent application development or data analysis.

Relational data may be a hassle but its a hassle you end up having to deal with anyway at some point.

I can see a graph database as being a useful place to stash a ton of shitty data as an initial place to start an ETL but I can't imagine using it as a system of record except in very limited situations.

3 more replies

junon5y ago

If a good enough engine comes along, I would agree with those speculations. Many times I've wished my SQL or Mongo databases had graph functionality.

slver5y ago

Those investors will never see that money again

whalesalad5y ago· 6 in thread

> Series F

Yikes

> largest investment in a private database company

I guess this is one of those PR moves that is trying to make something lame sound good? If your customer portfolio includes Walmart, Volvo and AstraZeneca why are you raising money a 6th time?

caturopath5y ago

Having some department at e.g. Volvo using your product doesn't mean that you have Volvo as a serious customer paying you like they would as a serious customer.

I take your point that this is a really late round of funding, but this doesn't mean they've caught on like they want to yet.

whalesalad5y ago

I agree that what you've described is likely the true situation. It just looks funny to see a company claim to be worth $2 billion dollars and namedrop big brands and yet require a 6th cash injection. There is an incongruence there.

2 more replies

staticassertion5y ago

IDK, on the one hand it's a late round. On the other, as a CEO, the ability to just immediately raise 325 million dollars is both impressive and appealing. It's insane how far even one million dollars goes, 325 is mind boggling to me.

hirako20005y ago

325M isn't all that much when the company has been growing for over 10 years, doubling its staff count every 2 years, with a bunch of VPs added in, supposedly to figure out how to run professionally 100 nerds. It leads to 4 layers made of directors, senior managers and managers below each VP. Those jobs take a decent pay, and a significant cash bonus. Significant travel and other expense coming from sales folks who won't take airbnb for a 3 days trip in NY, 5 stars sheraton please. Add to that it's pretty common to spend 1/3 of the yearly flow into marketing, your IT, HR and accounting departments having itself large IT expense (concur, service now, etc) you end up with a boat burning 325M per year quite easily.

1 more reply

httpz5y ago

Series F by itself isn't necessarily a bad thing anymore. Uber, Lyft, Airbnb, Snowflake all had Series F rounds at one point.

not_jd_salinger5y ago

Have any of them figured out how to generate a profit or have they just switched from extracting money from VCs to extracting it from the public?

2 more replies

mkr-hn5y ago· 5 in thread

It's always interesting to hear about huge funding rounds for companies I've never heard of. Even more so when they apparently work with a bunch of companies I have heard of.

handrous5y ago

They're a fairly major graph database product with a well-funded sales team that has been hammering the "I mean, doesn't everything look kinda like a graph? Isn't your data kinda a graph? You should definitely use us as your database-of-record, or for literally anything else you might use a database for, look how fast we are at graph stuff!" line hard and (apparently) with great success.

dgb235y ago

I’ve used it in production roughly 5+ years ago. Both graph data modeling and cypher queries are very fun to work with. It’s both dynamic and powerful but still gives you a decent amount of structure and ACID guarantees.

One thing that is much easier to model and query, or rather more natural and simple, is authorization and other granular questions you might have about how users and data is connected.

A thing that I can’t wrap my head around however is temporal data modeling with graphs. Haven’t seen or thought of anything too satisfying yet, that meshes well with how I think about graphs. Whereas in SQL it is more explored and clear to me.

I agree that their marketing is very aggressive, but this tech has quite some merit.

1 more reply

chippiewill5y ago

I used to work for a database vendor. I'm getting serious 'nam style flashbacks reading your comment.

The capability of sales to sell a product to massive companies for a use case that we're actually not very good at was unbelievable.

2 more replies

oauea5y ago

I'm sure it's fast, but also consumes a ridiculous amount of memory. Tried loading a smallish dataset into neo4j that runs on a 100mb ram postgres server just fine. Neo4j wanted gigabytes of memory.

4 more replies

objektif5y ago

So are you saying sales works?

1 more reply

haolez5y ago· 4 in thread

Series F? Is that a bad smell or is it considered normal nowadays? Genuine question.

dcolkitt5y ago

Ever since Sarbanes-Oxley there's been a continuously declining number of new public companies. The reality is the modern legal and regulatory environment makes being a public company significantly more burdensome than it was in 1995.

There's more than abundant amounts of capital in private equity, so the only real reason to go public is to create liquidity for early founders/investors/employees who want to cash out. Given that, arguably you could say going public, instead of raising private capital, is the smell. Or at least an attempt to top-tick the valuation, e.g. WeWork.

phillipcarter5y ago

Another way I look at this is that wealth has accumulated so disproportionately at the top that the small number of individuals/firms flush with wealth need to find ways to spend the money. So why not dump hundreds of millions on another company? Since there isn't any actual force coming in to take and redistribute it, might as well spend it.

not_jd_salinger5y ago

Both. I personally find the fact that having these later and later rounds of funding considered normal a huge indicator that something is very wrong with the current industry.

It means there's a lot of capital being dumped into trying to find some hidden source of profit and it's getting harder and harder to find it.

It's the capital equivalent of going from finding oil in your back yard to blasting it out of tar sands in the Canadian tundra. Sure the capital/oil keeps flowing, but the inherent unsustainability of the system starts to show its face more clearly.

hacksaw_hetty5y ago

Normal - just take a look on crunchbase

__jem5y ago· 4 in thread

Curious what something like Neo4j offers over a something like [this](https://docs.microsoft.com/en-us/sql/relational-databases/gr...) in MSSQL.

zozbot2345y ago

These seem to be non-standard SQL extensions, AIUI. At least Neo4j has something of a quasi-standard solution for their querying layer, that might also be supported by other vendors. These extensions are a dead end.

__jem5y ago

I’m not sure I understand calling this a “dead end”. Almost no one limits themselves to pure ANSI SQL. Pretty much any application of reasonable size in production uses vendor specific APIs. A “quasi-standard” is not a standard.

I was thinking more about technical reasons in terms of the storage layer. The query syntax seems to be the least interesting part of a database, to be perfectly honest.

clpm4j5y ago

Native graph storage and index-free adjacency. No tables, no JOINs.

__jem5y ago

Okay,that seems more interesting. Any resources on the data structures used to avoid indices? Without table ddl, if node types are arbitrary, that seems like a hard problem to solve in terms of storage layout.

1 more reply

motohagiography5y ago· 3 in thread

I've done development on an app with Neo as the back end, and what I liked about it was mainly py2neo and the cypher query language. Even after developing in it, approaching another graph in DGraph was conceptually impenetrable, as my impression of dgraph was they had a bunch of unnecessary and poor abstractions in their documentation. The next candidate is the redis graph, but I haven't. With Neo, if you learn cypher, you literally don't need to know anything else about it to be useful in it, which brings me to what I think their real market is.

The opportunity I understood after using Neo was the big product play would be a kind of mental shift for enterprise data analyst users whose jobs exist in excel/powerbi today with power users using Cognos, and less devops/SaaS company/etc. I over-use Apple as an example, but if Apple entered into enterprise data products, Neo would be the kind of thing to be the underlying tech for it, as if you are an apple user, an apple'ey analytics tool would be based on users producing and reasoning about their data with graphs instead of tables, if you could imagine a kind of photoshop for data, or a fundamental conceptual change from spreadsheets to graphs. They aren't as competitive as a data tool, but I think they are unrivaled as a knowledge tool.

The tech is really great, but the product piece appears to have been a challenge because the use cases for graphs have been very enterprise'y, which has limited adoption because people who operate at that higher business logic level of abstraction that graphs enable are not the people picking and adopting new technologies. The growth will come from younger people who learned python in high school, and have a more data centric world view. Maybe that's the play.

Anyway, as a user I can see why they got participation on an F round. Imo, they've solved the what/how/why and have done some amazing science and engineering, and what I hope that money buys them is some magic.

mrjn5y ago

(author of Dgraph here)

> my impression of dgraph was they had a bunch of unnecessary and poor abstractions in their documentation

I'm surprised to hear that. Dgraph uses GraphQL (and DQL, a fork of GraphQL) as the query language -- which is a lot more widely adopted language than Cypher. Dgraph users really like the simplicity and intuitiveness of the language and ease of use of the DB.

I'm curious what was confusing in documentation.

motohagiography5y ago

Nice! It's been a few years but what I remember is that if I ever had a chance to ask, I was going to ask this: What was the problem that calling what laymen call attributes/properties Predicates solved, are they just attributes, and is there some other mathematical property of DGraph predicates that makes them different from normal attributes that a user should know?

Literally every dgraph user must necessarily know the answer to that already, or maybe they just mentally black box it and work around it, but at the time, my impression was non-users don't know this, and if I'm adopting a whole new taxonomy I need extra incentives to know it's worth while. It's probably an excellent and even superior technology, but what read as auteurism in the product at the time made me reconsider how much time I wanted to invest before encountering another one.

Anyway, coming from being a Cypher user, the learning curve for the use case of "I want to create nodes of different types with attributes, with relationships of types with attrbites, then CRUD them and verticies with a Flask app" felt a bit steep after that.

SQLite would do the trick, but I wanted consistency from my business logic to a grammar, to a data model. It's very easy to encounter graphs and just think we're not smart enough for them or our problem isn't graphy enough, but given the ease with which I could encode a grammer into cypher, I reluctantly gave up on dgraph. That said, I'm not a gremlin/tinkerpop fan either, as from a top-down user use case, it wasn't satisfying either.

DGraph has a lot of users and customers who love your product and the smartest people I knew recommended it to me, so my issues might not register, but there were a few experiences going through the tutorials that made me wary I was sinking costs into it relative to my use case, e.g. I have 1 week to build a PoC Flask app with a graph on the back end, and then scale it if the customer cares. That's what I used Neo for, and didn't use dgraph for, even though I figured I'd hire developers to rewrite it for dgraph if it got off the ground.

Anyway, long way round, but I'm a long time believer and user of graph techs and want everyone in that market to succeed.

topicseed5y ago

Amongst all graph databases I tried, Neo would land third and last. Dgraph and ArangoDB would definitely be ahead in terms of developer experience from data loading to regular transactional use.

But I do appreciate all the effort Neo4j put for years in educating us all on graph databases, use cases, and just drawing attention and awareness.

busterarm5y ago· 3 in thread

The Enterprise version is ridiculously expensive. Think Oracle/IBM type of pricing.

Community Edition is hobbled to the point where I wouldn't recommend anyone run it in production.

hugofirthOP5y ago

Enterprise licenses for on premise Neo4j are certainly for a specific customers with specific needs, but there is always the DBaaS (https://neo4j.com/cloud/aura/).

Or if you absolutely need on premise and are small there is the startup program for free enterprise licenses (https://neo4j.com/startups/)

busterarm5y ago

Neo4j's entire pricing model, even in cloud, is built around the idea that you'll have one centralized very large graph.

Many companies, like the one I'm at, have the opposite use case -- many, geo-distributed, tiny graphs and multiple (read: 3-5) pre-prod environments. They simply don't have a pricing model that supports customers like us.

They wanted to charge us something like 10% of our ARR for something that was just a component of one microservice.

1 more reply

speaktorob5y ago

I don't think you're actually supposed to use CE in a production environment, it's more there for learning and teaching, which is why it's free. If you want production grade then use the cloud offering.

somewhereoutth5y ago· 3 in thread

Graph DBs work if you know your relationships of interest ahead of time, and are happy to have them baked into your dataset.

With relational databases, you can join on anything anywhen, so you can explore new relationships as you go.

DSingularity5y ago

> Match (a)-[:knows]-(b) > Match (b)-[:loves_to_eat]-(c) > Return a.name,c.name //food suggestions

Why isn’t this sufficient to explore novel relationships?

speaktorob5y ago

That's exactly the trade off, isn't it? Either you do the work to store the relationships and save on the compute and memory cost later, or you pay as you go to build it in real time with a relational database. It's horses for courses.

somewhereoutth5y ago

It could be argued that a graph dB is just a really badly implemented index.

1 more reply

xibalba5y ago· 3 in thread

According to Crunchbase, Neo4j was founded in 2007! Is this correct? 14 years in and they are still raising VC money!?

hirako20005y ago

Welcome to the new economy. VC money get poured with an exit at sight. The bags keep growing until it ends up on public offering.

By that time, the share is pretty much what it's worth. But 100 times round A. If all went well of course. Over 90% of the time, it didn't go well. Who knows how that will end for Neo4j, but the investors have their eggs in many other baskets anyway.

What matters isn't showing profit anymore, not even significant revenue to justify further funding. All you need is some appealing growth figures, sometimes not even that, just a convincing argument that hyper growth is on the horizon.

At some point millions are put into advertising and a strong sales force to grow revenue many folds. In the enterprise market, the trick often works pretty well.

Barrin925y ago

and the sheer size of these rounds always astonishes me for very specialized software products. 300 million bucks, that's enough to build a death star, what do they do with all of the cash

fedder5y ago

apparently building a new HQ, sponsoring F1 and doing a massive super bowl commercial. https://www.youtube.com/watch?v=jTGSyfvQoZ8&t=367s

1 more reply

nrjames5y ago· 2 in thread

Anybody know of a good graph extension for SQLite?

captn3m05y ago

See https://news.ycombinator.com/item?id=10991751 for some similar discussion.

DemocracyFTW5y ago

https://news.ycombinator.com/item?id=25544397

diveanon5y ago· 1 in thread

Neo4j can be a useful supplement to apps that have data models that include graphs.

If you are using relational db you can use recursion to achieve the same effect without having bring n4j and cipher into your stack.

A simple example of implementing a hierarchical graph data structure on postgres and exposing it via graphql can be found on the hasura blog.

https://hasura.io/blog/authorization-rules-for-multi-tenant-...

tango125y ago

(from Hasura) I've been wanting to try out a thing and use Neo4j + Postgres simultaneously. Use Postgres for data, Neo4j for relationships and graph-y queries.

Has anyone tried that? Would love any notes/pointers!

Join data across the two to get the best of both basically. Hasura doesn't support Neo4j natively yet, but maybe using Neo4j's graphql wrapper as an input to Hasura perhaps?

topspin5y ago

I've been looking at Bitnine AgensGraph and thus also Apache AGE. These are (related) extensions of PostgreSQL that use PostgreSQL to implement a complete graph database and Cypher query language. Interestingly you can even mix SQL and Cypher in the same statement.

This approach of extending PostgreSQL is very appealing to me. There is a great deal of value in the PostgreSQL stack that doesn't need to be reinvented just to deliver a graph database and query language. How much easier is it to adopt graph database techniques when it is simply an extension of database technology nearly everyone is already running? Conceivably one might find some future version of PostgreSQL RDS (and other cloud PostgreSQL services) delivers Cypher.

nisa5y ago

cypher is really cool and with neosemantics[1] you have the best of both worlds - labeled property graphs and rdf* they even have a cool reasoner and sparql.

that being said I thought about porting it to postgresql with apache age vs. using neo4j for a project because it's faster at least for this usecase. Easier said than done, through.

If you want to play with graphs and linked-data it's super cool. There is also structr[2] that builds CMIS / Alfresco ECM like functionality atop neo4j with graaljs scripts.

1: https://github.com/neo4j-labs/neosemantics

2: https://github.com/structr/structr

jrsj5y ago

Series F isn’t an inherently bad thing. That just means that they’ve been around for awhile, want funding to accelerate growth, and don’t want to go public. Every company doesn’t have to grow super fast after 1-2 massive funding rounds and then get acquired.

heldsteel75y ago

I worked on a use case which was clearly in graph in nature. So graph database seemed a natural choice, of course Neo4j was one of the most heard product even back then.

We did evaluate Neo4j, but put down due to its complex query language (cypher) and slowness. It was really an awkward language, super awkward.

We also evaluated arangodb and we found it much better than Neo4j. Performance was good and its query language was better too.

What we realised in the process is, using graph databases is more of a cultural transformation as well. SQL is much well understood, well adopted and well supported by community.

Ultimately we implemented the use case in Postgres, and thank God we did it that way. IMO, we can still get all the benefits of graphs we SQL databases with little efforts.

zestyping5y ago

I joined a small open source project that had decided to use Neo4J instead of a SQL database as its primary store. Simple queries mysteriously caused Neo4J to gobble up huge amounts of memory. Neo4J struggled even though we had fewer than a million items, sometimes getting so stuck that we had to restart the database. There wasn't any good tooling to explain why our queries were so slow. We'd already upgraded our VM beyond what we originally hoped to spend and were reluctant to spend even more on a larger one.

We had a team member who had used Neo4J professionally for years and could not figure it out. And we only had one; every other teammate and new volunteer had to be trained in a strange new way of thinking about databases and a new query syntax. Setting it up to run locally for development was a difficult process. Progress was slow and our code to access the database was messy. We kept being promised that, in exchange for these heavy burdens, Neo4J would do amazing things for us once we started doing graph queries, but we never got there because it couldn't do the basics.

We rewrote the project to run on PostgreSQL. Five tables, properly indexed, lightning fast, easy to set up and understandable by anyone. A hundred million rows and it didn't break a sweat, on the lowest tier of machine. Even graph queries were straightforward and quick.

Advice: Don't use Neo4J as your primary store, and avoid it altogether if you want volunteer or casual contributors. For us, it was all costs and no benefits.

cryptos5y ago

Series F = they haven't found a sustainable business model yet?

j / k navigate · click thread line to collapse

133 comments

83 comments · 19 top-level

Grimm15y ago· 16 in thread

Neo4J has been very meh in my experience, but they are the biggest.

spriggancg5y ago

dfraser9925y ago

zozbot2345y ago

1 more reply

feu5y ago

Do you have any opinions on ArangoDB or Dgraph? My new tech lead is talking about switching from MongoDB to one of those.

handrous5y ago

> My new tech lead is talking about switching from MongoDB to one of those.

File under, "not sure if a very good joke, or serious".

I'm leaning toward the former. "New tech lead" is the give-away (or is it?).

2 more replies

ianbutler5y ago

I like it and would use it again but there are rough edges to work around still and it is young so know your use case and know the trade offs you're making.

rch5y ago

Dgraph looks solid and has been coming up more often lately.

1 more reply

zurfer5y ago

goodoldneon5y ago

We used ArangoDB at my previous job. I liked it, though it was my first engineering job so I didn't get deep into the technicals. I thought it had a nice UI and query language

jonahss5y ago

Redis has an official graph module called RedisGraph. Check it out.

The_rationalist5y ago

What make you consider Tinkerpop mediocre?

Grimm15y ago

Honestly I never heard of it, I had a few in mind but that wasn't one. I'll give it a try next time I need a graph db maybe it can scratch my itch so to speak.

My concerns basically range around memory consumption, query language and language ecosystem.

1 more reply

hsaliak5y ago

That’s DGraph

nick_5y ago

Have you checked out RedisGraph extension for Redis?

azinman25y ago

Redis is memory-backed, so you have limitations here. When I think of neo4j and its equivalents, it's about super giant datasets that are beyond memory limits.

1 more reply

thanatos5195y ago

Yes. The performance is an order of magnitude better than Neo4J, but it lacks feature parity.

tgtweak5y ago· 8 in thread

I always find it amusing how much graphQL there is without actual graph db behind it.

Seems the concept of having fluid relationships is appealing for querying but not structuring/storing... which seems like a disconnect.

Edit: I just checked out neo4j "bloom", and it's definitely a good way to make graph more accessible - they should continue to build further on it.

handrous5y ago

deckard15y ago

This always annoyed me more than it should.

3 more replies

feu5y ago

Is the latter not the case? I'm genuinely asking here, as this is a point of view expressed by my tech lead (though in our case it's a graph DB vs an existing MongoDB setup).

2 more replies

tshaddox5y ago

rdevsrex5y ago

To me, the best part about graphql is subscriptions over websockets.

jexp5y ago

When I started back then in 2016 with it, it was pretty cool how directly graphql mapped to the graph model in the db.

tgtweak5y ago

This looks pretty cool, directly speaks to what I was talking about re: why doesn't more of this exist.

mrjn5y ago

Dgraph was built around GraphQL and natively supports it.

softwaredoug5y ago· 6 in thread

> By 2025, graph technologies will be used in 80% of data and analytics innovations, up from 10% in 2021, facilitating rapid decision making across the enterprise.”

zozbot2345y ago

AzzieElbab5y ago

Sql syntax for such queries is super awkward with tones of gottyas, not sure how performance compares

3 more replies

handrous5y ago

Neo4j is all-in on, "almost everything looks like (or can be made to look like) a graph, so almost everyone should be using a graph database".

vincent-toups5y ago

Relational data may be a hassle but its a hassle you end up having to deal with anyway at some point.

3 more replies

junon5y ago

If a good enough engine comes along, I would agree with those speculations. Many times I've wished my SQL or Mongo databases had graph functionality.

slver5y ago

Those investors will never see that money again

whalesalad5y ago· 6 in thread

> Series F

Yikes

> largest investment in a private database company

I guess this is one of those PR moves that is trying to make something lame sound good? If your customer portfolio includes Walmart, Volvo and AstraZeneca why are you raising money a 6th time?

caturopath5y ago

Having some department at e.g. Volvo using your product doesn't mean that you have Volvo as a serious customer paying you like they would as a serious customer.

I take your point that this is a really late round of funding, but this doesn't mean they've caught on like they want to yet.

whalesalad5y ago

2 more replies

staticassertion5y ago

hirako20005y ago

1 more reply

httpz5y ago

Series F by itself isn't necessarily a bad thing anymore. Uber, Lyft, Airbnb, Snowflake all had Series F rounds at one point.

not_jd_salinger5y ago

Have any of them figured out how to generate a profit or have they just switched from extracting money from VCs to extracting it from the public?

2 more replies

mkr-hn5y ago· 5 in thread

It's always interesting to hear about huge funding rounds for companies I've never heard of. Even more so when they apparently work with a bunch of companies I have heard of.

handrous5y ago

dgb235y ago

One thing that is much easier to model and query, or rather more natural and simple, is authorization and other granular questions you might have about how users and data is connected.

I agree that their marketing is very aggressive, but this tech has quite some merit.

1 more reply

chippiewill5y ago

I used to work for a database vendor. I'm getting serious 'nam style flashbacks reading your comment.

The capability of sales to sell a product to massive companies for a use case that we're actually not very good at was unbelievable.

2 more replies

oauea5y ago

I'm sure it's fast, but also consumes a ridiculous amount of memory. Tried loading a smallish dataset into neo4j that runs on a 100mb ram postgres server just fine. Neo4j wanted gigabytes of memory.

4 more replies

objektif5y ago

So are you saying sales works?

1 more reply

haolez5y ago· 4 in thread

Series F? Is that a bad smell or is it considered normal nowadays? Genuine question.

dcolkitt5y ago

phillipcarter5y ago

not_jd_salinger5y ago

Both. I personally find the fact that having these later and later rounds of funding considered normal a huge indicator that something is very wrong with the current industry.

It means there's a lot of capital being dumped into trying to find some hidden source of profit and it's getting harder and harder to find it.

hacksaw_hetty5y ago

Normal - just take a look on crunchbase

__jem5y ago· 4 in thread

Curious what something like Neo4j offers over a something like [this](https://docs.microsoft.com/en-us/sql/relational-databases/gr...) in MSSQL.

zozbot2345y ago

__jem5y ago

I was thinking more about technical reasons in terms of the storage layer. The query syntax seems to be the least interesting part of a database, to be perfectly honest.

clpm4j5y ago

Native graph storage and index-free adjacency. No tables, no JOINs.

__jem5y ago

1 more reply

motohagiography5y ago· 3 in thread

mrjn5y ago

(author of Dgraph here)

> my impression of dgraph was they had a bunch of unnecessary and poor abstractions in their documentation

I'm curious what was confusing in documentation.

motohagiography5y ago

Anyway, long way round, but I'm a long time believer and user of graph techs and want everyone in that market to succeed.

topicseed5y ago

Amongst all graph databases I tried, Neo would land third and last. Dgraph and ArangoDB would definitely be ahead in terms of developer experience from data loading to regular transactional use.

But I do appreciate all the effort Neo4j put for years in educating us all on graph databases, use cases, and just drawing attention and awareness.

busterarm5y ago· 3 in thread

The Enterprise version is ridiculously expensive. Think Oracle/IBM type of pricing.

Community Edition is hobbled to the point where I wouldn't recommend anyone run it in production.

hugofirthOP5y ago

Enterprise licenses for on premise Neo4j are certainly for a specific customers with specific needs, but there is always the DBaaS (https://neo4j.com/cloud/aura/).

Or if you absolutely need on premise and are small there is the startup program for free enterprise licenses (https://neo4j.com/startups/)

busterarm5y ago

Neo4j's entire pricing model, even in cloud, is built around the idea that you'll have one centralized very large graph.

They wanted to charge us something like 10% of our ARR for something that was just a component of one microservice.

1 more reply

speaktorob5y ago

somewhereoutth5y ago· 3 in thread

Graph DBs work if you know your relationships of interest ahead of time, and are happy to have them baked into your dataset.

With relational databases, you can join on anything anywhen, so you can explore new relationships as you go.

DSingularity5y ago

> Match (a)-[:knows]-(b) > Match (b)-[:loves_to_eat]-(c) > Return a.name,c.name //food suggestions

Why isn’t this sufficient to explore novel relationships?

speaktorob5y ago

somewhereoutth5y ago

It could be argued that a graph dB is just a really badly implemented index.

1 more reply

xibalba5y ago· 3 in thread

According to Crunchbase, Neo4j was founded in 2007! Is this correct? 14 years in and they are still raising VC money!?

hirako20005y ago

Welcome to the new economy. VC money get poured with an exit at sight. The bags keep growing until it ends up on public offering.

At some point millions are put into advertising and a strong sales force to grow revenue many folds. In the enterprise market, the trick often works pretty well.

Barrin925y ago

and the sheer size of these rounds always astonishes me for very specialized software products. 300 million bucks, that's enough to build a death star, what do they do with all of the cash

fedder5y ago

apparently building a new HQ, sponsoring F1 and doing a massive super bowl commercial. https://www.youtube.com/watch?v=jTGSyfvQoZ8&t=367s

1 more reply

nrjames5y ago· 2 in thread

Anybody know of a good graph extension for SQLite?

captn3m05y ago

See https://news.ycombinator.com/item?id=10991751 for some similar discussion.

DemocracyFTW5y ago

https://news.ycombinator.com/item?id=25544397

diveanon5y ago· 1 in thread

Neo4j can be a useful supplement to apps that have data models that include graphs.

If you are using relational db you can use recursion to achieve the same effect without having bring n4j and cipher into your stack.

A simple example of implementing a hierarchical graph data structure on postgres and exposing it via graphql can be found on the hasura blog.

https://hasura.io/blog/authorization-rules-for-multi-tenant-...

tango125y ago

(from Hasura) I've been wanting to try out a thing and use Neo4j + Postgres simultaneously. Use Postgres for data, Neo4j for relationships and graph-y queries.

Has anyone tried that? Would love any notes/pointers!

Join data across the two to get the best of both basically. Hasura doesn't support Neo4j natively yet, but maybe using Neo4j's graphql wrapper as an input to Hasura perhaps?

topspin5y ago

nisa5y ago

cypher is really cool and with neosemantics[1] you have the best of both worlds - labeled property graphs and rdf* they even have a cool reasoner and sparql.

that being said I thought about porting it to postgresql with apache age vs. using neo4j for a project because it's faster at least for this usecase. Easier said than done, through.

If you want to play with graphs and linked-data it's super cool. There is also structr[2] that builds CMIS / Alfresco ECM like functionality atop neo4j with graaljs scripts.

1: https://github.com/neo4j-labs/neosemantics

2: https://github.com/structr/structr

jrsj5y ago

heldsteel75y ago

I worked on a use case which was clearly in graph in nature. So graph database seemed a natural choice, of course Neo4j was one of the most heard product even back then.

We did evaluate Neo4j, but put down due to its complex query language (cypher) and slowness. It was really an awkward language, super awkward.

We also evaluated arangodb and we found it much better than Neo4j. Performance was good and its query language was better too.

What we realised in the process is, using graph databases is more of a cultural transformation as well. SQL is much well understood, well adopted and well supported by community.

Ultimately we implemented the use case in Postgres, and thank God we did it that way. IMO, we can still get all the benefits of graphs we SQL databases with little efforts.

zestyping5y ago

Advice: Don't use Neo4J as your primary store, and avoid it altogether if you want volunteer or casual contributors. For us, it was all costs and no benefits.

cryptos5y ago

Series F = they haven't found a sustainable business model yet?

j / k navigate · click thread line to collapse