How Netflix uses Java (opens in new tab)

(infoq.com)

241 pointsivanche2y ago196 comments

196 comments

102 comments · 17 top-level

ValtteriL2y ago· 22 in thread

>Netflix observed a 20% increase of CPU usage on JDK 17 compared to JDK 8. This was mostly due to the improvements in the G1 garbage collector.

Help me here, why do GC improvements cause CPU increase?

blackoil2y ago

I think this is a 20% improved utilization of CPU, earlier app was memory-bound or/and GC was consuming CPU. Now app has 20% more CPU available. It should be doing correspondingly more work. This could definitely be written clearly.

moffkalast2y ago

> Bakker provided a retrospective of their JDK 17 upgrade that provided performance benefits, especially since they were running JDK 8 as recently as this year. Netflix observed a 20% increase of CPU usage

Seems like it's exactly that, OP cropped out the relevant bit where they list it having an overall performance benefit for that extra CPU time. Otherwise it could be assumed that it just hogs more CPU to get the same result.

bunderbunder2y ago

I haven't dealt with this side of Java in a while, but it reflects my experience poking at Java 8 performance. At some (surprisingly early) point you'd hit a performance wall due to saturating the memory bus.

A new GC could alleviate this by either going easier on the memory itself, or by doing allocations in a way that achieves better locality of reference.

jillesvangurp2y ago

Most modern GCs trade off CPU usage and latency. Less latency means the CPU has to do more work on e.g. a separate thread to figure out what can be garbage collected. JDK 8 wouldn't have had the G1 collector (I think, or at least a really old version of that) and they would have probably been using one of the now deprecated garbage collectors that would be collecting less often but have a more open ended stop the world phase. It used to be that this would require careful tuning and could get out of hand and start taking seconds.

The new ZGC uses more CPU but it provides some hard guarantees that it won't block for more than a certain amount of milliseconds. And it supports much larger heap sizes. More CPU sounds worse than it is because you wouldn't want to run your application servers anywhere near 100% CPU typically anyway. So, there is a bit of wiggle room. Also, if your garbage collector is struggling, it's probably because you are nearly running out of memory. So, more memory is the solution in that case.

BinaryRage2y ago

The figure is about the overall improvement, not sure why that reads increase.

On JDK 8 we are using G1 for our modern application stack, and we saw a reduction in CPU utilisation with the upgrade with few exceptions (saw what I believe is our first regression today: a busy wait in ForkJoinPool with parallel streams; fixed in 19 and later it seems).

G1 has seen the greatest improvement from 8 to 17 compared to its counterparts, and you also see reduced allocation rates due to compact strings (20-30%), so that reduces GC total time.

It's a virtuous cycle for the GRPC services doing the heavy lifting: reduced pauses means reduced tail latencies, fewer server cancellations and client hedging and retries. So improvements to application throughput reduce RPS, and further reduce required capacity over and above the CPU utilisation reduction due to efficiency improvements.

JDK 21 is a much more modest improvement upgrading from 17, perhaps 3%. Virtual threads are incredibly impressive work, and despite having an already highly asynchronous/non-blocking stack, expect to see many benefits. Generational ZGC is fantastic, but losing compressed oops (it requires 64-bit pointers) is about a 20% memory penalty. Haven't yet done a head to head with Genshen. We already have some JDK 21 in production, including a very large DGS service.

2 more replies

ahoka2y ago

I don't think he meant that.

Macha2y ago

A somewhat common problem is to be limited by the throughput of CPU heavy tasks while the OS reports lower than expected CPU usage. A lot of companies/teams just kind of handwave it away as "hyperthreading is weird", and allocate more machines. Actual causes might be poor cache usage causing programs to wait on data to be loaded from memory, which depending on the CPU metrics you use, may not show as CPU busy time.

For companies at much smaller scale than netflix where employee time is relatively more costly than computer time, this might even be the right decision. So you might end up with 20 servers at 50% usage, but using 10 servers will take twice as long but still appear to be at 50% usage.

If the bottlenecks and overhead are reduced such that it's able to make more full use of the CPU, you might be able to reduce to e.g. 15 machines at 75% CPU usage. Consequently the increased CPU usage represents more efficient use of resources.

CraigJPerry2y ago

>> while the OS reports lower than expected CPU usage

>> which depending on the CPU metrics you use, may not show as CPU busy time

If your userspace process is waiting on memory (be that cache, or RAM) then you’ll show as CPU busy when you look in top or whatever - even though if you look under the covers such as via perf counters, you’ll see a lack of instructions executed.

The CPU is busy in this case and the OS won’t context switch to another task, your stalled process will be treated as running by the OS. At the hardware thread level then it will hopefully use the opportunity to run another thread thanks to hyper threading but at the OS level your process will show user space cpu bound. You’ll have to look at perf counters to see what’s actually happening.

>> you might end up with 20 servers at 50% usage, but using 10 servers will take twice as long but still appear to be at 50% usage.

Queue theory is fascinating, the latency change when dropping to half the servers may not be just a doubling. It depends on queue arrival rate and processing time but the results can be wild, like 10x worse.

xorcist2y ago

When you put it like that, yes. Hardware is cheap and all that. In practice I think that an organization that doesn't understand the software it is developing has a people problem. And people problems generally can't be solved with hardware.

If somebody knows how to make that insight actionable, let me know. No, hiring new people is not the answer. In all likelihood that swaps one hard problem for an even harder.

1 more reply

edpichler2y ago

To free memory. Also, 20% increase is not 20% in total. It's 20% when you go from 10 to 12 cpu usage, or from 50 to 60, for instance.

_the_inflator2y ago

Well done.

I always appreciate numbers and the differentiation between relative and absolute numbers in this case.

"We doubled our workforce in one week!" - CEO's first hire... ;)

matsemann2y ago

The CPU can do more tasks without being limited by memory pressure, perhaps?

I guess it depends on if they mean "we used 20% more CPU for the same output", or "we could utilize the CPUs 20% more".

paulbakker2y ago

It’s a 20% improvement. So less time spent on GC.

1 more reply

znpy2y ago

> Help me here, why do GC improvements cause CPU increase?

In Java 8 (afaik) there were pretty much no generational or concurrent garbage collectors, so garbage collector would happen in a stop-the-world manner: all work gets put on a halt, garbage collection happens, then the work can resume.

If you have a better GC, you have shorter and less frequent needs to do a stop the world pause.

Hence the code can run on cpu for more time, getting you higher cpu usage.

Higher cpu usage is often actually good in situations like this: it means you're getting more work done with the same cpu/memory configuration.

dboreham2y ago

Java8 was at least a decade into generational and concurrent GC. It does STW once in a while though which may be what you meant.

1 more reply

tpm2y ago

I read it as a good thing: GC improvements -> more available memory -> more work done by the CPU. But still would be interested in more detail.

groestl2y ago

Because the memory / I/O is not the bottleneck anymore, and the CPU can now run optimally.

jjtheblunt2y ago

I haven’t seen the specific profiling data, but it’s possible that the garbage collector is running a collection thread, concurrently with regular processing threads, and thereby preventing entire world synchronization points which would idle processor cores.

ahoka2y ago

Higher CPU usage paradoxically means better performance. When I last did OPS we used to watch total CPU usage of all services and if it was not 100%, then we started to look for a bottleneck to fix.

radomir_cernoch2y ago

Also interested! We saw basically the exact opposite. :-)

pyeri2y ago

It's like hiring more workers to accomplish the exact same output as before. "See, I achieved 20% growth in my targets!", some recruiter will say!

groestl2y ago

No, it's like improving a form to minimize the need for follow-up questions to the customer, and now seeing your workers (the same you had before) processing 20% more forms instead of waiting for responses.

inparen2y ago· 22 in thread

Spring Boot and Spring cloud for backend & graphql for the win. ;-)

RamblingCTO2y ago

No, just no. Performance and debugging are just plain horrible. The spring team loves to force you into their automagic shit and this bean stuff is so annoying. You almost got no compile time safety in this stack. It's the bane of my existence. I'd like to know that a compiled program will run. That seems virtually impossible with java/spring boot.

StevePerkins2y ago

I'm not sure what "no compile time safety in this stack" even means in the context of a strongly-typed compiled language.

If you are referring to the dependency injection container making use of reflection, then Spring Native graduated from experimental add-on to part of the core framework some years ago. You can now opt for Quarkus/Micronaut-style static build-time dependency injection, and even AOT compilation to Go-style native executables, if you're willing to trade off the flexibility that comes with avoiding reflection. For example, not being able to use any of the "@ConditionalOnXXX" annotations to make your DI more dynamic.

(Personally, I don't believe that those trade-offs are worth it in most cases. And I believe that all the Spring magic in the universe doesn't amount to 10% of what Python brings to the table in a minimal Django/Flask/FastAPI microservice. But the option is there if your use case truly calls for it.)

Honestly, I've never run into anyone who considers Spring to be "the bane of their existence", where the real issue wasn't simply that the bulk of their experience was in something else. Where they weren't thrown into someone else's project, and resent working with decisions made by other people, but don't want to either dig in and learn the tech or else search for a new job where they get to make the choices on a greenfield project.

6 more replies

pylua2y ago

Spring is basically a standard in itself and it is easier to hire people in it. It also normalizes large pieces of the backend application so even though they are written by different people they are similar.

Once you learn the annotation based configuration it also saves a lot of time.

The performance is valid but it will only keep improving.

1 more reply

Fabricio202y ago

It's funny to see this perspective! I used to work in a few companies locally who had adopted the early java-ee style for their applications and my experience is exactly the opposite. When going to spring I'm usually diagnosing issues on the application layer (ie: business issues, not framework issues), while on the java-ee applications I was often having to fix issues down at the custom persistence layer each company had, etc.. I see where you come from having looked at the "old" spring stack (non -boot), and I can see people getting mad over the configuration hell and how stuff is hidden behind xml.. Much like how java-ee is!

2 more replies

dimgl2y ago

I completely agree with this. Spring was an absolute nightmare during the short period of time where I had the misfortune of using it. It also didn't help that the codebase was a monstrosity... classes following no design patterns and having 40k lines. But still...

1 more reply

nameless9122y ago

That has not been my experience on the inside - I spend most of my days working on a Spring Boot based service at Netflix and frankly it's one of the most effortless environments I've ever worked in. Granted, there's a lot of ecosystem support from the rest of the company, but things are very low effort, and generally very predictable. I can usually drop a breakpoint in a debugger in exactly the right spot and find a problem immediately.

vmaurin2y ago

The issue with Spring ecosystem is that people use it without knowing why or which problem it solves but because almost everyone is using it. And most of the time, they don't need Spring (maybe a company like Netflix did, but it didn't prove to be the right choice at the end)

didntcheck2y ago

It's not quite as good as compile-time or type-based guarantees, but IME configuration errors with Spring are almost always flagged up immediately on application startup. As long as you have at least one test that initializes the application context (I.e. @SpringBootTest) then this should be caught easily

krooj2y ago

This is just... ignorance; your argument is basically, "I don't understand/want to learn how X works; therefore, X must be garbage"

1 more reply

smrtinsert2y ago

Performance and debugging simple, and compile time safety is Javas core domain. I think you're over focusing on proxying or enhancement of beans, but if you look at a documentation for a reasonable amount of time there's really nothing to it.

misja1112y ago

FYI, you can still use XML based configuration in Spring. The choice is yours. See https://docs.spring.io/spring-framework/docs/4.2.x/spring-fr...

I agree it is not common to do it, most teams follow the autoconfiguration madness.

bedobi2y ago

100% agree, Java and Spring are a mess and there's no justifiable reason to use them in 2023 (and no, "that's what we've always used" isn't a good justification)

Like srsly even DropWizard is better than Spring lol, let alone other even simpler frameworks like Ktor which is built on a much improved language over Java

wing-_-nuts2y ago

What do you propose as an alternative? Something like Micronaut trades more compile time for stricter checks and faster runtime. Do you use something like that?

2 more replies

twh2702y ago

We've adopted Quarkus and it's been a breath of fresh air. Excellent all around, DX, performance, features, it's all been good.

2 more replies

Cthulhu_2y ago

Spring is a safe and reliable choice I'd say; not the most exciting, but neither code nor frameworks should be exciting, they're used to solve a problem, they shouldn't become the problem itself.

GraphQL is interesting to me, I thought the clients were pretty similar across all platforms, meaning their API usage should also be similar enough to not need the flexible nature of GraphQL. But then, it allows for a lot more flexibility and decoupling - if a client needs an extra field, the API contract does not need to be updated, and not all clients need to be updated at once. Not all clients will be updated either, they will need to support 5-10+ year old clients that haven't updated yet for whichever reason.

m_0x2y ago

> not the most exciting

It was exciting when J2EE was dominating.

robertlagrant2y ago

Well, if the field is not available then new backend code will need to be written, resolvers, integrations, etc. But it does allow UIs to take less info over the wire, and eitherfewer joins need to be done or fewer performance-oriented APIs need building, as you say.

krooj2y ago

The stack is tremendously productive, but history has taught me a few things when dealing with Spring:

1. It's always best to start people off with plain old spring, even with an XML context, such that they understand the concepts at play with higher level abstractions like Boot. Hell, I even start with a servlet and singletons to elucidate the shortcomings of rolling your own. 2. Don't fall prey to hype around new projects in the Spring ecosystem, such as their OAuth2 implementation, since they often become abandonware. It's always best to take a wait and see approach 3. Spring Security is/was terrible to read, understand, and extend ;)

inparen2y ago

Ha ha, spring security is tricky and high chance may surprise some one while "boot"strapping a new project. But once done, it is out of way.

I did not like much of the XML, because it always seemed lot of duplication. All you doing is copying bean definitions and changing bean id and class/interface most of the time. But it became non issue over time. Now spring boot made it really easy with all those annotations.

olavgg2y ago

I am a big fan of Spring Boot, its one of the few frameworks that just works and let me focus 100% on solving business problems. I've tried Micronaut, Quarkus, Dropwizard, but they slow me down too much compared to just using Spring Boot.

For me delivering business value is the most important metric when I am comparing frameworks. Spring Boot wins every time.

ramon1562y ago

May I recommend Symfony? You get the advantages of Spring but also the nicer things of PHP :-)

baby2y ago

I had to review a Spring application once and that convinced me never to work with Java ever again

zeruch2y ago· 15 in thread

Early on (15+ years ago) I spent a few weeks there on contract and I noticed they used Java EVERYWHERE, and not always well. They had a CS app named after a key Star Wars character that was in all likelihood a breach of the Geneva Convention. A code atrocity with the performance of a sloth on its 8th bong rip with a UX from hell.

jedberg2y ago

If it helps that CS app was rewritten about 10 years ago (when I worked there, but not on that app) in part due to the complaints you mention. It's totally true that most resources were spent on customer facing apps. Internal apps were definitely not of the same quality, because they didn't need to be.

zeruch2y ago

Good to hear. The fact that in 2005 you had an app that required seemingly petabytes of memory to operate, and put on machines barely powerful enough to play minesweeper, was in and of itself a series of bad decisions...but the app itself, and it's layout were just maddening. It's like MC Escher was the UX lead.

sillywalk2y ago

"A code atrocity with the performance of a sloth on its 8th bong rip with a UX from hell."

Sounds like Apple Music.

hbn2y ago

For some reason whenever I'm on my work's VPN, Apple Music lets me play 1 album and then the next time I try to start a song it will tell me I'm not logged in and I'll have to force quit and relaunch (frequently a few times) before it will let me play another album.

Apple Music is the only app that has this problem.

1 more reply

fmntf2y ago

I was thinking of Jira

2 more replies

Arrath2y ago

Evocative description there, bravo

civilitty2y ago

It's so evocative that I don't even care about Netflix or Java anymore.

I just want to know where I can buy a bong ripping sloth [1] and whether they're legal in California.

[1] https://imgur.com/a/S3NVS16

myvoiceismypass2y ago

When I was there a decade ago, it started becoming more polyglot friendly (node apps had to use a jvm sidecar to do internal communications originally!)

jedberg2y ago

My team wrote some of the Python libraries for internal services just so we could avoid that Java sidecar! It took 10 times longer to boot the sidecar than the Python app.

dt3ft2y ago

Netflix should make a documentary about this.

Tim256592y ago

Ha..Ha.Ha

bruh22y ago

What does CS stand for here? I guess it's not computer science?

Also, that description made me lmao, thanks

jedberg2y ago

Customer service.

khalilravanna2y ago

Another reminder that acronyms are pretty terrible for communication. Every time I onboard with a new org there’s a whole new set of acronyms to learn that’s barely faster than typing out the unabbreviated version. Nice to save a couple seconds when the cost is only a bunch of people not able to follow along when people are communicating.

To be clear: not ragging on OP in particular at all but more at the widespread practice at a company level.

1 more reply

zeruch2y ago

CS = Customer Support/Care in that regard.

jarym2y ago· 6 in thread

Interesting the article jumps straight from REST to GraphQL and forgets Falcor[0] - Netflix's alternative vision for federated services. For a while it looked like it might be a contender to GraphQL but it never really seemed to take off despite being simpler to adopt.

[0] https://netflix.github.io/falcor/

paulbakker2y ago

Falcor is actually part of the "old" architecture described in the talk. Because it's mostly unknown and no longer used I didn't go into the details of it.

Falcor was developed at the time Facebook was developing GraphQL in-house. It has similar concepts, but never took off the way GraphQL did.

parthdesai2y ago

Netflix themselves have moved off falcor though

https://netflixtechblog.com/migrating-netflix-to-graphql-saf...

lfkdev2y ago

`Sad Prime noises`

dustingetz2y ago

iirc falcor predated graphql

ppseafield2y ago

I was at the React Rally conference where Falcon was publcly announced in August of 2015. I recall that Facebook gave a GraphQL presentation right before.

It seems GraphQL was first announced publicly in February 2015.

baby2y ago

Probably because most people don't want to work with Java

madaxe_again2y ago· 5 in thread

Ah, the way they break out artwork calls explains the weird behaviour I see with my U.K. Netflix account in Portugal - English titles, Portuguese posters, regardless of language preferences.

yurishimo2y ago

Yes because you’re being served the Portugal catalog. English is simply a localization setting that can be applied to any region.

dewey2y ago

Then this must be an edge case with UK, as within the EU you would get your "home catalog": https://europa.eu/youreurope/citizens/consumers/internet-tel...

definitelyauser2y ago

Yet you tend to lose english subtitles when travelling in certain regions.

dewey2y ago

How does the artwork explain that? Wouldn't they just need to call the artwork service with your current language preference instead of the default language of your current geolocation?

giraffe_lady2y ago

Internationalization vs localization in the wild.

yayitswei2y ago· 3 in thread

I heard Clojure is fairly popular at Netflix as well.

jvican2y ago

Not true. Clojure use is very rare.

yayitswei2y ago

Good to know, thanks. I don't have insider knowledge but at least from various posts it looks like there's some healthy usage at scale, e.g. https://news.ycombinator.com/item?id=18345341, https://news.ycombinator.com/item?id=18348295

Things may have changed in the last 5 years, though.

technion2y ago

I think this should be assumed for any "x company uses y uncommon language heavily" argument that you read online.

edejong2y ago· 3 in thread

Interesting, no mention of Scala at all. Did Netflix say goodbye to Scala altogether?

paulbakker2y ago

We never really used it, aside from some niche use cases. It’s always been Java primarily.

phendrenad22y ago

Interesting. At one point it was fairly well "known" that Netflix used Scala. They presented at Scala conferences and had lots of open roles mentioning Scala. And Scala fans used Netflix as an example, claiming that they used Scala for recommendations, APIs, etc. But maybe it was all a psyop.

rickette2y ago

You're probably confusing twitter with netflix, the former is/was a scala shop.

dlhavema2y ago· 2 in thread

Most of the postings for backend positions at Netflix I've seen call out nodejs. Can I assume they do both? Is one legacy and the other newer stuff, or are they more complimentary?

Anyone on in the inside know?

nameless9122y ago

Things are certainly more of a blend now than what's presented in this presentation, but the presenter is a big Java platform guy here. I would say ~70% of the services I interact with on a day to day basis are Java, another 20% in Node, and then the last 10% is a hodgepodge of Python, Go, and more esoteric stuff.

It varies from team to team; the "Studio" organization that supports creating Netflix content does lots of nodeJS due to the perception that it's faster to iterate on a UI and API together if they're both in the same language. On my team, we're very close to 50/50 due to managing a bunch of backend, business process type systems (Java), and a very complex UI (with a NodeJS backing service to provide a graphql query layer). Regardless, the tooling is really quite good, so interacting with a Node service is roughly identical to interacting with a Java service is roughly identical to interacting with anything else. We lean into code generation for clients pretty heavily, so graphQL is a good fit, but gRPC and Swagger are still used pretty frequently.

dlhavema2y ago

Thanks for responding. That's good insight

agilob2y ago· 2 in thread

Is this the talk? Looks like this is it https://www.youtube.com/watch?v=5dpLVvRpPPs

paulbakker2y ago

The an older version of the same talk. Things have moved a bit since, Java 21 and such, but mostly the same.

edpichler2y ago

Apparently, it is.

talent_deprived2y ago· 2 in thread

It's too bad they are using Gradle and Intellij, used both before, went back to Maven and Eclipse. Personal preference I guess.

ryanianian2y ago

IntelliJ is far and away above any other IDE and is well worth the paid license imho. It's a professional tool written by and for professionals. VSCode and Eclipse have some inertia in very particular workflows/tools and a few different (valid) ways of operating, but nothing is as polished and cohesive as the JetBrains.

Gradle, however, is a dumpster-fire of footguns and obtuse and non-debug-able DSLs.

chii2y ago

gradle lets you get something custom working quickly, because it's basically a script with code you want to execute.

For small projects, it works fine, as long as the complexity doesn't grow beyond a certain point for the small project (aka, doesn't grow to a big project), and is maintained by the same person.

For a large project, i do not like gradle at all. Maven is a much better build tool, since standardization is the best thing since sliced bread.

smrtinsert2y ago· 2 in thread

Not surprised about Rx. Rx is great at the UI layer imho, or anything with streams. For microservices, I don't see how it would have ever fit, since microservices should be as simple as possible doing just one thing.

yCombLinks2y ago

Netflix created RXjava

smrtinsert2y ago

created or ported? I thought it was created elsewhere.

3 more replies

coding1232y ago· 1 in thread

Every company that went down the grpc route will be doing hacks for the next 10 years until they eventually get rid of it.

kuratkull2y ago

Please provide a comprehensive substantiation for you comment.

dewey2y ago

In case you are wondering what LOLOMO stands for, it's "List of List of Movies".

yafetn2y ago

Netflix’s DGS framework for GraphQL is nice to work with but we’ve been frustrated with some prioritization choices by the team. For instance, if you’re using Kotlin, it’s impossible to define and pass scalars to the latest version of the client. There’s a year-old issue highlighting this problem that’s been ignored it seems.

https://github.com/Netflix/dgs-codegen/issues/455

geodel2y ago

This seems entirely unsurprising/standard Java setup. Perhaps it is proximity to Hollywood that some glamor is rubbed off on bog standard enterprise tech stack of Netflix.

gamaralf2y ago

It seems that this article should be titled "How Netflix uses JVM" (not "Java").

The article is superficial, mentions Java but seems that Groovy had a more important role there. But in the end, it really talks about JVM.

It reads like a PR piece from a Oracle and Netflix partnership to promote Java. Oracle have done that before.

talent_deprived2y ago

When they say "applications" do they mean stuff with more meat than microservices? If they're mostly microservices, 2800 seems low to me for someone with the recognition factor of Netflix.

j / k navigate · click thread line to collapse

196 comments

102 comments · 17 top-level

ValtteriL2y ago· 22 in thread

>Netflix observed a 20% increase of CPU usage on JDK 17 compared to JDK 8. This was mostly due to the improvements in the G1 garbage collector.

Help me here, why do GC improvements cause CPU increase?

blackoil2y ago

moffkalast2y ago

bunderbunder2y ago

A new GC could alleviate this by either going easier on the memory itself, or by doing allocations in a way that achieves better locality of reference.

jillesvangurp2y ago

BinaryRage2y ago

The figure is about the overall improvement, not sure why that reads increase.

G1 has seen the greatest improvement from 8 to 17 compared to its counterparts, and you also see reduced allocation rates due to compact strings (20-30%), so that reduces GC total time.

2 more replies

ahoka2y ago

I don't think he meant that.

Macha2y ago

CraigJPerry2y ago

>> while the OS reports lower than expected CPU usage

>> which depending on the CPU metrics you use, may not show as CPU busy time

>> you might end up with 20 servers at 50% usage, but using 10 servers will take twice as long but still appear to be at 50% usage.

xorcist2y ago

If somebody knows how to make that insight actionable, let me know. No, hiring new people is not the answer. In all likelihood that swaps one hard problem for an even harder.

1 more reply

edpichler2y ago

To free memory. Also, 20% increase is not 20% in total. It's 20% when you go from 10 to 12 cpu usage, or from 50 to 60, for instance.

_the_inflator2y ago

Well done.

I always appreciate numbers and the differentiation between relative and absolute numbers in this case.

"We doubled our workforce in one week!" - CEO's first hire... ;)

matsemann2y ago

The CPU can do more tasks without being limited by memory pressure, perhaps?

I guess it depends on if they mean "we used 20% more CPU for the same output", or "we could utilize the CPUs 20% more".

paulbakker2y ago

It’s a 20% improvement. So less time spent on GC.

1 more reply

znpy2y ago

> Help me here, why do GC improvements cause CPU increase?

If you have a better GC, you have shorter and less frequent needs to do a stop the world pause.

Hence the code can run on cpu for more time, getting you higher cpu usage.

Higher cpu usage is often actually good in situations like this: it means you're getting more work done with the same cpu/memory configuration.

dboreham2y ago

Java8 was at least a decade into generational and concurrent GC. It does STW once in a while though which may be what you meant.

1 more reply

tpm2y ago

I read it as a good thing: GC improvements -> more available memory -> more work done by the CPU. But still would be interested in more detail.

groestl2y ago

Because the memory / I/O is not the bottleneck anymore, and the CPU can now run optimally.

jjtheblunt2y ago

ahoka2y ago

Higher CPU usage paradoxically means better performance. When I last did OPS we used to watch total CPU usage of all services and if it was not 100%, then we started to look for a bottleneck to fix.

radomir_cernoch2y ago

Also interested! We saw basically the exact opposite. :-)

pyeri2y ago

It's like hiring more workers to accomplish the exact same output as before. "See, I achieved 20% growth in my targets!", some recruiter will say!

groestl2y ago

inparen2y ago· 22 in thread

Spring Boot and Spring cloud for backend & graphql for the win. ;-)

RamblingCTO2y ago

StevePerkins2y ago

I'm not sure what "no compile time safety in this stack" even means in the context of a strongly-typed compiled language.

6 more replies

pylua2y ago

Once you learn the annotation based configuration it also saves a lot of time.

The performance is valid but it will only keep improving.

1 more reply

Fabricio202y ago

2 more replies

dimgl2y ago

1 more reply

nameless9122y ago

vmaurin2y ago

didntcheck2y ago

krooj2y ago

This is just... ignorance; your argument is basically, "I don't understand/want to learn how X works; therefore, X must be garbage"

1 more reply

smrtinsert2y ago

misja1112y ago

FYI, you can still use XML based configuration in Spring. The choice is yours. See https://docs.spring.io/spring-framework/docs/4.2.x/spring-fr...

I agree it is not common to do it, most teams follow the autoconfiguration madness.

bedobi2y ago

100% agree, Java and Spring are a mess and there's no justifiable reason to use them in 2023 (and no, "that's what we've always used" isn't a good justification)

Like srsly even DropWizard is better than Spring lol, let alone other even simpler frameworks like Ktor which is built on a much improved language over Java

wing-_-nuts2y ago

What do you propose as an alternative? Something like Micronaut trades more compile time for stricter checks and faster runtime. Do you use something like that?

2 more replies

twh2702y ago

We've adopted Quarkus and it's been a breath of fresh air. Excellent all around, DX, performance, features, it's all been good.

2 more replies

Cthulhu_2y ago

Spring is a safe and reliable choice I'd say; not the most exciting, but neither code nor frameworks should be exciting, they're used to solve a problem, they shouldn't become the problem itself.

m_0x2y ago

> not the most exciting

It was exciting when J2EE was dominating.

robertlagrant2y ago

krooj2y ago

The stack is tremendously productive, but history has taught me a few things when dealing with Spring:

inparen2y ago

Ha ha, spring security is tricky and high chance may surprise some one while "boot"strapping a new project. But once done, it is out of way.

olavgg2y ago

For me delivering business value is the most important metric when I am comparing frameworks. Spring Boot wins every time.

ramon1562y ago

May I recommend Symfony? You get the advantages of Spring but also the nicer things of PHP :-)

baby2y ago

I had to review a Spring application once and that convinced me never to work with Java ever again

zeruch2y ago· 15 in thread

jedberg2y ago

zeruch2y ago

sillywalk2y ago

"A code atrocity with the performance of a sloth on its 8th bong rip with a UX from hell."

Sounds like Apple Music.

hbn2y ago

Apple Music is the only app that has this problem.

1 more reply

fmntf2y ago

I was thinking of Jira

2 more replies

Arrath2y ago

Evocative description there, bravo

civilitty2y ago

It's so evocative that I don't even care about Netflix or Java anymore.

I just want to know where I can buy a bong ripping sloth [1] and whether they're legal in California.

[1] https://imgur.com/a/S3NVS16

myvoiceismypass2y ago

When I was there a decade ago, it started becoming more polyglot friendly (node apps had to use a jvm sidecar to do internal communications originally!)

jedberg2y ago

My team wrote some of the Python libraries for internal services just so we could avoid that Java sidecar! It took 10 times longer to boot the sidecar than the Python app.

dt3ft2y ago

Netflix should make a documentary about this.

Tim256592y ago

Ha..Ha.Ha

bruh22y ago

What does CS stand for here? I guess it's not computer science?

Also, that description made me lmao, thanks

jedberg2y ago

Customer service.

khalilravanna2y ago

To be clear: not ragging on OP in particular at all but more at the widespread practice at a company level.

1 more reply

zeruch2y ago

CS = Customer Support/Care in that regard.

jarym2y ago· 6 in thread

[0] https://netflix.github.io/falcor/

paulbakker2y ago

Falcor is actually part of the "old" architecture described in the talk. Because it's mostly unknown and no longer used I didn't go into the details of it.

Falcor was developed at the time Facebook was developing GraphQL in-house. It has similar concepts, but never took off the way GraphQL did.

parthdesai2y ago

Netflix themselves have moved off falcor though

https://netflixtechblog.com/migrating-netflix-to-graphql-saf...

lfkdev2y ago

`Sad Prime noises`

dustingetz2y ago

iirc falcor predated graphql

ppseafield2y ago

I was at the React Rally conference where Falcon was publcly announced in August of 2015. I recall that Facebook gave a GraphQL presentation right before.

It seems GraphQL was first announced publicly in February 2015.

baby2y ago

Probably because most people don't want to work with Java

madaxe_again2y ago· 5 in thread

Ah, the way they break out artwork calls explains the weird behaviour I see with my U.K. Netflix account in Portugal - English titles, Portuguese posters, regardless of language preferences.

yurishimo2y ago

Yes because you’re being served the Portugal catalog. English is simply a localization setting that can be applied to any region.

dewey2y ago

Then this must be an edge case with UK, as within the EU you would get your "home catalog": https://europa.eu/youreurope/citizens/consumers/internet-tel...

definitelyauser2y ago

Yet you tend to lose english subtitles when travelling in certain regions.

dewey2y ago

How does the artwork explain that? Wouldn't they just need to call the artwork service with your current language preference instead of the default language of your current geolocation?

giraffe_lady2y ago

Internationalization vs localization in the wild.

yayitswei2y ago· 3 in thread

I heard Clojure is fairly popular at Netflix as well.

jvican2y ago

Not true. Clojure use is very rare.

yayitswei2y ago

Things may have changed in the last 5 years, though.

technion2y ago

I think this should be assumed for any "x company uses y uncommon language heavily" argument that you read online.

edejong2y ago· 3 in thread

Interesting, no mention of Scala at all. Did Netflix say goodbye to Scala altogether?

paulbakker2y ago

We never really used it, aside from some niche use cases. It’s always been Java primarily.

phendrenad22y ago

rickette2y ago

You're probably confusing twitter with netflix, the former is/was a scala shop.

dlhavema2y ago· 2 in thread

Most of the postings for backend positions at Netflix I've seen call out nodejs. Can I assume they do both? Is one legacy and the other newer stuff, or are they more complimentary?

Anyone on in the inside know?

nameless9122y ago

dlhavema2y ago

Thanks for responding. That's good insight

agilob2y ago· 2 in thread

Is this the talk? Looks like this is it https://www.youtube.com/watch?v=5dpLVvRpPPs

paulbakker2y ago

The an older version of the same talk. Things have moved a bit since, Java 21 and such, but mostly the same.

edpichler2y ago

Apparently, it is.

talent_deprived2y ago· 2 in thread

It's too bad they are using Gradle and Intellij, used both before, went back to Maven and Eclipse. Personal preference I guess.

ryanianian2y ago

Gradle, however, is a dumpster-fire of footguns and obtuse and non-debug-able DSLs.

chii2y ago

gradle lets you get something custom working quickly, because it's basically a script with code you want to execute.

For small projects, it works fine, as long as the complexity doesn't grow beyond a certain point for the small project (aka, doesn't grow to a big project), and is maintained by the same person.

For a large project, i do not like gradle at all. Maven is a much better build tool, since standardization is the best thing since sliced bread.

smrtinsert2y ago· 2 in thread

yCombLinks2y ago

Netflix created RXjava

smrtinsert2y ago

created or ported? I thought it was created elsewhere.

3 more replies

coding1232y ago· 1 in thread

Every company that went down the grpc route will be doing hacks for the next 10 years until they eventually get rid of it.

kuratkull2y ago

Please provide a comprehensive substantiation for you comment.

dewey2y ago

In case you are wondering what LOLOMO stands for, it's "List of List of Movies".

yafetn2y ago

https://github.com/Netflix/dgs-codegen/issues/455

geodel2y ago

This seems entirely unsurprising/standard Java setup. Perhaps it is proximity to Hollywood that some glamor is rubbed off on bog standard enterprise tech stack of Netflix.

gamaralf2y ago

It seems that this article should be titled "How Netflix uses JVM" (not "Java").

The article is superficial, mentions Java but seems that Groovy had a more important role there. But in the end, it really talks about JVM.

It reads like a PR piece from a Oracle and Netflix partnership to promote Java. Oracle have done that before.

talent_deprived2y ago

When they say "applications" do they mean stuff with more meat than microservices? If they're mostly microservices, 2800 seems low to me for someone with the recognition factor of Netflix.

j / k navigate · click thread line to collapse