Walmart Node.js Memory Leak (opens in new tab)

(joyent.com)

466 pointsbtmills12y ago75 comments

75 comments

47 comments · 17 top-level

diminoten12y ago· 9 in thread

I'm actually looking into a segfault issue deep in the bowels of a C++ addon we have in node.js (anyone in #node.js will have seen me over the past few weeks ask about it), but what reading this makes me realize is how woefully underequipped I am to hunt for problems of this nature.

My problem is likely in one of our addons, but this kind of debugging, this whole genre of problem solving is entirely beyond me. How do I get to this level? What do I need to learn? To study?

It's just a little depressing to read something like this and see how far the road ahead goes, despite how far I've already traveled...

JoachimSchipper12y ago

Debugging severe memory corruption or memory leaks is annoying, and can occasionally take a lot of time, but it's not necessarily that bad. Here are some pointers that may be helpful.

Tools: valgrind and gdb are obvious. But don't forget your compiler! Crank up the warnings, and look through LLVM-clang's -fsanitize=<foo> and warning options. (Also, if you're already on OpenBSD, check out the "S" flag to malloc; if you're on Solaris, check out, well, the blog post.) Finally, Boehm's conservative garbage collector has a "find memory leaks" mode, which looks useful for those cases where you can't get valgrind working. If all else fails, shovel through the memory dump looking for repeated patterns.

Testing: try to reproduce the problem; the first iteration may look something like "it runs out of memory after 36 hours". Then simplify: for instance, the author of the article could have asked "does this still happen if the server closes the connection immediately, without sending any data" and would have found the bug very quickly. (Of course, you're likely to ask a lot of wrong questions before hitting on the right one; experience and a full knowledge of the system you're working on is useful but not sufficient.) Questions like "does this happen more quickly if we ping 100 times per second instead of once every ten minutes" are often useful as well. (Finally, just printing memory usage every N seconds is helpul.)

Coding: be careful when writing code. The usual ways of improving code quality (e.g. code reviews) work to reduce memory leaks, too. Try to run a multiple-hour soak test every so often during development (preferably on a CI server); it's a lot easier to debug "hey, we suddenly run out of memory after yesterday's commits" than "well, something goes wrong in production". If you're doing new development, consider alternatives to malloc() - arena/pool allocation (e.g. libtalloc) is convenient and very fast if your memory use is tree-like (e.g. a connections owns a request owns some memory to sort the data before returning it). In C, goto a single chunk of cleanup-and-return code rather than duplicating the cleanup at every place where you exit from the function.

benihana12y ago

>Tools: valgrind and gdb are obvious.

The fact that someone is saying they don't know where to start seems to indicate this isn't true.

alanctgardner212y ago

In my experience from debugging C programs:

- Use valgrind (or gdb)! Your segfault should be simpler to find than a memory leak, because you know what line the segfault happens on.

- If you have a value that's getting mangled (pointer getting overwritten by a write to another address) and you can't figure out why, use watchpoints to see when that address is getting touched. http://sourceware.org/gdb/onlinedocs/gdb/Set-Watchpoints.htm...

- Find a minimal program to reproduce the problem. It's gross, but I used to actually just take a copy of the code and cut things out until the bug stopped, then look at the last thing I cut. You can do this as a binary search - only run the first half, check for the bug, only run the second half, check for the bug, repeat on the buggy half.

As I said, segfaults are a lot easier than this kind of problem (not that they're easy when you start out). Don't be discouraged! I would help out too, but you'd need to send everything to reproduce the bug (client code, server code, server platform, etc.)

justin6612y ago

> It's gross, but I used to actually just take a copy of the code and cut things out until the bug stopped, then look at the last thing I cut.

That's not gross.

stiff12y ago

I can't claim to be at the level of the joyent guys presented here, but I think taking a Operating System class and Computer Architecture class, or reading the respective textbooks helps, and at the same time you have to be familiar with the particular OS you happen to use, probably up to the point of reading and having basic understanding of the source code of the most important subsystems (virtual memory, process scheduling, filesystem handling, TCP/IP stack) and understanding what the system calls are and what they do. Then you need to know a wide range of tools the given OS offers for examining things, so that you do not get hopelessly stuck in the face of an emergency, since you often have to investigate a crash while it happens to even be able to reproduce it, so you need to know how to examine a running process etc. For Linux this means knowing stuff like:

http://en.wikipedia.org/wiki/Strace

http://en.wikipedia.org/wiki/Lsof

http://en.wikipedia.org/wiki/Vmstat

http://en.wikipedia.org/wiki/Netstat

http://en.wikipedia.org/wiki/DTrace

http://en.wikipedia.org/wiki/Tcpdump

http://en.wikipedia.org/wiki/Magic_SysRq_key

https://perf.wiki.kernel.org/index.php/Main_Page

...

There is a big bunch of tools in the OS very few developers know, sysadmins know more, but they often don't understand the OS and use the tools without understanding their output too well.

Some confirmation of what I have written here is the fact that Joyent forked OpenSolaris to create an OS precisely to make it easier to do things of this kind:

http://wiki.smartos.org/display/DOC/Why+SmartOS+-+ZFS%2C+KVM...

In 2005, Sun Microsystems open sourced Solaris, its renowned Unix operating system, eventually to be released as a distribution called OpenSolaris. Among the earliest adopters and most effective advocates of OpenSolaris was Ben Rockwood, who wrote The Cuddletech Guide to Building OpenSolaris in June, 2005 – the first of his many important contributions to the nascent OpenSolaris community. Meanwhile, Joyent's CTO Jason Hoffman was frustrated by the inability of most operating systems to answer seemingly-simple questions like: "Why is the server down? When will it be back up? ... Now that it's back up, why is my database still slow?"

Jason knew that these questions would be a lot easier to answer on Solaris-based systems, and recognized Sun's open-sourcing initiative as a huge opportunity.

barrkel12y ago

You need determination and experience, and some knowledge of how code is compiled at a low level.

Tools like those described in the article are handy, but aren't absolutely necessary. They save a lot of time, but the same effects can usually be gotten by more laborious means.

You have a segfault. You should know where in the code it's occurring already; it's either an access to bad memory with the instruction pointer (IP) at the point of access, or it's an attempt to execute code with the IP pointing at the bad memory, in which case the top of the stack (or, depending on calling convention, one of the registers) normally contains the place where it came from (necessarily, since the code expected to be returned to).

There are ways to turn an instruction pointer into line number offset when you have appropriate debug info, if you can't get the program running under a debugger.

Given the line number, segfaults can typically be split into three categories: plain bad logic, use after free, and memory corruption. The last is hardest to find IME, most easily done using a debugger and hardware breakpoints on memory address modifications, but you need a stable repro and a consistent memory allocator that gives predictable addresses for every rerun.

If any of the above is meaningless to you, it should give you some clues as to where you need to research.

jenandre12y ago

Crashes (that can be reliably reproduced) should be way easier to debug than memory leaks.

a) build a `debug` version of node (building node from source creates a node_g version which is a debug version)

b) build a `debug` version of all of the c++ addons in your node_modules folder (node-gyp build -d for each addon)

c) start gdb with the debug version of node: `gdb ./node_g`

d) in gdb, run your node script using `run <script.js>` -- add any other options

e) wait for it to crash, and then type `bt` - you'll see the location of the crash which should give you a good starting place.

diminoten12y ago

Yeah I got this far, but the stack trace doesn't have the symbols. The guy I'm working with said it might be because the addons use dynamically linked libraries instead of statically linked libraries...

By the way, I think a jenandre used to work at my company.

1 more reply

ryanobjc12y ago

I find you need two things:

- the "troubleshooter" mentality/thinking pattern - extensive system knowledge

I haven't figured out how to teach #1, except maybe for "don't have anyone to ask for help" and #2 is self explanatory.

davidw12y ago· 7 in thread

I looked at node.js for a system I'm involved with creating, but ultimately we went with Erlang just because it's been around a lot longer and is more stable in terms of things like this. We're working on a semi-embedded system that will not always be on-line or accessible for debugging. We also considered Go, which probably would have been more familiar to C++ guys, but it was also deemed a bit immature even if it seems like a very pleasant language to work with.

Cool writeup though!

pron12y ago

Of the three options you've considered, Erlang is clearly the best choice, but why haven't you even considered Java (or any other JVM language)? When it comes to monitoring, profiling or debugging a long-running application, nothing comes close to the JVM. And, needless to mention, it's extremely mature and stable.

A Java memory leak can be solved in a matter of minutes, or – if it's especially complex – in a couple of hours tops. You can take a heap dump and analyze it with Eclipse Memory Analyzer, and if you need allocation stack-traces, you instrument your code with VisualVM. All of this can be done remotely and without stopping the app.

Flight Recorder, which has recently been added to the HostSpot VM, even gives you instrumentation with hardly any performance penalty (though it requires a commercial license if used in production).

davidw12y ago

Standard Java is a memory hog, and memory usage was a consideration: it is a semi-embedded system. Plus, no one on our team knows it that well.

I know there are small footprint Java's, but then you're maybe wandering off the beaten path...

1 more reply

girvo12y ago

I know for a similar project I did, I ruled out Java as I (and my team at the time) were not very productive in it. It mattered, in that instance, but might not in some others. Depends on your teams skill set I guess!

andreypopp12y ago

2 or 3 years ago I hit memory leak in Erlang's stdlib's httpc... just saying.

rcb12y ago

This is impressive work by the Joyent team!

I've seen two sources of memory leaks in Erlang based systems: 1) unbounded process message queues, and 2) passing binaries across process (pid) boundaries.

Many beginning erlangers run into these, and they're relatively easy to identify and correct. With a little practice, these become easy patterns to recognize and avoid.

As far as httpc, I'm unaware of that bug -- but I can say that I recently worked on a commercial product that leveraged httpc as a core component of the service, and it worked fine.

2 more replies

lpgauth12y ago

I wouldn't recommend httpc to anyone... It's really slow and doesn't scale at all. There's plenty of better alternatives: lhttpc, ibrowse, hackney.

gcb112y ago

is inet part of stdlib?

1 more reply

rcthompson12y ago· 5 in thread

Ironically, this page hangs Chrome indefinitely when I try to load it. Luckily it only hangs the tab so I can still close it. I guess I'll fire up Firefox to see if I can actually read the article.

Edit: Actually, it loads fine in a private browsing tab, so it must be a bad interaction with some extension. Oh well.

dfc12y ago

I am curious why you find this ironic? What is your definition of irony?

pritambaral12y ago

Chrome uses V8. Chrome is the primary user of V8.

Not supporting OP's definition of irony, whatever it is, just speculating how OP could've thought of it.

2 more replies

somethingnew12y ago

The opposite of wrinkly.

tfb12y ago

It loads instantly for me. I'm using Chrome 31.0.1650.57 on Windows 7.

rcthompson12y ago

31.0.1650.57 on OSX 10.9. Firefox handles it just fine. Could be some interaction with an extension I have installed.

1 more reply

ilaksh12y ago· 4 in thread

I think there are still quite a few C and C++ programmers out there. To me this is a great example of why it is better software engineering to write a server in something like Node.js. Because rather than having a million code bases with potential memory leaks like this one, there is just the Node code. In ordinary JavaScript code its impossible to cause a problem just that.

sbov12y ago

It is fairly easy to create a long running server in a GC'd language that will continually consume more memory. Some don't like to call it a memory leak, which is why I put it the way I did, but the effect is the same.

At the end of the day, the more that you think this is impossible the more likely your programs will experience it. So please don't think that your program is immune to this because you use Javascript.

tantalor12y ago

Good example might be a server process which never releases memory, so the longer it runs the more memory it "consumes". That is, the maximum memory required to handle any previous request.

This might be a well known solved problem, but I have heard it mentioned before.

1 more reply

sebcat12y ago

It's all a matter of what you're trying to do. If you have a nice test framework with good coverage and automated build tests running stuff by valgrind, you mitigate the risk of having memory leaks.

Is it worth all the extra effort, when you could just go with a language that does GC at runtime? Sometimes it is, it depends on the use case and it depends on the people.

I've done a project rewrite from C to Java where the Java implementation performed a lot better and consumed less memory than the C one. Some of the performance gain was because I chose better algorithms and limited DB interaction, but some of the gain came from having immutability guarantees whereas the C code would just copy a lot of data structures where immutability was not guaranteed. A lot of time in the C based project was spent doing mallocs and frees and memcpys for nothing. This is poor project design, but poor design happens and Java has some protection agains that due to promoting encapsulation to a greater extent than C by default.

I am 100% certain that if the original project would've been better designed and managed, it would've kicked ass because having it in C would have allowed us to have a smaller memory footprint which would've meant a greater monetary profit in the end for this particular project over time due to system constraints.

What it comes down to is that if you have a good team that understands the required dev proccess of a mid-sized C project and who are profficient enough to implement such a project without doing too much "quick fixin'" it can be worth it. If you're limited by the size and/or competence (everyone can't be a rockstar. I certainly am not) of the team or limited in turnaround time for the product, choosing C will probably not be in your best interest. But having the right people around and if there's a monetary gain in doing things efficiently with the hardware you have then C is still an awesome tool to have in your toolbox.

Most of the time, in the world of SaaS and web based solutions, using C doesn't make a lot of sense except for some bits of core functionality. That's why I like languages with good C bindings. Knowing e.g., Python and C, you really can get the best of both worlds.

IMHO, YMMV, &c

EDIT: inserted some newlines

_random_12y ago

...or you can pick a language that is better than both: GC + strong and static - Scala/C# etc.

ryanseys12y ago· 3 in thread

And a one-line fix. Damn that must be satisfying.

yen22312y ago

Reminds me of that old joke:

The office photocopier broke down, so the manager called in a repairman. The repairman takes one look at the machine, draws an 'X' at the problem part, and hands the manager a bill for $500. The manager was shocked at the price, and demanded an itemized bill. The repairman simply wrote:

    Marking the 'X'              -   $1
    Knowing where to put the 'X' - $499

lstamour12y ago

I started Googling the Picasso "principle" about it being a lifetime to know how to do it, but it turned into Googling this one instead. Found a snippet, "Karl Steinmetz (German-born, U.S citizen), the well known electrical engineer who worked out many details of a.c. theory and was responsible largely for the adoption of a.c. for commercial use, was once called in by the General Electric Company to examine a poorly performing transformer. After a few minutes, Steinmetz marked an x on the transformer core and said, “It will work if you take off the turns from this x to the end.” The prescription worked well, and Steinmetz later sent G.E. a bill for his service of $10,000. The company official thought the bill excessive and asked for the itemization. Steinmetz then sent them a more detailed bill: For putting x on transformer core : $1; for knowing where to put the x: $9999." It's funny that in today's world, both Picasso and Steinmetz take "minutes" to do this, but in perhaps earlier tellings, it took hours for Picasso to do his work and days for Steinmetz: http://edisontechcenter.org/CharlesProteusSteinmetz.html

2 more replies

Scottopherson12y ago

Man I'd be shocked too if the repairman only drew an 'X' on the problem part instead of repairing the problem part.

batbomb12y ago· 1 in thread

Can anyone tell me if there is reason for this in bash?

     DEST=~~/public/walmart.graphs

stewars12y ago

Not bash. '~~' gets replaced by the MANTA_USER environment variable by the manta command line tool mput.

atomical12y ago· 1 in thread

I assume that they can restart the server at intervals or use load balancing. A few months of developer timer for something like this seems excessive unless he was working on something else as well.

spyc3r12y ago

As a former software engineer at Walmart I can tell you that a few months for something like that is nothing to them. They employ several thousand devs at the home office. Having one of them focus on a bug like this isn't an issue in terms of time or money. In their minds its worth it given the scale of the enterprise.

ambirex12y ago

Thank you, I really enjoy detailed write-ups like this. It is fascinating to see how an engineer approaches an elusive problem.

jzwinck12y ago

I'd like to read more about how we can prevent this class of error going forward. Could stronger typing or RAII or some other feature or trick have made the bug apparent at compile time?

I made a very basic Node.js module in C++ with V8 and it was surprisingly difficult to make a good (idiomatic JS behaviour, believably bug-free) wrapper for a straightforward class and factory method. I say this coming from Boost Python and Luabind, where there are some tricky parts to bind complex classes, but simple ones are easy enough, and once written, obviously correct.

city4112y ago

I've been running an extremely simple Node application on 0.10.18 for a while now and it has a very gradual memory leak. My code is just a few dozen lines, and it all seems pretty innocent. I am also using Hapi, so I thought maybe Hapi has a leak in it somewhere. Now I wonder if I have the same leak as Walmart here. I just now upgraded to 0.10.22 and am curious to see where I end up. If the leak goes away then hot damn, I got lucky :)

charlieflowers12y ago

FYI, a typo -- "illusive" -> "elusive". (haven't read further yet, just wanted to let you know).

aaronbrethorst12y ago

Wonderful blog post; major props for the engineering time expenditure. But, why do you have an Olark chat widget that says "Contact Sales". I don't want to have anything to do with those schlubs! If anything, I want to talk to serious engineers like you!

Perhaps a better call to action would be:

* Talk to us about how we can solve your problems

* Chat with us

* We can help you too

* What's up?

patrickg_zill12y ago

That is pretty impressive - I love how they could use DTrace to scope out what was going on.

retr0h12y ago

I've always loved the debugging tools in solaris (smartos or whatever now).

joeblau12y ago

Excellent details on the sleuthing that went on to find this error. I think it's great that there are great tools available to debug errors like this and your write up helps me in learning more about how to go about properly debugging my Node apps.

jnazario12y ago

cool writeup. while not a node.js user, i love these sorts of tours of system internals - i always learn a lot, both specific tools and also processes of using them.

thanks for the details, very articulate and useful stuff.

jokoon12y ago

we know that node.js is a bad piece of software, you don't need to remind us about it all the time

(down vote me)

1 more reply

j / k navigate · click thread line to collapse

75 comments

47 comments · 17 top-level

diminoten12y ago· 9 in thread

My problem is likely in one of our addons, but this kind of debugging, this whole genre of problem solving is entirely beyond me. How do I get to this level? What do I need to learn? To study?

It's just a little depressing to read something like this and see how far the road ahead goes, despite how far I've already traveled...

JoachimSchipper12y ago

Debugging severe memory corruption or memory leaks is annoying, and can occasionally take a lot of time, but it's not necessarily that bad. Here are some pointers that may be helpful.

benihana12y ago

>Tools: valgrind and gdb are obvious.

The fact that someone is saying they don't know where to start seems to indicate this isn't true.

alanctgardner212y ago

In my experience from debugging C programs:

- Use valgrind (or gdb)! Your segfault should be simpler to find than a memory leak, because you know what line the segfault happens on.

justin6612y ago

> It's gross, but I used to actually just take a copy of the code and cut things out until the bug stopped, then look at the last thing I cut.

That's not gross.

stiff12y ago

http://en.wikipedia.org/wiki/Strace

http://en.wikipedia.org/wiki/Lsof

http://en.wikipedia.org/wiki/Vmstat

http://en.wikipedia.org/wiki/Netstat

http://en.wikipedia.org/wiki/DTrace

http://en.wikipedia.org/wiki/Tcpdump

http://en.wikipedia.org/wiki/Magic_SysRq_key

https://perf.wiki.kernel.org/index.php/Main_Page

...

There is a big bunch of tools in the OS very few developers know, sysadmins know more, but they often don't understand the OS and use the tools without understanding their output too well.

Some confirmation of what I have written here is the fact that Joyent forked OpenSolaris to create an OS precisely to make it easier to do things of this kind:

http://wiki.smartos.org/display/DOC/Why+SmartOS+-+ZFS%2C+KVM...

Jason knew that these questions would be a lot easier to answer on Solaris-based systems, and recognized Sun's open-sourcing initiative as a huge opportunity.

barrkel12y ago

You need determination and experience, and some knowledge of how code is compiled at a low level.

Tools like those described in the article are handy, but aren't absolutely necessary. They save a lot of time, but the same effects can usually be gotten by more laborious means.

There are ways to turn an instruction pointer into line number offset when you have appropriate debug info, if you can't get the program running under a debugger.

If any of the above is meaningless to you, it should give you some clues as to where you need to research.

jenandre12y ago

Crashes (that can be reliably reproduced) should be way easier to debug than memory leaks.

a) build a `debug` version of node (building node from source creates a node_g version which is a debug version)

b) build a `debug` version of all of the c++ addons in your node_modules folder (node-gyp build -d for each addon)

c) start gdb with the debug version of node: `gdb ./node_g`

d) in gdb, run your node script using `run <script.js>` -- add any other options

e) wait for it to crash, and then type `bt` - you'll see the location of the crash which should give you a good starting place.

diminoten12y ago

By the way, I think a jenandre used to work at my company.

1 more reply

ryanobjc12y ago

I find you need two things:

- the "troubleshooter" mentality/thinking pattern - extensive system knowledge

I haven't figured out how to teach #1, except maybe for "don't have anyone to ask for help" and #2 is self explanatory.

davidw12y ago· 7 in thread

Cool writeup though!

pron12y ago

Flight Recorder, which has recently been added to the HostSpot VM, even gives you instrumentation with hardly any performance penalty (though it requires a commercial license if used in production).

davidw12y ago

Standard Java is a memory hog, and memory usage was a consideration: it is a semi-embedded system. Plus, no one on our team knows it that well.

I know there are small footprint Java's, but then you're maybe wandering off the beaten path...

1 more reply

girvo12y ago

andreypopp12y ago

2 or 3 years ago I hit memory leak in Erlang's stdlib's httpc... just saying.

rcb12y ago

This is impressive work by the Joyent team!

I've seen two sources of memory leaks in Erlang based systems: 1) unbounded process message queues, and 2) passing binaries across process (pid) boundaries.

Many beginning erlangers run into these, and they're relatively easy to identify and correct. With a little practice, these become easy patterns to recognize and avoid.

As far as httpc, I'm unaware of that bug -- but I can say that I recently worked on a commercial product that leveraged httpc as a core component of the service, and it worked fine.

2 more replies

lpgauth12y ago

I wouldn't recommend httpc to anyone... It's really slow and doesn't scale at all. There's plenty of better alternatives: lhttpc, ibrowse, hackney.

gcb112y ago

is inet part of stdlib?

1 more reply

rcthompson12y ago· 5 in thread

Ironically, this page hangs Chrome indefinitely when I try to load it. Luckily it only hangs the tab so I can still close it. I guess I'll fire up Firefox to see if I can actually read the article.

Edit: Actually, it loads fine in a private browsing tab, so it must be a bad interaction with some extension. Oh well.

dfc12y ago

I am curious why you find this ironic? What is your definition of irony?

pritambaral12y ago

Chrome uses V8. Chrome is the primary user of V8.

Not supporting OP's definition of irony, whatever it is, just speculating how OP could've thought of it.

2 more replies

somethingnew12y ago

The opposite of wrinkly.

tfb12y ago

It loads instantly for me. I'm using Chrome 31.0.1650.57 on Windows 7.

rcthompson12y ago

31.0.1650.57 on OSX 10.9. Firefox handles it just fine. Could be some interaction with an extension I have installed.

1 more reply

ilaksh12y ago· 4 in thread

sbov12y ago

tantalor12y ago

Good example might be a server process which never releases memory, so the longer it runs the more memory it "consumes". That is, the maximum memory required to handle any previous request.

This might be a well known solved problem, but I have heard it mentioned before.

1 more reply

sebcat12y ago

It's all a matter of what you're trying to do. If you have a nice test framework with good coverage and automated build tests running stuff by valgrind, you mitigate the risk of having memory leaks.

Is it worth all the extra effort, when you could just go with a language that does GC at runtime? Sometimes it is, it depends on the use case and it depends on the people.

IMHO, YMMV, &c

EDIT: inserted some newlines

_random_12y ago

...or you can pick a language that is better than both: GC + strong and static - Scala/C# etc.

ryanseys12y ago· 3 in thread

And a one-line fix. Damn that must be satisfying.

yen22312y ago

Reminds me of that old joke:

    Marking the 'X'              -   $1
    Knowing where to put the 'X' - $499

lstamour12y ago

2 more replies

Scottopherson12y ago

Man I'd be shocked too if the repairman only drew an 'X' on the problem part instead of repairing the problem part.

batbomb12y ago· 1 in thread

Can anyone tell me if there is reason for this in bash?

     DEST=~~/public/walmart.graphs

stewars12y ago

Not bash. '~~' gets replaced by the MANTA_USER environment variable by the manta command line tool mput.

atomical12y ago· 1 in thread

I assume that they can restart the server at intervals or use load balancing. A few months of developer timer for something like this seems excessive unless he was working on something else as well.

spyc3r12y ago

ambirex12y ago

Thank you, I really enjoy detailed write-ups like this. It is fascinating to see how an engineer approaches an elusive problem.

jzwinck12y ago

I'd like to read more about how we can prevent this class of error going forward. Could stronger typing or RAII or some other feature or trick have made the bug apparent at compile time?

city4112y ago

charlieflowers12y ago

FYI, a typo -- "illusive" -> "elusive". (haven't read further yet, just wanted to let you know).

aaronbrethorst12y ago

Perhaps a better call to action would be:

* Talk to us about how we can solve your problems

* Chat with us

* We can help you too

* What's up?

patrickg_zill12y ago

That is pretty impressive - I love how they could use DTrace to scope out what was going on.

retr0h12y ago

I've always loved the debugging tools in solaris (smartos or whatever now).

joeblau12y ago

jnazario12y ago

cool writeup. while not a node.js user, i love these sorts of tours of system internals - i always learn a lot, both specific tools and also processes of using them.

thanks for the details, very articulate and useful stuff.

jokoon12y ago

we know that node.js is a bad piece of software, you don't need to remind us about it all the time

(down vote me)

1 more reply

j / k navigate · click thread line to collapse