Go has added Valgrind support (opens in new tab)

(go-review.googlesource.com)

519 pointscirelli947mo ago147 comments

147 comments

Author of the linked CL here: we added this mostly so that we could abuse the memory initialization tracking to test the constant-time-ness of crypto code (similar to what BoringSSL does, proposed by agl around fifteen years ago: https://www.imperialviolet.org/2010/04/01/ctgrind.html), which is an annoyingly hard property to test.

We're hoping that there are also a bunch of other interesting side-effects of enabling the usage of Valgrind for Go, in particular seeing how we can use it to track how the runtime handles memory (hopefully correctly!)

edit: also strong disclaimer that this support is still somewhat experimental. I am not 100% confident we are properly instrumenting everything, and it's likely there are still some errant warnings that don't fully make sense.

chrsig7mo ago

w.r.t. your edit: Is there anything the community at large can do to aid your efforts?

on_the_beach7mo ago

This is super cool. Hopefully it will flush out other issues in Go too.

But I wonder why its not trivial to throw a bunch of different inputs at your cyphering functions and measure that the execution times are all within an epsilon tolerance?

I mean, you want to show constant time of your crypto functions, why not just directly measure the time under lots of inputs? (and maybe background Garbage Collection and OS noise) and see how constant they are directly?

Also some CPUs have a counter for conditional branches (that the rr debuger leverages), and you could sample that before and after and make sure the number of conditional branches does not change between decrypts -- as that AGL post mentions branching being the same is important for constant time.

Finally, it would also seem trivial to track the first 10 decrypts, take their maximum time add a small extra few nanoseconds tolerance, and pad every following decrypt with a few nanoseconds (executing noops) to force constant time when it is varying.

And you could add an assert that anything over that established upper bound crashes the program since it is violating the constant time property. I suppose the real difficulty is if the OS deschedules your execution and throws off your timing check...

CJefferson7mo ago

For the best crypto, you don’t want “within an epsilon”, you want “exactly the same number of CPU cycles”, because any difference can be measured with enough work.

HALtheWise7mo ago

Feeding random inputs to a crypto function is not guaranteed to exercise all the weird paths that an attacker providing intentionally malicious input could access. For example, a loop comparing against secret data in 32 bit chunks will take constant time 99.99999999% of the time, but is still a security hole because an attacker learns a lot from the one case where it returns faster. Crypto vulnerabilities often take the form of very specifically crafted inputs that exploit some mathematical property that's very unlikely from random data.

le-mark7mo ago

> But I wonder why its not trivial to throw a bunch of different inputs at your cyphering functions and measure that the execution times are all within an epsilon tolerance?

My guess is because the GC introduces pauses and therefor nondetermism in measuring the time anything takes.

pstuart7mo ago

I believe that Go can have GC disabled so that issue could be moot.

thwarted7mo ago

Because "constant time" here means algorithms that are O(1), rather than O(n). This isn't about wall-clock execution time, it's about avoiding the number of operations performed being based on attributes of the input.

scheme2717mo ago

I think it's more complicated than that. Certain things like branches, comparisons, and some other operations may not be constant time especially if you consider interactions with prior code. It's clearly possible just really difficult to get right.

pjmlp7mo ago

> Instead of adding the Valgrind headers to the tree, and using cgo to call the various Valgrind client request macros, we just add an assembly function which emits the necessary instructions to trigger client requests.

Love that they have taken this route, this is the way bootstraped toolchains should be, minimal building blocks and everything else on the language itself.

giancarlostoro7mo ago

I am still curious, had they not gone this route, and avoided the other two routes mentioned, what could they have done to make this process as simple as the rest of Go tends to be, and nearly as performant? I guess this is an ongoing question to be solved at a future date.

pjmlp7mo ago

It would be another scenario to use as ammunition for "see you can't implement a language toolchain without using C", usually voiced by folks without background in compiler design, and understanding that most of the time that is a decision that spurs out of convenience and nothing else.

Assembly isn't that hard, those of us that grown around 8 bit home computers were writing Z80 and 6502 Assembly aged 10 - 12 years old, while having fun cracking games and setting the roots of Demoscene.

thechao7mo ago

Oh. There was a comment to your comment saying that kids learning assembly was easy and — I guess? — implying that adults-learning-assembly is hard. I teach adults assembly on an irregular basis. Adults-learning-assembly is hard because adults are rational animals who (correctly) assume I'm an idiot for insisting on assembly. Once I explain the long-term benefits for our exceedingly specific use case, they pick up assembly in a few hours. Assembly isn't hard. Assembly is annoying because it takes absolutely gobsmacking amounts of assembly to do anything.

4 more replies

Steeeve7mo ago

> Assembly isn't that hard, those of us that grown around 8 bit home computers were writing Z80 and 6502 Assembly aged 10 - 12 years old, while having fun cracking games and setting the roots of Demoscene.

Finally. I found my people.

fragmede7mo ago

Z80 on the TI-80 series of calculators for me. The Internet was very young, but there was ticalc.org. Damn, it's still around. I wonder if can log in?

1 more reply

keybored7mo ago

To the deleted sibling comment about children vs. adults: some children are also optimists.

spookie7mo ago

Isn't Valgrind written in C?

1 more reply

chrsig7mo ago

I'm glad to see rsc still actively involved. And commenting on commit messages.

The older I get the more I value commit messages. It's too easy to just leave a message like "adding valgrind support", which isn't very useful to future readers doing archaeology.

pstuart7mo ago

rsc is a rock star! I believe his focus now is on using AI to manage issues and PRs and such -- I'm sure it will bear copious fruit.

amelius7mo ago

It only works if every package tests with it.

Otherwise the relevant warnings get swamped by a huge amount by irrelevant warnings.

This is why running Valgrind on Python code does not work.

jzwinck7mo ago

If that were true it would also apply to C and C++. I have used Valgrind with Python + Boost C++ hybrid programs and it worked fine after spending an hour making a suppressions file.

giancarlostoro7mo ago

> it worked fine after spending an hour making a suppressions file.

So you are confirming the problem, but treating it as if ignoring it is the solution for all?

sauercrowd7mo ago

it's a rejection of the thesis that it "does not work". It does, but it requires investing into a suppression file.

2 more replies

EasyMark7mo ago

Is an hour a big deal if it's something you can use over and over and over for debug purposes going forward?

ahartmetz7mo ago

For me, a waaay outdated suppressions file for Qt + a rough understanding what syscalls and frameworks do is enough. If my app crashes in a network request and a byte sent to the X server (old example, I use Wayland now) is uninitialized, I know to ignore it.

Valgrind(-memcheck) is an extremely important tool in memory-unsafe languages.

esbranson7mo ago

Simplification of overwhelming information sounds like a good use case for local LLMs. So I agree with other comments that toolchains are better positioned to include batteries like Valgrind.

Thaxll7mo ago

What do you mean if every package tests with it in the context of Go?

rwmj7mo ago

Valgrind is a hidden super-power. In much of the software I write, there's 'make check' which runs the test cases, and 'make check-valgrind' that runs the same test cases under valgrind. The latter is only used on developer machines. It often reveals memory leaks or other subtle memory bugs.

stingraycharles7mo ago

Somewhat yes, but as soon as you enter the world of multi-threading (which Go does a lot), the abstraction doesn’t work anymore: as I understand it (or rather, understood: last time I really spent a lot of time digging into it with C++ code was a while ago) it uses its own scheduler, and as such, a lot of subtle real world issues that would arise due to concurrency / race conditions / etc do not pop up in valgrind. And the performance penalty in general is very heavy.

Having said that, it saved my ass a lot of times, and I’m very grateful that it exists.

acidx7mo ago

I wrote the Lwan web server, which similarly to Go, has its own scheduler and makes use of stackful coroutines. I have spent quite a bit of time Valgrinding it after adding the necessary instrumentation to not make Valgrind freak out due to the stack pointer changing like crazy. Despite a lot of Valgrind's limitations due to the way it works, it has been instrumental to finding some subtle concurrency issues in the scheduler and vicinity.

From a quick glance, it seems that Go is now registering the stacks and emitting stack change commands on every goroutine context switch. This is most likely enough to make Valgrind happy with Go's scheduler.

ben-schaaf7mo ago

IME Helgrind does an great job finding concurrency issues.

cozzyd7mo ago

Yes though last I tried to use it it sadly didn't support openmp. Maybe that's fixed now (that was a while ago)

(I think it was possible to use on openmp if you compiled your compiler with special options)

hedora7mo ago

tsan from LLVM works a bit better in my experience. I still like valgrind in general though!

1 more reply

cyphar7mo ago

TSAN is not perfect but Go has had built-in TSAN support for a very long time (go build -race).

Also, strictly speaking all Go programs are multithreaded. The inability to spawn a single-threaded Go program is actually a huge issue in some system tools like container runtimes and requires truly awful hacks to work around. (Before you ask, GOMAXPROCS=1 doesn't work.)

DishyDev7mo ago

Very cool. Should flush out a few bugs.

I'd be interested to know why Valgrind vs the Clang AddressSanitizer and MemorySaniziter. These normally find more types of errors (like use-after-return) and I find it significantly faster than Valgrind.

tasn7mo ago

Go doesn't use clang/llvm, so they can't use these tools.

pjmlp7mo ago

TinyGo does, but it is also behind in language support.

acidx7mo ago

Go has had its own version of msan and asan for years at this point.

cirelli94OP7mo ago

I'm interested too. I'm using a Go program that call a cpp library with SWING and I was interested in find out if that library had a memory leak, or maybe the SWING wrap I wrote. But this kind of problem can't be detected via pprof, so I tought, what if Go support Valgrind?? and find out this changes.

I'm not sure if this will work though, will it @bracewel?

yxhuvud7mo ago

Valgrind also does stuff like memory tracking and memory-profiling, so this is great also from a performance tracking point of view.

17186274407mo ago

Valgrind is way faster and can be attached to a running program.

acidx7mo ago

Programs running under any Valgrind tool will be executed using a CPU emulator, making it quite a bit slower than, say, running the instrumented binaries as required by sanitizers; it's often an order of magnitude slower, but could be very well be close to two orders of magnitude slower in some cases. This also means that it just can't be attached to any running program, because, well, it's emulating a whole CPU to track everything it can.

(Valgrind using a CPU emulator allows for a lot of interesting things, such as also emulating cache behavior and whatnot; it may be slow and have other drawbacks -- it has to be updated every time the instruction set adds a new instruction for instance -- but it's able to do things that aren't usually possible otherwise precisely because it has a CPU emulator!)

17186274407mo ago

You're right and I was wrong, but in my experience Valgrind has been way faster then the AdressSanitizer. I don't perceive a difference with Valgrind, while ASan makes the program slower around 10x.

defraudbah7mo ago

looks very promising, one of the biggest issue in golang for me is profiling and constant memory leaks/pressure. Not sure if there is an alternative of what people use now

felixge7mo ago

I'd love to hear more! What kind of profiling issues are you running into? I'm assuming the inuse memory profiles are sometimes not good enough to track down leaks since they only show the allocation stack traces? Have you tried goref [1]?. What kind of memory pressure issues are you dealing with?

[1] https://github.com/cloudwego/goref

Disclaimer: I work on continuous profiling for Datadog and contribute to the profiling features in the runtime.

tsimionescu7mo ago

I for one am still mystified how it's possible that a GC language can't expose the GC roots in a memory profile. I've lost so many hours of my life manually trying to figure out what might be keeping some objects live, information the GC figures out every single time it runs...

felixge7mo ago

Do you think the GC roots alone (goroutine stacks with goroutine id, package globals) would be enough?

I think in many cases you'd want the reference chains.

The GC could certainly keep track of those, but at the expense of making things slower. My colleagues Nick and Daniel prototyped this at some point [1].

Alternatively the tracing of reference chains can be done on heap dumps, but it requires maintaining a partial replica of the GC in user space, see goref [2] for that approach.

So it's not entirely trivial, but rest assured that it's definitely being considered by the Go project. You can see some discussions related to it here [3].

Disclaimer: I contribute to the Go runtime as part of my job at Datadog. I can't speak on behalf of the Go team.

[1] https://go-review.googlesource.com/c/go/+/552736

[2] https://github.com/cloudwego/goref/blob/main/docs/principle....

[3] https://github.com/golang/go/issues/57175

defraudbah7mo ago

no, haven't heard of goref yet but will give it a shot!

usually I go with pprof, like basic stuff and it helps. I would NOT say memory leak is the biggest or most common issue I see, however as time goes and services become more complicated what I often see in the metrics is how RAM gets eaten and does not get freed as time goes, so the app eats more and more memory as time goes and only restart helps.

It's hard to call it memory leak in "original meaning of memory leak" but the memory does not get cleaned up because the choices I made and I want to understand how to make it better.

Thanks for the tool!

prerok7mo ago

Sorry if this is a basic question but are you setting the GOMEMLIMIT?

Also, are you running the code in a container? In K8s?

0x696C69617mo ago

How are you getting "constant memory leaks" in a GC'd language?

BlackFly7mo ago

This was often a question asked in Java interviews as well.

In Java heap fragmentation is usually considered a separate issue but I understand go has a non-moving garbage collector so you can lose memory due to pathological allocations that overly fragment memory and require constantly allocating new pages. I could be wrong about this since I don't know a lot about go, but heap fragmentation can cause troubles for long running programs with certain types of memory allocation.

Beside that, applications can leak memory by stuffing things into a collection (map or list) and then not cleaning it up despite becoming "stale". The references are live from the perspective of the garbage collector but are dead from the application perspective. Weak references exist to solve this problem when you expose an API that stores something but won't be able to know when something goes out of scope. I wouldn't consider this to be common, but if you are building a framework or any kind of platform code you might need to reach for this at some point. Some crazy folks also intern every string they encounter "for performance reasons" and that can obviously lead to what less crazy folk would consider a memory leak. Other folk stick a cache around every client and might not tune the cache parameters leading to unnecessary memory pressure...

johncolanduoni7mo ago

Golang has a feature that I love in general but that makes it very easy to keep unintended allocations around. If you have a struct with a simple int field, and you store that somewhere as an *int, the entire struct and anything it points to will be kept alive. This is super useful for short-lived pointers, and super dangerous for long-lived pointers.

Most other widely used GCed languages don’t allow the use of arbitrary interior pointers (though most GCs can actually handle them at the register level).

aleksi7mo ago

> If you have a struct with a simple int field, and you store that somewhere as an *int, the entire struct and anything it points to will be kept alive.

While Go allows interior pointers, I don't think what you say is true. runtime.KeepAlive was added exactly to prevent GC from collecting the struct when only a field pointer is stored. Take a look at this blog post, for example: https://victoriametrics.com/blog/go-runtime-finalizer-keepal...

2 more replies

GuB-427mo ago

A GC only deallocates unreferenced memory, if you keep unused references, that's a leak the GC won't catch, as it has no way to know that you won't need it later.

It can happen when your variables have too long a lifespan, or when you have a cache where the entries are not properly evicted.

foldr7mo ago

But how would Valgrind know more than the GC? Of course a program in a GCed language can leak memory, but it’s not clear to me how Valgrind would detect the kinds of memory leaks that you can create in pure Go code (without calling into C or using unsafe functions to allocate memory directly).

1 more reply

dijit7mo ago

There’s ways, GC isn’t perfect.

A common one I see fairly often is opening a big file, creating a “new slice” on a subset of the file and then using the “new slice” and expecting the old large object to be dropped.

Except, the “new slice” is just a reference into the larger slice, so its never marked unused.

giancarlostoro7mo ago

Interesting, I always thought of slices as stand-alone, I wonder if its the same in Python?

2 more replies

throwaw127mo ago

there are many ways:

    - you can create deadlocks
    - spawn goroutines while not making sure they have proper exit criteria
    - use slices of large objects in memory and pass them around (e.g. read files in a loop and pass only slice from whole buffer)
    - and so on

lukaslalinsky7mo ago

Not going to claim this is all sources, but Go makes it extremely easy to leak goroutines.

sim7c007mo ago

it's not hard. GC lets shit leak until it decided to clean it up...

do you think they will enable Valgrind if there's no leaks?

chrsig7mo ago

valgrind finds sooooo many more problems than just memory leaks

uninitialized memory, illegal writes, etc... There's a lot of good stuff that could be discovered.

1 more reply

pjmlp7mo ago

Ideally, they would have learnt from other languages, and offered explicit control over what goes into the stack instead of relying into escape analysis alone.

As it is, the only way to currently handle that is with " -gcflags -m=3" or using something like VSCode Go plugin, via "ui.codelenses" and "ui.diagnostic.annotations" configurations.

johncolanduoni7mo ago

I sometimes dream of a GCed language with a non-escaping pointer type. However to make it really useful (i.e. let you put it inside other non-escaping structs) you need something on the scale of the Rust borrow checker, which means adding a lot of complexity.

pjmlp7mo ago

Which is why several GC languages (the CS meaning of GC), are rather going down the path to keeping their approach to automatic resource management, plus type systems improvements for low level high performance code when needed.

So you only go down into the complexity of affine types, linear types, effects, formal proofs, dependent types, if really needed, after spending time reasoning with a profiler.

Now, this does not need to be so complex, since languages like Interlisp-D and Cedar at Xerox, that many GC languages have offered value types and explicit stack allocation.

That alone is already good enough for most scenarios, provided people actually spend some time thinking about how to design their data structures instead of placing everything into the heap.

SkySkimmer7mo ago

https://oxcaml.org/documentation/stack-allocation/intro/ ?

sethammons7mo ago

> constant memory leaks/pressure

In Go, never launch a goroutine that you don't know exactly how it will be cleaned up.

Thaxll7mo ago

pprof is pretty good, what do you need?

defraudbah7mo ago

yes, that's what I use, just wonder if there are alternatives. I am not sure how valgrind compares to it or the goref tool mentioned above, just asking around, does not hurt.

Thaxll7mo ago

Alternative to solve what problem? pprof is very powerful, it's not missing much.

1 more reply

tasn7mo ago

This feels more like a failure than a win.

Don't get me wrong, I love Valgrind, and have been using it extensively in my past life as a C developer. Though the fact that Go needs Valgrind feels like a failure of the language or the ecosystem. I've been doing Rust for ~6 years now, and haven't had to reach for Valgrind even once (I think a team member may have use it once).

I realize that's probably because of cgo, and maybe it's in-fact a step forward, but I can help but feel like it is a step backwards.

JyB7mo ago

I never understand why there's always one of the top comment on every Go post being derogatory and mentioning Rust. It never fails. It starts to feel like a weird mix of defensiveness and superiority complex.

tasn7mo ago

If I had to guess: because Rust engineers are like every other engineer that reads HN. They read stories, sometimes comment, and share from their own experience. Rust and Go were created roughly at the same time and are more directly comparable than e.g. Go and Ruby, so you don't see Ruby people writing about not having to use Valgrind. This means you'll see more comments from Rust devs than others in Go threads.

At least that's why I wrote that original comment.

9rx7mo ago

> and are more directly comparable than e.g. Go and Ruby

Why do you say that? The original Go announcement made it abundantly clear that it was intended to be like a dynamically-typed language, but faster. It is probably more like Python than Ruby, as it clearly took a lot of ideas from Python, but most seem to consider those languages to be in the same hemisphere anyway.

Beyond maybe producing complied binaries, which is really just an implementation detail, not a hard feature of the language, what is really comparable with Rust?

3 more replies

TinkersW7mo ago

You should see the C++ threads

stitched2gethr7mo ago

There were developed around the same time so maybe that accounts for some of the comparisons, but at least in this case I think it matters that they are both relatively new languages with modern tooling.

9rx7mo ago

If they were being compared, shouldn't we also see the inverse? There is a discussion about Rust on the front page right now, with 230 comments at time of writing, and not a single mention of Go.

In fairness, the next Rust discussion a few pages deep does mention Go, but in the context of:

1. Someone claiming that GC languages are inherently slow, where it was pointed out that it doesn't have to be that way, using Go as an example. It wasn't said in comparison with Rust.

2. A couple of instances of the same above behaviour; extolling the virtues of Rust and randomly deriding Go. As strange as the above behaviour is, at least Go was already introduced into the discussion. In these mentioned cases Go came from completely out in left field, having absolutely nothing to do with the original article or having any relation to the thread, only showing up seemingly because someone felt the need to put it down.

hu37mo ago

Rust is, in my opinion, overrepresented by a vocal minority in HN. A vocal corpus that tend to be passive aggressive more often than other language communities, in my experience.

Which is sad because I like the language and find it useful. But a part of the community does a disservice with comments like your parent comment. It's often on the cusp of calling people who code in Go "stupid". But I digress.

flkenosad7mo ago

Religious wars. C is Judaism. Go/Rust/Python/Ruby are all the different sects of Christianity. AI is Islam.

bracewel7mo ago

This is mainly actually for testing constant-time code, rather than doing proper memory tracking (see https://www.imperialviolet.org/2010/04/01/ctgrind.html for a slightly out-of-date description of this technique).

tasn7mo ago

Oh, interesting. Thanks for sharing!

I guess there's also callgrind that may be useful for Gophers.

chrsig7mo ago

Useful for people -- believe it or not not all of us internalize the language we use as part of our identity (and species).

stusmall7mo ago

I've had to use valgrind a bit in Rust. Not much but the need is there. It really depends on the product you are working with. Rust is an extremely flexible language and can be used in many spaces. From high level abstract functional code to low level C-like code on a microcontroller. Some use cases would never image using unsafe, some can't live without it. For most FFI to C is just a fact of life, so it comes up somewhere.

When I used it before I was working on a Rust modules that was loaded in by nginx. This was before there were official or even community bindings to nginx.... so there was a lot of opportunity for mistakes.

pjmlp7mo ago

Depends on how much unsafe you actually happen to write, use unsafe crates, or link into C and C++ libraries.

I also seldom need something like this in Java, .NET or node, until a dependency makes it otherwise.

tasn7mo ago

For sure, that's why I said it's possibly due to the ecosystem. We link against two C libraries other than libc: OpenSSL and librdkafka. Though they are both abstracted away with solid Rust bindings so for us, so as a consumer of these libs it hasn't been a problem (I guess it may be a problem for the people developing them).

I guess maybe the failure is not the addition of it (as it's useful for people writing the bindings), but rather how happy everyone on the thread is (which means it's more useful than it should be due to a failure with the ecosystem).

9rx7mo ago

> but rather how happy everyone on the thread is (which means it's more useful than it should be due to a failure with the ecosystem).

More likely Go users are just happy in general. The Rust users always come across as being incredibly grumpy for some reason, which may be why that happiness — or what would be considered normalcy in any other venue — seems to stand out so much in comparison.

> We link against [...] OpenSSL

Which is kind of funny as Valgrind support was added specifically for the crypto package to help test tricky constant-time cases. You think that the failure of the ecosystem is that Go has built high-quality crypto support in rather than relying on the Heartbleed library instead...? That is certainly an interesting take.

1 more reply

pjmlp7mo ago

Even though there is the whole CGO is not Go meme, it certainly makes it rather easy to write C and C++ code directly on a Go project, thus I imagine some folks reach rather easy to it.

Which I can relate to, when doing stuff that is Windows only , I rather make use of C++/CLI than getting P/Invoke declarations correctly.

paulf387mo ago

That's quite nice. There is a small risk that the client request mechanism might change . The headers don't change much - mostly when a new platform gets added. Go is only targeting amd64 and arm64.

This isn't so much about leaks. The most important thing that this will enable is correct analysis of uninitialised memory. Without annotation memory that gets recycled will not be correctly poisoned. I imagine that it will also be useful for the other tools (except cachegrind and callgrind).

suobset7mo ago

Not even a Go user, and yet this is one of the best things I have read today morning. Valgrind is possibly one of the most powerful tools I have in my belt!!

DarkNova67mo ago

Would you mind to elaborate? I don't program in C but it sounds interesting.

paulf387mo ago

C is the language that benefits the most from tools like Valgrind. It's just so easy in C to write code with memory faults.

Memcheck (the main tool) has shortcomings (very slow, does not detect all kinds of errors). Its strongest point is that it does not need an instrumented build. That can be particularly important if you have issues in 3rd party libraries that you can't build. Its other strong point is that it checks for both addressability and initialisedness at the same time.

My favourite feature is using GDB with Valgrind+vgdb. That allows you to see what memory is addressable and/or initialised from within GDB.

olivia-banks7mo ago

I love Valgrind, but since my main development machine is an M3, I don’t get to use it nearly as much as I would like.

jasonjmcghee7mo ago

https://github.com/LouisBrunner/valgrind-macos

paulf387mo ago

If any macOS experts could help with this port it would be most welcome.

Apple have been making big changes that keep breaking things and Valgrind has not kept up. Louis Brunner has done an amazing job more or less single handedly managed to keep the basic flow working.

0x696C69617mo ago

This is only useful for cgo correct?

kevincox7mo ago

I presume it is also useful if you are using the unsafe APIs as well to mess with pointers and do raw memory reads.

9rx7mo ago

It's really for crypto. https://news.ycombinator.com/item?id=45348445

But maybe others will find a way to use it. Who knows?

holyknight7mo ago

damn, i remember using valgrind when writing C in university a long time ago.

willy_k7mo ago

And I’m going to be using Valgrind in a few weeks, writing C in university now.

pjmlp7mo ago

I remember when it came to be, and I was already working. Yep feeling gray.

starboyy7mo ago

oh man. you came at the right time.

j / k navigate · click thread line to collapse

147 comments

bracewel7mo ago

chrsig7mo ago

w.r.t. your edit: Is there anything the community at large can do to aid your efforts?

on_the_beach7mo ago

This is super cool. Hopefully it will flush out other issues in Go too.

But I wonder why its not trivial to throw a bunch of different inputs at your cyphering functions and measure that the execution times are all within an epsilon tolerance?

CJefferson7mo ago

For the best crypto, you don’t want “within an epsilon”, you want “exactly the same number of CPU cycles”, because any difference can be measured with enough work.

HALtheWise7mo ago

le-mark7mo ago

> But I wonder why its not trivial to throw a bunch of different inputs at your cyphering functions and measure that the execution times are all within an epsilon tolerance?

My guess is because the GC introduces pauses and therefor nondetermism in measuring the time anything takes.

pstuart7mo ago

I believe that Go can have GC disabled so that issue could be moot.

thwarted7mo ago

scheme2717mo ago

pjmlp7mo ago

Love that they have taken this route, this is the way bootstraped toolchains should be, minimal building blocks and everything else on the language itself.

giancarlostoro7mo ago

pjmlp7mo ago

thechao7mo ago

4 more replies

Steeeve7mo ago

Finally. I found my people.

fragmede7mo ago

Z80 on the TI-80 series of calculators for me. The Internet was very young, but there was ticalc.org. Damn, it's still around. I wonder if can log in?

1 more reply

keybored7mo ago

To the deleted sibling comment about children vs. adults: some children are also optimists.

spookie7mo ago

Isn't Valgrind written in C?

1 more reply

chrsig7mo ago

I'm glad to see rsc still actively involved. And commenting on commit messages.

The older I get the more I value commit messages. It's too easy to just leave a message like "adding valgrind support", which isn't very useful to future readers doing archaeology.

pstuart7mo ago

rsc is a rock star! I believe his focus now is on using AI to manage issues and PRs and such -- I'm sure it will bear copious fruit.

amelius7mo ago

It only works if every package tests with it.

Otherwise the relevant warnings get swamped by a huge amount by irrelevant warnings.

This is why running Valgrind on Python code does not work.

jzwinck7mo ago

If that were true it would also apply to C and C++. I have used Valgrind with Python + Boost C++ hybrid programs and it worked fine after spending an hour making a suppressions file.

giancarlostoro7mo ago

> it worked fine after spending an hour making a suppressions file.

So you are confirming the problem, but treating it as if ignoring it is the solution for all?

sauercrowd7mo ago

it's a rejection of the thesis that it "does not work". It does, but it requires investing into a suppression file.

2 more replies

EasyMark7mo ago

Is an hour a big deal if it's something you can use over and over and over for debug purposes going forward?

ahartmetz7mo ago

Valgrind(-memcheck) is an extremely important tool in memory-unsafe languages.

esbranson7mo ago

Simplification of overwhelming information sounds like a good use case for local LLMs. So I agree with other comments that toolchains are better positioned to include batteries like Valgrind.

Thaxll7mo ago

What do you mean if every package tests with it in the context of Go?

rwmj7mo ago

stingraycharles7mo ago

Having said that, it saved my ass a lot of times, and I’m very grateful that it exists.

acidx7mo ago

ben-schaaf7mo ago

IME Helgrind does an great job finding concurrency issues.

cozzyd7mo ago

Yes though last I tried to use it it sadly didn't support openmp. Maybe that's fixed now (that was a while ago)

(I think it was possible to use on openmp if you compiled your compiler with special options)

hedora7mo ago

tsan from LLVM works a bit better in my experience. I still like valgrind in general though!

1 more reply

cyphar7mo ago

TSAN is not perfect but Go has had built-in TSAN support for a very long time (go build -race).

DishyDev7mo ago

Very cool. Should flush out a few bugs.

tasn7mo ago

Go doesn't use clang/llvm, so they can't use these tools.

pjmlp7mo ago

TinyGo does, but it is also behind in language support.

acidx7mo ago

Go has had its own version of msan and asan for years at this point.

cirelli94OP7mo ago

I'm not sure if this will work though, will it @bracewel?

yxhuvud7mo ago

Valgrind also does stuff like memory tracking and memory-profiling, so this is great also from a performance tracking point of view.

17186274407mo ago

Valgrind is way faster and can be attached to a running program.

acidx7mo ago

17186274407mo ago

You're right and I was wrong, but in my experience Valgrind has been way faster then the AdressSanitizer. I don't perceive a difference with Valgrind, while ASan makes the program slower around 10x.

defraudbah7mo ago

looks very promising, one of the biggest issue in golang for me is profiling and constant memory leaks/pressure. Not sure if there is an alternative of what people use now

felixge7mo ago

[1] https://github.com/cloudwego/goref

Disclaimer: I work on continuous profiling for Datadog and contribute to the profiling features in the runtime.

tsimionescu7mo ago

felixge7mo ago

Do you think the GC roots alone (goroutine stacks with goroutine id, package globals) would be enough?

I think in many cases you'd want the reference chains.

The GC could certainly keep track of those, but at the expense of making things slower. My colleagues Nick and Daniel prototyped this at some point [1].

Alternatively the tracing of reference chains can be done on heap dumps, but it requires maintaining a partial replica of the GC in user space, see goref [2] for that approach.

So it's not entirely trivial, but rest assured that it's definitely being considered by the Go project. You can see some discussions related to it here [3].

Disclaimer: I contribute to the Go runtime as part of my job at Datadog. I can't speak on behalf of the Go team.

[1] https://go-review.googlesource.com/c/go/+/552736

[2] https://github.com/cloudwego/goref/blob/main/docs/principle....

[3] https://github.com/golang/go/issues/57175

defraudbah7mo ago

no, haven't heard of goref yet but will give it a shot!

It's hard to call it memory leak in "original meaning of memory leak" but the memory does not get cleaned up because the choices I made and I want to understand how to make it better.

Thanks for the tool!

prerok7mo ago

Sorry if this is a basic question but are you setting the GOMEMLIMIT?

Also, are you running the code in a container? In K8s?

0x696C69617mo ago

How are you getting "constant memory leaks" in a GC'd language?

BlackFly7mo ago

This was often a question asked in Java interviews as well.

johncolanduoni7mo ago

Most other widely used GCed languages don’t allow the use of arbitrary interior pointers (though most GCs can actually handle them at the register level).

aleksi7mo ago

> If you have a struct with a simple int field, and you store that somewhere as an *int, the entire struct and anything it points to will be kept alive.

2 more replies

GuB-427mo ago

A GC only deallocates unreferenced memory, if you keep unused references, that's a leak the GC won't catch, as it has no way to know that you won't need it later.

It can happen when your variables have too long a lifespan, or when you have a cache where the entries are not properly evicted.

foldr7mo ago

1 more reply

dijit7mo ago

There’s ways, GC isn’t perfect.

A common one I see fairly often is opening a big file, creating a “new slice” on a subset of the file and then using the “new slice” and expecting the old large object to be dropped.

Except, the “new slice” is just a reference into the larger slice, so its never marked unused.

giancarlostoro7mo ago

Interesting, I always thought of slices as stand-alone, I wonder if its the same in Python?

2 more replies

throwaw127mo ago

there are many ways:

    - you can create deadlocks
    - spawn goroutines while not making sure they have proper exit criteria
    - use slices of large objects in memory and pass them around (e.g. read files in a loop and pass only slice from whole buffer)
    - and so on

lukaslalinsky7mo ago

Not going to claim this is all sources, but Go makes it extremely easy to leak goroutines.

sim7c007mo ago

it's not hard. GC lets shit leak until it decided to clean it up...

do you think they will enable Valgrind if there's no leaks?

chrsig7mo ago

valgrind finds sooooo many more problems than just memory leaks

uninitialized memory, illegal writes, etc... There's a lot of good stuff that could be discovered.

1 more reply

pjmlp7mo ago

Ideally, they would have learnt from other languages, and offered explicit control over what goes into the stack instead of relying into escape analysis alone.

As it is, the only way to currently handle that is with " -gcflags -m=3" or using something like VSCode Go plugin, via "ui.codelenses" and "ui.diagnostic.annotations" configurations.

johncolanduoni7mo ago

pjmlp7mo ago

So you only go down into the complexity of affine types, linear types, effects, formal proofs, dependent types, if really needed, after spending time reasoning with a profiler.

Now, this does not need to be so complex, since languages like Interlisp-D and Cedar at Xerox, that many GC languages have offered value types and explicit stack allocation.

That alone is already good enough for most scenarios, provided people actually spend some time thinking about how to design their data structures instead of placing everything into the heap.

SkySkimmer7mo ago

https://oxcaml.org/documentation/stack-allocation/intro/ ?

sethammons7mo ago

> constant memory leaks/pressure

In Go, never launch a goroutine that you don't know exactly how it will be cleaned up.

Thaxll7mo ago

pprof is pretty good, what do you need?

defraudbah7mo ago

yes, that's what I use, just wonder if there are alternatives. I am not sure how valgrind compares to it or the goref tool mentioned above, just asking around, does not hurt.

Thaxll7mo ago

Alternative to solve what problem? pprof is very powerful, it's not missing much.

1 more reply

tasn7mo ago

This feels more like a failure than a win.

I realize that's probably because of cgo, and maybe it's in-fact a step forward, but I can help but feel like it is a step backwards.

JyB7mo ago

tasn7mo ago

At least that's why I wrote that original comment.

9rx7mo ago

> and are more directly comparable than e.g. Go and Ruby

Beyond maybe producing complied binaries, which is really just an implementation detail, not a hard feature of the language, what is really comparable with Rust?

3 more replies

TinkersW7mo ago

You should see the C++ threads

stitched2gethr7mo ago

9rx7mo ago

If they were being compared, shouldn't we also see the inverse? There is a discussion about Rust on the front page right now, with 230 comments at time of writing, and not a single mention of Go.

In fairness, the next Rust discussion a few pages deep does mention Go, but in the context of:

1. Someone claiming that GC languages are inherently slow, where it was pointed out that it doesn't have to be that way, using Go as an example. It wasn't said in comparison with Rust.

hu37mo ago

Rust is, in my opinion, overrepresented by a vocal minority in HN. A vocal corpus that tend to be passive aggressive more often than other language communities, in my experience.

flkenosad7mo ago

Religious wars. C is Judaism. Go/Rust/Python/Ruby are all the different sects of Christianity. AI is Islam.

bracewel7mo ago

tasn7mo ago

Oh, interesting. Thanks for sharing!

I guess there's also callgrind that may be useful for Gophers.

chrsig7mo ago

Useful for people -- believe it or not not all of us internalize the language we use as part of our identity (and species).

stusmall7mo ago

pjmlp7mo ago

Depends on how much unsafe you actually happen to write, use unsafe crates, or link into C and C++ libraries.

I also seldom need something like this in Java, .NET or node, until a dependency makes it otherwise.

tasn7mo ago

9rx7mo ago

> but rather how happy everyone on the thread is (which means it's more useful than it should be due to a failure with the ecosystem).

> We link against [...] OpenSSL

1 more reply

pjmlp7mo ago

Even though there is the whole CGO is not Go meme, it certainly makes it rather easy to write C and C++ code directly on a Go project, thus I imagine some folks reach rather easy to it.

Which I can relate to, when doing stuff that is Windows only , I rather make use of C++/CLI than getting P/Invoke declarations correctly.

paulf387mo ago

That's quite nice. There is a small risk that the client request mechanism might change . The headers don't change much - mostly when a new platform gets added. Go is only targeting amd64 and arm64.

suobset7mo ago

Not even a Go user, and yet this is one of the best things I have read today morning. Valgrind is possibly one of the most powerful tools I have in my belt!!

DarkNova67mo ago

Would you mind to elaborate? I don't program in C but it sounds interesting.

paulf387mo ago

C is the language that benefits the most from tools like Valgrind. It's just so easy in C to write code with memory faults.

My favourite feature is using GDB with Valgrind+vgdb. That allows you to see what memory is addressable and/or initialised from within GDB.

olivia-banks7mo ago

I love Valgrind, but since my main development machine is an M3, I don’t get to use it nearly as much as I would like.

jasonjmcghee7mo ago

https://github.com/LouisBrunner/valgrind-macos

paulf387mo ago

If any macOS experts could help with this port it would be most welcome.

Apple have been making big changes that keep breaking things and Valgrind has not kept up. Louis Brunner has done an amazing job more or less single handedly managed to keep the basic flow working.

0x696C69617mo ago

This is only useful for cgo correct?

kevincox7mo ago

I presume it is also useful if you are using the unsafe APIs as well to mess with pointers and do raw memory reads.

9rx7mo ago

It's really for crypto. https://news.ycombinator.com/item?id=45348445

But maybe others will find a way to use it. Who knows?

holyknight7mo ago

damn, i remember using valgrind when writing C in university a long time ago.

willy_k7mo ago

And I’m going to be using Valgrind in a few weeks, writing C in university now.

pjmlp7mo ago

I remember when it came to be, and I was already working. Yep feeling gray.

starboyy7mo ago

oh man. you came at the right time.

j / k navigate · click thread line to collapse