Using unwrap() in Rust is okay (opens in new tab)

(blog.burntsushi.net)

212 pointscp93y ago197 comments

197 comments

78 comments · 16 top-level

dcsommer3y ago· 19 in thread

I totally agree with de-emphasizing the old "recoverable" vs. "unrecoverable" dichotomy (https://blog.burntsushi.net/unwrap/#what-about-recoverable-v...). Every time I've heard programmers (especially in the context of exceptions) try to define it, I've found it imprecise and open to debate.

When invariant violations or mistakes by programmers (aka bugs) are detected, the program should halt as it is in an inconsistent state and continuing could be very dangerous (think privacy/security/data corruption). Otherwise, don't halt (handle it or have the caller handle it).

jcranmer3y ago

The criteria I tend to prefer is "expected" versus "unexpected" errors. I/O errors, especially network errors, are things that are going to be expected under reasonable operation, therefore it make sense that code should handle them. Similarly, user input resulting in incorrectly formatted code should be reasonably expected and therefore handled.

But the same kinds of failures might not be reasonably expected in other circumstances--I wouldn't expect that the internal configuration files of an application should occur in reasonable operation, and therefore it makes sense to panic if they're corrupted... even if the cause is an I/O operation on a local disk, or parsing some JSON or TOML or INI or whatnot file.

One implication of this is that it needs to be easy for any error system to promote an "expected error" into an "unexpected error"--which is what unwrap/expect does. The recoverable/unrecoverable error suggests that there ought to be no reason to do this, but there is absolutely a reason to do so: what category an error falls into is ultimately decided by the context of the error, not the generation of the error itself.

vbezhenar3y ago

I write Java and for me it's not exactly "expected" versus "unexpected".

There are unexpected errors for sure. For example "StackOverflowError" which could be thrown from any method call.

There should be unexpected errors handler which does some sane thing in given circumstances. Usually it involves logging error and its details (stacktrace, may be something else) and returning some kind of generic error to the caller (e.g. HTTP 500).

But the thing is, this handler very often suitable for handling errors that I expect but I don't want to bother handling them. For example those errors might be rare enough that writing code would be a net negative (every line of code is maintenance burden and error handling code is maintenance burden power 2). And I'm totally OK with those errors being handled in generic way.

Even if it's user input sometimes. For example user could be me. And I know input format. I don't want to write error handling for myself, all I need is to prevent data corruption. Getting HTTP 500 InvalidNumberFormatException is totally fine in some situations.

And language should provide means for writing that kind of code. At least that's my opinion and that's what I truly miss in those languages with explicit error handling of every function call.

You might call it lazy coding. I call it reasonable coding.

Exceptions for the win!

1 more reply

saghm3y ago

> The criteria I tend to prefer is "expected" versus "unexpected" errors

The way I've often heard this phrased is "exceptions are for exceptional behavior", and it's always rubbed me the wrong way a bit (although maybe this is partially just because I don't think wordplay is a sufficient argument to do something; I've made similar arguments in the past to friends who sung the praises of "no-shave November" and "thirsty Thursdays"). From digging a bit deeper when I've heard this opinion espoused, it seems like it mostly boils down to the fact that exceptions tend not to be as efficient as happy-path code, so using them for circumstances that are too common is not going to lead to good performance. I guess I don't really find this a subtle enough concept to warrant needing to introduce another abstraction layer into the discussion, especially one that's much vaguer like "is this behavior expected?" If there's a performance concern, I think it's much better addressed directly rather than shifting the discussion to a proxy.

1 more reply

Someone3y ago

> I/O errors, especially network errors, are things that are going to be expected under reasonable operation, therefore it make sense that code should handle them

It makes some sense, but often, code should do relatively little and process should do most of the handling. Recovering from I/O errors and properly testing the recovery code can take huge amounts of time and effort.

Often, aborting the program and rerunning it or even restarting a service is the best way to handle these because properly handling them in code costs time better spent on other things. Logging a decent error message and aborting may be the better choice.

But of course, that depends on the use case. Databases need lots of recovery code for I/O errors and deadlock recovery, for example, even for cases that occur maybe once every year.

(And yes, process nowadays is often automated, making it code again, but IMO, that’s a different kind of code)

taeric3y ago

I'm a fan of, "is this going to a user, or to another part of the calling program?" If expecting it to go to a user, exception make sense. Need to bubble it all the way up to the part of the system that is closest to the user.

1 more reply

merb3y ago

Network errors might be retryable/routable differently, but often (especially when starting out) should probably returned to the User. I mean if s3 is down you can retry the call but often it is down then

2 more replies

preseinger3y ago

> I wouldn't expect that the internal configuration files of an application should occur in reasonable operation, and therefore it makes sense to panic if they're corrupted.

Whether an error returned from a fn is expected or unexpected has to be a property of the fn signature and language conventions in isolation. If semantic or other locally-unknowable details influence fault classification and error control flow at call sites, your program program becomes unmaintainable over time.

So like if you have fn parse_config that takes a string, or a file descriptor, or whatever -- it should return a Result<Config, Error> and yield an error for any un-parseable input. There is no reason this fn should ever panic.

IMO -- call stacks are sacrosanct, making control flow visible and obvious is one of if not the most important thing to optimize in nontrivial programming contexts.

throwawaymaths3y ago

It's kind of the opposite. If you have a network error, often you shouldn't bother to try handling it, the best strategy is often to zero out your state, give up and try again later.

By contrast, you should give up parsing a JSON if it's a config file read on startup but probably not if it's user input.

alerighi3y ago

> When invariant violations or mistakes by programmers (aka bugs) are detected, the program should halt as it is in an inconsistent state and continuing could be very dangerous (think privacy/security/data corruption). Otherwise, don't halt (handle it or have the caller handle it).

Well it's not always the case. There are situations in which if you detect errors you want the program to continue running, and have only that particular functionality to fail.

I tend to write resilient code, since I work in embedded systems and what you never want is the system to crash. Halting a CPU on an invariant violation (i.e. and assert failing) is something useful for debugging (you trigger the debugger and you then analyze why it happened), but something you generally don't want in production.

Bette to have a ton of checks more and in case of an invariant violation (that maybe is resulting from a programmer mistake, but there is always the possibility of hardware memory corruption errors) to return an error and handle it in some ways (for example restart the task that returned the error, trying to go back to the last working state).

burntsushi3y ago

> There are situations in which if you detect errors you want the program to continue running, and have only that particular functionality to fail.

Yes, like a web server. If a request handler fails by panicking, in a Rust program, you catch the panic, respond with a 500 error and log the panic somewhere. But you continue serving other requests.

I talked about this in the blog post.

The problem with your strategy is that it requires you to be aware of your own mistakes. That doesn't sound like a robust strategy, unless you're investing huge resources into sophisticated tooling and have drastically restricted the expressivity of your programming environment. That exists and is fine, and I even addressed that in the blog post too.

2 more replies

mook3y ago

> I tend to write resilient code, since I work in embedded systems and what you never want is the system to crash.

That might be okay too depending on what your system is. I've had a cell phone (the monochrome dumb kind) display an assertion error at me, that was kind of cool. A screw was coming loose, so there were hardware issues. Worst case, I'd have to bring it to the store and get it repaired or replaced; there was no threat of injury or large monetary damage.

stormbrew3y ago

To me the real issue is this is an extremely forced binary and there's really at least three meaningful categories (especially in software with a UI of any sort):

- unactionable invariant violation (poisoned mutex, hard memory errors): crash immediately, something that should ever happen happened and there's no way to either handle or present the error to the user in a meaningful way.

- unactionable (at the call site) but normal errors (couldn't open a file, disconnected from the remote end of a connection, etc): these need to be propagated up to where they can be turned into actionable information for a user, ideally. This is rarely a thing the call site where it happened can usefully do.

- immediately actionable and normal errors (user input didn't validate, file user wanted to open doesn't exist, connection failed but can be retried with a backoff, etc). These need to be handled at the call site or maybe one or two levels up.

You need an exception-like mechanism (or at least a process for emulating one, a la go MRV or C errno) to handle the second case, you often want it for the third case, but it never really makes sense to use it for the first.

That said, I think in non-test rust code you should use expect instead of unwrap, because sometimes invariants do trip and that little tiny extra bit of info can make a huge difference to resolving it.

hinkley3y ago

Recoverable vs unrecoverable comes down to requirements. Certain companies known for having software that 'just works' tend to have both very few unrecoverable errors and very conservative feature sets to help facilitate that short list.

It is very clearly a choice, even if many people are deciding by default. By not tackling an issue, you've chosen to have that issue.

dcsommer3y ago

I think we have compatible views. Each layer of the software must decide it's requirements and handle errors appropriately per requirements. You're right I didn't articulate when to handle an issue locally vs. pass it up. I think that's where requirements (and also explicit API guarantees) come into play.

I do think that APIs that "overpromise" by not returning the errors they do not handle to the caller, and instead halt or throw an exception, do their users a disservice in the long-run. These just become undocumented cases that bite you later on. Better libraries have all these conditions baked into the API itself.

2 more replies

simion3143y ago

Don't exception also halt your program if you ignore them?

Also if using a library I don't want a bug in it to bring my program down, then I am forced to use workarounds like create a child process to use the library, start the child process from the main process and check on it to see if it fails or succeeds, that would be bad for performance and ugly.

arcticbull3y ago

Yep, there's not really any such thing (IME) as a 'recoverable' error - except with respect to I/O.

There's either I/O errors - or there's logic errors. A failure with logic should nuke due to the app being in an inconsistent state; trust is lost. An I/O error should fail softly.

jcelerier3y ago

> There's either I/O errors - or there's logic errors. A failure with logic should nuke due to the app being in an inconsistent state; trust is lost.

nah. in GUI apps for instance you want the failure in the logic of a sub-sub-function to just tell the error "wops" when the button that triggered the action was clicked, not nuke the app (unless you hate your users). e.g. imagine a 3D software which allows to do mesh operations - user clicks on the "Smooth the mesh" button somewhere. Programmer forgot to handle a division by zero in some degenerate case of the smoothing computation which ends up leading to an exception: a value becomes zero, someone used unsigned integers for n in an "n - 1" computation which ends up in a call to array_of_floats.resize(0xffffffffffffffff) (and a likely std::bad_alloc being thrown if you're in c++).

The original mesh is unchanged as the operation waits until the computation is complete to replace the old mesh with the new.

If you ever decide to crash in this situation I am sure you will have great reviews on 3D modeling software comparisons.

3 more replies

ithkuil3y ago

A parser library can return an error when the input is wrong. It doesn't necessarily mean the app is in an inconsistent state when that happens; it all depends on what the application does. It follows that aborting is often a sensible decision in application code and rarely in library code.

1 more reply

fatherzine3y ago

Partially true. In practice people implement multiplexed servers, for many reasons, including performance / throughput. A logic failure should nuke _the offending request_, not the entire server with all the unrelated concurrent requests.

1 more reply

cedws3y ago· 15 in thread

The problem is that Rust developers don't just use unwrap() when it should panic. I've seen plenty of "production grade" crates basically just unwrap because the author didn't know how to handle it gracefully or just wanted to get the code compiling, then forgot about it.

burntsushi3y ago

That is a problem. Another similar problem is that programmers write buggy code. I don't see anything particularly special about 'unwrap()'. 'unwrap()' is a common manifestation of it. But as others have pointed out to you, this same phenomenon manifests everywhere and in other programming languages, but through different means.

carlmr3y ago

This is why I write in a 2-pass manner quite often. In the first pass I use unwrap a lot to get the functionality going. In the second pass I search for all the unwrap and expect and take a moment to think about in what cases these may happen.

This works for me really well to get a prototype going, but then have a solid program later. Because the unwrap/expect is so easy to search for, that you can really postpone the error handling until later.

dureuill3y ago

not sure why you're getting downvoted. maybe the scope of what you're saying ('plenty of "production grade" crates unwrap [when they shouldn't]') calls for a reference? Or maybe because you're singling out Rust developers while the issue is certainly observed in all languages with similar mechanisms (see unchecked exceptions abuse in java, or aborting asserts abuse in C)

Anyway i wouldn't say "plenty", but i did came across crates (parsers :/) that would unwrap on malformed input. the workaround is to encapsulate their use in a catch_unwind.

for the record, i had a similar issue in a c++ lib where the author elected to abort on the unsupported input, so i'm somewhat thankful that the idiomatic mechanism is panic (which is recoverable if needs be) in Rust

deathanatos3y ago

Guilty as charged, and honestly, I wish I had a good mechanism/function call to separate "I'm just being lazy here, this is a TODO" from "actually, this should never happen.

  .map_err(|_| unimplemented!()).unwrap()

or something. (I hope you get the gist of that, as I'm 99% sure that code doesn't compile. Error(E) -> !, Ok(T) -> T)

dureuill3y ago

You could go:

    result.unwrap_or_else(|| todo !())

(unimplemented! is for missing functionality in a given version, todo! is for missing functionality during development)

But using unwrap for these kinds of TODO is ok, code review won't let unjustified unwraps to pass

burntsushi3y ago

Just write 'expect("FIXME")' or similar. :-)

1 more reply

librexpr3y ago

Maybe it would have been a good idea to have two names for unwrap, one which would mean "I'm certain that this value will always be okay", and another which would mean "I'm taking a shortcut because I'm writing a script or just want it to compile for now". Maybe a longer name like "assert_valid()" for the one where you're sure it's okay. That might make it easier to find the places where shortcuts were taken and forgotten.

burntsushi3y ago

Consider using anyhow. At that point, unwrap isn't a shortcut anymore except for one place: you are in a deeply nested function call and you suddenly realize you need to bubble up an error a few layers. Now you have to change a bunch of function signatures.

I don't find myself in that position too frequently. Certainly not enough to warrant two identical but differently named functions.

In that case, I would suggest coming up with your own pattern. Perhaps an unwrap() with a FIXME comment. Or a expect("FIXME").

1 more reply

r3un13y ago

There is! `expect()`

1 more reply

jeffrallen3y ago

Maybe if Rust code wasn't so hard to get working at all, then more of it would work correctly (including checking errors instead of giving up and unwrapping).

1 more reply

speed_spread3y ago

This is why .unwrap() should have been named .or_panic(). The current wording makes it sound innocuous, even if you know what it does.

dymk3y ago

I'm confused, you seem to be saying opposite things. You're saying authors don't use unwrap when it should panic, and in the next sentence, you say they use unwrap (which causes a panic) when the failure could be gracefully recovered.

c7DJTLrn3y ago

"don't just"

2 more replies

bern44443y ago

When writing a library, wouldn't the prudent thing to do be to return a Try/Either or have the method accept a function to invoke in case of an error (like an IOC solution)?

ironmagma3y ago

But luckily when that time does come to refactor the code to handle the error case, the Rust compiler does a lot to help you make sure it’s handled properly.

ComputerGuru3y ago· 5 in thread

One runtime panic I wish was a compile-time error in the rust standard library is the use of an incorrect memory order, eg Ordering::Release with AtomicBool::load().

It would have been fairly trivial to set up generic constraints specifying if a read, write, or read-write ordering semantic is expected and to fail to compile if it wasn’t met.

nynx3y ago

This is going to happen at some point now that const generics arrived.

ComputerGuru3y ago

Likely only once we finally get const parameters specified as regular arguments.

It doesn’t need const at all, though. You just need three traits and either first class enum variants as types or a pseudo enum (mod/struct Ordering with ZST structs Acquire, Release, etc)

pitaj3y ago

This could be a good application of a Clippy lint.

oconnor6633y ago

I think the reason it's not a compile-time error is that it's actually possible to select your ordering at runtime, like this:

    use std::sync::atomic::*;
    let x = AtomicU64::new(0);
    let ordering = if rand::random() {
        Ordering::Relaxed
    } else {
        Ordering::SeqCst
    };
    x.fetch_add(1, ordering);

ComputerGuru3y ago

Yes, I tried (a few months ago) mocking a PR for this that refactored the enum variants into ZST structs and that was the only sticking point for backwards compatibility.

richardwhiuk3y ago· 5 in thread

I think the `expect()` bad examples are something of a strawman.

The `.expect()` for regex for example would say what the regex is matching for.

I think it'd be desirable to have a `.unwrap_with_context("Context: {}")`, and the you'd get `Context: Inner Panic Info`.

burntsushi3y ago

So you're saying that the 'expect()' message when a regex compilation error occurs should be a translation from a terse domain specific language to bloviating prose? :-)

What 'expect()' message would you write for this regex? https://github.com/BurntSushi/ucd-generate/blob/6d3aae3b8005...

I think 'unwrap()' there is perfectly appropriate.

> I think it'd be desirable to have a `.unwrap_with_context("Context: {}")`, and the you'd get `Context: Inner Panic Info`.

Why?

richardwhiuk3y ago

.expect("UnicodeData::from_str regex failed to compile")

With this, even with out backtrace, you can work out what happened.

Without it, you just know that some regex somewhere is invalid.

1 more reply

richardwhiuk3y ago

Just to be clear, I agree with everything else you've said, and you've produced some awesome Rust libraries.

1 more reply

xvedejas3y ago

When you unwrap/expect on an error, the inner error value is already printed. Are you suggesting something more? Reference the implementation details from the article:

    impl<T, E: std::fmt::Debug> Result<T, E> {
      pub fn unwrap(self) -> T {
        match self {
          Ok(t) => t,
          Err(e) => panic!("called `Result::unwrap()` on an `Err` value: {:?}", e),
        }
      }
    }

vincenv3y ago

You can get that with anyhow [1] by calling unwrap after `.with_context`.

[1]: https://docs.rs/anyhow/latest/anyhow/

koala_man3y ago· 5 in thread

tl;dr: Author convincingly argues that Rust `unwrap()` (Java `Optional.get`, Haskell `fromJust`) is fine when you either have checked that the call will not fail, or when you're in a unit test or similar where a panic is a helpful result.

(And, conversely, that it's not fine to use it to avoid doing real error handling)

drexlspivey3y ago

Is this supposed to be controversial? Even rust docs say so https://doc.rust-lang.org/book/ch09-03-to-panic-or-not-to-pa...

burntsushi3y ago

Nope. But people are thoroughly confused by it. Comes up all of the time. And a lot of people have more extremist positions. Banning unwrap. Or banning any panicking branches at all. That's why my blog post covers an example where we convert a function that never panics (sensible) to a function whose signature says it might return an error, but it actually will never return an error (bonkers).

People advocate for this. After publishing this article, it almost seems like people are more confused than I thought. Idk.

Anyway, no, this blog is not meant to be controversial. It is meant to untangle knots.

1 more reply

OJFord3y ago

Buried in your 'or similar' is the more controversial and helpful (IMO) use - the infamous 'this should never happen' exceptions.

burntsushi3y ago

No, that isn't buried in the "or similar" of the GP. They mention that explicitly with "checked that the call will not fail." The "or similar" refers to documentation examples and prototyping/one-off scripts.

marcosdumay3y ago

Well, if you want a definitive answer, it's "it depends". Those errors are not all alike.

cmrdporcupine3y ago· 3 in thread

The article has nuance. It's good.

But the language has made it too easy to unwrap/expect and panic.

This escape hatch should exist, for sure. But it ought to be more explicit, and more of a pain.

It is far too easy to reach for this tool.

burntsushi3y ago

Your focus on unwrap/expect seems arbitrary. Why aren't you commenting on the ease with which 'slice[i]' and 'x * y' fail? Or alloc failure? Is slice index syntax not also too easy to reach for?

I ask because I suspect you've got a bit of motte and bailey going on here. The motte is "hey let's make unwrap/expect more verbose because we want people to be REALLY sure," but the bailey is "let's actually make everything that can panic a lot more verbose and totally change the character of the language and make it a lot less practical."

I'd encourage you to read the "lint" section near the end: https://blog.burntsushi.net/unwrap/#should-we-lint-against-u...

cmrdporcupine3y ago

Your response is kind of harsh, and you're making a bit of an unsympathetic straw man out of what I said.

IMHO there's a qualitative difference in the programmer's expectations when indexing a slice vs calling a function which has been explicitly written to return an error condition. Years of convention causes us to expect indexing errors and to write defensively. But the implied contract of a Rust function with a Result is that the user should do probably do something with the result other than panic in most cases.

I agree that panicing is a legit option. And I agree with the scenarios laid out in the article. And I also don't think lint is the right way to handle it.

But I'm currently in a codebase that is full of unwraps all over -- which the developers did for expediency "get this thing shipped" reasons -- and that (and other codebases I've seen) is what leads me to the conclusion that the ergonomics of putting unwrap right out there in our faces aren't ideal.

Hell, even calling it "result_or_panic" would have perhaps made casual users of it pause and think about what they were doing. There are likely syntactical tools that could have been put in place to really make the user think before creating a panic.

(FWIW safety isn't my primary reason for preferring Rust. I'd be fine with "C++ with a ML-style type system." The general tamping down of footguns is great, though)

1 more reply

lmm3y ago

> Is slice index syntax not also too easy to reach for?

It absolutely is. Modern language design should be discouraging getting an element by index; there are usually better alternatives e.g. iterating through a datastructure, or using combinators like zip to build the datastructure/view you need.

2 more replies

sitkack3y ago· 2 in thread

One can set the env var themselves in main(), many errors are transient, best to capture it the first time.

    // use std::env;
    env::set_var("RUST_BACKTRACE", "full");

burntsushi3y ago

Hah. Ironically, set_var might be deprecated at some point and replaced with an unsafe alternative. (Long story. Short story is that it's currently unsound. If you have C code trying to read from the environment at the same time you end up with UB. If everything is Rust code though, then I believe you're fine.)

sitkack3y ago

I was trying to find a "turn on backtrace" as an official api but couldn't find one in 90s of looking. I see the backtrace crate for catching them in user space in process, which is nice.

What would you recommend? Forking and execing yourself to set the env var? backtrace::enable_full() or something to that effect would be nice.

1 more reply

dont_panic_3y ago· 2 in thread

my problem with panic is that it is like walking in a mine field, you don't know which function will blow up at any given time, and it's not checked by the compiler.

If there was some sort of signal to mark a function as panicking (& vice versa), that would be nice.

burntsushi3y ago

This is like saying, "my problem with bugs is that they're like walking in a mine field, you don't know which function might have a bug in it or not."

pornel3y ago

Not really. It's possible to verify that a call graph can't call panic anywhere, and there exist 3rd party hacky solutions for this already. Rust is just lacking first-class features for this.

1 more reply

ArrayBoundCheck3y ago· 2 in thread

Does this mean using C inside of rust is ok? I'm pretty sure the original team kept hitting memory problems and it was in fact not ok. Browsers crash often enough that I don't want unwrap making it worse

TheDong3y ago

> Does this mean using C inside of rust is ok? I'm pretty sure the original team kept hitting memory problems and it was in fact not ok

This reads a little like flamebait.

Memory unsafety is bad. panic is _not_ a memory unsafe operation, it does not result in memory problems.

This post is not related to memory safety in any way.

ArrayBoundCheck3y ago

The third sentence was the point. I don't want browsers being less reliable and terminating randomly due to unwraps

2 more replies

jeffrallen3y ago· 1 in thread

When you've gotten to the point of saying, "well, it's ok to panic in example code" you've already lost the game. Novice programmers learn from example code, and novice programmers are an order of magnitude more common than experienced ones. A programming ecosystem that depends on 9 out of every 10 people being able to intuit that which the other 1 understands is not an ecosystem that's going to produce good code.

Rust is a minefield of bear traps laid by experts, and I fear for the future of our industry if Go and Java programmers are required by some quirk of network effects or first mover advantage or whatever to starting programming (badly) in Rust.

burntsushi3y ago

Did you read the blog? I addressed all of that.

schubart3y ago· 1 in thread

> When checking preconditions, make sure the panic message relates to the documented precondition, perhaps by adding a custom message. For example,

> `assert!(!xs.is_empty(), "expected parameter 'xs' to be non-empty")`.

This panics with

> thread 'main' panicked at 'expected parameter 'xs' to be non-empty', src/main.rs:79:5

Without the custom message it's

> thread 'main' panicked at 'assertion failed: !xs.is_empty()', src/main.rs:79:5

Given that panics should be for bugs, i.e. interpreted by developers, I'd say the second message is clear enough and a custom message just adds noise in the source code.

burntsushi3y ago

I don't disagree. Definitely depends on how opaque the assertion test is. In this case, yeah, probably don't need a message. Pithy illustrative examples are hard.

yuan433y ago· 1 in thread

The author notes that API simplicity might be a reason to avoid pushing invariants to compile time:

> What do I mean by “API simplicity?” Well, this panic could be removed by moving this runtime invariant to a compile time invariant. Namely, the API could provide, for example, an AhoCorasickOverlapping type, and the overlapping search routines would be defined only on that type and not on AhoCorasick. Therefore, users of the crate could never call an overlapping search routine on an improperly configured automaton. The compiler simply wouldn’t allow it.

> But this adds a lot of additional surface area to the API. And it does it in really pernicious ways. For example, an AhoCorasickOverlapping type would still want to have normal non-overlapping search routines, just like AhoCorasick does. It’s now reasonable to want to be able to write routines that accept any kind of Aho-Corasick automaton and run a non-overlapping search. In that case, either the aho-corasick crate or the programmer using the crate needs to define some kind of generic abstraction to enable that. Or, more likely, perhaps copy some code.

> I thus made a judgment that having one type that can do everything—but might fail loudly for certain methods under certain configurations—would be best. The API design of aho-corasick isn’t going to result in subtle logic errors that silently produce incorrect results. If a mistake is made, then the caller is still going to get a panic with a clear message. At that point, the fix will be easy.

What I gather from this is that the author chose to define a type (call it A) with an attribute that when set in a certain way will cause certain functions to panic. This was preferred to the alternative (two types, A and B) with functions specific to each and where panic was not possible.

This kind of design decision comes up a lot, so understanding the reasoning here could be helpful in a lot of situations. Unfortunately, the passage is less than clear due to lack of source code inline and the highly-specific nature of the problem. An example with source code using more accessible algorithms might be an improvement here.

That said, I'm skeptical that the full range of approach was considered. I sometimes find that the presence of unwrap is a smell pointing to types that have not been fully fleshed out.

As an extreme case, consider a struct whose fields contained diverse data (numbers, colors, enumerated values), but which are all defined as strings. It will be very easy to put this struct into an inconsistent runtime state because nothing can be checked at compile time. The type itself is anemic. Replacing strings with more constrained types eliminates opportunities for panic - possibly all of them.

I get that the whole point is "at what cost?" All I'm saying is that the tradeoffs aren't clear from the example in the passage.

burntsushi3y ago

Well, what isn't clear? What do you want to know? I'll try again.

An Aho-Corasick automaton can be built in a few different ways. How it's built changes how matches are reported: https://docs.rs/aho-corasick/latest/aho_corasick/enum.MatchK...

It also turns out that some match kinds are more amenable to other types of searches, such as overlapping searches. Overlapping searches report every possible match, but leftmost "match kinds" specifically prune certain matches from the automaton. An overlapping search with a "leftmost" match kind produces weird results that are difficult to characterize.

So, when an automaton is configured with a "leftmost" match kind, you have a choice: allow overlapping searches or disallow them. I chose to disallow them. Once you make that choice, you then must choose whether to disallow them at compile time or disallow them at runtime. I chose runtime, for the reasons stated.

If I chose compile time, then I'd need a new `AhoCorasickOverlapping` type which provides the overlapping search routines in addition to the non-overlapping search routines. Then I could get rid of the overlapping search routines on `AhoCorasick`. I'd then also need to add a new build method[1] to the `AhoCorasickBuilder` that let you build an overlapping automaton.

> That said, I'm skeptical that the full range of approach was considered. I sometimes find that the presence of unwrap is a smell pointing to types that have not been fully fleshed out.

I am a fallible human. I might be wrong. So sure, be skeptical!

> As an extreme case, consider a struct whose fields contained diverse data (numbers, colors, enumerated values), but which are all defined as strings. It will be very easy to put this struct into an inconsistent runtime state because nothing can be checked at compile time. The type itself is anemic. Replacing strings with more constrained types eliminates opportunities for panic - possibly all of them.

I'm not sure what this is a case of? Like yeah, I agree, that sounds bad?

> I get that the whole point is "at what cost?" All I'm saying is that the tradeoffs aren't clear from the example in the passage.

Understood. Small illustrative examples are hard. Especially API design. API design is a very domain specific thing. I could probably write an entire blog post on the API design of just the aho-corasick crate. It has gone through many iterations and many lessons have been learned. (And I still have at least one more iteration to go.) I tried to distill down one small part of it in order to talk about the idea of not pursuing literally every possible compile time restriction because sometimes keeping invariants maintained at runtime leads to a simpler API. If you accept that principle already, then all is well.

But some people think the entire farm should be bet on pushing every possible thing to a compile time invariant, regardless of the cost. I am not one of those people and I think it leads to bad API design. And it's very relevant to this topic because if you don't push something to compile time, then, well, you probably need a panicking branch somewhere.

[1]: https://docs.rs/aho-corasick/latest/aho_corasick/struct.AhoC...

nnoitra3y ago· 1 in thread

This blog post will age out in 6 months.

burntsushi3y ago

Why? It's basically an elaboration of the same advice I gave eight years ago. And I was not the first.

ww5203y ago

I generally just use expect(), assert!, or unreachable! rather than simply unwrap() to document the unexpected invalid state.

renewiltord3y ago

Very neat. Just discovered anyhow and the context stuff from this post.

berryton3y ago

The Rust book has a small discussion about his in chapter 9.3 (https://doc.rust-lang.org/book/ch09-03-to-panic-or-not-to-pa...).

I think it is good to have some references (the blog post and the book) for when the horde comes after you.

j / k navigate · click thread line to collapse

197 comments

78 comments · 16 top-level

dcsommer3y ago· 19 in thread

jcranmer3y ago

vbezhenar3y ago

I write Java and for me it's not exactly "expected" versus "unexpected".

There are unexpected errors for sure. For example "StackOverflowError" which could be thrown from any method call.

And language should provide means for writing that kind of code. At least that's my opinion and that's what I truly miss in those languages with explicit error handling of every function call.

You might call it lazy coding. I call it reasonable coding.

Exceptions for the win!

1 more reply

saghm3y ago

> The criteria I tend to prefer is "expected" versus "unexpected" errors

1 more reply

Someone3y ago

> I/O errors, especially network errors, are things that are going to be expected under reasonable operation, therefore it make sense that code should handle them

But of course, that depends on the use case. Databases need lots of recovery code for I/O errors and deadlock recovery, for example, even for cases that occur maybe once every year.

(And yes, process nowadays is often automated, making it code again, but IMO, that’s a different kind of code)

taeric3y ago

1 more reply

merb3y ago

2 more replies

preseinger3y ago

> I wouldn't expect that the internal configuration files of an application should occur in reasonable operation, and therefore it makes sense to panic if they're corrupted.

IMO -- call stacks are sacrosanct, making control flow visible and obvious is one of if not the most important thing to optimize in nontrivial programming contexts.

throwawaymaths3y ago

It's kind of the opposite. If you have a network error, often you shouldn't bother to try handling it, the best strategy is often to zero out your state, give up and try again later.

By contrast, you should give up parsing a JSON if it's a config file read on startup but probably not if it's user input.

alerighi3y ago

Well it's not always the case. There are situations in which if you detect errors you want the program to continue running, and have only that particular functionality to fail.

burntsushi3y ago

> There are situations in which if you detect errors you want the program to continue running, and have only that particular functionality to fail.

Yes, like a web server. If a request handler fails by panicking, in a Rust program, you catch the panic, respond with a 500 error and log the panic somewhere. But you continue serving other requests.

I talked about this in the blog post.

2 more replies

mook3y ago

> I tend to write resilient code, since I work in embedded systems and what you never want is the system to crash.

stormbrew3y ago

To me the real issue is this is an extremely forced binary and there's really at least three meaningful categories (especially in software with a UI of any sort):

hinkley3y ago

It is very clearly a choice, even if many people are deciding by default. By not tackling an issue, you've chosen to have that issue.

dcsommer3y ago

2 more replies

simion3143y ago

Don't exception also halt your program if you ignore them?

arcticbull3y ago

Yep, there's not really any such thing (IME) as a 'recoverable' error - except with respect to I/O.

There's either I/O errors - or there's logic errors. A failure with logic should nuke due to the app being in an inconsistent state; trust is lost. An I/O error should fail softly.

jcelerier3y ago

> There's either I/O errors - or there's logic errors. A failure with logic should nuke due to the app being in an inconsistent state; trust is lost.

The original mesh is unchanged as the operation waits until the computation is complete to replace the old mesh with the new.

If you ever decide to crash in this situation I am sure you will have great reviews on 3D modeling software comparisons.

3 more replies

ithkuil3y ago

1 more reply

fatherzine3y ago

1 more reply

cedws3y ago· 15 in thread

burntsushi3y ago

carlmr3y ago

dureuill3y ago

Anyway i wouldn't say "plenty", but i did came across crates (parsers :/) that would unwrap on malformed input. the workaround is to encapsulate their use in a catch_unwind.

deathanatos3y ago

Guilty as charged, and honestly, I wish I had a good mechanism/function call to separate "I'm just being lazy here, this is a TODO" from "actually, this should never happen.

  .map_err(|_| unimplemented!()).unwrap()

or something. (I hope you get the gist of that, as I'm 99% sure that code doesn't compile. Error(E) -> !, Ok(T) -> T)

dureuill3y ago

You could go:

    result.unwrap_or_else(|| todo !())

(unimplemented! is for missing functionality in a given version, todo! is for missing functionality during development)

But using unwrap for these kinds of TODO is ok, code review won't let unjustified unwraps to pass

burntsushi3y ago

Just write 'expect("FIXME")' or similar. :-)

1 more reply

librexpr3y ago

burntsushi3y ago

I don't find myself in that position too frequently. Certainly not enough to warrant two identical but differently named functions.

In that case, I would suggest coming up with your own pattern. Perhaps an unwrap() with a FIXME comment. Or a expect("FIXME").

1 more reply

r3un13y ago

There is! `expect()`

1 more reply

jeffrallen3y ago

Maybe if Rust code wasn't so hard to get working at all, then more of it would work correctly (including checking errors instead of giving up and unwrapping).

1 more reply

speed_spread3y ago

This is why .unwrap() should have been named .or_panic(). The current wording makes it sound innocuous, even if you know what it does.

dymk3y ago

c7DJTLrn3y ago

"don't just"

2 more replies

bern44443y ago

When writing a library, wouldn't the prudent thing to do be to return a Try/Either or have the method accept a function to invoke in case of an error (like an IOC solution)?

ironmagma3y ago

But luckily when that time does come to refactor the code to handle the error case, the Rust compiler does a lot to help you make sure it’s handled properly.

ComputerGuru3y ago· 5 in thread

One runtime panic I wish was a compile-time error in the rust standard library is the use of an incorrect memory order, eg Ordering::Release with AtomicBool::load().

It would have been fairly trivial to set up generic constraints specifying if a read, write, or read-write ordering semantic is expected and to fail to compile if it wasn’t met.

nynx3y ago

This is going to happen at some point now that const generics arrived.

ComputerGuru3y ago

Likely only once we finally get const parameters specified as regular arguments.

It doesn’t need const at all, though. You just need three traits and either first class enum variants as types or a pseudo enum (mod/struct Ordering with ZST structs Acquire, Release, etc)

pitaj3y ago

This could be a good application of a Clippy lint.

oconnor6633y ago

I think the reason it's not a compile-time error is that it's actually possible to select your ordering at runtime, like this:

    use std::sync::atomic::*;
    let x = AtomicU64::new(0);
    let ordering = if rand::random() {
        Ordering::Relaxed
    } else {
        Ordering::SeqCst
    };
    x.fetch_add(1, ordering);

ComputerGuru3y ago

Yes, I tried (a few months ago) mocking a PR for this that refactored the enum variants into ZST structs and that was the only sticking point for backwards compatibility.

richardwhiuk3y ago· 5 in thread

I think the `expect()` bad examples are something of a strawman.

The `.expect()` for regex for example would say what the regex is matching for.

I think it'd be desirable to have a `.unwrap_with_context("Context: {}")`, and the you'd get `Context: Inner Panic Info`.

burntsushi3y ago

So you're saying that the 'expect()' message when a regex compilation error occurs should be a translation from a terse domain specific language to bloviating prose? :-)

What 'expect()' message would you write for this regex? https://github.com/BurntSushi/ucd-generate/blob/6d3aae3b8005...

I think 'unwrap()' there is perfectly appropriate.

> I think it'd be desirable to have a `.unwrap_with_context("Context: {}")`, and the you'd get `Context: Inner Panic Info`.

Why?

richardwhiuk3y ago

.expect("UnicodeData::from_str regex failed to compile")

With this, even with out backtrace, you can work out what happened.

Without it, you just know that some regex somewhere is invalid.

1 more reply

richardwhiuk3y ago

Just to be clear, I agree with everything else you've said, and you've produced some awesome Rust libraries.

1 more reply

xvedejas3y ago

When you unwrap/expect on an error, the inner error value is already printed. Are you suggesting something more? Reference the implementation details from the article:

    impl<T, E: std::fmt::Debug> Result<T, E> {
      pub fn unwrap(self) -> T {
        match self {
          Ok(t) => t,
          Err(e) => panic!("called `Result::unwrap()` on an `Err` value: {:?}", e),
        }
      }
    }

vincenv3y ago

You can get that with anyhow [1] by calling unwrap after `.with_context`.

[1]: https://docs.rs/anyhow/latest/anyhow/

koala_man3y ago· 5 in thread

(And, conversely, that it's not fine to use it to avoid doing real error handling)

drexlspivey3y ago

Is this supposed to be controversial? Even rust docs say so https://doc.rust-lang.org/book/ch09-03-to-panic-or-not-to-pa...

burntsushi3y ago

People advocate for this. After publishing this article, it almost seems like people are more confused than I thought. Idk.

Anyway, no, this blog is not meant to be controversial. It is meant to untangle knots.

1 more reply

OJFord3y ago

Buried in your 'or similar' is the more controversial and helpful (IMO) use - the infamous 'this should never happen' exceptions.

burntsushi3y ago

marcosdumay3y ago

Well, if you want a definitive answer, it's "it depends". Those errors are not all alike.

cmrdporcupine3y ago· 3 in thread

The article has nuance. It's good.

But the language has made it too easy to unwrap/expect and panic.

This escape hatch should exist, for sure. But it ought to be more explicit, and more of a pain.

It is far too easy to reach for this tool.

burntsushi3y ago

Your focus on unwrap/expect seems arbitrary. Why aren't you commenting on the ease with which 'slice[i]' and 'x * y' fail? Or alloc failure? Is slice index syntax not also too easy to reach for?

I'd encourage you to read the "lint" section near the end: https://blog.burntsushi.net/unwrap/#should-we-lint-against-u...

cmrdporcupine3y ago

Your response is kind of harsh, and you're making a bit of an unsympathetic straw man out of what I said.

I agree that panicing is a legit option. And I agree with the scenarios laid out in the article. And I also don't think lint is the right way to handle it.

(FWIW safety isn't my primary reason for preferring Rust. I'd be fine with "C++ with a ML-style type system." The general tamping down of footguns is great, though)

1 more reply

lmm3y ago

> Is slice index syntax not also too easy to reach for?

2 more replies

sitkack3y ago· 2 in thread

One can set the env var themselves in main(), many errors are transient, best to capture it the first time.

    // use std::env;
    env::set_var("RUST_BACKTRACE", "full");

burntsushi3y ago

sitkack3y ago

I was trying to find a "turn on backtrace" as an official api but couldn't find one in 90s of looking. I see the backtrace crate for catching them in user space in process, which is nice.

What would you recommend? Forking and execing yourself to set the env var? backtrace::enable_full() or something to that effect would be nice.

1 more reply

dont_panic_3y ago· 2 in thread

my problem with panic is that it is like walking in a mine field, you don't know which function will blow up at any given time, and it's not checked by the compiler.

If there was some sort of signal to mark a function as panicking (& vice versa), that would be nice.

burntsushi3y ago

This is like saying, "my problem with bugs is that they're like walking in a mine field, you don't know which function might have a bug in it or not."

pornel3y ago

Not really. It's possible to verify that a call graph can't call panic anywhere, and there exist 3rd party hacky solutions for this already. Rust is just lacking first-class features for this.

1 more reply

ArrayBoundCheck3y ago· 2 in thread

TheDong3y ago

> Does this mean using C inside of rust is ok? I'm pretty sure the original team kept hitting memory problems and it was in fact not ok

This reads a little like flamebait.

Memory unsafety is bad. panic is _not_ a memory unsafe operation, it does not result in memory problems.

This post is not related to memory safety in any way.

ArrayBoundCheck3y ago

The third sentence was the point. I don't want browsers being less reliable and terminating randomly due to unwraps

2 more replies

jeffrallen3y ago· 1 in thread

burntsushi3y ago

Did you read the blog? I addressed all of that.

schubart3y ago· 1 in thread

> When checking preconditions, make sure the panic message relates to the documented precondition, perhaps by adding a custom message. For example,

> `assert!(!xs.is_empty(), "expected parameter 'xs' to be non-empty")`.

This panics with

> thread 'main' panicked at 'expected parameter 'xs' to be non-empty', src/main.rs:79:5

Without the custom message it's

> thread 'main' panicked at 'assertion failed: !xs.is_empty()', src/main.rs:79:5

Given that panics should be for bugs, i.e. interpreted by developers, I'd say the second message is clear enough and a custom message just adds noise in the source code.

burntsushi3y ago

I don't disagree. Definitely depends on how opaque the assertion test is. In this case, yeah, probably don't need a message. Pithy illustrative examples are hard.

yuan433y ago· 1 in thread

The author notes that API simplicity might be a reason to avoid pushing invariants to compile time:

That said, I'm skeptical that the full range of approach was considered. I sometimes find that the presence of unwrap is a smell pointing to types that have not been fully fleshed out.

I get that the whole point is "at what cost?" All I'm saying is that the tradeoffs aren't clear from the example in the passage.

burntsushi3y ago

Well, what isn't clear? What do you want to know? I'll try again.

An Aho-Corasick automaton can be built in a few different ways. How it's built changes how matches are reported: https://docs.rs/aho-corasick/latest/aho_corasick/enum.MatchK...

> That said, I'm skeptical that the full range of approach was considered. I sometimes find that the presence of unwrap is a smell pointing to types that have not been fully fleshed out.

I am a fallible human. I might be wrong. So sure, be skeptical!

I'm not sure what this is a case of? Like yeah, I agree, that sounds bad?

> I get that the whole point is "at what cost?" All I'm saying is that the tradeoffs aren't clear from the example in the passage.

[1]: https://docs.rs/aho-corasick/latest/aho_corasick/struct.AhoC...

nnoitra3y ago· 1 in thread

This blog post will age out in 6 months.

burntsushi3y ago

Why? It's basically an elaboration of the same advice I gave eight years ago. And I was not the first.

ww5203y ago

I generally just use expect(), assert!, or unreachable! rather than simply unwrap() to document the unexpected invalid state.

renewiltord3y ago

Very neat. Just discovered anyhow and the context stuff from this post.

berryton3y ago

The Rust book has a small discussion about his in chapter 9.3 (https://doc.rust-lang.org/book/ch09-03-to-panic-or-not-to-pa...).

I think it is good to have some references (the blog post and the book) for when the horde comes after you.

j / k navigate · click thread line to collapse