Concurrent Programming, with Examples (opens in new tab)

(begriffs.com)

330 pointsbegriffs6y ago88 comments

88 comments

51 comments · 11 top-level

Random_ernest6y ago· 14 in thread

The article is very nice, thanks a lot for it. Especially since I hear the word concurrency and parallelism often thrown around without any distinction.

Very off topic, but I have read several times the argument that the rise of functional programming is due to it's easy concurrency (since functions don't have side effects) and that concurrency becomes more and more important due to moores law being dead (i.e. we can't scale the hardware up, we have to add cores to our processors).

Could someone with more experience comment on that? Is concurrency really easier in functional languages and is the rising importance of concurrency a valid reason to look into functional programming?

clarry6y ago

> Is concurrency really easier in functional languages and is the rising importance of concurrency a valid reason to look into functional programming?

I think it depends on who's writing the code. A little bit of shared mutable state here and there isn't gonna kill you, but if the program's architecture is poor, that'll blow up and spread everywhere and the next thing you know is you're spending half your time worrying about data races, deadlocks, and a locking nightmare.

Shared-nothing architectures fix that; your synchronization will happen over a narrow range of well defined primitives (e.g. message queues), though this doesn't prevent race conditions.

Functional programming that encourages pure functions & immutable state is a bit of a straight jacket that keeps you from making a mess that blows up. I think it helps, and I wish $stuffAtWork were written in such a language because I'm tired of wrestling with terrible architecture (that I'm not really allowed to fix, given the time constraints).

The other thing that makes a difference is what your problem looks like. Sometimes concurrency is absolutely trivial, like dispatching sections of an array for threads to perform (independent) compute on, and come back with a result. Sometimes it's way more complicated...

toolslive6y ago

Yes. Concurrency is easier in functional languages.

Many of them provide a concurrency monad which allows you to write expressions like this:

   read_from_socket_into_buffer params >>= 
   process_buffer >>= 
   ...

Where you roughly read the '>>=' as follows: "the left side might take an arbitrary amount of time to produce a result. While it's off and doing its thing, go and do something else. Once it has produced the result, take it and feed it as a parameter to function on the right."

BTW, It's quite easy to implement your own concurrency monad, however: the real work is the wrapping of all the blocking system calls into this framework.

1 more reply

Athas6y ago

Concurrency is all about simultaneous effects. A language like Haskell is nice enough because it makes effects explicit, so you at least know where things can go wrong, but if you have a bunch of concurrent effectful Haskell threads running at the same time, you have the usual issues with nondeterminism. Functional languages to tend to emphasize higher-level concurrency patterns than old-school shared mutable state, so in practice it works OK.

Specifically, concurrent Haskell is good and also quite fast. It doesn't quite compare to Erlang in the concurrency department although the languages are very different in other ways, so there are reasons to prefer Haskell anyway. I think this shows that just having a functional language that emphasizes effect-free programming is not sufficient. If you want a good concurrent language, then you have to design it like that, and not simply rely on the absence of shared mutable state.

In practice, people write massively concurrent systems all the time, in the form of dynamic web sites that handle many concurrent requests, and synchronise access to a shared mutable state via a transactional database of some kind. These don't depend on fancy languages, but on a rigid architecture that tries to isolate synchronisation issues in the web server and database components.

While this article is about concurrency, I'll also add a note on parallelism: Parallel functional programming has IMO not been shown to work well in practice. While it is trivial to parallelise more or less any pure functional program correctly, it is very hard to actually gain meaningful speedup from it. The single point of parallelism is performance, and most current implementations of functional languages (including all general purpose functional languages I know of) simply have too many bottlenecks or pitfalls to scale for computational work as well in practice as one might expect in theory. For example, concurrent Haskell is fast because most of the actual work tends to be IO and GHC has an excellent parallel IO manager and scheduler, while parallel Haskell is hampered by, among other things, a slow garbage collector.

hopia6y ago

Do you refer to Erlang excelling in distributed concurrency, due to how its built to rely on the actor model? Or Erlang also excels in concurrency on a single machine?

Btw, what are some of those other things hampering Haskell's performance with parallel computing? Wouldn't the garbage collector mainly produce pauses instead of really destroying parallel performance? Although a concurrent garbage collector has recently been merged to the nightly GHC.

1 more reply

zelphirkalt6y ago

What you heard is correct. If you have pure functions and provided, that you have working primitives of concurrency, you can go to any place in you code, where you call a pure function and change the code, so that the function runs in a new concurrency (or parallelism) primitive. For example in a new thread or process. The places where this is useful can be discovered by profiling.

Another advantage is, that you can unit test each function on its own, without setting up a whole host of preconditions. That's because of referential transparency.

imtringued6y ago

You are confusing Moores' Law with Dennard scaling. Without Moores' Law you're not adding more transistors and therefore more cores.

011000116y ago

I'm just an idiot EE who learned to program, but it seems like functional languages without side-effects accomplish their task by basically working on data structures stored on the stack. That basically means each call stack has its own data structure. If you wrote threaded code so that each thread had its own data structure, and worked off a shared set of read-only data structures, you wouldn't need to worry about locking.

IME locking isn't really all that hard until you need to squeeze out performance. If you can handle one big dumb lock around everything, it's easy. The finer grained your locking gets, the harder it is to get right(even then, lock hierarchies can help to a great degree).

unlinked_dll6y ago

One big dumb lock only protects against data races. There are other classes of concurrency bugs that require smart locking to avoid, and it's always a pain.

For example, iterator invalidation. Which is probably the most common thing I've seen when people use dumb locks to make a concurrent program sound. Either your iterators need to be aware of the lock and hold it through their lifetime (which means practically only one iterator can exist, which is undesirable) or your datastructure needs to be aware of outstanding iterators to it and update them (which means you have added logic to simple mutations, which is also undesirable).

It's problems like that you look for in non-functional languages that require library authors to say things like "this is/isn't thread safe" in their documentation, while providing those guarantees manually. And as a consumer of the library you need to be aware of these things and look for them, and what an error looks like when someone messes up.

I don't know what you're trying to say about stacks/the call stack in functional languages, because they implement their patterns in variety of ways - including locking. It's just about providing particular guarantees in the API, and the benefits of those guarantees are that it prevents entire classes of bugs like iterator invalidation and data races.

There are also all sorts of clever data structures you can use that don't require duplication of the underlying data. The easiest is a singly linked list without insertion. No locks required.

bcrosby956y ago

The problem is that if you have one big dumb lock around everything then you have no actual concurrency or parallelism.

1 more reply

toast06y ago

> Could someone with more experience comment on that? Is concurrency really easier in functional languages and is the rising importance of concurrency a valid reason to look into functional programming?

Only you know if concurrency is relavant to your work. A lot of internet stuff is deeply concurrent, but there's a lot of sequential work out there, too.

I have experience with Erlang. It's amazing for concurrency, but the reason is really not anything it gives you, but everything it prevents you from doing, or requires you to do.

Erlang's immutability is important because it means you can't save a reference to something and have it magically updated; if you want a value updated somewhere, you need to send the current value or request the current value at time of use; explicit synchronization makes concurrency clearer, and helps influence system design to reduce synchronization. Immutability also makes message passing easier; the content of a message can't change after it's composed which means the implementation is free to make a copy of the message if it's convenient and as a programmer, there's no need to consider the effects of changing something that you sent to another process. Immutability also makes garbage collection very simple; it's impossible to form a looped datastructure, so there's no need for mark and sweep, a simple copying collector is sufficient. Individual stacks per Erlang process (like a green thread) means GC pauses don't stop the world, only individual processes, and pause length is bounded and proportional to the memory use of the process.

You could certainly set up a similar environment in C or whatever traditional language; the problem would be enforcing your system design on your own code, and partitioning off library code (including libc and/or kernel syscalls) to ensure it doesn't impact your system design either.

leethargo6y ago

I don't have more experience, necessarily, but I think the argument builds on the facts that pure functions don't have side-effects, which avoids many of the problems of simultaneous data editing, and also the use of immutable data structures, which can be shared freely (in a read-only manner).

jimbokun6y ago

> Is concurrency really easier in functional languages

You can use the same approaches in non-functional languages. You can only use immutable data structures, and pure functions everywhere. Then you get the same benefits of easier reasoning about concurrent execution, as you get from a functional programming language.

So the point of using an actual functional language, is that the compiler pushes you very strongly towards programming this way. It's kind of like using a language with built in garbage collection. You could implement a garbage collector in C, and use it everywhere in your program where you allocate and release memory. But having it build into the language frees you up from having to think about it very much.

dirtydroog6y ago

I don't get the immutable data structures thing though. If I have a vector of 1 billion numbers and I need to sort it it'll be unfeasible to make a copy every time I swap elements. RAM isn't infinite either.

So, deep down in the guts of the language's libraries there must be mutable operations. Which kind of defeats the whole point.

I don't get point of functional programming. Machines don't work that way.

8 more replies

Heliosmaster6y ago

Yes. I can say that doing concurrency in Clojure is particularly easy, check for yourself: https://www.braveclojure.com/core-async/

bmn__6y ago· 6 in thread

Next article in the series: now that you know about the dangerous/complicated primitives, don't ever touch them again. Instead use the high-level safe concurrency/parallelism mechanisms in your programming language: futures/promises, nurseries, channels, observers, actors, monitors. Ideally, these should be built-in, but a library whose API composes well into most programs will also do.

Data races can be statically removed by carefully restricting certain parts of the language design, see Pony. https://tutorial.ponylang.io/#what-s-pony-anyway

Bonus: learn aspects of deadlocking by playing a game: https://deadlockempire.github.io/

closeparen6y ago

I disagree, first you need to program a few toy projects with the dangerous primitives, to really internalize how tricky they are.

It's because of my concurrent C homework assignments that I was able to really appreciate Go's channels in my first internship.

keymone6y ago

“Never touch them again” means don’t reach for those primitives when higher level tools are available. Learning about them is fine.

It’s like with cryptography - don’t roll your own solutions just because you know what xor is. You’ll fail miserably.

throw513196y ago

I am just getting into concurrency. Is there any point to use threads or forkjoinpool in Java anymore? Should you always just use CompletableFutures and Suppliers?

rrss6y ago

Somebody has to maintain the operating systems and a metric ton of system software written in C. It cannot be magically all switched to Pony or Rust overnight.

swsieber6y ago

See also Rust for the data races portion.

Though IIUC Pony is also supposed to prevent deadlocks.

Groxx6y ago

There are no locks in Pony, so yes: no deadlocks. (livelocks / other kinds of "lack of progress" are of course still possible tho)

thallukrish6y ago· 5 in thread

My experience is, single threaded execution and being able to replicate that with local data for each instance and remote lookup at a fine grained data level when needed, is a more easier way to maintain the code. Cocurrency and all those synchronisation is damn hard to code and debug.

tabtab6y ago

In my opinion it's overhyped. I'll probably take point hits for claiming it, but so be it. Let the truth ring.

Why copy techniques meant for Netflix and Facebook when your org or app is most likely 1/1000th their size. Phallic size jealousy at work.

Most concurrent and parallel work can and should be done on a true-and-tried RDMBS for most orgs and apps. Use transactions/rollbacks properly and let the RDBMS manage most the grunt work instead of reinvent the wheel in app code.

K.I.S.S. and use-the-right-tool-for-the-job.

thallukrish6y ago

If you want extreme scale, most data in memory, such as a search engine processing millions of documents running data pipelines, then RDBMS isn't the way to go. For other MVC types, for a reasonable scale out, straight forward models with RDBMS should do.

1 more reply

cle6y ago

It has tradeoffs and can easily result in higher complexity, eg if you need a shared cache to minimize latency/memory.

I like the attitude the article takes. To successfully use concurrency, you should understand the tools available to you and their tradeoffs so you can make the right decision for your requirements.

pjmlp6y ago

When one has nice tooling like Visual Studio graphical debugging, not so much.

tester7566y ago

Its it?

https://marketplace.visualstudio.com/items?itemName=AdamWulk...

011000116y ago· 4 in thread

> The sched_yield() puts the calling thread to sleep and at the back of the scheduler’s run queue.

Not necessarily, but it is fine for this purpose I suppose. See https://news.ycombinator.com/item?id=21959692

Glad to see lock hierarchies mentioned. Barriers are new to me so that was nice.

IMO, it would be nice to at least have a mention of lock-free techniques and their advantages and disadvantages.

Joker_vD6y ago

The main disadvantage of lock-free techniques is that you have to write code without any critical sections, that is, you have to properly manage arbitrary interleaving of actions. It's hard enough to manage staleness/inconsistency of data at high (business logic) level, never mind the low level where the code is not even executed in the written order.

011000116y ago

Also lock free usually means atomics which means memory fences. Those can be slow. It's another tool you should have available though.

1 more reply

agapon6y ago

Regarding sched_yield, another common (and, IMO, superior) approach to that issue is to drop the first lock and then acquire the second lock after the try-lock on it failed. Then you drop the second lock and retry the whole sequence again.

scott_s6y ago

Agreed that sched_yield() is unlikely to be the right approach. A better approach is to use nanosleep() with an exponential backoff.

jayd166y ago· 3 in thread

No mention of volatile variables or the concept of stale cpu cache reads when a value is written to from another core. I think its a pretty common and fundamental concept that should be in a write up such as this.

bonzini6y ago

If you use the standard blocking synchronization primitives, you cannot have stale reads. If you don't, the right way to introduce them would be with the C11 memory model relationships (synchronizes-with, happens-before), volatile shouldn't be touched with a 10 foot pole except for synchronization with signal handlers.

rrss6y ago

> stale cpu cache reads

Every multiprocessor system nowadays has coherent caches for normal memory, meaning that if you have reads that you perceive as stale, it isn't because of the cache, but because the hardware doesn't guarantee sequential consistency.

You can have problems due to relaxed consistency on a system without caches, and the problems on systems with caches aren't due to the caches.

There are situations when you have to worry about lack of coherency for other memory types, but AFAIK these are rarely exposed to userspace.

011000116y ago

I agree it would be nice to add. Is that something you need to worry about when using the standard synchronization APIs though? They all handle memory fencing for you, no?

rrss6y ago· 2 in thread

Does anyone know the history behind the distinction between concurrency and parallelism presented here? The most frequent reference I see is Pike's "Concurrency is not parallelism" talk, but I'm curious who first came up with this distinction.

rrss6y ago

More food for thought, as I've been thinking about this:

1. std::thread::hardware_concurrency

This is the number of threads that can execute in parallel, no?

2. "Memory-level parallelism"

How many memory operations can be "outstanding" at once - seems comparable to a single core issuing multiple disk reads. The memory operations aren't really serviced simultaneously, they just have overlapping lifetimes.

For more fun, some people refer to the case where performance is limited by the amount of memory-level parallelism available as "concurrency-limited": https://sites.utexas.edu/jdm4372/2018/01/01/notes-on-non-tem...

pjc506y ago

You can have concurrency without parallelism per the definition of the article - on a single processor system with timeslicing, for example.

SIMD systems effectively give you parallelism without concurrency - only one instruction is executing, but it's operating on multiple dataflows.

Your linked definition of "concurrency limited" seems to refer to utilisation. In the scenario described, how effectively the processor can be utilised depends on how many concurrent tasks it has in progress so it has something to do while one of them is waiting for a cache miss.

inaseer6y ago· 2 in thread

There is a good body of knowledge around dealing with concurrency issues within a single process. We've tools (locks, semaphores ...) to deal with the complexity as well as programming paradigms which help us write code which minimizes data races. It's interesting to realize that in a world with an increasing number of micro-services manipulating shared resources (a shared database, shared cloud resources), or even multiple nodes backing a single micro-service all reading and writing to shared resources, similar concurrency bugs arise all the time. Unlike a single process where you can use locks and other primitives to write correct code, there is no locking mechanism we can use to protect access to these global shared resources. We have to be more thoughtful so we write correct code in the presence of pervasive concurrency, which is easier said than done.

abjKT26nO86y ago

> Unlike a single process where you can use locks and other primitives to write correct code, there is no locking mechanism we can use to protect access to these global shared resources.

Databases provide transactions. This mechanism is also an inspiration for a synchronisation model called Software Transactional Memory proposed for Haskell, and used as "the" synchronisation model in Clojure. Locks and semaphores are rather lower-level primitives and it's harder for humans to reason about them with an ease comparable to using CSP or STM.

inaseer6y ago

Yes, database transactions should be heavily leveraged wherever possible. We've often had to write services which create multiple resources in response to user requests. As an example, create an entry in the database and trigger the creation of, say, an Azure storage account. Transactions across independent services and resources don't work and correctness requires thoughtful design. In the more general case, whenever your service talks to more than micro-service to complete an operation, you will probably have to think through issues of consistency and transactionality.

highhedgehog6y ago· 2 in thread

Is anyone aware of good examples that can be used to explain and implement parallelism/concurrency that are not the bankers? I have seen it too many times.

giu6y ago

The dining philosophers problem comes to mind, which originally was formulated by Dijkstra [0]. You can find implementations in different languages at Rosetta code [1]

[0] https://en.wikipedia.org/wiki/Dining_philosophers_problem [1] https://rosettacode.org/wiki/Dining_philosophers

highhedgehog6y ago

Thank you! Actually I saw that too.

I was hoping for more real life examples where you can see the effects of concurrency parallelism.

moring6y ago· 2 in thread

I'm a bit disappointed that the article doesn't explain the need for a memory/consistency model and how it interacts with CPU caches. Locks are the easy part, and the article makes you think that with them you can now write at least simple concurrent programs.

Why is that? I'm pretty sure that the author's intention is not to equip the readers with the tools to make buggy programs, yet that is exactly what happens here.

011000116y ago

Don't the standard synchronization APIs documented in the article handle memory barriers for you?

rrss6y ago

Yes. As long as you use correctly-constructed synchronization primitives (e.g. pthreads), you don't need to worry about memory consistency.

When you need to start worrying is when you start implementing your own synchronization (either rolling your own primitives or going lock-free).

'moring needs to clarify what they are talking about. It's perfectly possible to write correct code using pthreads on modern hardware with no understanding of memory consistency.

latrasis6y ago

Thank you for the great read! Wondering how io_uring would be put in place of this situation...would be very interested in the authors review: https://kernel.dk/io_uring.pdf

Jahak6y ago

Interesting article and a great blog

j / k navigate · click thread line to collapse

88 comments

51 comments · 11 top-level

Random_ernest6y ago· 14 in thread

The article is very nice, thanks a lot for it. Especially since I hear the word concurrency and parallelism often thrown around without any distinction.

clarry6y ago

> Is concurrency really easier in functional languages and is the rising importance of concurrency a valid reason to look into functional programming?

Shared-nothing architectures fix that; your synchronization will happen over a narrow range of well defined primitives (e.g. message queues), though this doesn't prevent race conditions.

toolslive6y ago

Yes. Concurrency is easier in functional languages.

Many of them provide a concurrency monad which allows you to write expressions like this:

   read_from_socket_into_buffer params >>= 
   process_buffer >>= 
   ...

BTW, It's quite easy to implement your own concurrency monad, however: the real work is the wrapping of all the blocking system calls into this framework.

1 more reply

Athas6y ago

hopia6y ago

Do you refer to Erlang excelling in distributed concurrency, due to how its built to rely on the actor model? Or Erlang also excels in concurrency on a single machine?

1 more reply

zelphirkalt6y ago

Another advantage is, that you can unit test each function on its own, without setting up a whole host of preconditions. That's because of referential transparency.

imtringued6y ago

You are confusing Moores' Law with Dennard scaling. Without Moores' Law you're not adding more transistors and therefore more cores.

011000116y ago

unlinked_dll6y ago

One big dumb lock only protects against data races. There are other classes of concurrency bugs that require smart locking to avoid, and it's always a pain.

There are also all sorts of clever data structures you can use that don't require duplication of the underlying data. The easiest is a singly linked list without insertion. No locks required.

bcrosby956y ago

The problem is that if you have one big dumb lock around everything then you have no actual concurrency or parallelism.

1 more reply

toast06y ago

Only you know if concurrency is relavant to your work. A lot of internet stuff is deeply concurrent, but there's a lot of sequential work out there, too.

I have experience with Erlang. It's amazing for concurrency, but the reason is really not anything it gives you, but everything it prevents you from doing, or requires you to do.

leethargo6y ago

jimbokun6y ago

> Is concurrency really easier in functional languages

dirtydroog6y ago

So, deep down in the guts of the language's libraries there must be mutable operations. Which kind of defeats the whole point.

I don't get point of functional programming. Machines don't work that way.

8 more replies

Heliosmaster6y ago

Yes. I can say that doing concurrency in Clojure is particularly easy, check for yourself: https://www.braveclojure.com/core-async/

bmn__6y ago· 6 in thread

Data races can be statically removed by carefully restricting certain parts of the language design, see Pony. https://tutorial.ponylang.io/#what-s-pony-anyway

Bonus: learn aspects of deadlocking by playing a game: https://deadlockempire.github.io/

closeparen6y ago

I disagree, first you need to program a few toy projects with the dangerous primitives, to really internalize how tricky they are.

It's because of my concurrent C homework assignments that I was able to really appreciate Go's channels in my first internship.

keymone6y ago

“Never touch them again” means don’t reach for those primitives when higher level tools are available. Learning about them is fine.

It’s like with cryptography - don’t roll your own solutions just because you know what xor is. You’ll fail miserably.

throw513196y ago

I am just getting into concurrency. Is there any point to use threads or forkjoinpool in Java anymore? Should you always just use CompletableFutures and Suppliers?

rrss6y ago

Somebody has to maintain the operating systems and a metric ton of system software written in C. It cannot be magically all switched to Pony or Rust overnight.

swsieber6y ago

See also Rust for the data races portion.

Though IIUC Pony is also supposed to prevent deadlocks.

Groxx6y ago

There are no locks in Pony, so yes: no deadlocks. (livelocks / other kinds of "lack of progress" are of course still possible tho)

thallukrish6y ago· 5 in thread

tabtab6y ago

In my opinion it's overhyped. I'll probably take point hits for claiming it, but so be it. Let the truth ring.

Why copy techniques meant for Netflix and Facebook when your org or app is most likely 1/1000th their size. Phallic size jealousy at work.

K.I.S.S. and use-the-right-tool-for-the-job.

thallukrish6y ago

1 more reply

cle6y ago

It has tradeoffs and can easily result in higher complexity, eg if you need a shared cache to minimize latency/memory.

I like the attitude the article takes. To successfully use concurrency, you should understand the tools available to you and their tradeoffs so you can make the right decision for your requirements.

pjmlp6y ago

When one has nice tooling like Visual Studio graphical debugging, not so much.

tester7566y ago

Its it?

https://marketplace.visualstudio.com/items?itemName=AdamWulk...

011000116y ago· 4 in thread

> The sched_yield() puts the calling thread to sleep and at the back of the scheduler’s run queue.

Not necessarily, but it is fine for this purpose I suppose. See https://news.ycombinator.com/item?id=21959692

Glad to see lock hierarchies mentioned. Barriers are new to me so that was nice.

IMO, it would be nice to at least have a mention of lock-free techniques and their advantages and disadvantages.

Joker_vD6y ago

011000116y ago

Also lock free usually means atomics which means memory fences. Those can be slow. It's another tool you should have available though.

1 more reply

agapon6y ago

scott_s6y ago

Agreed that sched_yield() is unlikely to be the right approach. A better approach is to use nanosleep() with an exponential backoff.

jayd166y ago· 3 in thread

bonzini6y ago

rrss6y ago

> stale cpu cache reads

You can have problems due to relaxed consistency on a system without caches, and the problems on systems with caches aren't due to the caches.

There are situations when you have to worry about lack of coherency for other memory types, but AFAIK these are rarely exposed to userspace.

011000116y ago

I agree it would be nice to add. Is that something you need to worry about when using the standard synchronization APIs though? They all handle memory fencing for you, no?

rrss6y ago· 2 in thread

rrss6y ago

More food for thought, as I've been thinking about this:

1. std::thread::hardware_concurrency

This is the number of threads that can execute in parallel, no?

2. "Memory-level parallelism"

pjc506y ago

You can have concurrency without parallelism per the definition of the article - on a single processor system with timeslicing, for example.

SIMD systems effectively give you parallelism without concurrency - only one instruction is executing, but it's operating on multiple dataflows.

inaseer6y ago· 2 in thread

abjKT26nO86y ago

> Unlike a single process where you can use locks and other primitives to write correct code, there is no locking mechanism we can use to protect access to these global shared resources.

inaseer6y ago

highhedgehog6y ago· 2 in thread

Is anyone aware of good examples that can be used to explain and implement parallelism/concurrency that are not the bankers? I have seen it too many times.

giu6y ago

The dining philosophers problem comes to mind, which originally was formulated by Dijkstra [0]. You can find implementations in different languages at Rosetta code [1]

[0] https://en.wikipedia.org/wiki/Dining_philosophers_problem [1] https://rosettacode.org/wiki/Dining_philosophers

highhedgehog6y ago

Thank you! Actually I saw that too.

I was hoping for more real life examples where you can see the effects of concurrency parallelism.

moring6y ago· 2 in thread

Why is that? I'm pretty sure that the author's intention is not to equip the readers with the tools to make buggy programs, yet that is exactly what happens here.

011000116y ago

Don't the standard synchronization APIs documented in the article handle memory barriers for you?

rrss6y ago

Yes. As long as you use correctly-constructed synchronization primitives (e.g. pthreads), you don't need to worry about memory consistency.

When you need to start worrying is when you start implementing your own synchronization (either rolling your own primitives or going lock-free).

'moring needs to clarify what they are talking about. It's perfectly possible to write correct code using pthreads on modern hardware with no understanding of memory consistency.

latrasis6y ago

Thank you for the great read! Wondering how io_uring would be put in place of this situation...would be very interested in the authors review: https://kernel.dk/io_uring.pdf

Jahak6y ago

Interesting article and a great blog

j / k navigate · click thread line to collapse