OCaml 4.03 will, “if all goes well”, support multicore (opens in new tab)

(sympa.inria.fr)

185 pointswting11y ago113 comments

113 comments

64 comments · 11 top-level

tangled11y ago· 11 in thread

It's interesting to read Xavier's annual statement on why there will never be multi core support in OCaml: http://mirror.ocamlcore.org/caml.inria.fr/pub/ml-archives/ca...

AceJohnny211y ago

When was the latest iteration of post from? The page displays date as NaN...

Edit: I think I found it [1] That iteration was from 2002. I'd be curious to see if his opinion has evolved in 12 years.

Also, interesting to see game developer Chris Hecker [2] in that thread.

[1] http://caml.inria.fr/pub/ml-archives/caml-list/2002/11/threa... search for "Why systhreads?" and "Xavier Leroy". Also, damn their website's broken.

Better link, I wish GMane had better Googlejuice: http://thread.gmane.org/gmane.comp.lang.caml.general/16381/f...

[2] http://en.wikipedia.org/wiki/Chris_Hecker

gnuvince11y ago

Around 2001 I think?

rwmj11y ago

2002

trentnelson11y ago

    > To make things worse, non-blocking I/O is done completely differently
    > under Unix and under Win32.  I'm not even sure Win32 provides enough
    > support for async I/O to write a real user-level scheduler.

sigh, VMS got the link between processes, threads, I/O and waitable events (specifically, the link between tying the completion of future I/O to subsequent computation) right from day one. And by virtue of Cutler, therefore, so did NT, and thus, Windows.

UNIX did not. The core concept of separating the work (computation to be done after an event occurs) from the worker[1] (the thread that performs the work) is absent; the manifestation of that is the lack of good, completion-oriented asynchronous I/O primitives. Instead of being able to say to the kernel "here, do this, then let me know when you're done"[2] and moving on to the next piece of work in the queue, you have to do the elaborate non-blocking multiplex dance for socket I/O, palm file I/O off onto a separate set of threads that can block (or do AIO) and generally manage all threading and concurrency primitives yourself.

It took me ten years of UNIX systems programming to suddenly grasp the elegance of the VMS/NT/Windows approach a few years ago. It provides you with everything you need to optimally exploit all your cores for work that is both heavily compute bound and I/O bound.

It has been fascinating to see the difference in performance between Linux and Windows in practice with PyParallel when Windows kernel primitives are exploited properly:

https://speakerdeck.com/trent/pyparallel-pycon-2015-language....

And more recently, with 10Gbe hardware at home:

Linux lwan (the top performer on Techempower Framework Benchmark):

    [trent@zebra/ttypts/1(~s/wrk)%] time ./wrk --timeout 120 --latency -c 256 -t 12 -d 30 http://10.0.0.2:8080/plaintext
    Running 30s test @ http://10.0.0.2:8080/plaintext
      12 threads and 256 connections
      Thread Stats   Avg      Stdev     Max   +/- Stdev
        Latency     5.34ms    7.46ms 197.13ms   82.40%
        Req/Sec    14.41k   364.49    18.82k    76.61%
      Latency Distribution
         50%  398.00us
         75%    9.01ms
         90%   17.50ms
         99%   28.03ms
      5178617 requests in 30.10s, 0.93GB read
    Requests/sec: 172048.49
    Transfer/sec:     31.67MB

Windows PyParallel:

    [trent@zebra/ttypts/1(~s/wrk)%] time ./wrk --timeout 120 --latency -c 256 -t 12 -d 30 http://10.0.0.2:8080/plaintext
    Running 30s test @ http://10.0.0.2:8080/plaintext
      12 threads and 256 connections
      Thread Stats   Avg      Stdev     Max   +/- Stdev
        Latency     1.52ms    9.38ms 492.43ms   99.33%
        Req/Sec    18.37k     1.01k   22.75k    73.50%
      Latency Distribution
         50%    1.09ms
         75%    1.28ms
         90%    1.56ms
         99%    5.18ms
      6598900 requests in 30.10s, 1.03GB read
    Requests/sec: 219236.69
    Transfer/sec:     34.92MB
    ./wrk --timeout 120 --latency -c 256 -t 12 -d 30   106.30s user 138.87s system 814% cpu 30.114 total

[1]: https://speakerdeck.com/trent/parallelism-and-concurrency-wi...

[2]: https://speakerdeck.com/trent/pyparallel-how-we-removed-the-...

e12e11y ago

Did you try to run that code under ReactOS too? I would assume (I've not checked) that they follow the NT kernel design -- so should have similar "architectural" performance -- even if I doubt they've had as much time to hand-tune the details.

It'd be interesting if running under the VMS/NT thread/fork model could be seen as a reason to deploy some apps on ReactOS rather than Linux/BSD. Would also be interesting if one could see any difference running a multi-core KVM guest on ReactOS vs a Linux/BSD guest/container/jail. Although I suppose one would need to dedicate a hw nic to see any real results (avoiding the host OS/VM scheduler etc)?

Note-to-self: something to play with...

1 more reply

pascal_cuoq11y ago

Xavier's first sentence states that they two operating systems have a visibly different philosophy, not that one is better than the other. The second sentence should be interpreted in the context of this first sentence: if you try to emulate Unix's primitive with Windows', and especially if you want to do this and write a user-level scheduler that does not occasionally deadlock without reason, you will get stuck in a couple of places.

This doesn't mean that Windows' philosophy does not give you optimal performance in PyParallel. It simply means that OCaml had chosen for its low-level system primitives a Unix model and that it was difficult to make a Windows version of the same primitives so that OCaml programmers could write this kind of program portably between Windows and Unix.

NOTE: without, at the time it is in my timezone, looking up the full post, I have to say that I don't think that the quoted two sentences have anything to do with the discussion. It seems to me that the two sentences assume that a multicore (multiprocessor, at the time the post was written) OCaml runtime is not available, and discusses the options to still provide threads. A user-level scheduler is one option to provide threads to OCaml programs without a concurrent OCaml runtime. Another option is to use Windows' native threads and superior philosophy for blocking primitives to run each OCaml thread as a native thread (although at most one of these will be running at any given time. All the others will be waiting on the heap mutex).

OCaml ended up providing threads under Windows and a Unix-like “Unix” module around 1996-ish, way before the linked discussion. So thanks for the explanation about VMS, but I think it is off-topic, too.

NOTE 2: I have now read the original post. You should, too. It starts with:

> Threads have at least three different purposes:

> 1- Parallelism on shared-memory multiprocessors.

> 2- Overlapping I/O and computation (while a thread is blocked on a network

> read, other threads may proceed).

>3- Supporting the "coroutine" programming style

> (e.g. if a program has a GUI but performs long computations,

> using threads is a nicer way to structure the program than

> trying to wrap the long computation around the GUI event loop).

> The goals of OCaml threads are (2) and (3) but not (1) (for reasons

> that I'll get into later)

What makes it relevant to the current discussion is (1), but Xavier is discussing (2) and (3) at the time of the quote you chose to take out of context.

1 more reply

gtk4011y ago

Would you mind explaining what the link between NT and VMS is?

2 more replies

plorkyeran11y ago

> Of course, all this SMP support stuff slows down the runtime system even if there is only one processor, which is the case for almost all our users...

> What about hyperthreading? Well, I believe it's the last convulsive movement of SMP's corpse :-)

Oh how things have changed. This was written before it was clear just how much of a disaster the P4 was, so it was a pretty reasonable position at the time.

rodgerd11y ago

He was hardly the only one thinking that way - I remember Gabe Newell being scathing about multicore/multiprocessing when the PS3 and Xbox 360 were released. All a fad, waste of time.

bitmadness11y ago

That was back in 2002! I don't think he'd take the same stance now...

istvan__11y ago

I think he was saying it is unlikely.

"In summary: there is no SMP support in OCaml, and it is very very unlikely that there will ever be. If you're into parallelism, better investigate message-passing interfaces."

dorfsmay11y ago· 11 in thread

I took a really hard look at OCaml a year ago, as I was running into performance issues with python. Lack of multicore support made me give up on it.

Now that Rust is around and supporting multicore, that's probably where I'll be investing my time.

I'd love to hear feedback from people who have used both Rust and OCaml.

diginux11y ago

I have started playing with Rust and have used OCaml on and off, but recently really diving hard into it building a REST-based application from top to bottom in OCaml, including all the infrastructure bits. We are slowly making our stuff open source as it becomes useable: https://github.com/afiniate

In short, OCaml is a mature language that has been used for decades in commercial applications. I feel OCaml is the next progression for the people that got excited about distributed systems via the Erlang path and want more of the safety and reasoning that comes from a strongly/statically-typed language like OCaml. Rust may or may not take off, but I am confident OCaml will remain viable for the foreseeable future, and probably gain slow, but steady popularity as engineers see all the cool things you can do like MirageOS: http://openmirage.org/

lobster_johnson11y ago

Isn't the Unicode situation in OCaml more or less the same as in Erlang and Ruby 1.8, ie. "string" is just a byte string, and there's no native encoding support?

Last I checked, there was decent third-party library support in Batteries. I imagine it would be painful if you were to use Batteries' "UTF8.t" string type and had to interface with some other library that used "string" or some other string solution (like Camomile)?

2 more replies

WaxProlix11y ago

Don't forget the .NET OCaml, F#!

1 more reply

saosebastiao11y ago

I've long admired OCaml, but never used it really. I picked up Rust a while ago and actually made some use of it. I love the language a lot, but it is still too immature and low level for my current situation. I ended up using F# as the compromise, which has been amazing.

There are some areas where OCaml is more advanced than F# (functors, the codegen from the optimized compiler, lack of the msbuild barf sandwich, less hacky on non-Windows platforms), but there are also plenty of areas where F# is more advanced than OCaml (computation expressions, code interoperability, real 32 and 64 bit integers, agents, multicore runtime).

I would say that if OCaml was ideal for you except for the lack of parallelism, then you should definitely check out F# before you go all the way to Rust. Rust is awesome and for the right use case you should use it, but F# is a lot closer to OCaml than Rust is.

eatonphil11y ago

I am the author of the major web frameworks for ocaml [0]. On of the nicest things about OCaml is the identical system calls. I built my entire OWebl platform by reading through C servers.

On the other hand, Rust (like Erlang) reinvents and wraps a lot of these calls in ways that are not immediately obvious. (Or at least, not AS immediately obvious as they are in OCaml.)

This is such a tremendous aid because there are nearly limitless documents and examples of the Unix API.

[0] http://github.com/eatonphil/owebl

WaxProlix11y ago

I'm in the same place sort of, and Golang and Julia seem like the obvious higher-performance transitions from Python. What about Rust makes you consider it so highly?

dorfsmay11y ago

To be honest the main reasons are:

* network effect / hype

One of the main reason. It sounds lame, but I can justify to a customer re-writing a project in Rust because they've heard about it and they will be able to hire people who have either used it or at the very least will be interested in learning it. Also, they network/hype means that we are going to see good libraries emerge fairly quickly.

* multicore support

A year ago I was ready to move to OCaml, bought books, started to learn it, but the multicore situation was worse than python. Today we hear that "there is a good chance it's going to get multicore support". In Rust, it's already there, and it's not and after thought.

Rust also happened to be slightly faster for most things according to micro-benchmarks, but we know how reliable those are, and it's we're not talking order of magnitudes here, although it is early and we can hope it gets even better.

yarrel11y ago

The Rust hype machine has kicked into high gear. D suddenly seems under-promoted.

iamd3vil11y ago

Take a look at Nim Language. (www.nim-lang.org) A High Performance lang with "Pythonesque" syntax.

2 more replies

e_d_g_a_r11y ago

What were you doing that you needed multicore?

dorfsmay11y ago

Pivoting hundreds of millions json entries and upload the results to an object store.

I ended up using a combination of python threads and processes, which was more work I wanted to do, and still relatively slow.

1 more reply

almosthaskeller11y ago· 11 in thread

Looked into a static FP language recently. Was torn between OCaml and Haskell. Leaned more toward Haskell than OCaml. Mainly because OCaml feels like it was hacked together, with a lot of very strange and inconsistent syntax and poorly thought out semantics. That said, I haven't chosen either yet, because Haskell has its own share of oddities that I'm still not comfortable with. But at least it feels more pure and consistent and well thought out in its syntax and semantics.

elihu11y ago

Ocaml and Haskell are both good languages. Haskell probably has a bigger community and more momentum at this point. I switched from Ocaml to Haskell long ago because I wanted parallelism.

The implementation philosophy of the two languages is pretty different, despite being superficially similar in terms of syntax. Ocaml is pretty predictable -- you can look at code and have a pretty good idea of what kind of code the compiler is going to generate.

Haskell is a lot more opaque. Between laziness and a more rigid type system, ghc can do some pretty crazy code transformations. In general, this is a good thing, but it can make performance questions harder to figure out.

I think that Ocaml is easier to learn, but Haskell is more fun, and I've learned more from using it.

ufo11y ago

Haskell libraries also tend to have more levels of abstraction then Ocaml ones, in my experience. Ocaml libraries don't tend to use things like monad transformers or lenses.

alextgordon11y ago

After many years of struggling with Haskell, I've all but given up on it, because of the broken record semantics. They seem to be able to find time to implement monad comprehensions or whatever paper fodder is most in vogue, but you still can't have two datatypes with the same field name in the same module. It's not a serious language.

elihu11y ago

That is also true of Ocaml, and probably every other ML derived language that treats accessor functions as ordinary functions (rather than using the C-style dot operator). There is a type-directed name resolution proposal that would remove this limitation in Haskell, but it would probably make the typechecker a lot more complicated.

The can't-reuse-field-names thing is annoying, but claiming that it "isn't a serious language" because they made a design choice that doesn't meet your exact expectations seems kind of closed-minded to me.

3 more replies

djur11y ago

Having two datatypes with the same field name in scope doesn't work well with type inference. OCaml permits it, but it requires type annotations, and in practice it seems like developers solve the problem the same way Haskell does -- separate modules.

That said, working with modules is a lot nicer in OCaml than in Haskell, so it's a less painful solution.

issaria11y ago

A record field accessor is just a normal function, are you also expecting define methods with the same name within the same module? If you want an elegant solution, write a typeclass. You have to write you java in the haskell way. https://wiki.haskell.org/Name_clashes_in_record_fields

e_d_g_a_r11y ago

What do you mean hacked together? It looks like most any other ML for the most part.

And what poorly thought out semantics?

jallmann11y ago

ML actually has a formally specified semantics. Haskell does not.

brians11y ago

OCaml doesn't either.

1 more reply

codygman11y ago

Do you mean:

http://sml-family.org/sml97-defn.pdf

1 more reply

creichert11y ago

Both are great languages and worth learning. All programming languages have quirks.

j_baker11y ago· 5 in thread

Does OCaml not already support multicore? Is concurrency green thread based? Even at that, there's nothing stopping a user from starting multiple processes....

murbard211y ago

Yes, the threads are green threads: only one can run at a time. There's also an Async framework in Jane Street's Core library and there's LWT.

My understanding is that GC is hard with multithread, particularly in a functional language where it's going to do some heavy lifting and needs to be very performant.

Refefer11y ago

That's not really true. Erlang's VM is fantastic at GC with thousands upon thousands of green processes multiplexed onto the system threads, allowing soft realtime performance. Similarly, Haskell's Parallel Strategies library works well with the Parallel GC. Immutability makes this a whole lot easier.

Or were you referring to OCaml in particular?

2 more replies

bad_user11y ago

GC can be hard with single threading as well. Even if you don't have concurrency to worry about, it still has to be incremental - meaning to do its work in small batches and provide a guarantee that the process doesn't get blocked for more than X millis.

Also, if you can rely on the data-structures stored in your heap to be persistent, then you can tune the GC for it. The problem is that you need to make assumptions about the life-cycle of those data-structures. For example, the persistent data-structures being used in Scala or Clojure can be pretty heavy for the JVM's garbage collectors because they tend to produce junk that is neither short-term or long-term, thus invalidating the assumptions with which the JVM was built with. And generally that's OK, because the JVM's GCs can cope pretty well and if the need to optimize arises, well both Scala and Clojure are hybrids (just like OCaml), so you can just use mutable stuff if by profiling you see problems. So the theory is known and a decent concurrent GC can be built.

bad_user11y ago

Starting multiple processes sucks though, you know, for the kind of use cases for which OCaml should be well suited for. This is because OS processes are more expensive than threads and communication and synchronization between such processes gets very expensive.

istvan__11y ago

on this argument you could state that Erlang's processes the way to go because OS threads are way more expensive. If I have a problem that I can easily make parallel than I could just start up N OCaml processes and send each process a chunk of work and this would not be much more inefficient than the thread based implementation. On the top of that, I don't like to create a tonn of threads in any application, I much rather have a fixed number of threads and using channels to send them work than dynamically (on demand) creating threads or processes.

1 more reply

DonPellegrino11y ago· 4 in thread

More information is available

in my original post in r/ocaml: https://www.reddit.com/r/ocaml/comments/36ninh/403_scheduled...

in the repost in r/programming: https://www.reddit.com/r/programming/comments/36ppx0/ocaml_4...

DonPellegrino11y ago

For those asking "How the hell does OCaml not support multicore in 2015????", this is my reply, crossposted from /r/ocaml:

You can make OS level threads, but they can't be both running at the same time due to the GIL (Global Interpreter Lock). Then why are they even there you might ask? Because it allows you to do a blocking call on a thread and to keep executing other stuff in the main thread. Other languages that have a GIL (and the same restriction) are Javascript (including Node.js), Ruby and Python.

Now, IN PRACTICE, things are a bit different. You're never gonna make your own thread to block on things. You're gonna use Lwt to manage all your concurrency so you can do tons of blocking stuff at the same time and combine the tasks nicely without ending up in a Node.js-style "callback hell".

But still, even with tons of concurrency, you don't have parallelism. It's all you need for 98% of your programs, but if you then need to do heavy number-crunching it won't be enough. This is the exact same situation that happens in Node.js, Python, etc, except that OCaml is massively faster than those languages, so even some CPU-bound work is acceptable because OCaml is really performant.

Currently, there's 2 options if you wanna do CPU-bound work: you can use ctypes to call C code easily (from Lwt_preemptive) and then release the lock from within C with caml_release_runtime_system(), so your C code will be truly parallel (and running in the thread pool automatically managed by Lwt_preemptive), and you can call caml_acquire_runtime_system() before returning the result back to OCaml to get the lock back and merge back with the normal code.

The second option is to do an oldschool fork() and communicate with message-passing. Or have a master that manages workers and communicates with ZMQ, HTTP, TCP, IPC, etc. Or use a library that does it all for you like parmap, Async Parallel, etc etc.

What this "multicore support" means is that you'll be able to have threads in the same process that run in parallel because the GIL is going away. In practice it'll probably be implemented directly into Lwt so you'll be able to do something with Lwt_preemptive and just tell it to run some function in a separate thread and then use >>= to handle its result. It's gonna be simpler than both options I described above.

Again, more technical information is available in my r/ocaml post

jwatzman11y ago

> The second option is to do an oldschool fork() and communicate with message-passing. Or have a master that manages workers and communicates with ZMQ, HTTP, TCP, IPC, etc. Or use a library that does it all for you like parmap, Async Parallel, etc etc.

I work on the Hack language typechecker at Facebook. The typechecker is written in OCaml, and since it needs to operate on the scale of Facebook's codebase (tens of millions of lines of code), it's a pretty performance-sensitive program. We needed real parallelism, but doing it with fork() and IPC was too costly for us, both in terms of storage (if you aren't careful you end up duplicating a bunch of data) and CPU (serializing/deserializing OCaml data structures to send over IPC is CPU-intensive).

We ended up doing something somewhat more interesting. Before we fork(), we mmap a MAP_ANON|MAP_SHARED region of memory -- that region will be backed by the same physical frames in each child after we fork, so writes to it in one child process will be visible in the others. We use a little bit of C code to safely manage the shared-memory concurrency here.

The code for this all open source (along with the rest of the typechecker, HHVM runtime, etc) if you want to take a look: https://github.com/facebook/hhvm/blob/master/hphp/hack/src/h...

I also gave a tech talk a while ago on internals of the type system and typechecker; the latter part starts here: https://www.youtube.com/watch?v=aN22-V-b8RM&feature=youtu.be...

1 more reply

eurleif11y ago

>Other languages that have a GIL (and the same restriction) are Javascript (including Node.js)

Not quite true; JS just doesn't support threads at all. It's asynchronous and single-threaded. In node.js's case, an event loop uses a system call like epoll or kqueue to wait for many events at a time, and dispatches those events to the correct callbacks.

You can do parallelism in JS with Web Workers, and they do use native OS threads, but they lack shared memory, and can only communicate using message passing. So from the perspective of the JS code, they behave more like processes than threads. No GIL, in any case.

1 more reply

sampo11y ago

> Node.js, Python, etc, except that OCaml is massively faster than those languages

The numerical benchmark table in http://julialang.org/ suggests that JavaScript is quite a number crunching beast, within 2x-3x of C performance.

1 more reply

nextos11y ago· 4 in thread

I'm considering OCaml for a new project where C++ would be the typical choice. Think algorithms handling massive amounts of data, and some numerics.

I have some experience with ML, Haskell & Lisp. OCaml is appealing because it is quite efficient and predictable. Does it have the bit of laziness Clojure has that makes functional programming easy with large data?

gmfawcett11y ago

Yes, there is support for laziness (see "streams"). A couple things to keep in mind: floating point values in Ocaml are boxed (floats are actually pointers to float data on the heap), and integers are one bit shorter than native types (31 or 63 bits) due to the way that Ocaml values are tagged internally. The native compiler generates good, predictable, but fairly simple code: few optimizations are applied (although there is active work underway, in the "flambda" project, that will significantly change this). Also, there is of course a garbage collector, though it is quite efficient in most cases. These factors may or may not be a performance issue in your own project.

DonPellegrino11y ago

In practice the generated code is already extremely fast and the 1-bit shorter ints help make the GC one of the fastest I've seen. If you do a lot of floating point calculations, you can put your floats in an array and they'll become unboxed.

shriphani11y ago

Function application in OCaml is eager.

However, implementing laziness is trivially accomplished.

raphaelss11y ago

It already comes with support for lazy evaluation.

http://caml.inria.fr/pub/docs/manual-ocaml/extn.html#sec216

1 more reply

SniperOwl11y ago· 3 in thread

If Jane Street Capital has it their way, multicore support "will definitely go well".

diginux11y ago

Are you from JSC by chance? Is this from an authoritative source, or an just an observation of likeliness? Would love if they indeed do push hard for this.

tomjen311y ago

It sounds like such a company might also have the capital to pay people to make it work.

It is pretty stupid for a language not to have multicore support in 2015. Javascript has it (in its own, somewhat broken way).

rubiquity11y ago

> Javascript has it (in its own, somewhat broken way).

No it doesn't. Also OCaml is from a pre-multicore era. Even Erlang wasn't multicore from the start, SMP was added in 2005.

1 more reply

feld11y ago· 2 in thread

Seems like a major feature to put in a point release...

I don't understand why some projects have such bizarre versioning methodology.

LeonidasXIV11y ago

This is not really a point release. OCaml versions a little different, a version number has three parts: Super-Major.Major.Patch. Super-Major releases are incredibly rare, the last one was the bump to 4, which was done since the language now supports GADTs (while staying compatible with OCaml 3.x). I don't even know what caused the bump from 2.x to 3.00. Then the Major part is a normal release in which many features may be added. The format is always two digits, of which the first may as well be a 0. The Patch part is just for fixes, stuff that was broken and overlooked when the release was done.

So OCaml 4.03.0 is basically 4.3.0 in a Python-esque versioning scheme (remember how many changes were done between Python 2.2.0 and 2.7.0?).

feld11y ago

Thanks for clearing that up

timruffles11y ago· 2 in thread

Great! This, plus the lack of libraries, put me off. Will take a new look!

LeonidasXIV11y ago

You'll be delighted to hear that OPAM curently features >800 libraries, too.

kristianp11y ago

Any libraries in particular?

avsm11y ago

If you'd like to see the approach we're taking at OCaml Labs in order to build multicore, read KC's blog post here:

http://kcsrk.info/ocaml/multicore/2015/05/20/effects-multico...

The core idea is incredibly exciting (to us, anyway). Rather than baking in a specific multicore scheduler, we're allowing pluggable schedulers written in OCaml. They use algebraic effects to allow an independent scheduler to compose concurrency among OCaml threads. This will ensure that the OCaml runtime remains lean, and even allow applications to define their own strategies for concurrent scheduling.

tempodox11y ago

Yupeee!

j / k navigate · click thread line to collapse

113 comments

64 comments · 11 top-level

tangled11y ago· 11 in thread

It's interesting to read Xavier's annual statement on why there will never be multi core support in OCaml: http://mirror.ocamlcore.org/caml.inria.fr/pub/ml-archives/ca...

AceJohnny211y ago

When was the latest iteration of post from? The page displays date as NaN...

Edit: I think I found it [1] That iteration was from 2002. I'd be curious to see if his opinion has evolved in 12 years.

Also, interesting to see game developer Chris Hecker [2] in that thread.

[1] http://caml.inria.fr/pub/ml-archives/caml-list/2002/11/threa... search for "Why systhreads?" and "Xavier Leroy". Also, damn their website's broken.

Better link, I wish GMane had better Googlejuice: http://thread.gmane.org/gmane.comp.lang.caml.general/16381/f...

[2] http://en.wikipedia.org/wiki/Chris_Hecker

gnuvince11y ago

Around 2001 I think?

rwmj11y ago

2002

trentnelson11y ago

    > To make things worse, non-blocking I/O is done completely differently
    > under Unix and under Win32.  I'm not even sure Win32 provides enough
    > support for async I/O to write a real user-level scheduler.

It has been fascinating to see the difference in performance between Linux and Windows in practice with PyParallel when Windows kernel primitives are exploited properly:

https://speakerdeck.com/trent/pyparallel-pycon-2015-language....

And more recently, with 10Gbe hardware at home:

Linux lwan (the top performer on Techempower Framework Benchmark):

    [trent@zebra/ttypts/1(~s/wrk)%] time ./wrk --timeout 120 --latency -c 256 -t 12 -d 30 http://10.0.0.2:8080/plaintext
    Running 30s test @ http://10.0.0.2:8080/plaintext
      12 threads and 256 connections
      Thread Stats   Avg      Stdev     Max   +/- Stdev
        Latency     5.34ms    7.46ms 197.13ms   82.40%
        Req/Sec    14.41k   364.49    18.82k    76.61%
      Latency Distribution
         50%  398.00us
         75%    9.01ms
         90%   17.50ms
         99%   28.03ms
      5178617 requests in 30.10s, 0.93GB read
    Requests/sec: 172048.49
    Transfer/sec:     31.67MB

Windows PyParallel:

    [trent@zebra/ttypts/1(~s/wrk)%] time ./wrk --timeout 120 --latency -c 256 -t 12 -d 30 http://10.0.0.2:8080/plaintext
    Running 30s test @ http://10.0.0.2:8080/plaintext
      12 threads and 256 connections
      Thread Stats   Avg      Stdev     Max   +/- Stdev
        Latency     1.52ms    9.38ms 492.43ms   99.33%
        Req/Sec    18.37k     1.01k   22.75k    73.50%
      Latency Distribution
         50%    1.09ms
         75%    1.28ms
         90%    1.56ms
         99%    5.18ms
      6598900 requests in 30.10s, 1.03GB read
    Requests/sec: 219236.69
    Transfer/sec:     34.92MB
    ./wrk --timeout 120 --latency -c 256 -t 12 -d 30   106.30s user 138.87s system 814% cpu 30.114 total

[1]: https://speakerdeck.com/trent/parallelism-and-concurrency-wi...

[2]: https://speakerdeck.com/trent/pyparallel-how-we-removed-the-...

e12e11y ago

Note-to-self: something to play with...

1 more reply

pascal_cuoq11y ago

NOTE 2: I have now read the original post. You should, too. It starts with:

> Threads have at least three different purposes:

> 1- Parallelism on shared-memory multiprocessors.

> 2- Overlapping I/O and computation (while a thread is blocked on a network

> read, other threads may proceed).

>3- Supporting the "coroutine" programming style

> (e.g. if a program has a GUI but performs long computations,

> using threads is a nicer way to structure the program than

> trying to wrap the long computation around the GUI event loop).

> The goals of OCaml threads are (2) and (3) but not (1) (for reasons

> that I'll get into later)

What makes it relevant to the current discussion is (1), but Xavier is discussing (2) and (3) at the time of the quote you chose to take out of context.

1 more reply

gtk4011y ago

Would you mind explaining what the link between NT and VMS is?

2 more replies

plorkyeran11y ago

> Of course, all this SMP support stuff slows down the runtime system even if there is only one processor, which is the case for almost all our users...

> What about hyperthreading? Well, I believe it's the last convulsive movement of SMP's corpse :-)

Oh how things have changed. This was written before it was clear just how much of a disaster the P4 was, so it was a pretty reasonable position at the time.

rodgerd11y ago

He was hardly the only one thinking that way - I remember Gabe Newell being scathing about multicore/multiprocessing when the PS3 and Xbox 360 were released. All a fad, waste of time.

bitmadness11y ago

That was back in 2002! I don't think he'd take the same stance now...

istvan__11y ago

I think he was saying it is unlikely.

"In summary: there is no SMP support in OCaml, and it is very very unlikely that there will ever be. If you're into parallelism, better investigate message-passing interfaces."

dorfsmay11y ago· 11 in thread

I took a really hard look at OCaml a year ago, as I was running into performance issues with python. Lack of multicore support made me give up on it.

Now that Rust is around and supporting multicore, that's probably where I'll be investing my time.

I'd love to hear feedback from people who have used both Rust and OCaml.

diginux11y ago

lobster_johnson11y ago

Isn't the Unicode situation in OCaml more or less the same as in Erlang and Ruby 1.8, ie. "string" is just a byte string, and there's no native encoding support?

2 more replies

WaxProlix11y ago

Don't forget the .NET OCaml, F#!

1 more reply

saosebastiao11y ago

eatonphil11y ago

I am the author of the major web frameworks for ocaml [0]. On of the nicest things about OCaml is the identical system calls. I built my entire OWebl platform by reading through C servers.

On the other hand, Rust (like Erlang) reinvents and wraps a lot of these calls in ways that are not immediately obvious. (Or at least, not AS immediately obvious as they are in OCaml.)

This is such a tremendous aid because there are nearly limitless documents and examples of the Unix API.

[0] http://github.com/eatonphil/owebl

WaxProlix11y ago

I'm in the same place sort of, and Golang and Julia seem like the obvious higher-performance transitions from Python. What about Rust makes you consider it so highly?

dorfsmay11y ago

To be honest the main reasons are:

* network effect / hype

* multicore support

yarrel11y ago

The Rust hype machine has kicked into high gear. D suddenly seems under-promoted.

iamd3vil11y ago

Take a look at Nim Language. (www.nim-lang.org) A High Performance lang with "Pythonesque" syntax.

2 more replies

e_d_g_a_r11y ago

What were you doing that you needed multicore?

dorfsmay11y ago

Pivoting hundreds of millions json entries and upload the results to an object store.

I ended up using a combination of python threads and processes, which was more work I wanted to do, and still relatively slow.

1 more reply

almosthaskeller11y ago· 11 in thread

elihu11y ago

Ocaml and Haskell are both good languages. Haskell probably has a bigger community and more momentum at this point. I switched from Ocaml to Haskell long ago because I wanted parallelism.

I think that Ocaml is easier to learn, but Haskell is more fun, and I've learned more from using it.

ufo11y ago

Haskell libraries also tend to have more levels of abstraction then Ocaml ones, in my experience. Ocaml libraries don't tend to use things like monad transformers or lenses.

alextgordon11y ago

elihu11y ago

3 more replies

djur11y ago

That said, working with modules is a lot nicer in OCaml than in Haskell, so it's a less painful solution.

issaria11y ago

e_d_g_a_r11y ago

What do you mean hacked together? It looks like most any other ML for the most part.

And what poorly thought out semantics?

jallmann11y ago

ML actually has a formally specified semantics. Haskell does not.

brians11y ago

OCaml doesn't either.

1 more reply

codygman11y ago

Do you mean:

http://sml-family.org/sml97-defn.pdf

1 more reply

creichert11y ago

Both are great languages and worth learning. All programming languages have quirks.

j_baker11y ago· 5 in thread

Does OCaml not already support multicore? Is concurrency green thread based? Even at that, there's nothing stopping a user from starting multiple processes....

murbard211y ago

Yes, the threads are green threads: only one can run at a time. There's also an Async framework in Jane Street's Core library and there's LWT.

My understanding is that GC is hard with multithread, particularly in a functional language where it's going to do some heavy lifting and needs to be very performant.

Refefer11y ago

Or were you referring to OCaml in particular?

2 more replies

bad_user11y ago

istvan__11y ago

1 more reply

DonPellegrino11y ago· 4 in thread

More information is available

in my original post in r/ocaml: https://www.reddit.com/r/ocaml/comments/36ninh/403_scheduled...

in the repost in r/programming: https://www.reddit.com/r/programming/comments/36ppx0/ocaml_4...

DonPellegrino11y ago

For those asking "How the hell does OCaml not support multicore in 2015????", this is my reply, crossposted from /r/ocaml:

Again, more technical information is available in my r/ocaml post

jwatzman11y ago

The code for this all open source (along with the rest of the typechecker, HHVM runtime, etc) if you want to take a look: https://github.com/facebook/hhvm/blob/master/hphp/hack/src/h...

I also gave a tech talk a while ago on internals of the type system and typechecker; the latter part starts here: https://www.youtube.com/watch?v=aN22-V-b8RM&feature=youtu.be...

1 more reply

eurleif11y ago

>Other languages that have a GIL (and the same restriction) are Javascript (including Node.js)

1 more reply

sampo11y ago

> Node.js, Python, etc, except that OCaml is massively faster than those languages

The numerical benchmark table in http://julialang.org/ suggests that JavaScript is quite a number crunching beast, within 2x-3x of C performance.

1 more reply

nextos11y ago· 4 in thread

I'm considering OCaml for a new project where C++ would be the typical choice. Think algorithms handling massive amounts of data, and some numerics.

gmfawcett11y ago

DonPellegrino11y ago

shriphani11y ago

Function application in OCaml is eager.

However, implementing laziness is trivially accomplished.

raphaelss11y ago

It already comes with support for lazy evaluation.

http://caml.inria.fr/pub/docs/manual-ocaml/extn.html#sec216

1 more reply

SniperOwl11y ago· 3 in thread

If Jane Street Capital has it their way, multicore support "will definitely go well".

diginux11y ago

Are you from JSC by chance? Is this from an authoritative source, or an just an observation of likeliness? Would love if they indeed do push hard for this.

tomjen311y ago

It sounds like such a company might also have the capital to pay people to make it work.

It is pretty stupid for a language not to have multicore support in 2015. Javascript has it (in its own, somewhat broken way).

rubiquity11y ago

> Javascript has it (in its own, somewhat broken way).

No it doesn't. Also OCaml is from a pre-multicore era. Even Erlang wasn't multicore from the start, SMP was added in 2005.

1 more reply

feld11y ago· 2 in thread

Seems like a major feature to put in a point release...

I don't understand why some projects have such bizarre versioning methodology.

LeonidasXIV11y ago

So OCaml 4.03.0 is basically 4.3.0 in a Python-esque versioning scheme (remember how many changes were done between Python 2.2.0 and 2.7.0?).

feld11y ago

Thanks for clearing that up

timruffles11y ago· 2 in thread

Great! This, plus the lack of libraries, put me off. Will take a new look!

LeonidasXIV11y ago

You'll be delighted to hear that OPAM curently features >800 libraries, too.

kristianp11y ago

Any libraries in particular?

avsm11y ago

If you'd like to see the approach we're taking at OCaml Labs in order to build multicore, read KC's blog post here:

http://kcsrk.info/ocaml/multicore/2015/05/20/effects-multico...

tempodox11y ago

Yupeee!

j / k navigate · click thread line to collapse