undefined | Better HN

story

0 pointskragen17y ago0 comments

I think I can see how MapReduce could be argued to be a "game-changer" (although I'm not sure it's true), but why F#? If I understand correctly, it's basically OCaml on the CLR. So you get a Python-like brevity of code (and ease of programming? maybe it has better error messages than OCaml?} with C# performance and access to the CLR libraries. That sure sounds useful, but why is it a "game-changer"?

0 comments

gaius17y ago

Because functional languages are a good choice for extracting as much parallelism as you can get from your algorithms.

kragenOP17y ago

No.

CUDA yes. Cg yes. HLSL yes. Verilog yes. VHDL yes. C++ or Java with MapReduce yes. PHP and MySQL with memcached yes. Erlang yes — and it really is functional, inside each process, anyway — that is, the level where you aren't getting any parallelism. Octave or R, potentially, but not today, as far as I know. Mathematica yes, and it, too, is mostly functional.

In theory, side effects are what make parallelism hard, and so languages whose semantics are side-effect-free (unlike F# or Mathematica or Erlang) should make it easy. So we all thought in 1980. Since then we spent 20 years or so trying to make that happen, and it basically didn't work.

There are basically four kinds of parallelism within easy reach today. There's SIMD, like MMX, SSE, 3DNow, AltiVec, and the like; you'd think that data-parallel languages and libraries like Numpy and Octave would be all over this, but except for Mathematica, that doesn't seem to be happening. There's running imperative code on a bunch of tiny independent processors that share no data; AFAIK that's what the shader languages are doing. There's instruction-level parallelism on a superscalar processor, which largely benefits from things like contiguous arrays in memory, or maybe what Sun is doing with Niagara, where the processor pretends to be a bunch of tiny independent slow processors. And then there's splitting up your data across a shared-nothing cluster, which is how every high-traffic web site works, and that's what MapReduce makes simpler.

Uh, and then there's designing your own hardware or programming FPGAs, which is what Verilog and VHDL are for.

Languages like OCaml (I don't know anything about F# except that it's like OCaml, but for the CLR) have no special advantage for any of these scenarios. They don't even have the theoretical advantage that they have no side effects and therefore you can speculatively multithread them without breaking the language semantics. They do have the massive practical disadvantage, in most of the scenarios I described, of needing unpredictable amounts of memory, having massive libraries, and using pointers all over the place. Using pointers all over the place kills your locality of reference and your ILP. Having massive libraries and using unpredictable amounts of memory makes it impossible to run them inside your GPU and means they can't run on an FPGA (except by using external memory, like the awesome Reduceron). And nothing about the language semantics helps with SIMD either.

So, sheesh, go read Alan Bawden's dissertation or whatever, but don't go around claiming that ML (or even Haskell) is going to magically make your algorithms parallel. We tried that. It didn't work. We're trying something else now.

khafra17y ago

Just for my own edification, since you seem quite familiar with the subject: Is Obsidian* a big fat waste of time that'll never be as good as just compiling Haskell for a CPU like a normal person? 'cause I was considering investing some time in learning it, when I wouldn't have the free brain cycles for CUDA.

* http://www.google.com/search?q=obsidian+haskell

1 more reply

gaius17y ago

It's not that it magically makes thing parallel. It's that it puts the tools to do it in the hands of Joe Developer who has hankered to do functional programming in the "real world" and now can say to his manager, look, it's part of Visual Studio now, it's official, there's no reason I shouldn't do this. I am confused as to why you say OCaml/F# don't even have theoretical applications here - it it because the .NET libraries aren't side-effect free? It's about practicality, not about technology. The former is what was missing, the latter's been around for 20 years or more.

And please, PHP and MySQL? For computation? Are you serious?

1 more reply

tlrobinson17y ago

For a second there I thought Alec Baldwin wrote a dissertation having something to do with concurrency...

j / k navigate · click thread line to collapse

0 comments

gaius17y ago

Because functional languages are a good choice for extracting as much parallelism as you can get from your algorithms.

kragenOP17y ago

No.

Uh, and then there's designing your own hardware or programming FPGAs, which is what Verilog and VHDL are for.

khafra17y ago

* http://www.google.com/search?q=obsidian+haskell

1 more reply

gaius17y ago

And please, PHP and MySQL? For computation? Are you serious?

1 more reply

tlrobinson17y ago

For a second there I thought Alec Baldwin wrote a dissertation having something to do with concurrency...

j / k navigate · click thread line to collapse