Skip to content

Top Best Ask Show New Jobs

Show HN: Prometeo – a Python-to-C transpiler for high-performance computing (opens in new tab)

(github.com)

166 pointszanellia4y ago140 comments

140 comments

93 comments · 30 top-level

lvass4y ago· 11 in thread

Cython, pypy, micropython, nuitka, shedskin, ironpython, graalpython, jython, mypyc, pyjs, skuptjs, brython, activepython, stackless, transcrypt, cinder and many more I don't remember.

They're all practically useless or delegated to specific tasks. At this point you'd need to present incredible evidence that an alternative compiler can be useful. Personally I find it comical how many developers are still eluded by a promise of performant python. I hope you achieve your goals, good luck.

zanelliaOP4y ago

The point of prometeo is not to obtain a "performant Python". Python is used merely as a host language for an embedded domain specific language. You could do the same thing with any other language with a mature library for AST analysis :)

lvass4y ago

Which makes this thread's title at least confusing.

Most of the ones you list require dynamic linking and so is hard to make use of in specialized environments.

His project seems to be generating generic C code which is much easier to port to any weird platforms. In fact, it might be perfect for my use-case where dynamic linking is just extra attack surface.

I understand that the project is still in the early stages, but I will be paying close attention to it. If at some point it will be possible to write "regular" Python in it (minus most of the standard library and imports), then it could be a candidate for an edge computing platform.

m_ke4y ago

Numpy, Numba and PyTorch seem to be doing ok.

gh02t4y ago

Cython is also pretty successful obviously, though I don't think it quite fits in OPs list given that it's more about writing extensions than replacing your entire Python code/stack. But I do agree with OPs sentiment even as someone who writes a lot of Python.

Der_Einzige4y ago

Not sure what you mean by "eluded by a promise of performant python". Tensorflow and Pytorch do a great job thank you very much.

so much effort to match the performance of lower level languages that it would have actually been easier to use those directly :)

Zababa4y ago

I'm not sure, most people aren't writing ASM these days because the compilers are good enough for most cases. Compilers are great.

Tozen4y ago

Good point. And you don't have to go that low. Maybe go use Object Pascal, Nim, or Vlang. I know... the libraries. But a lot of them are bindings of C libraries. So, you can create bindings in other languages too or use Python from those languages. There are various options.

zanelliaOP4y ago

I would disagree on "easier" :) Ever spent half a day debugging a segfault?

staticautomatic4y ago

SpaCy is pretty incredible evidence.

OulaX4y ago· 9 in thread

Each programming language has its purpose.

C code is performant and that is a fact. Python code is not.

When building mission critical systems why don't programmers just use C itself instead of coding in another programming language and having it transpiled for them? Why introduce such tools all the time?

I am against this because the tools programmers use are becoming too bloated compared to 10-20 years ago.

Want to build an Android App? Use Java/Kotlin.

Want to build an iOS App? Use Swift.

Want to build a Web App? Use a Single JS Framework (Why millions of frameworks?)

Want to build a Windows Desktop App? Use C#.NET Either with WinForms or WPF.

I really see tools and technologies coming up all the time to solve a problem that most of the time doesn't exist.

packetlost4y ago

You've never been near a lab environment clearly. Python is a dominant language in university labs and runs of lot more real-time systems than you think. Grad students rarely have industry experience and don't necessarily have the know-how to write C code effectively, so it's a question of resources and ecosystem. Numpy, matplotlib, pandas, scikit, TensorFlow, etc. are all huge draws for the scientific and ML communities.

PaulDavisThe1st4y ago

> runs of lot more real-time systems than you think

soft real time systems, for sure. if it runs any hard real time systems, get out of the lab.

We could of course debate the boundary between hard and soft, but I'd rather not.

zanelliaOP4y ago

I think many people who have at least once first prototyped a numerical algorithm in a high-level language (say Python, Julia, MATLAB?) and then implemented it in C, can relate to the experience of transitioning from error messages of the type: "dimension mismatch for XYZ" to "segmentation fault". That's in my opinion a strong motivation to build tools that can automate certain parts of the development process.

Writing C code directly is as a good option, as long as your code is not too complex to develop, maintain and extend.

And, again, Python here is intended to be the host language for an embedded domain specific language that gets compiled into C. It does not need to be efficient it needs to be expressive and easy to analyse and transpile.

adgjlsfhk14y ago

Note that the whole point of Julia is that it saves you the rewrite. There is Julia code running on top supercomputers that gives speed competitive to C/C++/Fortran. You will have to put in some work to get Julia code to be that fast, but it is usually dramatically easier than a rewrite in a different language.

klyrs4y ago

The problem that this language solves is that it automatically sorts out the memory usage for you. That isn't a problem for me; I've been programming in C for decades. But it is a problem for most python programmers who don't have a lick of C experience, but want to get C performance. It drastically lowers the barrier of entry.

zanelliaOP4y ago

For what it counts, I have developed code for this kind of applications exclusively in C for ~5 years (let's say 20% of my working time). I still think that debugging a segfault that you could have avoided is not very productive and that motivated me to look into possible alternatives.

GekkePrutser4y ago

Yes and it will also prevent common memory management bugs that can lead to code injection.

GekkePrutser4y ago

It doesn't have to be production. Maybe it's for a research project where you just need the extra performance.

Everything has a cost. This may not be ideal but learning to do C properly as an experienced Python dev will have a time cost as well. This may just be the best way to get from A to B.

I remember when I did a one-off project with a PIC microcontroller. I only had an assembler and I spent 2 days getting nowhere.

Then i found a C compiler and I had the whole thing running in 2 hours. The compiler turned out to much more efficient in speed as well as code size than my hand-written assembler.

Zababa4y ago

> When building mission critical systems why don't programmers just use C itself instead of coding in another programming language and having it transpiled for them?

Why C and not assembler?

> Why introduce such tools all the time?

C compilers are one of those tools.

adsharma4y ago· 3 in thread

This is awesome! The direction of using a subset of python, while leveraging the user base and static typing to accomplish some other everyday task in a different language is very legit IMO.

I took a cursory look at:

https://github.com/zanellia/prometeo/blob/master/prometeo/cg...

It seems quite similar in spirit to

https://github.com/adsharma/py2many/blob/main/pyrs/transpile...

I'm not spending much time on py2many last few months (started a new job). Let me know if any of it sounds useful - especially the ability to transpile to 7-8 languages including Julia, C++ and Rust.

zanelliaOP4y ago

Pretty cool! How do you manage multiple target languages with a single AST parser? Do you use an intermediate AST?

No intermediate AST. To understand the various stages of transpilation and separation of language specific and independent rewriters, this file is a good starting point:

https://github.com/adsharma/py2many/blob/main/py2many/cli.py...

pure_simplicity4y ago

That's a good question to put in the discussion tab on their GitHub repo, that way others that are interested can find it too :)

pure_simplicity4y ago· 3 in thread

Have you considered using Nim? It's a great language that has some similarities to python and compiles to C code. It is a very convenient and powerful language to use, coming from Python.

zanelliaOP4y ago

I did look into Nim, but, given Python's maturity/popularity, I decided to stick with it as host language for the DSL.

pure_simplicity4y ago

Well, you give yourself a whole lot of extra work by attempting to create a Python to C transpiler, basically creating an even less mature and less popular language ecosystem than Nim has, when you could use Nim and get most of the benefits (statically typed, compiled to C) out of the box and enjoy their growing ecosystem. It seems that Prometeo is currently more limited than Nim.

Maybe I'm not understanding your use case well enough and maybe your approach is actually a locally optimal solution, and I honestly greatly respect your effort and want to see your project succeed, cuz I love Python too. I even wanted to make a faster python myself at some point. Just, the more I learn about Nim, the more I appreciate its design decisions and think to myself: this is the faster version of python I'm looking for. Now, we just need the large package ecosystem that python has, but I'm willing to both wait and participate in making it come about.

This presentation about writing keyboard firmware in Nim may be helpful, if you're willing to give Nim some more consideration:

https://youtu.be/dcHEhO4J29U

There is also another talk about embedded programming in Nim from the same conference, here:

https://youtu.be/rlZ4ALGAU1M

fault14y ago

How about zig? From what I've heard, zig is supposed to be a "better" drop in replacement for C.

rich_sasha4y ago· 3 in thread

Soo... it takes Python syntax and produces a C program, with no links back to Python - is that right? It uses a strict subset of Python, so that Prometeo programs are valid Python, but not necessarily the opposite. Is that fair?

Do you envisage this being a conduit for tight loop optimisation in Python? Or is it rather "you'd like a C program but can't write C good"?

And if the former, how do you compare to Nuitka and Cython? I read your README but couldn't quite make sense of it :)

zanelliaOP4y ago

> Soo... it takes Python syntax and produces a C program, with no links back to Python - is that right? It uses a strict subset of Python, so that Prometeo programs are valid Python, but not necessarily the opposite. Is that fair?

yep

> Do you envisage this being a conduit for tight loop optimisation in Python? Or is it rather "you'd like a C program but can't write C good"?

There are already plenty of options for calling high-performance libraries from Python. Now 1) interpreting Python programs that use, e.g., NumPy, can be slow. 2) Compiling these programs using, e.g., Cython or Nuitka, can speed up the code across calls to high-performance libraries, but the resulting code will still rely on the Python runtime library, which can be slow/unreliable in an embedded context.

Coming to the second part of the question, writing C code directly is definitely an option, but, after doing a bit of that, I realized how tedious/error prone it is to develop/maintain/extend relatively complex code bases for embedded scientific computing (e.g. this one https://github.com/acados/acados). Or, to put it as Bjarne Stroustroup once said "fiddling with machine addresses and memory is rather unpleasant and not very productive". The good news seemed to be that many of the code structures necessary to write that type of code are rather repetitive and can hopefully be generated automatically to some extent.

> And if the former, how do you compare to Nuitka and Cython? I read your README but couldn't quite make sense of it :)

This table (from the README) shows some computation times for Nuitka, prometeo, Python and PyPy.

CPU times in [s]:

Python 3.7 (CPython) : 11.787 Nuitka : 10.039 PyPy: 1.78 prometeo : 0.657

Other than performance, the main difference is, again, the runtime library dependency.

rich_sasha4y ago

Right. Gotcha. So Prometeo isn't another "make Python fast again" project, but rather an orthogonal effort to write fast (C) programs, but in a high-level Python-like language. Thanks.

BBC-vs-neolibs4y ago

And Cython? (Not CPython)

savant_penguin4y ago· 3 in thread

For a matrix size 50 it beats Julia by a factor of 10 wow

https://github.com/zanellia/prometeo/blob/master/benchmarks/...

adgjlsfhk14y ago

This isn't that meaningful since the "Julia" version is just calls to OpenBlas/LAPACK which are known to have relatively high overhead for small matrices. I'd be much more interested in seeing a comparison vs LoopVectorization/StaticArrays which are Julia libraries specifically optimized for small matrices.

zanelliaOP4y ago

Good point, it should be easy to add Julia to the Fibonacci benchmark. Here is the Python code https://github.com/zanellia/prometeo/blob/master/examples/fi...

shele4y ago

The Julia-code has some performance flaws in the hot loop (like making a superfluous copy of a matrix before just reading it.)

zcw1004y ago· 3 in thread

Have you thought of targeting WebAssembly? If you're going from Python/Prometo -> C you could always make the extra sep of Python/Prometo -> C -> WASM but I wonder if there would be an advantage of skipping the intermediate C.

zanelliaOP4y ago

Python to ASM would actually be really cool and would guarantee performance gains for small matrices, but it would require quite some implementation effort. Not sure about WASM.

Why WASM? It would be a pessimization compared to just transpiling to C if performance is the goal. WASM also is restricted to 128-bit vector instructions.

zcw1004y ago

Because wasm doesn't support Python and it might be nice to be able to write WASM in a Python like language.

up6w64y ago· 3 in thread

Yes, I'm gonna talk about Julia...

It's kinda of sad how much effort is put on the creation of new Python compilers to make it slight faster while the problem of latency to compile that people hate at Julia is not tracked because of the lack of manpower to improve Julia's interpreter.

https://youtu.be/IlFVwabDh6Q?t=2530 (tldr: The Julia interpreter is currently about 500x slower than JIT code and there are a lot of low-hanging fruit work there that could easily give it a 10x speedup - this could make more viable to switch between compiler and interpreter depending on the work)

zanelliaOP4y ago

Personally, I think Julia is great - just don't know it well enough to write a package that takes Julia ASTs and generate C code from them :) There could totally be a Julia implementation of the main idea behind prometeo (Julia per se does not solve the problem that prometeo aims at solving).

adgjlsfhk14y ago

You can just use `@code_llvm` to generate LLVM code, or `@code_native` to generate assembly. Does that do what you need?

fault14y ago

the problem with Julia in the use case of OP is really the fact that it is garbage collected (and perhaps also how its GC is tuned). You can work to eliminate allocations, but the memory determinism problem is more important in real time control and embedded systems. see for example, this video: https://www.youtube.com/watch?v=dmWQtI3DFFo

It's kind of why C is still king in this space.

chriswarbo4y ago· 2 in thread

Very interesting! What are the similarities/differences compared to RPython (as used by PyPy)?

https://rpython.readthedocs.io/en/latest/rpython.html

loeg4y ago

Looks like RPython is a bigger language that doesn’t target an embedded use case without a Python runtime. Though I may be mistaken - I am not super familiar with RPython.

chrisseaton4y ago

RPython programs can be compiled to a standalone executable without a Python runtime - it's what PyPy is written in, for example.

sys_647384y ago· 2 in thread

Seems like a python to C++ would translate more of the language to like for like concepts more easily.

zanelliaOP4y ago

Right, for sure I would not need to re-invent the machinery to translate a class into a glorified C struct. The whole thing started with C in mind for portability arguments, but it might be a good idea to keep an eye on C++ as an option.

4w4s4y ago

But some "embedded platform" tool-chains do no support C++

4w4s4y ago· 2 in thread

Nice job! Is this aimed at single core/thread computations or the prometeo layer is also a way to write in a more "user friendly way" basic parallel code?

zanelliaOP4y ago

For the time being, it targets single core/thread applications only.

Do you have access to builtins and intrinsics? Are there any plans?

The single threaded thing is not an issue because you can still call the same function on each CPU and use the CPU ID to target parts of the computation, like a compute kernel function.

amkkma4y ago· 2 in thread

Regarding all the questions about Julia:

There's ongoing work to reduce runtime dependencies of Julia (for example in 1.8, you can strip out the compiler and metadata), but then it's only approaching Go/Swift and other static languages with runtimes.

Generating standalone runtime free LLVM is another path, that is actually already pretty mature as it's what is being done for the GPU stack.

Someone just has to retarget that to cpu LLVM, and there's a start here: https://github.com/tshort/StaticCompiler.jl/issues/43

zanelliaOP4y ago

That's quite cool. Maybe the whole thing can be rewritten in Julia too at some point. I just know too little about Julia to judge.

amkkma4y ago

Well IMO it can definitely be rewritten in Julia, and to an easier degree than python since Julia allows hooking into the compiler pipeline at many areas of the stack. It's lispy an built from the ground up for codegen, with libraries like (https://github.com/JuliaSymbolics/Metatheory.jl) that provide high level pattern matching with e-graphs. The question is whether it's worth your time to learn Julia to do so.

You could also do it at the LLVM level: https://github.com/JuliaComputingOSS/llvm-cbe

One cool use case is in https://github.com/JuliaLinearAlgebra/Octavian.jl which relies on loopvectorization.jl to do transforms on Julia AST beyond what LLVM does. Because of that, Octavian.jl. a pure julia linalg library, beats openblas on many benchmarks

xapata4y ago· 2 in thread

That's a compiler. I don't understand the desire to create a new word when the old one is fine.

zanelliaOP4y ago

"A program that translates between high-level languages is usually called a source-to-source compiler or transpiler" from https://en.wikipedia.org/wiki/Compiler.

xapata4y ago

Yes, a compiler. The fact there's a note in Wikipedia doesn't change my view.

cycomanic4y ago· 2 in thread

How does it compare to pythran? Except for the fact that it's c and not c++?

zanelliaOP4y ago

Not sure how easy it would be to make the code generated by Pythran standalone, i.e., no dependency on the Python runtime library. Any Pythran expert? :)

cycomanic4y ago

Pythran code is standalone, i.e. no dependency on the Python runtime AFAIK.

pella4y ago· 1 in thread

Nice project.

small comment - related to the benchmarks:

- in Julia: it has a newer ricatti solver (in package)

https://github.com/andreasvarga/MatrixEquations.jl/blob/mast...

https://github.com/andreasvarga/MatrixEquations.jl

giaf4y ago

The benchmark in prometeo is a discrete time Riccati recursion (as opposite to a continuous time Riccati equation) algorithm. And it is the exact same algorithm implemeneted in all languages, making the comparison more fair as the only variable is the implementation itself.

klyrs4y ago· 1 in thread

Kneejerk reaction as an enthusiastic Cython developer: "bah, another crappy (subset of Python)-to-C compiler."

After reading: this is really cool. If I understand this, I think you should be able to beat Cython without breaking a sweat. I'm quite excited to use this.

zanelliaOP4y ago

hahaha thanks!

loeg4y ago· 1 in thread

To get ahead of the obvious question I had and I’m sure others will, this is from the README:

> Cython is a programming language whose goal is to facilitate writing C extensions for the Python language. In particular, it can translate (optionally) statically typed Python-like code into C code that relies on CPython. Similarly to the considerations made for Nuitka, this makes it a powerful tool whenever it is possible to rely on libpython (and when its overhead is negligible, i.e., when dealing with sufficiently large scale computations), but not in the context of interest here.

I.e., it’s a python-like DSL that does not depend on the Python runtime.

Thanks for sharing OP, this is pretty cool.

zanelliaOP4y ago

Right, that's indeed the main reason I could not simply use Cython or Nuitka (or Julia?). The Python runtime library will do all kinds of non real-time/embedded friendly operations such as garbage collections, memory allocation/de-allocation and so on, in the background.

cossatot4y ago· 1 in thread

I'm curious what an example use case is for scientific computing on an embedded device. Is this for real-time analysis on a data logger or something?

Many of us think of clusters as high-performance scientific computing, which are about as far from embedded as it gets.

Please note that I am not being snarky, just curious!

zanelliaOP4y ago

Thanks for the question! My background is in numerical optimization for optimal control. Projects like this https://github.com/acados/acados motivated the development of prometeo. It's mostly about solving optimization problems as fast as possible to make optimal decisions in real-time.

MR4D4y ago· 1 in thread

Looks like this could be pretty nice.

I noticed your disclaimer at the bottom of the linked page [0], and wanted to get an idea of how far you were looking to take this. Will it go beyond maths into normal functions (string handling, etc) ? Do you eventually plan on supporting most of python - for instance, do you think I could write a web server using your tool in the future?

[0] - "Disclaimer: prometeo is still at a very preliminary stage and only few linear algebra operations and Python constructs are supported for the time being."

zanelliaOP4y ago

Unfortunately, I think that writing a transpiler for general Python programs might be rather difficult without resorting to approaches used, e.g., in Cython/Nuitka. Among other things, computing the worst-case heap usage could be quite cumbersome/computationally heavy for a general program without "constraints". I'd be happy to hear what others think about the topic though.

cerved4y ago· 1 in thread

Looks like a cool project!

I can't speak much about the code itself or the aims of the projects. Personally I would recommend more informative commit messages.

I do this myself, especially working on personal stuff, but writing commit messages that succinctly explain what each commit does is a good practice and gives a serious impression.

I often find myself hacking away and periodically going back to flesh out messages using rebase.

zanelliaOP4y ago

Thanks for the suggestion. Until now it's been a lot of discussion with friends and colleagues and much less actual collaboration on code writing - I might have drifted into bad practices.

nurettin4y ago· 1 in thread

One major problem is that the error messages need a lot of work. Why aren't class variables and static methods not accepted? I can't know that if your code just throws an exception while iterating some dictionary.

zanelliaOP4y ago

I agree that error handling is one of the main things to be improved. The problem is that in some cases the AST walker ends up in unhandled states and prometeo throws a generic exception with a line number only. Are you looking at something in particular? With basically 0 users at the moment, this kind of feedback is quite useful.

4w4s4y ago· 1 in thread

It seems a convenient/high_level way to use highly optimized C libraries with minimal overhead both in terms of execution time (i.e. vs standard interpreted Python) both in term of runtime size/complexity (see Julia).

zanelliaOP4y ago

That's correct. I'd say one of the fundamental differences between the two lies in the fact that the code generated by prometeo does not depend on a runtime library (which is somewhat fundamental for embedded applications, e.g., embedded optimization). From prometeo's README:

Finally, although it does not use Python as source language, we should mention that Julia too is just-in-time (and partially ahead-of-time) compiled into LLVM code. The emitted LLVM code relies however on the Julia runtime library such that considerations similar to the one made for Cython and Nuitka apply.

throw53993759304y ago· 1 in thread

Great project, but terrible name, considering how popular Prometheus is.

zanelliaOP4y ago

fair enough :p I might change it in the future.

marmaduke4y ago· 1 in thread

Stand-alone is a very useful concept. I don’t like deploying Python stacks much. Wouldn’t that additionally mean you could target CL, CUDA or Sycl variants of C?

zanelliaOP4y ago

I'd say that's possible in principle - definitely not there at the moment though (and not even planned).

sergius4y ago· 1 in thread

How does this compare with Nim and MicroPython?

zanelliaOP4y ago

I though about using Nim as a host language for the DSL for a while, but then decided to rely on Python simply because it is more mature (and I had already partially figured out how to manipulate Python ASTs to generate C code).

omegalulw4y ago· 1 in thread

Do you have benchmarks against numpy on big computations (10-1000s)?

zanelliaOP4y ago

No, that's not the timescale of interest, I would say. However, if the big chunk of computations is delegated to HPC libraries I would say that NumPy could be rather competitive there (although still not easy to embed). If instead you need to run many times the same piece of code where a large fraction is pure Python, of course, it would not change the picture with respect to the "small" computations scenario.

dom964y ago· 1 in thread

This is really cool. Just a bit of pedantry: is Python higher-level than C? If so this isn't a transpiler but a compiler :)

zanelliaOP4y ago

Fair enough - it's blurred I'd say. I see C as a lower-level, and yet still high-level, language, if compared to Python :)

zanelliaOP4y ago

Hi all,

prometeo is an experimental modeling tool for embedded high-performance computing. prometeo provides a domain specific language (DSL) based on a subset of the Python language that allows one to conveniently write scientific computing programs in a high-level language (Python itself) that can be transpiled to high-performance self-contained C code easily deployable on embedded devices.

The package is still rather experimental, but I hope this concept could help making the development of software for high-performance computing (especially for embedded applications) a little easier.

What do you think of it? Looking forward to receiving comments/suggestions/criticism :)

b200004y ago

just write your code in c or c++ and be done with it. if you need math libs there are plenty out there for anything you can imagine. python will go the way java went many years ago.

BBC-vs-neolibs4y ago

A brief comparison distinguishing it from Cython would be most welcome.

j / k navigate · click thread line to collapse