undefined | Better HN

0 pointsschaefer2y ago0 comments

The Jetson AGX Orin Developer Kit [1] has 64 GB of unified 256-bit LPDDR5 memory.

It costs $2,000 and might get some people someplace interesting.

[1]: https://developer.nvidia.com/embedded/learn/getting-started-...

0 comments

5 comments · 4 top-level

gautamcgoel2y ago· 1 in thread

LPDDR5 doesn't have nearly as much memory bandwidth of GDDR6.

skavi2y ago

Per chip? Not the full story when discussing a system which can integrate multiple. The Orin has more memory bandwidth than an RTX 4050 even though the latter uses GDDR6. The M3 Max has double the bandwidth of the Orin, but also uses LPDDR5.

wmf2y ago

Orin is kind of expensive for what it does. I think you'd be better off with a Mac Studio for $2,400 at this point.

cma2y ago

5FP32 TFLOPs, if not doing sparse low precision inference it seems to be about in line with mid-high end 2014 Nvidia consumer card performance (gtx 980), one decade old.

For running sparsified/quantized llama2 it might be good, not sure about for fine tuning. I didn't see any FP16 numbers.

eurekin2y ago

Thanks! That is every interesting.

Here's a direct amazon link: https://www.amazon.com/dp/B0BYGB3WV4

And a running demo: https://forums.developer.nvidia.com/t/llama-2-llms-w-nvidia-...

j / k navigate · click thread line to collapse