undefined | Better HN

0 pointsselfhoster111y ago0 comments

It helps to be able to run the model locally, and currently this is slow or expensive. The challenges of running a local model beyond say 32B are real.

0 comments

1 comments · 1 top-level

rightbyte1y ago

Ye the compressed version is not nearly as good.

I would be fine though with like 10 times the wait time. But I guess consumer hardware need some serius 'ram pipeline' upgrade for big models to be run at crawl speeds.

j / k navigate · click thread line to collapse