undefined | Better HN

0 pointsmarcopicentini2y ago0 comments

No idea yet. Which you recommend to start? It will be hosted on a Ubuntu server (Digital Ocean, Linode etc..)

0 comments

4 comments · 2 top-level

reustle2y ago· 2 in thread

You can definitely start by checking out ollama, it was super helpful for me

It’s only for MacOSX. I expect to load the model on a Ubuntu server, not on my local dev machine.

You have to build it if you want it for Ubuntu, Windows, or anything else. Just build Go on your machine and have at it.

Any reason you're doing that vs. using Lambda Labs / Replicate / together.ai / Banana.dev, etc.

There's a lot of good model deployment platforms that would make it easy to call your model behind a hosted endpoint

-- If you do want to self-host - there's some great libraries like https://github.com/lm-sys/FastChat and https://github.com/ggerganov/llama.cpp that might be helpful

If none of these really solve your issue - feel free to email me and I'm happy to help you figure something out - krrish@berri.ai

j / k navigate · click thread line to collapse