undefined | Better HN

0 pointsukuina2y ago0 comments

This is a reference to Kobold Horde, a distributed volunteer network of GPUs that can be inferenced upon.

0 comments

I didn't mean to imply splitting llama up between machines (though that is a thing with llama.cpp), but a pool of clients and servers who make requests and process them:

https://lite.koboldai.net/

A few users with half decent PCs can serve a much larger group of people, and the "lesser" hosts can host smaller models to "earn" access to larger ones.

j / k navigate · click thread line to collapse

0 comments

brucethemoose22y ago

I didn't mean to imply splitting llama up between machines (though that is a thing with llama.cpp), but a pool of clients and servers who make requests and process them:

https://lite.koboldai.net/

A few users with half decent PCs can serve a much larger group of people, and the "lesser" hosts can host smaller models to "earn" access to larger ones.

j / k navigate · click thread line to collapse