Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
tarruda
1mo ago
0 comments
Share
I'm only interested in the local, single user use case. Plus I use a Mac studio for inference, so vLLM is not an option for me.
0 comments
default
newest
oldest
mycall
1mo ago
You can get concurrency gains [0] as local/single user (multi-agent) use case with vLLM with your Mac Studio.
[0]
https://youtu.be/Ze5XLooTt6g?t=658
j
/
k
navigate · click thread line to collapse