Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
1M Tokens/s: Scaling Qwen 3.5 27B on 96 B200 GPUs with vLLM | Better HN
1M Tokens/s: Scaling Qwen 3.5 27B on 96 B200 GPUs with vLLM
(opens in new tab)
(medium.com)
3 points
m4r1k
1mo ago
0 comments
Share
0 comments
No comments yet.