2A collection of reproducible LLM inference engine benchmarks: SGLang vs. vLLM (opens in new tab)(github.com)GitHub1zhwu1y ago0Save
3Efficient GPU Resource Management for ML Workloads Using SkyPilot, Kueue on GKE (opens in new tab)(github.com)GitHub2zhwu1y ago0Save
4New Recipe: Serving Llama-2 with VLLM's OpenAI-Compatible API Server (opens in new tab)(github.com)GitHub1zhwu2y ago0Save
7Serving LLM 24x Faster on the Cloud with VLLM and SkyPilot (opens in new tab)(blog.skypilot.co)12zhwu2y ago1Save
8Biologists are moving to the clouds with SkyPilot from UC Berkeley (opens in new tab)(twitter.com)5zhwu3y ago0Save
9Vicuna releases its secrete of finding available A100s on the cloud to train it (opens in new tab)(twitter.com)4zhwu3y ago2Save