Skip to content

Top Best Ask Show New Jobs

zhwu | Better HN

zhwu

37 karmaJoined October 5, 202240 submissions

Recent submissions

1

VRAM Ghost Busting: Who You Gonna Close()? (opens in new tab)

(hcompany.ai)

3zhwu2d ago0

2

A collection of reproducible LLM inference engine benchmarks: SGLang vs. vLLM (opens in new tab)

(github.com)GitHub

1zhwu1y ago0

3

Efficient GPU Resource Management for ML Workloads Using SkyPilot, Kueue on GKE (opens in new tab)

(github.com)GitHub

2zhwu1y ago0

4

New Recipe: Serving Llama-2 with VLLM's OpenAI-Compatible API Server (opens in new tab)

(github.com)GitHub

1zhwu2y ago0

5

Train Your Own Vicuna on Llama-2 (opens in new tab)

(github.com)GitHub

3zhwu2y ago0

6

Guide on fine-tuning your own Vicuna on Llama-2 (opens in new tab)

(twitter.com)

9zhwu2y ago0

7

Serving LLM 24x Faster on the Cloud with VLLM and SkyPilot (opens in new tab)

(blog.skypilot.co)

12zhwu2y ago1

8

Biologists are moving to the clouds with SkyPilot from UC Berkeley (opens in new tab)

(twitter.com)

5zhwu3y ago0

9

Vicuna releases its secrete of finding available A100s on the cloud to train it (opens in new tab)

(twitter.com)

4zhwu3y ago2