Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
TensorRT-LLM runtime now open-source
(opens in new tab)
(github.com)
4 points
mmoskal
1y ago
1 comments
Save
Share
1 comments
1 comments · 1 top-level
top
newest
oldest
mmoskal
OP
1y ago
Previously, the "Executor" runtime was shipped as binary blobs. This is the bit that schedules requests and manages KV cache (similar to vLLM or SGLang server).
j
/
k
navigate · click thread line to collapse