1Benchmarking LLM Inference Back Ends: VLLM, LMDeploy, MLC-LLM, TensorRT-LLM, TGI (opens in new tab)(bentoml.com)15chaoyu1y ago1
2Show HN: ML Serving orchestration framework on Kubernetes (opens in new tab)(github.com)2chaoyu3y ago0
3BentoML: The easiest way to build Machine Learning APIs (opens in new tab)(github.com)4chaoyu5y ago1