1Major upgrades to Ray Serve: 88% lower latency and 11.1x higher throughput (opens in new tab)(anyscale.com)2robertnishihara1mo ago1
2SkyRL brings Tinker to your GPUs (2025) (opens in new tab)(novasky-ai.notion.site)24robertnishihara2mo ago5
3vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep (opens in new tab)(blog.vllm.ai)147robertnishihara3mo ago54
4Massively Parallel Agentic Simulations with Ray (opens in new tab)(anyscale.com)2robertnishihara8mo ago0
5Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes (opens in new tab)(anyscale.com)1robertnishihara9mo ago0
6An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (opens in new tab)(anyscale.com)1robertnishihara9mo ago0
9AsyncFlow: An Asynchronous Streaming RL Framework for LLM Post-Training (opens in new tab)(arxiv.org)4robertnishihara10mo ago0
11Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure (opens in new tab)(anyscale.com)2robertnishihara10mo ago0
12Uv and Ray: Pain-Free Python Dependencies in Clusters (opens in new tab)(anyscale.com)44robertnishihara10mo ago10
13Roll: Reinforcement Learning Optimization for Large-Scale Learning (opens in new tab)(github.com)1robertnishihara11mo ago0
14An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (opens in new tab)(anyscale.com)1robertnishihara11mo ago0
15Uv and Ray: Pain-Free Python Dependencies in Clusters (opens in new tab)(anyscale.com)1robertnishihara1y ago0