1How to get GLM 5.2 to 280 tokens per second (opens in new tab)(baseten.co)3mikejulietbravo4d ago1Save
2Show HN: Automatically Build Nvidia TRT-LLM Engines (opens in new tab)(baseten.co)2mikejulietbravo1y ago0Save
3Show HN: 60% higher tokens per second for 70B custom LLMs (opens in new tab)(baseten.co)1mikejulietbravo1y ago0Save
4Show HN: Baseten Chains – Framework and SDK for Multi-Model AI Products (opens in new tab)(baseten.co)9mikejulietbravo2y ago5Save
5Open Source Inference Engine Baseten Raises $40M from IVP, Spark and Greylock (opens in new tab)(baseten.co)2mikejulietbravo2y ago1Save
6Launch a Personal Diffusion 2.0 Server on the Cloud (opens in new tab)(lightning.ai)4mikejulietbravo3y ago1Save
7Show HN: How to Build a Stable Diffusion text-to-image generator (OSS) (opens in new tab)(lightning.ai)8mikejulietbravo3y ago9Save
8PyTorch Core Technical Lead Joins Lightning AI to Build PyTorch Team (opens in new tab)(lightning.ai)11mikejulietbravo3y ago4Save