2Show HN: Made a batching LLM API for a project. Mistral 200 tk/s on RTX 3090 (opens in new tab)(github.com)GitHub3muttled2y ago0Save