1Low-Latency Inference with Speculative Decoding on D-Matrix Corsair and GPU (opens in new tab)(gimletlabs.ai)1nserrino3mo ago0Save
2The emerging role of SRAM-centric chips in AI inference (opens in new tab)(gimletlabs.ai)3nserrino3mo ago0Save
3Speeding up PyTorch inference on Apple devices with AI-generated Metal kernels (opens in new tab)(gimletlabs.ai)187nserrino9mo ago30Save
4Show HN: Pixie, open source observability for Kubernetes using eBPF (opens in new tab)(github.com)GitHub6nserrino4y ago3Save
6Observing HTTP/2 Traffic Is Hard, but eBPF Can Help (opens in new tab)(blog.px.dev)91nserrino4y ago4Save
9Horizontal Pod Autoscaling with Custom Metrics in Kubernetes (opens in new tab)(blog.px.dev)4nserrino4y ago0Save
10Open sourcing Pixie under Apache 2.0 license (opens in new tab)(blog.px.dev)108nserrino5y ago18Save