1Launch HN: RunAnywhere (YC W26) – Faster AI Inference on Apple Silicon (opens in new tab)(github.com)GitHub240sanchitmonga223mo ago153Save
2Fastest LLM decode engine on Apple Silicon. 658 tok/s on M4-max,beats mlx by 19% (opens in new tab)(runanywhere.ai)5sanchitmonga223mo ago3Save
3Show HN: On-device browser agent (Qwen) running locally in Chrome (opens in new tab)(github.com)GitHub19sanchitmonga225mo ago3Save
4Runanywhere – Make every CPU and GPU count (opens in new tab)(github.com)GitHub5sanchitmonga2210mo ago2Save
5PrependAI – Solving the Data Ingestion Nightmare for AI Agents (opens in new tab)(prepend.dev)4sanchitmonga221y ago1Save