1Kimi K2.5 on 2 Mac Studios (<$20K) running at 24 TPS (opens in new tab)(twitter.com)6alexandercheema1mo ago4
2Clustering Nvidia DGX Spark and M3 Ultra Mac Studio for 4x Faster LLM Inference (opens in new tab)(twitter.com)8alexandercheema5mo ago1
3Clustering Nvidia DGX Spark and M3 Ultra Mac Studio for 4x Faster LLM Inference (opens in new tab)(blog.exolabs.net)5alexandercheema5mo ago0
4Meta AI: "The Future of AI Is Open Source and Decentralized" (opens in new tab)(twitter.com)214alexandercheema1y ago111
5Two MacBooks is All You Need: Running Llama 3 405B using Apple MLX and Exo (opens in new tab)(twitter.com)11alexandercheema1y ago2