1Show HN: Control your X/Twitter feed using a small on-device LLM (opens in new tab)(imbue.com)15kanjun2mo ago3Save
2Tri Dao, Stanford: On FlashAttention and sparsity, quantization (opens in new tab)(imbue.com)2kanjun2y ago0Save