2"Optimal Cognitive Core"- specialized 1.7B model for grounded question answering (opens in new tab)(huggingface.co)6dryarzeg20d ago0Save
3Step 3.7 Flash – 198B-A11B MoE vision-language model (opens in new tab)(huggingface.co)5dryarzeg24d ago0Save
4Rotary GPU: Exploring Local Execution for Large MoE Models Under Limited VRAM (opens in new tab)(arxiv.org)arXiv41dryarzeg24d ago4Save