1ZAYA1-8B: Frontier intelligence density, trained on AMD (opens in new tab)(zyphra.com)3cmitsakis3d ago0
2DeepSeek could be valued at up to $50B in first fundraising (opens in new tab)(reuters.com)3cmitsakis4d ago0
3Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks (opens in new tab)(odysseys-website.pages.dev)1cmitsakis11d ago0
5Qwen3.6-35B-A3B: Agentic coding power, now open to all (opens in new tab)(qwen.ai)1274cmitsakis24d ago532
6The Axios supply chain attack used individually targeted social engineering (opens in new tab)(simonwillison.net)48cmitsakis1mo ago12
7Round Robin: license that's share-alike for improvements and permissive for apps (opens in new tab)(roundrobinlicense.com)2cmitsakis5mo ago2
8Aisuite – Simple, unified interface to multiple Generative AI providers (opens in new tab)(github.com)4cmitsakis1y ago1
11Würstchen: Fast Diffusion for Image Generation (opens in new tab)(huggingface.co)21cmitsakis2y ago2
12Medusa: Simple Framework for Accelerating LLM Generation (opens in new tab)(github.com)1cmitsakis2y ago0
13GPT Can Solve Mathematical Problems Without a Calculator (opens in new tab)(arxiv.org)2cmitsakis2y ago0
14TinyLlama project aims to pretrain a 1.1B Llama model on 3T tokens (opens in new tab)(github.com)201cmitsakis2y ago60