11-Bit and Ternary Bonsai Image 4B: Image Generation for Local Devices (opens in new tab)(prismml.com)3xenova29d ago0Save
2ML-intern: open-source ML engineer that reads papers, trains and ships models (opens in new tab)(github.com)GitHub3xenova2mo ago0Save
3Kokoro WebGPU: Real-time text-to-speech 100% locally in the browser (opens in new tab)(huggingface.co)227xenova1y ago53Save
4Transformers.js v3: WebGPU Support, New Models and Tasks, and More (opens in new tab)(huggingface.co)1xenova1y ago0Save
5SAM 2: Segment Anything in Images and Videos (opens in new tab)(github.com)GitHub824xenova1y ago147Save
6PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding (opens in new tab)(github.com)GitHub1xenova2y ago0Save
7Mamba: New SSM arch with linear-time scaling that outperforms Transformers (opens in new tab)(github.com)GitHub6xenova2y ago2Save
8Count tokens used by GPT-4 and Llama for large texts (> 50k characters) (opens in new tab)(huggingface.co)2xenova2y ago1Save
9Making real-time ML-powered web games with Transformers.js (opens in new tab)(huggingface.co)2xenova2y ago1Save
10MMS: Scaling Speech Technology to 1000 languages demo (opens in new tab)(huggingface.co)1xenova3y ago0Save
11Whisper Web: ML-powered speech recognition in the browser (opens in new tab)(twitter.com)3xenova3y ago1Save