2Training and Finetuning Embedding Models with Sentence Transformers v3 (opens in new tab)(huggingface.co)2cubie1y ago0
3Embedding Quantization: 25-45x retrieval speedup, 32x or 4x less memory usage (opens in new tab)(huggingface.co)4cubie2y ago0