1Storage: How to deduplicate at intra and inter-files level (opens in new tab)(old.reddit.com)2lhoestq6h ago0Save
3Dataset streaming for distributed SOTA model training now more efficient (opens in new tab)(huggingface.co)2lhoestq8mo ago0Save
5A prompt is worth 1000 data points: combining GPT3 prompting and fine-tuning (opens in new tab)(huggingface.co)3lhoestq5y ago0Save
6Datasets: Release 1.3 brings dataset versioning, on-the-fly transforms and more (opens in new tab)(reddit.com)2lhoestq5y ago0Save