And they also have ready-to-use scripts for A LOT of the usual datasets: https://huggingface.co/datasets
including LAION 400M and LAION 2B: https://huggingface.co/datasets/laion/laion2B-en