This isn't a hypothetical. I've used this exact workflow for this exact use case. I'm telling you it's no good.
I run a search engine crawler and my average across 100M docs is about 7 Kb when compressed with zstd (fs block size is typically 4 Kb). Some much larger than that of course, but many smaller still. HTML in general compresses absurdly well.