Websites don’t whitelist their crawlers, they maintain custom bypasses for a wide variety of websites.
If the websites were inclined to whitelist these crawlers, they’d also whitelist archive.org which is actually easy to whitelist. Archive.is is not