To be completely fair - that's not ecological use either. Decoding a video stream must be more electricity-consuming as comparing to just doing a native audio stream.
YouTube can (and I believe does) serve simple audio streams for scenarios where you're only listening to music/podcasts. Try poking around a YouTube link with yt-dlp, they definitely have audio-only sources available.