> Spotify has actual humans with ears
Yeah the actual humans with ears are me and you. They use play-time as the basis for recommendation as well as content-similarity, and put effort into de-biasing this data for, e.g., position bias. This is the same for YouTube etc.
Spotify is one of the best examples of a modern large-scale recommender system, for me.