There's something to that idea, because the art of optimizing a site for curiosity has mostly to do with avoiding repetition, and the most repetitive comments ought to be the most machine-learnable.
https://hn.algolia.com/?dateRange=all&page=0&prefix=true&sor...
https://hn.algolia.com/?dateRange=all&page=0&prefix=false&so...
https://hn.algolia.com/?dateRange=all&page=0&prefix=false&so...