undefined | Better HN

0 pointsminitech2y ago0 comments

No, that wasn’t the point at all. Compressibility was used to determine similarity.

0 comments

1 comments · 1 top-level

And what it accidentally showed, was that NCD between individual digits in the training set is a really terrible distance metric for classification.

You can do classification with KNN, which is obvious. You can also do classification with compression, which is less obvious, and neat. This approach tries to combine them in a way which doesn't work.

j / k navigate · click thread line to collapse