undefined | Better HN

0 pointskenjackson10y ago0 comments

As you note, the algorithm for judicial discretion is unknown. The algorithm for this software is fully known, just kept from the public.

0 comments

4 comments · 1 top-level

yummyfajitas10y ago· 3 in thread

The validity of the algorithm can be - and apparently has been - reliably tested and been found to be useful and mostly unbiased. This analysis has been performed by both the algorithm's creators and highly adversarial third parties, such as the author of this article. Both found that whatever bias there is is small, and cannot be distinguished from random chance.

For example, the author of this very article has done such an analysis. Here's her R notebook:

https://github.com/propublica/compas-analysis/blob/master/Co...

Her analysis shows (within the limitations of the frequentist paradigm) that:

a) the predictor is useful - score_factorHigh and score_factorMedium both have p-values that are essentially zero.

b) The predictor is not racially biased that much - race_factorAfrican-American:score_factorHigh and the other bias terms have p-values that are > 0.05 .

Look, I'd love it if we required such algorithms to be open source. I'm a huge proponent of both open science and open government. Nevertheless, there is an entire discipline devoted to evaluating predictive algorithms without needing to care about their details - it's called "machine learning".

The wonderful thing about statistics is that even a highly biased person (such as the author of this article) can still reach a correct conclusion that goes against their biases.

kenjacksonOP10y ago

Lets be clear -- if the null hypothesis in this case is true (that there is no bias), and all other assumptions made are true, there is a slightly greater than 5.7% chance of obtaining this result (or something even more skewed). That's a great bar for publication of SCIENCE. It's not a great bar for hiding behind a proprietary algorithm used in sentencing.

People talk about misuse of p-values, but this takes the cake.

mdjeu10y ago

This is in my professional area, and yummyfajitas is right on certain points. The reason these approaches started taking off at all is because the alternative, subjective decisions, don't generally work as well. There's plenty of meta-analyses to show this; that's why these risk systems get used.

Also, this analysis is certainly a useful addition to the literature on this system, but it's one analysis, and regardless of your philosophical stance on p-values, a p-value of .057 in the presence of multiple testing isn't the most convincing thing.

Having said that, the use of non-open predictive systems is a problem for criminal settings. Maybe this thing is biased, but the only way to find out and fix it is to do these sorts of analyses and have this sort of discussion.

The problem isn't the use of prediction systems, it's the use of them without open academic scrutiny, without correcting any biases that emerge.

1 more reply

yummyfajitas10y ago

I replied over there: https://news.ycombinator.com/item?id=11756449

j / k navigate · click thread line to collapse