undefined | Better HN

0 pointsbehnamoh4y ago0 comments

i think a more general question is how would you measure explainability when you see it? is there some sort of metric for that?

0 comments

1 comments · 1 top-level

visarga4y ago

One weak point I see - this tool only measures how much an individual input token would be changed to decrease the loss. The reason a token might have large gradients might be related to how many times it appears in the training set and how consistent is the training set with the evaluation set, not just how much the prediction disagrees with the target label.

So it jointly measures data coverage, consistency and data to target fit. Just my intuition, might be wrong.

j / k navigate · click thread line to collapse