I understand you know how real world spam filters work, but this is a different use case.
I'm not trying to generate spam, I'm talking about an IM app, where you have your contacts, and you can talk with each other and only see the legit messages.
I can elaborate, but please continue in a non-adversarial way if possible.