[Edit to add linky: http://u.cs.biu.ac.il/~koppel/papers/male-female-text-final.... ]
'Brown & Levinson’s (1987) polite-ness theory takes into account an individual’s efforts to preserve the “face(s)” of others with whom one communicates. For example, they propose impersonalizing the speaker and hearer by avoiding the pronouns I and you, using past tense to create distance and time, diminishing the force of speech by using hedge words such as perhaps, using slang to convey ingroup membership, and using inclusive forms (we and let’s) to include speaker and hearer.'
In English, subjunctive case usually requires a to, and people asking permission use the subjunctive a lot. Is that the signal he's finding? Is that the best NLP can do?
That said, the last time I dabbled in IR, my algorithm choked on stopwords. I should revisit it.
http://infoblog.stanford.edu/2008/08/spotsigs-are-stopwords-...
Maybe for determining social hierarchies for mind share in product recommendations or advertising?
Yeah, I can't think of one noncreepy way to use this.
IR aims to retrieve information and not sentiments from the text. This kind of work is related to sentiment analysis, where use of stops words as signal is common. Thus the whole argument that some holy criteria in IR has been shown to false is clearly incorrect. Even original authors don't make such assertions.