Among other things, "a synopsis of the SMILES notation, its use as a molecule identifier, a way to reproduce my data set, and a couple of possible alternatives for getting something similar" is something I would be interested in. And, considering the upvotes I got for my grandparent comment, something that other people would be interested in as well.
Also: http://hnnotify.com/
By comparison, an email provides the full context, and is easier to integrate into a workflow. For example, I can drag an attachment directly into my editor. A gist requires additional steps.
Regarding hnnotify.com, I enjoy the ability to let go of most HN threads after a couple of days. This thread one of a handful of exceptions. Can I really subscribe to one-and-only-one thread? I don't see that's it's worthwhile to set up a third-party account and active the service for a rare event. In any case, if it takes a month for bmh100 to evaluate the code then the HN thread will be closed, so there's only a narrow window for which this service is useful.
I do not share your optimism in the random contributions of others. To start, it's not like I haven't talked about this before. See https://bitbucket.org/dalke/smilez and http://www.dalkescientific.com/writings/diary/archive/2007/0... (under "Compressing SMILES") for two examples. Have I gotten any feedback about them? No. So why put more effort into hoping for a one-in-a-million event, which is what you suggest, instead of optimizing the chance of getting a followup from someone who specifically expressed interest? Experience says that I should optimize for the latter.
What is your interest in the SMILES notation that can't be resolved through https://en.wikipedia.org/wiki/Simplified_molecular-input_lin... ? I would be glad to tell you more. I have worked with different aspects of SMILES for over 15 years and co-authored the OpenSMILES specification. I have also written many blog posts about different aspects of how to work with SMILES. And gotten few followups.
What skill set do you have, that I might tailor a response? Are you comfortable installing from source, do you prefer one of the GNU/Linux packaging systems, or Mac/homebrew? Or are you happiest with extracting data from a database dump? My 'synopsis .. of possible alternatives' was more an offer to follow up on any of those options, but was of itself incomplete. It works because email has the implied statement that I will respond to further questions.
If you don't have specific interest, are more generically wanting to be informed, then perhaps you can understand why I would prefer to use other mechanism, like my blog posts, which are more likely to get the kinds of responses I'm looking for than spending time tuning an off-topic HN comment.
I consider the possibility of a random person coming across something and finding it interesting a worthy goal in and of itself. You do not.
This is a rather fundamental difference, and as such I do not think that anything I say will reconcile the matter.
Experience says that both categories are low. Perhaps there's a 1:10,000 change for a member of class 1, and a 1:500 chance for a member in class 2.
If I do as you suggest, I might raise that to 10:10,000 and 10:500.
However, my belief is that directed email has a higher stickiness, because of the reasons I mentioned earlier. I believe those statistics become 1:10,000 (ie, unchanged) and 15:500, respectively.
If you work the math out, you'll see that it's overall better to send the directed email.
Another option is to do both, which you'll see is what I did for the question that was asked. Your complaint is that I should haven't sent additional information in private mail, which is odd given that HN's own guidelines suggest that there are HN-related questions that are inappropriate to post and should instead be done by email.
You have also stated that I do not "consider the possibility of a random person coming across something and finding it interesting". This simply isn't true, as you can tell from the analysis above, and from the two pages I linked to two pages where I have posted information meant for random strangers to hopefully identify.
You've come across like you are irritated for being left out of the conversation. I've suggested a few topics I could discuss, but how can I say more when you haven't expressed any specific interest about the problem (either on SMILES or short word compression). When writing, it's good to have a target audience in mind. Should I assume a basic understanding of arithmetic compression, or start from the basics? Do I need to explain state machines? And so on.
My above analysis left out the work factor. Rather than write 40 different essays, each aimed for a different set of strangers (chemist background, CS background, math background, web dev background), etc. and with at best a 1:100 chance of success, it's a better use of my time to just work on the code. I believe I could do what I want in about 2 months.