1
I've also started looking into the SW itself (a sort of heavily customised drupal but no changes in core files so not a fork).
What else would you investigate? Not so much from a business point of view but from... any other point of view :)
Cheers
At the same time I'm not an expert in the fields hence my question: would you got for manual scraping or you are aware of very sophisticated tools that I could try out? I'm aware of the well-known OSS ones (Weka, Scrapy, Beautiful Soup) but unless I'm missing something they fall into the "write as many rules as different websites you've got" category.
TA!