"Big Data" and "Data Scientist" are the latest buzzwords like Web 2.0 and Java were during their ORA hype-r eras.
There is a lot more data. There are a lot more tools and technology to deal with data. There is a need for high quality people that understand how to handle data. But there are no rockstars, there are people that have passion and spent years learning and teaching their craft. But they aren't rock stars. The best may get paid as much as a very well off doctor or business owner, but they aren't going to fill stadiums around the world, sell millions of t-shirts or be targets of media gossip columns.
Technology at a basic level amplifies agency, certain techniques and their resultant technologies have benefited individuals and citizens more than State agencies and with others it's been the inverse. I think we are moving into an stage where there is potential for massive recentralisation of control in various domains.
But this is HN, not a political discussion forum, most of the 'frighteningly ambitious ideas' to grace these pages are to do with pushing out more ads or other such pablum. PG isn't a philanthropist and YC isn't a charitable foundation; HN is naturally aligned with those objectives.
[1]https://en.wikipedia.org/wiki/History_of_statistics#Etymolog...
:)
how about we just use the oh so boring "professional" instead? i want respect from society for my work, not a cute pet title co-opted from other industries, where they can use these terms with a straight face and without rolled eyes from the audience.
I've spent bits of my career working with fairly large data sets at one time or another, and providing discovery, insight and analytic tools into that data, but very little of it seems to have anything remotely to do with what the job descriptions for "Data Scientist" are asking for.
Consider this, building a very large graph of the internet, then using various models on that graph to find unique and actionable insights: such as finding routing bottlenecks for a video delivery service, involves lots of data, lots of scientific like exploration, yet isn't a "data scientist" job by the job reqs.
How about this, building a text parser that can finely categorize and make recommendations for a research organization based on millions of grant proposals, all categorized into various "mission silos" that research organization is built around. Not a "data scientist" job.
Analyze multi-lingual news stories to build a real-time alert system for conflict analysts. Not a "data scientist" job.
Building a tool that can scan multi-spectral aerial imagery and automatically extrapolate man-made structures from natural, catalog all of the different vehicle makes and models, and generate a predictive model of commuting patterns, or make recommendations for housing development based on perceived socio-economic conditions? Not a data-scientist job.
Collecting information on who propositions who from a dating web site, normalizing the data for population and writing a report on the findings? That's a "data scientist's" job.
It's not that that kind of work isn't valuable, only that there are so many other kinds of things that involve what might intuitively be called "data science" that calling just the one discipline "data science" is doing a disservice to what should be an amazing discipline -- part Computer Scientist, Part Analyst.
But I think it's important to not underestimate the shift that has been happening. The race to automate tasks has always been accelerating, but it hit an inflection point a few years ago when mainstream business people realized what current technology can achieve. Pretty much every industry I've seen has been or can be drastically transformed by better data management and predictive analytics.
It's going to affect all of us, so I think it's worth following closely.
IMO something is accelerating/changing and we hit an inflection point. It does help to give it a denominator. It's human, not just marketing. I'm sure one time people thought compsci and software engineering were redundant or pretentious. I'm still ambivalent about the name data scientist.
What can we learn that we can apply to technology in general? I feel like "programmers" and "IT" tend to be undervalued while Data Science tends to be accurately or even overvalued.
Given that Data Scientist is just a sexier rephrasing of Analyst, my suggestion would be to retitle Programmers as Computational Engineers. Or use Lisp.
"Did some growth hacking on the hedge funds, bagged it, 100% organic product..."