Mark Cuban on Why You Need to Study Artificial Intelligence (opens in new tab)

(bothsidesofthetable.com)

191 pointsapplecore9y ago130 comments

130 comments

107 comments · 36 top-level

minimaxir9y ago· 20 in thread

The title is clickbait, but the problem with the religious fervor of machine learning/deep learning is that entrepreneurs/VC keep arguing that deep learning is magic and can solve any problem if you just stack enough layers. (see also: https://www.reddit.com/r/ProgrammerHumor/comments/5si1f0/mac...)

Meanwhile, statistical methods for non-image/text data with identifiable features can often work better than neural networks, but they are not as sexy. (Good discussion on HN about this: https://news.ycombinator.com/item?id=13563892)

hasbroslasher9y ago

It's worth noting that ML doesn't just mean neural networks - those just get the most hype. I'm also curious if you could give an example of traditional models (presumably logreg, SVM, decision forest/trees) outperforming DNN on classification tasks. I read your thread and agree with the points, but I'm curious to see if anyone's outlined examples of when DNN fail in particular (not necessarily RNN/CNN).

Aside - I think it's really ironic for people with no technical know how to go on a talk show and say "learn ML or you're toast". First, the inertia of the world economy is much greater than 3 years. Second, it's unrealistic to ask every 7-11 cashier to go learn ML. Third, programming is a complex art and we're surely not 3 years away from the singularity. Most people I know who have some ML knowledge also realize how far we are from a computer program that can read code, read specs, and then decide how to implement or fix existing code. And last, even if all of the above were moot, I think it's more likely that we write a program that successfully helms a Fortune 500 company before we write one that produces or improves on software. The former could be more easily reduced to a game of chess while the latter requires some massive novel research into "creativity". Maybe it's Mark Cuban who should be learning ML!

willholloway9y ago

I would argue we are already inside the singularity.

As far as ML, we have seen this before. When the tools get cheaper and easier to use, adoption goes up. RoR + cheap cloud servers and easy VC financing greatly increased the number of startups.

Now TensorFlow and the Nvidia Pascal architecture have made ML accessible and fast.

If there is one thing history has shown us, it is that the Hacker News comment section is full of myopic pessimists with an axe to grind (no offense peers!) -- just go back and look at the Dropbox launch thread here.

Even if ML doesn't meet all the hype (it won't for a time, and then later on it will probably exceed it and create things we can't even dream about right now) there will be an entire wave of ML startups, products and jobs.

There is a ton of money to be made on the ML train guys. It's the new social-local-mobile. Much will be shit, some will change all our lives for the better.

I'm old enough now to know how this works.

Edit PS.

I've heard listenable, interesting music completely generated with tensorflow. I have seen artwork that is arguably as good as anything any of the greats have put out generated through similar methods, and we are in the infancy of machine learning.

Machines have beaten humans at chess, go and most impressively in my opinion Jeopardy!

ML is real and it's here and you should leverage it as much as you can, all the knowledge and tools are free.

1 more reply

AndrewKemendo9y ago

>Most people I know who have some ML knowledge also realize how far we are from a computer program that can read code, read specs, and then decide how to implement or fix existing code.

That's exactly the wrong way to think about it IMO. My vision of Machine Programming is that it's a fluid OS/UI that adapts application architecture to how people actually use them. So for example you have "Deep Excel" which is trained first to give the basic functionality of a spreadsheet, but as each person uses it, it gets data about most used/combined UI elements, follow on actions etc... that "rewire" through RL the UX to behave in the way that the user indicates it wants to behave.

So each instance of the program running would be unique to each user and constantly changing to make it more adaptive to their needs. Network these in a deeper system and you can iterate on design and functionality of products way faster.

2 more replies

cookiecaper9y ago

Yeah, we just came across someone on HN trying to write a testing DSL with NLP. [0] He's apparently gotten funding for this.

To be frank, the tech community has really been leaving a sour taste in my mouth the last few years. Everyone is anxious to bandwagon on these buzzwords that provide little to no benefit or are applicable only to a small segment of the tech population.

In most cases, even a superficial understanding of the problem space should make it obvious that $This_Weeks_Sexy_Solution is a very bad fit. Do you need a server that persists data and is individually addressable? Then why are you using Kubernetes and Docker, which are still struggling to figure out these very basic things? k8s and Docker have very specific uses, but unless you're Google, they're probably not the right fit for your production environment right now.

This phenomenon seemed to hit a critical mass with document databases and single-page apps, and it continues to iterate with every open-source release that comes out of Google or Facebook. Since Google released TensorFlow last year and the compounded hype of [much worse than advertised attempts at] conversational speech recognition in Siri and Alexa, the "machine learning" bandwagon is starting to try to edge into the spotlight, and it sounds like it's already a mandatory part of any VC pitch.

Fortunately, machine learning is pretty hard and you get into hairy math practically right away, so I don't think the legs on this one will last as long. But we'll see. We'll at least have a lot of faux-ML going around and a lot of people making spurious claims on their resumes.

The economic collisions that make Silicon Valley and the tech industry in general a hive for inexperienced and insecure youth are bearing some really interesting effects this way. How can a company that's not "blown about by every wind of [tech fad]" fully exploit its relative sanity for competitive advantage?

[0] https://news.ycombinator.com/item?id=13572415 ; my reply downthread at https://news.ycombinator.com/item?id=13573978

eldavido9y ago

I've noticed this and I think it's an expression of a more general human urge to feel more important / significant than you really are.

It's the same urge that makes us all think we're better drivers than average, better looking than average, etc. It's the urge that make people move to Silicon Valley/Hollywood/etc., because everyone's going to "make it" even though statistically, that isn't possible. It's why every parent with the means pays through the nose to get their kids into Harvard, why SUV sales are high even though most people don't off-road, and why people buy technical outerwear for climbing, etc. if they never go near a mountain.

Everyone has a constructed image of themselves and we're lying if we say we don't try to burnish it. I think using all this tech is an expression of the desire for status. The problem is, it really can be a time-consuming, frustrating, customer-data-losing, unsupportable, waste of resources. That's bad for business, bad for your health (spending time fixing shit rather than eating well or going to the gym), and it can make your friends and family irritated with you, too.

I'm building a product right now on the most boring stack possible: python, postgres, pyramid. I've been there/done that on the whole sharded-MongoDB-Redis-Memcache-900 machines-AWS-Docker-startup-ops nightmare-use all the things (we actually used every single thing in that list at my last job). It was unreliable, a huge pain in the ass, a little bit fun, but ultimately quite counterproductive. These days, my bets are on things like the JVM, the .NET CLR, and Postgres. As Knish from Rounders would say, "My kids eat".

Another thing to keep in mind: a lot of this is resume padding. Yes, this is completely unprofessional, reckless behavior, but it happens, many recruiters are keyword-driven, and as interviewers many of us are impressed with a keyword-laden resume. We do this to ourselves as an industry in how we hire, who we let speak at events, and who we praise. It's nobody's fault but our own.

IsaacL9y ago

I was amazed to see this comment, like a meteorite of sanity crashing into HN.

My sibling post is on the right lines - a lot of people in tech are chasing a delusional self-image - though I don't think that's true of everyone. People can make the effort to be self-aware and honest with themselves. I think the rapid expansion of tech in the last few years attracted many, many youngsters, hucksters, delusional types, and so on, and people with maturity and experience became a minority.

thearn49y ago

For whatever reason, statistics seems to be in the midst of a serious branding problem. Maybe traditional frameworks of inference are too conservative for non-researcher's tastes?

I'm curious what Fisher, Neyman, and Pearson would say about the current state of the field. Especially considering how often they and other statisticians disagreed with each other throughout the 20th century.

estro9y ago

I believe your branding hypothesis is correct, and all of this hype around neural nets and machine learning is creating a startup ecosystem ripe for market correction. I've heard a million times that most machine learning problems are solvable by regression, and I expect to hear that a million times more.

1 more reply

digitalzombie9y ago

> For whatever reason, statistics seems to be in the midst of a serious branding problem.

Many of them don't know how to code. And the stat program of most of these universities doesn't require any real coding classes. The professor give you code templates and you tweak it to get the result. But the stat classes are really good to handle data.

Data science in general like Chapman Hill and UC Irvine seems like a really wide range of stuff from data analyst, a lil programming, data visualization, etc... I don't feel like it goes deep enough in what Statistic does imo.

Most are stuck with Tableau and SAS. Pharmacy companies uses SAS. Startup companies and tech companies uses Python and R.

I'm speaking from one data point sample, my current stat grad program. My undergrad is comp sci and I did a few years in the tech industry before coming back to grad school.

1 more reply

zebrafish9y ago

I have no experience with the entrepreneur/VC side of this coin, but Data Scientists know the differences between traditional regression based ML and CNNs. We know that doing image recognition or NLP is best suited to nvidia/tensorflow/keras/deeplearning/buzzword/buzzword. We also know that SVMs & Random Forests & regression lines work just as well as they always have to make predictions based on your click-thru data.

Maybe we should explain the cost/benefit of the buzzwords vs. the science?

logfromblammo9y ago

It's simple. More buzzwords equals more money. So whatever it is you're really doing, just find some way to semi-plausibly attach the latest hot buzzwords to it.

Most of the "dinosaurs" around here should know by now that you don't get paid to do what works; you get paid to do what the boss/customer wants. (And that's not always going to be exactly what they asked for, either.)

As long as you continue to throw money at me, I'll continue to chase your wild geese, hunt your snipes, fish for your red herrings, and joust with your wind turbines. Most of us here can learn fast enough to always stay one or two steps ahead of the boss on whatever topic they might latch on to. And that becomes genuine experience if it somehow manages to graduate past the business fad phase.

Eridrus9y ago

To say something in defence of the deep learning religion: it's a very flexible framework for expressing lots of different ideas and the ability to train all those ideas jointly if you can frame them in terms of SGD updates is very appealing.

And even if you don't want to use SGD to fit your model, the deep learning machinery can be reused for other things like bayesian inference, eg PyMC3 and Edward are built on Theano/TensorFlow respectively.

There have even been papers showing differentiable trees integrated into the deep learning framework. There is work on how to input graphs.

Maybe this is just my own desire for unifying frameworks shining through, but joint training seems to have real benefits and feature learning has clearly had a large impact on reinforcement learning.

Maybe it's not the best tool for day to day data science work, but that tends towards what tools are accessible and good enough, rather than what is best.

pif9y ago

Your comment sounds wise, but with all those acronyms and technical terms I can't be sure :-)

ska9y ago

It's not even that clean. A lot of the buzzword impact causes mass confusion on the applicability of techniques. Lots of people are trying to apply deep learning techniques on too small amounts of data (and truth) with fairly predictable results ... and throwing around at best questionable performance numbers with little validation.

CNN's are great for some image recognition/classification problems, but that's just the start of the conversation.

lottin9y ago

Sexy is in the eye of the beholder. Seriously, I think the problem of ML is that it often goes against the so-called principle parsimony, which plays a critical role in statistical modelling. Especially in many areas where the whole point isn't strictly to predict future outcomes, but to understand how things work and to test hypotheses. Understanding leads to better prediction anyways. If you can predict something but you don't know why or how, I would take such predictions with a grain of salt.

jackcosgrove9y ago

I think you have the cart before the horse regarding understanding and prediction. Prediction precedes understanding in importance and sometimes methodology, since if you can't predict outcomes correctly your understanding, however well fleshed-out, will be inaccurate.

Interestingly, the parsimony principle is lent empirical weight by bad results obtained from overfitting.

Machine learning enables us to multiply many more entities than we can using our conscious thought processes. The various image recognition models use many variables that enable results impossible with other techniques. It does not violate parsimony if you can obtain better results on a large data set.

I can predict that if a car drives by it will not teleport to some other location, but will rather continue along its path. I've been able to predict this since I was a child. I did not need to study the laws of motion to do so. I would not take such a prediction with a grain of salt.

1 more reply

randcraw9y ago

I think that's because the majority of folks using ML care only about discriminative ML, not generative. A black box ML technique that delivers low predictive errors is good enough. To those ends, parsimony is largely irrelevant.

I suspect that others using ML, who are inclined to test hypotheses, induce cause, and discover unknowns (via generative ML), are far fewer.

So it's hardly surprising that the eye of discriminative MLers dominates in defining 'sexiness'.

mattnewton9y ago

Just call it "statistical machine learning" or some other vacuous term and people who know enough will respect it (maybe after rolling their eyes a bit), and people who don't will be happy to hear you are using machine learning.

wakkaflokka9y ago

I call it the Jimmy Johns method of machine learning - just keep stacking those layers.

russdill9y ago

Once it's good enough to solve as many problems as VCs claim, you won't need anyone to program it :p

deepnotderp9y ago· 9 in thread

I'm going to go against the grain here and (gasp) not hate on deep learning. People should realize that although many older statistical methods and "traditional" machine learning methods such as LDA, SVMs and decision trees may be good enough for business tasks, they are not the cutting edge of AI research. I think people are forgetting exactly how difficult image classification and object detection was before the advent of deep learning. People hating on "stack more layers" forget that "stack more layers" is EXACTLY what improved imagenet performance to such as massive extent. ResNets pushed the limits by figuring out how to stack more layers in a beneficial way.

And let's take a look at AlphaGo, how would you do that with SVMs or decision trees? Just get over the fact that deep learning provides a level of "intuition" (Go's search space is famously greater than the estimated number of stars in the universe).

I think that a part of the problem is that older ML PhDs are angry that deep learning is so easy (until the learning rate fails to provide convergence of course...) and would prefer that their preferred methods would still reign supreme.

I'll end this wall of text on the note that OpenAI's Ian Goodfellow notes that all projects at OpenAI use deep learning right now, but they are not dogmatic and will consider other approaches if they work well. I think this is the path that should be taken. On the other hand, I also see a bright future for uniting traditional techniques with deep learning such as attaching a SVM to a CNN and combining decision trees with CNNs both of which have resulted in good results.

dmreedy9y ago

>> I think that a part of the problem is that older ML PhDs are angry that deep learning is so easy (until the learning rate fails to provide convergence of course...) and would prefer that their preferred methods would still reign supreme.

I think that's definitely a part of it, and I feel that way sometimes myself (not that I'm a PhD). But there's another side of that reluctance that lies on the axis of model accountability and explicability. A lot of modern ML/Deep Learning doesn't -feel- like we're understanding anything any more than we did ten years ago. Yes our black-box results are better according to the tests we've laid out for them, but there's something more slippery about the 'why', beyond the handwave of 'complexity'. Maybe this is just the way it will be going forward (in the spirit of Quantum's "shut up and calculate"), but it is not easy to give up something that you can wrap your head around with something that kind of just takes care of itself, especially if you're in the business of seeking knowledge instead of results.

deepnotderp9y ago

I'm not sure I fully buy the "explainability" argument against deep learning since our goal is human level intelligence and human intelligence isn't easily explained either.

On the other hand, we're nowhere near human level intelligence in most tasks (Go, Poker and image classificaiton non withholding) so I can understand the argument for explainability from a practical perspective. I think we're making some good progress in that direction though, and I'll list them below:

1) Attention maps in CNNs can tell us what the net is usually looking at.

2) "Attentive Explanations" use attention mechanisms to point to the object of interest to generate an explanation for VQA tasks, check the paper (warning PDF): https://arxiv.org/pdf/1612.04757

3) A recent project used a similar explanation mechanism that forced the network to output "what it was thinking" while playing an Atari game.

4) NTMs (neural turing machines) allow weighted memory access, which alleviates the back box issue to some extent.

4 more replies

cr0sh9y ago

I understand and sympathize with your thoughts; I'm not an ML or deep learning expert, but I do agree somewhat with Mark Cuban's assertion that software developers need to get familiar with the technology.

I'm currently enrolled in the Udacity Self-Driving Car Engineer Nanodegree, and while that program is focused on applying ML techniques to a specific problem domain, I can see where and how to apply what we've learned so far to other problem domains. Furthermore, I have in the past taken other MOOC courses (Ng's ML Class in 2011, and Udacity's CS373 in 2012) which taught more generalized approaches (more of the "standard" ML concepts in Ng's course, less so in the CS373 course - but I should note that even in the course I am taking currently, one or two of the lessons did discuss and talk about GD and other simple mathematical models for ML for classification, etc).

I recently completed (late) the "behavioral cloning" project, where we had to implement a CNN to drive a simulated car around a track. At first, it seemed like it would be a simple challenge - slap the NVIDIA model on the thing, gather a bunch of data for training/validation, and it should all "just work".

If only.

I struggled immensely with that one; when trying to explain the problem to "outsiders", I kept running into the issue (which I knew about) of how to describe the fact that this system wasn't something I could easily explain. I can't tell someone "oh change that rule and tweak that test and it does this" - because it doesn't. Its not that simple. I know this, but conveying it to someone else isn't easy. I've found that these systems, for their simplicity compared to, say, a mouse's brain - share the same kind of "black box" issues - I can't tell you why my car likes to veer at a certain point, then recover. I never drove like that with my training data, but for some reason, somehow - that's what it learned.

From a business perspective - this is where things will fall down. We (that is, those people in the deep learning ML domain space) can't explain how these models work. In some ways, they don't seem to be deterministic. In theory, post training, they should be - but in reality, short of perfect data being fed in - the data they get in their inputs is never exactly the same - their ability to generalize and fit this imperfect data, with a result that correct, within a level of error - is a strength, not a weakness. But try to convey this to someone where money or exact results are on the line...

Researchers can't even explain how the C.Elegans connectome works, and it is vastly simpler and completely mapped (interestingly, embodying the connectome as an ANN in a robot produces a machine that acts and can be trained like a C.Elegans - we don't how or why, though).

We humans expect determinism from our machines, because for so long that is how machines have operated; the same input produces the same output. Our machines now, though, can produce different outputs for the (seemingly) same input, and that worries us (perhaps rightfully so, especially in the case of a self-driving anything carrying passengers!). Interestingly, we give a "pass" to human beings doing these same tasks; that is, we don't expect perfect results from these non-deterministic neural networks we hire or employ. We don't worry (much - though we try with imperfect models) about how one person decides to do something one way or another, even if they can't explain themselves.

But we create machines that do that same - and it worries us. I sometimes wonder if that says more about us than about the machines...

sgt1019y ago

Yup, can't explain it, don't understand it.. let's let it loose on pediatric surgery what can go wrong!

feral9y ago

You can argue against Cuban's point that anyone who doesn't know machine learning will be "dinosaur in 3 years", while also being enthusiastic about deep learning.

> "traditional" machine learning methods ... may be good enough for business tasks, they are not the cutting edge of AI research

That's true - but so? Under discussion is whether every developer should drop what they are doing and learn ML/DL, not what the cutting edge of AI research is?

In that context a lot of the low-hanging fruit for the next while is going to involve actually deploying simpler ML, with all the non-ML work that entails - and maybe using off-the-shelf DL to extract information from unstructured data.

> older ML PhDs are angry that deep learning is so easy

That's very ungenerous. People tend to do PhDs in things like ML (often making sacrifices to do so) because they want to advance the field - not to make it harder! People are excited. They are wary about hype causing another AI winter, or a lot of important projects failing for silly reasons.

deepnotderp9y ago

I agree that Cuban is being foolish here, I was just talking about the technical perspective.

reachtarunhere9y ago

Agree on the idea of Classical Machine Learning experts being a little pissed regarding the average joe getting better results with such ease. A lot of these people have put in years of study/work in those techniques and it must be sad to see some form of sudden paradigm shift making it worth less.

On the other hand it is true that DL is not required for many (most?) companies as they don't work with the kind of problems you list (vision, robotics)

romaniv9y ago

>And let's take a look at AlphaGo, how would you do that with SVMs or decision trees?

This is a poor example. There are Go programs that work very well and use exclusively MCTS without any neural networks.

deepnotderp9y ago

This is a fallacy, AlphaGo didn't increase very much with additional hardware at all, it's elo would still have been higher than lee se-dol's elo.

And yes, MCTS worked well for Go, but notice how the combination boosted it to new levels? And deep nets without MCTS also worked very well on Go, outperforming MCTS only approaches (by FAIR and was the top before being dethroned by AlphaGo).

1 more reply

itg9y ago· 9 in thread

Good luck with that. Any place doing serious ML will require the person to have a PhD or have publications and presentations at conferences like NIPS/ICML. Even most CS grads with a bachelors do not have the math background required unless they double majored in math or stats.

This is more VC/founders who are hyping up AI and need more ML folks so they can drive down costs.

minimaxir9y ago

The notion that you can only do "serious ML" if you have a PhD/published papers is a No True Scotsman fallacy.

The modern tools for ML/deep learning are accessible to all open source and well documented. And as I note in my top-level comment, old-fashioned statistical methods like linear regression are more than sufficient for real-world business problems, and definitely do not require a PhD to grok.

vidarh9y ago

This is what I keep telling people too - that the "old-fashioned" statistical methods aren't applied to more than a tiny little fraction of the problems they could be applied to yet.

jbooth9y ago

I didn't double major in math or stats. I squeaked through linear algebra without understanding the material, and then re-studied it independently a few years ago and actually understood a little more.

Yet, I can still apply machine learning to solve Ax=b problems. More importantly, I can use business analysis and write code to transform business problems into an Ax=b problem, and then optimize it.

You don't need a PhD to grok optimizing a vector to transform a matrix of inputs into a vector of observed outputs, then apply that trained vector going forward. Neural nets are slightly more complicated than a straight linear regression, but only slightly. I'd call decision tree methods like GBM even more complex, but still eminently grokkable for a decent programmer.

Avalaxy9y ago

> Any place doing serious ML will require the person to have a PhD or have publications and presentations at conferences like NIPS/ICML.

Stop spouting this bullshit. You don't need a PhD, and you don't need to advance the field to be doing 'serious ML'. All you need to be able to do is know how and when to apply it to solve crucial business problems.

solipsism9y ago

This misunderstanding is what needs to go the way of the dinosaur. You don't need a PhD to know how to clean data, prepare features, or generate synthetic features. You don't need a PhD to tune hyperparameters. You don't need a PhD to know how to separate your test data from your training data. You don't even need a PhD to wire things together at a high level with something like tensorflow.

Obviously you will want a theoretical expert on your team. But if your startup is counting on having a room full of them, good luck.

forgetsusername9y ago

>Any place doing serious ML

Good thing the vast majority of businesses won't need "serious" ML, but instead will require only simple implementations to help solve business problems.

sidlls9y ago

Utter nonsense. A PhD signals two things: that a person has the same degree of mastery of core material as a person with a master's degree and that he or she has the determination to do additional original research sufficient to produce a 100 page paper.

It isn't required for any serious research effort, except by the accident of inertia. And it certainly isn't a necessary indicator of determination.

tensor9y ago

You seem to make extremely light of "doing original research" here. A PhD or equivalent is absolutely required to do serious research in the field. Sure, you can always get the depth of knowledge required without a formal program, but the reality is that few do. Most people just take Machine Learning 101 then think they are a domain expert.

That said, there is definitely a place for non-PhD level ML practitioners. I don't think the industry has stabilized in this regard, but I can definitely see a "machine learning developer" type position becoming quite common. This is not the same as someone doing original research, but would definitely meet the needs of a great many business use cases.

1 more reply

dang9y ago

> Utter nonsense.

This counts as name-calling in the sense that the HN guidelines ask you not to do it: https://news.ycombinator.com/newsguidelines.html. Your comment would be much better without that bit.

We detached this subthread from https://news.ycombinator.com/item?id=13599533 and marked it off-topic.

vidarh9y ago· 5 in thread

One of the things I've realised is that the more I'm looking around, the more I find opportunities where people "should" have seen the opportunities of basic bayesian models, simple clustering algorithms etc. and other simple mathematical/statistical methods 20 years, but didn't, and still don't. That has massively changed my perspective on how quickly the onslaught of machine learning will come.

E.g. when I was reading up on genetic algorithms etc. 20 years ago we also expected the "revolution" to be right around the corner, and that things like genetic programming would change the world in a few years time. And while various of those methods found use some places, most places that could have used at least some of the simpler ones, still don't.

In other words, I think talking about a 3 year timeline is crazy. It's getting more attention, sure, but there is so much low-hanging fruit that most developers could be busy for the next 20 years putting in place the most trivial algoriths all over the place and we still wouldn't have picked off even the low hanging fruit where the computational resources and algorithms and data to make a big impact were well within reach 20 years ago.

This certainly means there is plenty of room for a lot developers to do very cool stuff and build careers on machine learning today, but it also mean most developers will not have to learn the state of the art - or anything near it - for a very long time.

As a concrete example I give to people, consider all of the search boxes out there on various sites - product searches, location searches, site searches - that are straight keyword based searches that don't take into account any clickstream data to improve ranking. The proportion of search boxes I see that take advantage of the available data is vanishingly small, even though very basic analysis can improve the perceived relevance of the results massively.

We certainly will see more companies invest in proper machine learning as the payoff gets higher and difficulty in taking advantage of it drops. But we will also see a huge proportion of sites that could use it continue to ignore it for years to come.

There are big business opportunities in finding ways of making a dent in that portion of the market, though, and so learning this stuff can certainly be well worth it on a personal level, but I don't believe in his timeline in terms of the overall market.

forgetsusername9y ago

>But we will also see a huge proportion of sites that could use it continue to ignore it for years to come. There are big business opportunities in finding ways of making a dent in that portion of the market

I agree with this wholeheartedly. I think there are pockets of HN that are out of touch with the reality in SMBs across the country (and planet). Often these businesses are struggling to track data, let alone utilize it for decision making or analyze it any sophisticated manner. "Out-of-the-box" ML will be another tool in the toolbelt in these instances.

So, sure, if you are trying to build Massive ML Tech Co., you won't be able to hire a few devs who watched a ML series on Coursera to achieve that goal. But there's ample opportunity to apply that knowledge elsewhere.

anotheryou9y ago

If competition with basic keyword search survives, the benefit of implementing something more sophisticated might just not be worth it. With search you usually know what you are looking for and can pin it down quite well with classic keyword search, discovery happens elsewhere.

vidarh9y ago

> With search you usually know what you are looking for and can pin it down quite well with classic keyword search, discovery happens elsewhere.

Having looked at search terms users enters on a lot of client sites, I don't believe that for a second. Users are exceptionally bad at composing search terms.

1 more reply

curuinor9y ago

Algolia, Swiftype and Constructor.io (disclaimer: I worked at the latter) all do the learning for that stuff, at a price.

vidarh9y ago

I think the existence of services like that makes the point even starker: There are tools out there that can provide some of the basics, and yet the low hanging fruit still persists. Just knowing that those services exist, and understanding the pros and cons on when to recommend them likely puts you ahead of the pack.

feral9y ago· 4 in thread

Reading HN I worry that we're going to have the opposite problem - a glut of people will try and (badly?) learn ML and then realize there aren't enough ML jobs.

I've a PhD and held ML-engineer positions in a few different companies - I've good industry awareness.

Most applied ML, for most companies, right now, is actually relatively simple models (hand-coded rules! logistic regression! You'd be shocked how common these are.) The bulk of the work is data cleaning, gathering, integration, deployment, productisation, reliability, avoiding pathological cases, special-casing, Product, UX. You do need ML specialists who understand the stuff, to make it all work and come together - but the ratio of ML specialists to the wider team is low. Maybe 1 or 2 specialist on a team of 10 for an ML heavy product.

This is going to remain the case IMO. Yes, there will be small teams, in highly resourced organizations (GOOG, FB etc), academic research labs, or occasional hard-tech startups, who do new model development. Maybe if AI becomes huge, you'll see more traditional Fortune 500s spin up similar efforts.

But there'll be a much wider set of people&businesses applying and tuning well understood approaches, rather than doing new model development. And you just don't need as many ML specialists, for that approach.

Even with deep learning, the tooling will advance. I mean, even look at all the research papers describing applications at the moment - so many of them are using pre-trained models. Industry will be similar. Tooling will advance, and you'll be able to do increasingly more with off-the-shelf pieces.

I think ML is absolutely going to have a big impact - I buy at least some of the hype. But should all developers, or even a substantial minority of developers, start learning ML as a career imperative? I don't think so.

Finally, it takes serious time to learn this stuff. Its easy to dabble (and worthwhile doing - its fun; and sometimes you can do powerful things in a using tools in a very blackbox manner!). But actually thoroughly learning it takes time. It takes serious time to build statistical intuition, as just one example.

We could easily end up with a great many career developers who have a specialization in ML, frustrated they never get to use it.

ylem9y ago

This is not my field, but a serious question--I once read that part of the motivation of pharmaceutical companies in hiring researchers was not so that they would all produce ground breaking independent research, but rather because they would be capable of reading the literature (again, not my field). Is that true at all for machine learning? Would companies hire people who would be up to date with the literature so that they could implement algorithms that others have developed in an academic context and put them into production?

feral9y ago

Yes, I think that's true.

Basically, the interfaces of the models/tools/abstractions people will use will be 'leaky'.

For example, you can take a machine learning method from scikit learn, which works really well on the scikit learn example, and apply it to your problem.

Any developer can do this pretty quickly. If it works, and gets good accuracy out-of-the-box, then great.

But if it doesn't work, what do you do? How do you know where to look, what could trip it up? Are your features OK, or is it a problem with your model? Or maybe you framed the problem wrong? When you get into this sort of area, that's when you need an ML expert, who knows whats actually going on under the hood - or can at least learn the particularly model quickly - and can make progress faster. A more general developer will slow down drastically once they start reading the documentation for how the model/system actually works.

And hopefully the expert will frame the project better from the very start, solving issues before they even arise, because they know the kind of issue that can occur, or suggest easier paths to solutions.

So, agree with your point there.

But how many such experts do you need? In my experience, only a small number on a bigger team. ML folk are highly leveraged, but need a lot of support to get their product into production - to manage the data (if its worth applying ML to, there's probably a lot of data, so a big data engineering task, maybe connected to a live system), to think about the UX etc.

This will all evolve on several levels of course:

- The tooling will get better; model/data deployment/management will get easier. But also and non-experts will be able to get more done as ML becomes more robust out-of-the-box.

- We'll get better at building ML products (e.g. team structure, data infrastructure, UX (designers learning how these things work), company org (e.g. a lot of friction between Agile and ML))

- Businesses will want to do bigger things

But if I had to guess, we'll be picking up low-hanging fruit for a while, and most of the work, for most companies, will be in the support infrastructure and application, with just a minority of specialist ML roles.

ska9y ago

Absolutely they will, and should. But they'll need relatively few people on the team(s) who can do this.

ska9y ago

    I mean, even look at all the research papers describing applications at the moment - so many of them are using pre-trained models.

Which is probably going to blow up in their collective faces at some point - it's at best a problematic technique. Many of these papers have little or no serious analysis of generalization issues or validation.

I think to the broader point that generally learning how to validate a ML technique is going to be very useful to a lot of people, implementing them less so.

pjungwir9y ago· 4 in thread

Sort of a content-free article, but the headline is an interesting bold claim that conjures a lot of thoughts:

- I know enough machine learning to be dangerous, but I'm hardly ever asked to use it. I designed a Bayesian classifier for my own startup around 6 years ago, analyzing political donor networks. I've completed the Stanford ML course. Back in college I did a math minor, so I'm comfortable with linear algebra, calculus, etc. I'm pretty comfortable with statistics of both kinds. But my bread-and-butter is freelance web development . . . and I'm not really even sure how to find work doing more MLy things.

- I've read over and over that the most time-consuming part of ML work is data collection & cleanup, and that matches my own experience. It is the same thing that killed so many data warehouse projects in the 90s. You don't need a Ph.D. to do it, but it is a tough and costly prerequisite. So it seems like you'll need non-ML programmers even for specifically ML projects.

- In a similar vein, Google has written about the challenges of "operationalizing" machine learning projects.[1] Having a little experience collaborating with a team doing an ML project, where they did the ML engine and I did the user-facing application, I can say that many ML experts are not experts in building reliable, production-ready software.

- Will there ever be a Wordpress of machine learning? If there is, the author will be rich, but you won't need a Ph.D. to operate it. But because ML requires hooks into your existing systems, I don't know if this will ever happen. What will happen I think is plugins to existing e-commerce systems for product recommendation or other off-the-shelf ML-powered features. These already exist, but I assume they will become more prevalent and powerful over time. In any case, the mainstreaming of ML for business will be inversely correlated with the expense to implement it, which suggests it will be easier and easier for non-expert developers to use (and misuse).

EDIT: Added the (now-)third bullet point I forgot before.

[1] https://research.google.com/pubs/pub43146.html

vidarh9y ago

> and I'm not really even sure how to find work doing more MLy things.

I think this is key - prospective clients won't ask for it because they don't understand where it could be used, and they won't understand the heavy ML methods. An approach there would be to pitch things like improving search results using a bayesian classifier applied to analytics data as a cheap upgrade when quoting other work. Until people are used to even the basic statistical approaches they won't be ready to invest in something more drastic.

Bartweiss9y ago

> I've read over and over that the most time-consuming part of ML work is data collection & cleanup, and that matches my own experience. It is the same thing that killed so many data warehouse projects in the 90s. You don't need a Ph.D. to do it, but it is a tough and costly prerequisite. So it seems like you'll need non-ML programmers even for specifically ML projects.

Yeah, this is a very good point. I trained a simple RNN to play games a while back, and the "RNN" part was something anyone with a decent math background could self-teach pretty quickly. There are lots of tricks and speedups and complications I could have added, but the baseline product was pretty easy.

The "trained" part, though. That took huge effort, and research, and tinkering. Building up a proper dataset with usable input features was 90%+ of my effort. Most of it was too specific to standardize in any useful way, so working with an off-the-shelf system for the learner wouldn't have saved any decent amount of effort.

sdenton49y ago

Re, Off the shelf: I saw a nice presentation a while back which asserted that you get a certain boost to your solution just throwing scikit.learn at the problem. After that, you spend time/money on either better, customized ml, or on feature engineering. Today, you very quickly get to a place where feature engineering is the better place to put effort.

So there's some off the shelf ml, which is pretty great, but the problem of automatic feature selection - especially if new features need to be plumbed into a system - is still insufficiently solved. We've got systems for paring down the set of available features, but our systems have no idea how to ask for new features. That's essentially domain research, after all, and probably a 'strong ai-complete' problem.

cr0sh9y ago

> Will there ever be a Wordpress of machine learning?

It certainly isn't at the same level, but Keras (imho) comes real close to being such a thing. Perhaps it, coupled with something like an expanded version of the TensorFlow playground, might make a power but easy-to-approach tool.

That said, even with Keras you still need an understanding of what the parts are for and how to "hook them up".

anupshinde9y ago· 3 in thread

Statements like these suggest that another AI Winter is coming (sooner than 3 years I guess)

"""He thinks even programming is vulnerable to being automated and reducing the number of available programming jobs."""

I believed something similar could happen within 1-2 years of learning/writing AI programs (more than 12 years back). I believed it so much that it consumed most of my weekends as I took on the Genetic Programming approach. Yes! computers can write programs - BUT trying reading those. Eventually after spending hours or days, you will be able to read those programs and you might find a simple "hello-world" program represented by a complex mathematical equation. Good luck trying to get such program fixed by humans. Imagine an experience decoding deep-learning-neural-nets. However, that is black-box from a programmer perspective.

From a business/management personnel perspective - the code is a black box anyways. When they get NNs that can generate required software, they will replace the people-manager with a NN-manager (who is a programmer btw!)

primaryobjects9y ago

Are you referring to something like this?

Using Artificial Intelligence to Write Self-Modifying/Improving Programs http://www.primaryobjects.com/2013/01/27/using-artificial-in...

anupshinde9y ago

No, that looks like GAs with string/array representations. Similar thing worked for me when I tried randomly referencing nodes within a chromosome (say from index A to index B) - generating a graph like structure

The outputs are like this for some not-so-easy targets:

Op nodes ['ADD', 'ADD', 'MUL'] EXPR: ['ADD[ni_99](ADD[ni_49](I__7[ni_43](), ADD[ni_19](I__8[ni_66](), ADD[ni_79](GET_CONST_3[ni_25](), I__9[ni_71]()))), I__1[ni_61]())', 'ADD[ni_13](I__6[ni_17](), ADD[ni_49](I__7[ni_43](), ADD[ni_19](I__8[ni_66](), ADD[ni_79](GET_CONST_3[ni_25](), I__9[ni_71]()))))', 'MUL[ni_68](GET_CONST_8[ni_73](), FLOAT[ni_42](I__1[ni_91]()))']

With genetic programming (using an AST), it can solve complex equations:

However, this simple equation i.e. correct answer "(a + b + c - d) / e" could be evolved and will result into either of these (depends on my luck maybe)

Case1: ((int)((b+((c-d)+a))/e)&(int)((b+((c-d)+a))/e))

Case2:

((((int)a&(int)(((((((mod(e,a)c)/e)(((((int)(d-a)&(int)b)+e)/a)/e))/((((int)e&(int)c)+e)+(c+(e/a))))c)/e)e))/((((int)e&(int)c)+e)+((b/e)+(e/e))))+(((((((((mod(e,a)c)/e)(((((int)(d-e)&(int)b)+e)/a)/e))/((((int)e&(int)c)+e)+((b/e)+(e/e))))c)/e) <..........10383 characters here.........> ))))))/e))/((((int)e&(int)c)+e)+(((d-e)/e)+((d/b)/e))))+(((mod(e,a)/e)+(b/e))+(((c/b)+((d+b)/e))/e)))))))))

The GP output (case 1 and 2) above was generated with a tweaked version of https://github.com/rogeralsing/go-genetic-math

makeset9y ago

> Statements like these suggest that another AI Winter is coming (sooner than 3 years I guess)

I wouldn't bet against it. It's very risky to extrapolate progress in AI.

A DARPA project named CHI [1] succeeded in automatically producing programs from a high-level knowledge description language. And when they fed it its own description, CHI rewrote itself, with a better implementation. Do you fear for your programming job yet? Well, that was 36 years ago.

[1] www.dtic.mil/get-tr-doc/pdf?AD=ADA105661

bsaul9y ago· 3 in thread

Honest question : once the technics will settle a bit, and libraries are created, what will be needed, apart from knowing that machine learning algorithm are based on some kind of stasticial inference, with a few settings here and there ?

I mean, we don't need a phd in image compression to create a service that streams videos. We just use libraries. Same for everything in computer science, it always end up packaged in some kind of reusable code or service, and only some specialists remain working in the field to work on marginal improvements.

Why would ML be any different ?

Eridrus9y ago

Once you've plugged your data into an ML system and gives you a classifier, are you done? What if the results are not good enough, do you just move on to another problem?

If you don't just move on to another problem you will need people who know how to push these systems further.

Namrog849y ago

But at some point, there is a difference between turning dials of the systems vs implementing new novel ways of these systems. Obviously we always need new novel approaches, but how much of that should stay in phd/academic/research world and how much spreads out into engineering day to day activities?

I am rather newbish in the whole space, but most everything I've seen is quite often just knowing enough to turn the right dials the right way. And I do wonder how long will it be before some of these dials can start turning themselves with more iterations with a simple, "yes that's what I want, or no that's what I don't want"

Sure, it might take some more cycles for it to find optimal in this example.

1 more reply

pesfandiar9y ago

Unlike video streaming libraries, there's no certain way for people to verify the outcome of an ML function is correct (e.g. Would this model work on other datasets? Is the statistical inference correct?). You need at least some basic knowledge about the underlying math to avoid pitfalls.

thomasahle9y ago· 3 in thread

I wonder to what degree he means. Because if he means "really" learning it, this is akind to the "learn to program in a week" books. It takes much more than three years to learn well.

vidarh9y ago

Here's my suggestion to people worried about this: Learn about how to use bayesian models and clustering methods, and how to recognise where they may be applicable, and you will already be able to produce things that will astound executives out there, and deliver very real value for your employer / customers, and those methods are simple - you don't even need to understand the maths to be able to make use of them, though it helps.

By all means, start learning actual machine learning methods too, rather than just statistical methods, but as I pointed out elsewhere on this thread: there is low hanging fruit everywhere.

Not nearly all of those will need "proper" machine learning, and even fewer will be willing to pay what it will cost to hire people with in-depth machine learning experience or pay the development costs or computational costs anytime soon. But a lot of them will buy into the buzzwords and look for a cheaper halfway-house or be open to pitches.

fixermark9y ago

I think his target audience was investors and company owners, so "Learn ML" in this context really means "Get savvy about the market and hire the people you need for your org to have some knowledge of the topic."

Perhaps a good time to be an ML grad coming fresh out of college.

diminoten9y ago

You could join the top 5% of knowers about machine learning in 3 years, I bet. I doubt you'd innovate, but you could at least master the state of the art within 3 years.

evgen9y ago· 2 in thread

I would be far more inclined to heed this advice if Cuban have any indication of understanding ML as anything more than a magic black box; fairy dust to be sprinkled into every pitch deck to solve any objection or solve difficult problems. The bandwagon is passing through folks, jump on board with Mark or you will have fewer buzzwords with which to craft your deck...

fixermark9y ago

Was he talking to engineers or owners / executives though?

""" The Upfront Summit is LA's premier technology event, with more than 750 of the country's top investors, startups, and corporate executives """

If he was talking to executives, it may be sound advice. It's extremely likely the hot business opportunities in the short-term will be applying ML techniques to outstrip competitors trying to solve problems with traditional hand-coded solutions. In business-speak, "Learn ML" translates to "Familiarize yourself with the space and hire the people you need who know the topic," because that's how a company "learns" something.

dentemple9y ago

As a developer, I'm perfectly okay with using a magic black box.

As long as we can reasonably understand the I/O (and that use of the software can pass a security review), I don't see a problem with this.

msvan9y ago· 2 in thread

Either he's right about machine learning, or this is exactly the kind of thing bubbles are made of.

geodel9y ago

Of course he is right. I saw some dinosaur characteristics already showing up when I looked myself in mirror this morning.

Twisell9y ago

You are so lucky! I can't even look into a mirror since I learn 3 years ago that a NoSQL ninja will soon stab me in the back and steal my job because I work with that declining SQL old stuff!

1 more reply

brilliantcode9y ago· 2 in thread

Normally I'd laugh off any Mark Cuban antics but he isn't wrong. AI is going to greatly reduce white collar jobs with economies of scale.

Luddites of 18th century thought they would never be replaced and continued on their trajectory.

ploika9y ago

In three years? No way.

A lot of white-collar jobs may be automated (or otherwise changed beyond recognition due to technology) after about thirty years maybe, but not three.

brilliantcode9y ago

I agree 3 years is way too short. I'd say 15 years is even early. 30 year seems maybe, 100% in 60 years.

gremlinsinc9y ago· 1 in thread

That's as strong as a statement as Trump: Learn Machine Learning or you'll be a Dinosaur in 3 years...

Maybe if it was coming from Bill Gates, Mark Zuckerberg, or another tech titan with some actual coding experience and a deeper level of learning about what ML even is. Cuban's a business man, and most CEO's I know don't have a clue about the stacks that run their own company, let alone what's popular.

That said, I do think ML will be important, but I develop ecommerce apps and things of the such in Laravel, unless I move into AI and Neural nets I don't see needing to know a lot about ML (though I wouldn't mind moving in that direction as that space picks up) -- but there's still plenty of opportunities without it.

AJ0079y ago

Making machines learn, applying machine learning to successfully solve problems, and positioning business assets to benefit from machine learning are three very different pieces of the "learn machine learning" puzzle. I don't know how any human would be able to go from 0 to having a good grasp of even one of these pieces in three years let alone all of them.

I suppose there will be (and to some degree already are) machine learning magic wands, but they are going to be the kinds of things that suck capital out of the companies that utilize them (through loss of proprietary data and competition blocking moats.)

marricks9y ago· 1 in thread

Assuming we're all going to be deep learning programmers is quite a bit foolhardy. I think what's really relevant to consider is AI winters can and do happen[1]. I would not disagree deep learning has done some amazing things, what I would say is it does have limitations.

What causes AI winters is when an advance such as deep learning can be applied to new problems and leads to increased interest. And while this new thing is really good as a subset of problems and impresses the public, of course it can't displace humans at everything and naturally has it's limitations.

So funding pours in, everyone gets hyped, and then those natural limits are (re)discovered and everyone gets all anti-AI research. Of course many people knew the limitations all along, but the dream is gone and so is a lot of funding until the next thing comes along.

This is probably natural to a lot of fields but AI just seems more prone to these boom and bust cycles because it's really exciting stuff.

[1] https://en.wikipedia.org/wiki/AI_winter

sevensor9y ago

Exactly. 25 years later, I'm still waiting to be replaced by an Expert System. A lot of the tech that fueled the hype train (logic programming! genetic algorithms!) is still really neat, but it didn't work out the way we expected. Same thing will happen with ML. It's the Prolog of tomorrow.

mad449y ago· 1 in thread

(Pre-apology. I am not trolling, please don't get my comment below more than what I intended: another perspective to look at the strong reaction Cuban's comments incited.)

Reading through the comments, I see that Cuban's statement upset and even angered several HN commenters. That is a strong emotional reaction.

I am not saying it is the 5 stages of grief, but the first 3 fits: denial, anger, bargaining, depression and acceptance.

Also from Howard Aiken: Don't worry about people stealing your ideas. If your ideas are any good, you'll have to ram them down people's throats.

return09y ago

I 'm not even sure why it provokes such reaction. Neural networks have existed for 4 decades ; without the deep- part, but we know their potential power and we still have not been replaced. Granted, this time it's different. I think what worries most of us is that neural nets need lots of data and we don't have access to it. Still, they are easy to learn and we should be learning about them (note to self).

onmobiletemp9y ago· 1 in thread

This is so dumb. You arent going to understand or get a job in machine learning unless you have at least a masters in the subject. Its extremely difficult and complex. I see tons of college students taking machine learning classes in anticipation of becoming a dinosaur amd none of them could get a machine learning job afterwards. Programming has always been pretty easy. The ai revolution wont be like the home computer revolution. Its going to be led by a relatively small group of academics, scientists and engineers working in prestigious research positions.

cr0sh9y ago

Currently there just aren't that many ML jobs out there to apply for - but who knows what the landscape will be like in 5 years or so?

Your assertion, though:

> This is so dumb. You arent going to understand or get a job in machine learning unless you have at least a masters in the subject. Its extremely difficult and complex.

...couldn't be further from the truth. You can understand this stuff without a masters in the subject. It really isn't too difficult or complex. Sure, I will admit that understanding how to take a derivative might be useful, but despite not having that knowledge (but I'm working on obtaining it), I have still been able to implement successfully working ML solutions - at least in a classroom-type environment.

My last success was getting a virtual car to drive around a virtual track, staying on the track and negotiating the curves, using an implementation of the NVIDIA End-to-End CNN architecture and some data I generated (plus augmentation and some other fun stuff). I used Keras and Python 3, running on my workstation at home, with a 750 ti SC as my "GPU" (I really need to upgrade this). My model converged very well after 10 epochs, but after 20 the loss was pushed pretty low to sub 1%. As far as I could tell, there wasn't evidence of overfitting (I need to do more investigation on this, though).

This was all done as part of Udacity's Self-Driving Car Engineer Nanodegree, which I am taking part in. Prior to this, I also completed Udacity's CS373 course in 2012, and Andrew Ng's ML Class in 2011. My motivation for all of this has mainly been my interest in autonomous unmanned ground vehicle robotics technology. I have an ongoing side-project in developing such a platform (seemingly back-burner'd a lot, though - life getting in the way, I guess). Even so, if a job offer comes about because of it, I'm not going to complain.

As it is, I believe the knowledge has helped me land positions, since it shows my dedication to improving my skills in problem domains outside of the everyday software development tasks. When potential employers have asked about it, I can show them some code I've worked on, while mentioning how some of the more simpler ML methods could help in a business problem domain. It sets me apart somewhat from other candidates, I believe.

Especially those who think the topic isn't worth their time to learn, because it may be "difficult and complex".

bluekite20009y ago· 1 in thread

anyone knows if there is a business need for human to train/label data? I have been thinking of going to a place w/ cheap labor cost (perhaps Vietnam) and set up an operation like this.

cr0sh9y ago

In the short term, perhaps.

In the long term, possibly not - there are already efforts and approaches being done in ML/deep learning to get models to label unlabeled data (google "deep learning unlabeled data" for tons of research info).

badthingfactory9y ago

I'll place this in the same folder as the articles claiming Wix will eventually replace web developers.

dkarapetyan9y ago

No thanks. Fundamentals and not hype is what makes one not a dinosaur.

anotheryou9y ago

I think prosthetic knowledge will become deeper and more accurate in the long run. And if it scales we don't need many people building the general purpose AI.

With this prosthetic knowledge we will have to learn much more what to ask and know how much the machine knows.

One has to quickly grasp the abstract that is one level too high or detailed for the machine to find and than find the seperate answers of the level below to recombine them. You can't yet ask where to open a restaurant, but you can google for demographics and write a program to map ratios between income, foot-traffic density and restaurant density.

Once we can ask what and where is the most profitable business in to start in town, we probably still don't get a step by step guide how to do it, interior design included. Where the rubber meets the road there is still a lot of oppertunities to decide on and complex data we can grasp more easy than the machine.

xs9y ago

Just like there isn't such a thing as polymaths anymore because the world has so many specialized skills, I think there won't be the possibilities of things like "full stack developer" in the near future because of the complexities for development. Some things that contribute to the complexities will be: ubiquitous controls, advanced AI, internet of things, augmented reality, machine learning, and new technologies we don't even have yet. We are in the golden age now where a single person can sometimes create a better website or app compared to a whole development team at a fortune 500 company. I think our Internet world will become so complex in the very near future that a single person simply won't be able be capable and they'll have to become specialized in only a portion of it.

taytus9y ago

Sorry, I refuse to visit such a clickbait headline.

owaislone9y ago

I think what he meant to say was that companies should learn it in 3 years or the competition will drive them out of business. I don't think he meant individuals will be dinosaurs, but startups that fail to take advantage of ML/AI.

xamuel9y ago

Three years ago, Machine Learning on a resume meant: "Good candidate."

Currently, it means: "Jumps on bandwagons, caution."

In three years, it'll mean: "Brainless buzzwords, avoid."

JustSomeNobody9y ago

I want a nickel every time someone says programming jobs will be automated in 5 years. This goes back to the beginning of programming and it hasn't happened.

coldcode9y ago

Machine learning would imply the machine needs to learn it not me. Knowing something about it and actually using it are quite different. While it might appear "everywhere" most of programming is still not ML and probably won't me. When I started in 1981 I didn't know C yet, that did not make me a dinosaur then either.

hnmot2239y ago

"Mark also said that what happens in the next 5–10 years is just going to blow everybody away, especially in the field of automation. He thinks even programming is vulnerable to being automated and reducing the number of available programming jobs."

He's talking out of his ass here. This won't be happening anytime soon (if ever)

usgroup9y ago

I kind of read it "learn it and you'll be a dinasaur for 3 years". It made me quite excited at the prospect. I started to practice growling and running at a curious forward angle , but on second reading I find myself disappointed.

id1220159y ago

It doesn't matter how far technology gets, there are some -isms and they are the real Dinosaurs that we have to get over and we are too small.

bgdkbtv9y ago

Oh yeah? Does Mark Cuban know artificial intelligence himself or is he just asking people to study it and work for him? :)

acd9y ago

I am sysadmin/devops what machine learning tools and topics would you recommend to learn for that field?

mi100hael9y ago

That page has so much JS bloat it made my top-of-the-line MBP lag just scrolling.

cstuder9y ago

Ok, so where do I start?

1 more reply

rocky11389y ago

Why don't we just learn it in 3 years, then?

general_ai9y ago

No, you're not going to be a "dinosaur". 99% of extremely well compensated software engineering jobs do not involve ML. Using top large companies as a proxy of what things are going to be like in the world at large 3 years from now, maybe one in 200-300 engineers does anything in any way related to ML there. And that's a generous estimate. You do need to know what it is, roughly, but there's no need to drop everything you're doing and switch careers.

otikik9y ago

Yeah. I'm going to be a clickbaitsaurus.

j / k navigate · click thread line to collapse

130 comments

107 comments · 36 top-level

minimaxir9y ago· 20 in thread

hasbroslasher9y ago

willholloway9y ago

I would argue we are already inside the singularity.

As far as ML, we have seen this before. When the tools get cheaper and easier to use, adoption goes up. RoR + cheap cloud servers and easy VC financing greatly increased the number of startups.

Now TensorFlow and the Nvidia Pascal architecture have made ML accessible and fast.

There is a ton of money to be made on the ML train guys. It's the new social-local-mobile. Much will be shit, some will change all our lives for the better.

I'm old enough now to know how this works.

Edit PS.

Machines have beaten humans at chess, go and most impressively in my opinion Jeopardy!

ML is real and it's here and you should leverage it as much as you can, all the knowledge and tools are free.

1 more reply

AndrewKemendo9y ago

>Most people I know who have some ML knowledge also realize how far we are from a computer program that can read code, read specs, and then decide how to implement or fix existing code.

2 more replies

cookiecaper9y ago

Yeah, we just came across someone on HN trying to write a testing DSL with NLP. [0] He's apparently gotten funding for this.

[0] https://news.ycombinator.com/item?id=13572415 ; my reply downthread at https://news.ycombinator.com/item?id=13573978

eldavido9y ago

I've noticed this and I think it's an expression of a more general human urge to feel more important / significant than you really are.

IsaacL9y ago

I was amazed to see this comment, like a meteorite of sanity crashing into HN.

thearn49y ago

For whatever reason, statistics seems to be in the midst of a serious branding problem. Maybe traditional frameworks of inference are too conservative for non-researcher's tastes?

estro9y ago

1 more reply

digitalzombie9y ago

> For whatever reason, statistics seems to be in the midst of a serious branding problem.

Most are stuck with Tableau and SAS. Pharmacy companies uses SAS. Startup companies and tech companies uses Python and R.

I'm speaking from one data point sample, my current stat grad program. My undergrad is comp sci and I did a few years in the tech industry before coming back to grad school.

1 more reply

zebrafish9y ago

Maybe we should explain the cost/benefit of the buzzwords vs. the science?

logfromblammo9y ago

It's simple. More buzzwords equals more money. So whatever it is you're really doing, just find some way to semi-plausibly attach the latest hot buzzwords to it.

Eridrus9y ago

There have even been papers showing differentiable trees integrated into the deep learning framework. There is work on how to input graphs.

Maybe this is just my own desire for unifying frameworks shining through, but joint training seems to have real benefits and feature learning has clearly had a large impact on reinforcement learning.

Maybe it's not the best tool for day to day data science work, but that tends towards what tools are accessible and good enough, rather than what is best.

pif9y ago

Your comment sounds wise, but with all those acronyms and technical terms I can't be sure :-)

ska9y ago

CNN's are great for some image recognition/classification problems, but that's just the start of the conversation.

lottin9y ago

jackcosgrove9y ago

Interestingly, the parsimony principle is lent empirical weight by bad results obtained from overfitting.

1 more reply

randcraw9y ago

I suspect that others using ML, who are inclined to test hypotheses, induce cause, and discover unknowns (via generative ML), are far fewer.

So it's hardly surprising that the eye of discriminative MLers dominates in defining 'sexiness'.

mattnewton9y ago

wakkaflokka9y ago

I call it the Jimmy Johns method of machine learning - just keep stacking those layers.

russdill9y ago

Once it's good enough to solve as many problems as VCs claim, you won't need anyone to program it :p

deepnotderp9y ago· 9 in thread

dmreedy9y ago

deepnotderp9y ago

I'm not sure I fully buy the "explainability" argument against deep learning since our goal is human level intelligence and human intelligence isn't easily explained either.

1) Attention maps in CNNs can tell us what the net is usually looking at.

2) "Attentive Explanations" use attention mechanisms to point to the object of interest to generate an explanation for VQA tasks, check the paper (warning PDF): https://arxiv.org/pdf/1612.04757

3) A recent project used a similar explanation mechanism that forced the network to output "what it was thinking" while playing an Atari game.

4) NTMs (neural turing machines) allow weighted memory access, which alleviates the back box issue to some extent.

4 more replies

cr0sh9y ago

If only.

But we create machines that do that same - and it worries us. I sometimes wonder if that says more about us than about the machines...

sgt1019y ago

Yup, can't explain it, don't understand it.. let's let it loose on pediatric surgery what can go wrong!

feral9y ago

You can argue against Cuban's point that anyone who doesn't know machine learning will be "dinosaur in 3 years", while also being enthusiastic about deep learning.

> "traditional" machine learning methods ... may be good enough for business tasks, they are not the cutting edge of AI research

That's true - but so? Under discussion is whether every developer should drop what they are doing and learn ML/DL, not what the cutting edge of AI research is?

> older ML PhDs are angry that deep learning is so easy

deepnotderp9y ago

I agree that Cuban is being foolish here, I was just talking about the technical perspective.

reachtarunhere9y ago

On the other hand it is true that DL is not required for many (most?) companies as they don't work with the kind of problems you list (vision, robotics)

romaniv9y ago

>And let's take a look at AlphaGo, how would you do that with SVMs or decision trees?

This is a poor example. There are Go programs that work very well and use exclusively MCTS without any neural networks.

deepnotderp9y ago

This is a fallacy, AlphaGo didn't increase very much with additional hardware at all, it's elo would still have been higher than lee se-dol's elo.

1 more reply

itg9y ago· 9 in thread

This is more VC/founders who are hyping up AI and need more ML folks so they can drive down costs.

minimaxir9y ago

The notion that you can only do "serious ML" if you have a PhD/published papers is a No True Scotsman fallacy.

vidarh9y ago

This is what I keep telling people too - that the "old-fashioned" statistical methods aren't applied to more than a tiny little fraction of the problems they could be applied to yet.

jbooth9y ago

Yet, I can still apply machine learning to solve Ax=b problems. More importantly, I can use business analysis and write code to transform business problems into an Ax=b problem, and then optimize it.

Avalaxy9y ago

> Any place doing serious ML will require the person to have a PhD or have publications and presentations at conferences like NIPS/ICML.

solipsism9y ago

Obviously you will want a theoretical expert on your team. But if your startup is counting on having a room full of them, good luck.

forgetsusername9y ago

>Any place doing serious ML

Good thing the vast majority of businesses won't need "serious" ML, but instead will require only simple implementations to help solve business problems.

sidlls9y ago

It isn't required for any serious research effort, except by the accident of inertia. And it certainly isn't a necessary indicator of determination.

tensor9y ago

1 more reply

dang9y ago

> Utter nonsense.

This counts as name-calling in the sense that the HN guidelines ask you not to do it: https://news.ycombinator.com/newsguidelines.html. Your comment would be much better without that bit.

We detached this subthread from https://news.ycombinator.com/item?id=13599533 and marked it off-topic.

vidarh9y ago· 5 in thread

forgetsusername9y ago

anotheryou9y ago

vidarh9y ago

> With search you usually know what you are looking for and can pin it down quite well with classic keyword search, discovery happens elsewhere.

Having looked at search terms users enters on a lot of client sites, I don't believe that for a second. Users are exceptionally bad at composing search terms.

1 more reply

curuinor9y ago

Algolia, Swiftype and Constructor.io (disclaimer: I worked at the latter) all do the learning for that stuff, at a price.

vidarh9y ago

feral9y ago· 4 in thread

Reading HN I worry that we're going to have the opposite problem - a glut of people will try and (badly?) learn ML and then realize there aren't enough ML jobs.

I've a PhD and held ML-engineer positions in a few different companies - I've good industry awareness.

We could easily end up with a great many career developers who have a specialization in ML, frustrated they never get to use it.

ylem9y ago

feral9y ago

Yes, I think that's true.

Basically, the interfaces of the models/tools/abstractions people will use will be 'leaky'.

For example, you can take a machine learning method from scikit learn, which works really well on the scikit learn example, and apply it to your problem.

Any developer can do this pretty quickly. If it works, and gets good accuracy out-of-the-box, then great.

So, agree with your point there.

This will all evolve on several levels of course:

- The tooling will get better; model/data deployment/management will get easier. But also and non-experts will be able to get more done as ML becomes more robust out-of-the-box.

- We'll get better at building ML products (e.g. team structure, data infrastructure, UX (designers learning how these things work), company org (e.g. a lot of friction between Agile and ML))

- Businesses will want to do bigger things

ska9y ago

Absolutely they will, and should. But they'll need relatively few people on the team(s) who can do this.

ska9y ago

    I mean, even look at all the research papers describing applications at the moment - so many of them are using pre-trained models.

I think to the broader point that generally learning how to validate a ML technique is going to be very useful to a lot of people, implementing them less so.

pjungwir9y ago· 4 in thread

Sort of a content-free article, but the headline is an interesting bold claim that conjures a lot of thoughts:

EDIT: Added the (now-)third bullet point I forgot before.

[1] https://research.google.com/pubs/pub43146.html

vidarh9y ago

> and I'm not really even sure how to find work doing more MLy things.

Bartweiss9y ago

sdenton49y ago

cr0sh9y ago

> Will there ever be a Wordpress of machine learning?

That said, even with Keras you still need an understanding of what the parts are for and how to "hook them up".

anupshinde9y ago· 3 in thread

Statements like these suggest that another AI Winter is coming (sooner than 3 years I guess)

"""He thinks even programming is vulnerable to being automated and reducing the number of available programming jobs."""

primaryobjects9y ago

Are you referring to something like this?

Using Artificial Intelligence to Write Self-Modifying/Improving Programs http://www.primaryobjects.com/2013/01/27/using-artificial-in...

anupshinde9y ago

The outputs are like this for some not-so-easy targets:

With genetic programming (using an AST), it can solve complex equations:

However, this simple equation i.e. correct answer "(a + b + c - d) / e" could be evolved and will result into either of these (depends on my luck maybe)

Case1: ((int)((b+((c-d)+a))/e)&(int)((b+((c-d)+a))/e))

Case2:

The GP output (case 1 and 2) above was generated with a tweaked version of https://github.com/rogeralsing/go-genetic-math

makeset9y ago

> Statements like these suggest that another AI Winter is coming (sooner than 3 years I guess)

I wouldn't bet against it. It's very risky to extrapolate progress in AI.

[1] www.dtic.mil/get-tr-doc/pdf?AD=ADA105661

bsaul9y ago· 3 in thread

Why would ML be any different ?

Eridrus9y ago

Once you've plugged your data into an ML system and gives you a classifier, are you done? What if the results are not good enough, do you just move on to another problem?

If you don't just move on to another problem you will need people who know how to push these systems further.

Namrog849y ago

Sure, it might take some more cycles for it to find optimal in this example.

1 more reply

pesfandiar9y ago

thomasahle9y ago· 3 in thread

I wonder to what degree he means. Because if he means "really" learning it, this is akind to the "learn to program in a week" books. It takes much more than three years to learn well.

vidarh9y ago

By all means, start learning actual machine learning methods too, rather than just statistical methods, but as I pointed out elsewhere on this thread: there is low hanging fruit everywhere.

fixermark9y ago

Perhaps a good time to be an ML grad coming fresh out of college.

diminoten9y ago

You could join the top 5% of knowers about machine learning in 3 years, I bet. I doubt you'd innovate, but you could at least master the state of the art within 3 years.

evgen9y ago· 2 in thread

fixermark9y ago

Was he talking to engineers or owners / executives though?

""" The Upfront Summit is LA's premier technology event, with more than 750 of the country's top investors, startups, and corporate executives """

dentemple9y ago

As a developer, I'm perfectly okay with using a magic black box.

As long as we can reasonably understand the I/O (and that use of the software can pass a security review), I don't see a problem with this.

msvan9y ago· 2 in thread

Either he's right about machine learning, or this is exactly the kind of thing bubbles are made of.

geodel9y ago

Of course he is right. I saw some dinosaur characteristics already showing up when I looked myself in mirror this morning.

Twisell9y ago

You are so lucky! I can't even look into a mirror since I learn 3 years ago that a NoSQL ninja will soon stab me in the back and steal my job because I work with that declining SQL old stuff!

1 more reply

brilliantcode9y ago· 2 in thread

Normally I'd laugh off any Mark Cuban antics but he isn't wrong. AI is going to greatly reduce white collar jobs with economies of scale.

Luddites of 18th century thought they would never be replaced and continued on their trajectory.

ploika9y ago

In three years? No way.

A lot of white-collar jobs may be automated (or otherwise changed beyond recognition due to technology) after about thirty years maybe, but not three.

brilliantcode9y ago

I agree 3 years is way too short. I'd say 15 years is even early. 30 year seems maybe, 100% in 60 years.

gremlinsinc9y ago· 1 in thread

That's as strong as a statement as Trump: Learn Machine Learning or you'll be a Dinosaur in 3 years...

AJ0079y ago

marricks9y ago· 1 in thread

This is probably natural to a lot of fields but AI just seems more prone to these boom and bust cycles because it's really exciting stuff.

[1] https://en.wikipedia.org/wiki/AI_winter

sevensor9y ago

mad449y ago· 1 in thread

(Pre-apology. I am not trolling, please don't get my comment below more than what I intended: another perspective to look at the strong reaction Cuban's comments incited.)

Reading through the comments, I see that Cuban's statement upset and even angered several HN commenters. That is a strong emotional reaction.

I am not saying it is the 5 stages of grief, but the first 3 fits: denial, anger, bargaining, depression and acceptance.

Also from Howard Aiken: Don't worry about people stealing your ideas. If your ideas are any good, you'll have to ram them down people's throats.

return09y ago

onmobiletemp9y ago· 1 in thread

cr0sh9y ago

Currently there just aren't that many ML jobs out there to apply for - but who knows what the landscape will be like in 5 years or so?

Your assertion, though:

> This is so dumb. You arent going to understand or get a job in machine learning unless you have at least a masters in the subject. Its extremely difficult and complex.

Especially those who think the topic isn't worth their time to learn, because it may be "difficult and complex".

bluekite20009y ago· 1 in thread

anyone knows if there is a business need for human to train/label data? I have been thinking of going to a place w/ cheap labor cost (perhaps Vietnam) and set up an operation like this.

cr0sh9y ago

In the short term, perhaps.

badthingfactory9y ago

I'll place this in the same folder as the articles claiming Wix will eventually replace web developers.

dkarapetyan9y ago

No thanks. Fundamentals and not hype is what makes one not a dinosaur.

anotheryou9y ago

I think prosthetic knowledge will become deeper and more accurate in the long run. And if it scales we don't need many people building the general purpose AI.

With this prosthetic knowledge we will have to learn much more what to ask and know how much the machine knows.

xs9y ago

taytus9y ago

Sorry, I refuse to visit such a clickbait headline.

owaislone9y ago

xamuel9y ago

Three years ago, Machine Learning on a resume meant: "Good candidate."

Currently, it means: "Jumps on bandwagons, caution."

In three years, it'll mean: "Brainless buzzwords, avoid."

JustSomeNobody9y ago

I want a nickel every time someone says programming jobs will be automated in 5 years. This goes back to the beginning of programming and it hasn't happened.

coldcode9y ago

hnmot2239y ago

He's talking out of his ass here. This won't be happening anytime soon (if ever)

usgroup9y ago

id1220159y ago

It doesn't matter how far technology gets, there are some -isms and they are the real Dinosaurs that we have to get over and we are too small.

bgdkbtv9y ago

Oh yeah? Does Mark Cuban know artificial intelligence himself or is he just asking people to study it and work for him? :)

acd9y ago

I am sysadmin/devops what machine learning tools and topics would you recommend to learn for that field?

mi100hael9y ago

That page has so much JS bloat it made my top-of-the-line MBP lag just scrolling.

cstuder9y ago

Ok, so where do I start?

1 more reply

rocky11389y ago

Why don't we just learn it in 3 years, then?

general_ai9y ago

otikik9y ago

Yeah. I'm going to be a clickbaitsaurus.

j / k navigate · click thread line to collapse