(1) popularization of pytorch
(2) network effect due to the number of users
(3) uploading weights, distributing weights, running demo on GPUs is mandatory part of ML engineering
=== interlude ===
I hope it doesn't sound like I'm being argumentative; discussion is especially interesting to me because it often weighs on me how hard it is to explain HuggingFace. So I enjoy trying and improving at it.
=== longer analogy ===
Imagine if all mobile developers in 2008* needed to host demos on iPhones captive in a server farm somewhere.* Some company offered that for free. On top of it apps were 30 GB, but the company hosted downloads for free. So everyone is putting their stuff on there. Then that feedback loop continues while the field takes a historic spike in interest and it's 4 years later.
* AI developers in 2020.
** GPUs captive in a server farm somewhere.
== Musings ==
This sort of highlights a thread of discussion for startups, the unreasonable effectiveness of specialization. Data scientists in 2020 use Python because they can, they're not really familiar with GitHub as in VCS so their mental model of it is more a dropbox. All of a sudden there's an $X billion (so far) opportunity to clone GitHub, but make it marginally easier to use via hiding stuff that's necessary for all other software, and then light money on fire hosting GPUs and S3.