I think it's deeper than that; I think the problem is mobile devices. The OS has to somehow paper-over the fact that there's no mouse, and that everything has to be done with finger-stabs on a 3"x5" screen. That doesn't work with the traditional desktop widgets, so a variety of OS-level widgets and Javascripty plugins is layered on top. But (a) they're not consistent with one-another, and (b) they're not consistent with the desktop metaphor (which isn't going to go away).
Basically, I don't think a phone is suitable for user-input of any complexity. It's a device for selecting content that you then consume passively. It can't be used as a replacement for a desktop. "Mobile first" sounds all very well, but nearly all mobile-first projects have the desktop portion permanently stubbed.