Incidentally, I just complained in another post about the way that needing to "invoke" voice mode makes it much less fluent. You may at least want to pop that one on to the mental back burner to see if there's a solution you can think of for it. I don't know what, but something that makes the friction of invoking a voice app less would help a lot. (I assume this is where Amazon Echo idea comes from.)