I've already played with the vision API, so that doesn't seem all that new. But I agree it is impressive.
That said, watching back a Windows Vista speech recognition demo[1] I'm starting to wonder if this stuff won't have the same fate in a few years.