I think they have some kind of special optimized chip that can listen for only a specific phrase at very low power and wake the rest of the device when it hears it. It seems super unlikely that they can listen constantly to anything anyone says, pick out things that can be advertised for, and show ads for those things the next time the user browses without eating lots of power and data.