The general concept has a real world use, but this specific implementation probably doesn't perform as well as most of the already available ones on more powerful hardware.
Maybe it could have an IRL use for efficient wake word spotting or the like though.