See some of their example solutions here (along with their projects in 2022): https://id2223kth.github.io/assignments/project/ServerlessML...
One of the student labs was gamified language learning - say the image in the language you are learning, and Whisper tells you whether you said it in an understandable way.
This Gradio implementation is a more polished version of their early efforts.
It has a few customization features that are nice: https://github.com/Purfview/whisper-standalone-win/discussio...
Works miles better than plain faster-whisper, in my experience. Not sure if there's wildcard support but that's easily scripted.