Something else I wanted to ask about - does the voice recognition engine (either wav2letter or the new one you mention) adapt/learn according to the individual using it? I have a fairly strong Scottish accent, and would prefer to speak naturally if possible.
I view fully automatic online training as a sort of anti-pattern - Dragon does that and it will randomly forget entire words. Talon may eventually have some kind of process for self-serve model training. I do have some plans for what that might look like.
Even without automatic model training there's already a feature to automatically create a sort of "personal dataset" as you use Talon, which you can use to train speech models (Talon or otherwise) down the line, or even send me to improve the main model.
I also found that the microphone makes a big difference - with my Plantronics Voyager Legend bluetooth headset, it was basically unusable, misunderstanding almost everything I said. But if I used a cheap Logitech USB headset that I've had for a decade, alphabet accuracy was good.
Something else is that it does seem to struggle a bit with my accent. For example, with the alphabet I would say "air", and 75% of the time it would hear `oh`/`near` - however, if I said "air" in an American accent, it heard `air` correctly every time.
Will be interesting to see how your new engine fairs when it's released.
The wiki is something I currently try to introduce folks to later in the process, because it's unofficial and historically has had assumptions, inaccuracies, or very outdated information that caused me additional stress/support load. I know the community has been working on improving that.
Bluetooth mics are almost universally worse than cheap wired mics, due to bandwidth/power/compression constraints. If you make a file user/settings.talon containing "settings(): speech.record_all = 1", Talon will record successful utterances to recordings/ adjacent to user/, and you can compare what the mic sounds like to Talon. It's also very likely the mic works better with Conformer.
The alphabet is pretty easy to change. Check out the top of keys.py. There are some words that aren't really the engine's fault when it comes to accent, and some pairs like air/near are more of a configuration issue if your accent doesn't differentiate them.
I'm hoping to release v0.2 with Conformer sometime around July 1