This project started after I tried a bunch of Chrome plugins that let you speak to Chat GPT. The one I liked best was this one: https://github.com/C-Nedelcu/talk-to-chatgpt
I forked the code and refactored it quite a bit in order to improve the voice recognition, improve the quality of text-to-speech (using WellSaid) and then I added the screen capture capabilities. That's when it started feeling truly magical and useful to me.
The biggest issue is that ChatGPT is still too slow, but Sam Altman claimed during devday that improving the speed is the next biggest priority for them.