Gotcha. No, if you have plans to focus on audio, then, by all means, don't let yourself get sidetracked. For text content, I already am using Voice Dream Reader, which, in my mind, is the best there is. Being able to study both text and audio content in the same app and with the same workflow for highlights would just be the icing on the cake.
And I really like your one-tap highlight functionality (or triple clicking the headphone buttons). Very useful when driving or when the phone is tucked away and you're listening via headphones.