Does any voice assistant do this right now? Genuine question, I don't actually know. It sounds useful as long as it's not invasive.
Alexa+ does, but I don't use it for anything except kitchen timers and home automation triggers, so I can't speak to how well it works in a longer conversation.
Zoom's meeting notes excels at this, Google Meet is terrible at it. Meet mishears our company name about 90% of the time; various attendee names are a coin toss.
* "this" being: context consideration in speech-to-text/transcription.
And though I have the feature enabled that should cause it to ask ChatGPT about things it can't answer, that works even less frequently.
But even if all of these things were true, the stuff on your phone you would expect to be exposed to the model as available tool calls, are not. So their efficacy is very limited.
(edit: iPhone 16 Pro Max, if anyone is curious)