Browsing is not the same as using a personal assistant.
First, it takes much less compute to serve a page than to run an LLM query. LLMs are slow even if you eliminate all network.
Second, your expectations when browsing are not the same as when using a personal assistant.
Right now even when I simply ask Siri to set a timer it takes more than a couple of seconds. Add an actual GPT in the mix and it’s laughable.
In any case, even with a private relay, Apple’s phrasing does not deny sending device identifiers and allowing ClosedAI/Microsoft to build your shadow profile (without storing requests verbatim).