Basically logistically it's going to need to be in a data centre.
It's ideal for small context high throughput. Perhaps parsing huge text piles like if you had the entire Epstein files as text.
I think Claude code benefits from larger context to keep your entire project in view and deep reasoning.
What this would certainly replace is when Claude dispatched to Haiku for manual NLP tasks.