Really late to the party, but I love this concept. I feel like this would be really difficult in an open office/shared office.
I enjoy team based offices, 7-10 man rooms. Even in there, this would probably be a nightmare unless you had this tech running in real time so you don't get microphone crosstalk/echo.
None the less, I really like the spirit of the system.
Apparently IBM tested a system where participants faces were projected onto dummies faces in a real room, voice related through speakers on each dummy, then the whole thing recorded and broadcast to participants.