The reason why we are barely getting started into VLBI in the IR on the ground (and nothing that I have heard of yet in space) is that the different apertures need to be stable relative to each other with a precision better than a small fraction of a wavelength. (One tenth, one twelves and one twenties are often used, depending basically on which performance drop relative to the theoretical optimum you are willing to live with.)
For radio astronomy, where we do VLBI everyday, we have to handle waves of wavelength 1 cm and position antennas to a precision better than a millimeter. Not easy when the antennas are scattered across the country, but something we can pull off.
For IR astronomy we are talking wavelength in the range of 1000 nanometers to 30 microns. So at the easiest end of the spectrum you would have to position satellites to a precision better than 3 microns relative to each other, while flying on orbit and being pulled and pushed by tidal forces, gradients in the graviational fields and solar wind pressure (which contains turbulent fluctuations). For it to actually work in near IR you would have to get the positioning right to within 100nm.
For comparison: The mirrors of JWST itself are flat to within about 25nm. And in some sense we ARE doing IR VLBI with JWST since we have separate mirror segments that we all position correctly relatively to each other. But doing so we separate freeflying satellites is something we just aren't capable of yet.
PS: Yes, LISA Pathfinder has demonstrated measurements of spacecraft separation down to a few picometer, so we are slowly getting there.