Good question! :-)
Back then, we were very proud to offer perfect alignment between Main Mic and the direct signal from virtual spot mic (i.e. MIR's "dry" signal component). As a matter of fact, we carefully cut any remnant of the recorded direct signal from all IRs and replace it in real-time with the readily positioned dry signal, to avoid any hint of phasing and/or timing issues. So there's nothing you could compensate. :-)
That said, I know very well that sometimes the delay between spot mic and main mic adds to the sense of perceived depth and enveloping. You can achieve this (in VE Pro only!) by splitting the wet from the dry MIR signal and adding some milliseconds of delay to the former.
HTH,