I am actually working on a special Kinect mode for VC. So far it seems to allow you to speak from further away from the microphone, but the results are not spectacular (at least not with the Kinect I have) and I have yet to really test it out with music etc. Also using the Kinect language packs has some serious drawbacks. You have to use SP which means no dictation and no training, but for some reason the Kinect language packs also do not support numbers!
If you need someone help you with testing I am for it .....
I was thinking to use VB to develop some basic e.x.ies that give some feed back to VC ... Maybe a gesture to trigger VC modes ... Using depth cam one can use it to determine how far is the speaker which can be used to adjust mic volume, kind of AGC ...