I don't think it's phasing since it's clearly two different pitches, and not a minute time (phase) difference. In fact, any blobs I haven't moved play back phase accurately with the original audio. I figured it was the original clip being being played, but unlike in older Sonar versions with V-Vocal there is no actual original clip "behind" the Melodyned one, otherwise I'd start looking there.
It's not input echo. I'm on X3e right now but it has happened as far back as at least X3c.