I repeated those tests recently and someone suggested I try Reaper, so I downloaded it a few days ago and ran the tests again.
Reaper without any meddling gave me results similar to Live - Reaper chose to put my two ensembles in the same physical core and I got sound glitches. Sonar does not do that, it put them in separate physical cores. But the overhead for Sonar was much greater than Reaper or Live and that extra overhead defeated the gains of using the second physical core.
But when I disabled hyperthreading I saw this in Reaper:

I was able to load 4 instances without any problems. I did not test this using Live or Sonar (that is run the test with HT OFF), but I believe that would have worked just as well.
Telling Reaper to use Cores 1,3,5,7 with hyperthreading on also worked fine:
There's no way to set affinity for DLLs, just executables.
Of the three (Sonar Live Reaper) only Reaper with the cpu core locking capability was able to load balance the 4 instances of Reaktor that I used with HT ON.
Note that my CPU was running at 4.25 GHz to get that to happen.
My next audio system build may have 64 cores, but they may be locked at 2.8 GHz per core and will never get close to running this.
The most powerful supercomputer today uses a cluster of core running at near that speed - it will perform poorly on this test.