Where tests like that are helpful in that you may be interested in 1) seeing how efficient Sonar is, 2) such as seeing how your DAW's load balancing compares to other DAWs (the dreaded core 1 is spiking the others are nowhere near 100%), 3) determining if there's another DAW that will allow you to squeeze a few more plugins/tracks into your mixes (this would be helpful if you had a fast PC but were still maxing out your CPU during large mixes). It could also help companies like Cakewalk in identifying where there can be room for improvement, by someone else doing free performance benchmarking for them.
It's completely legit, and if you read about their testing methods and how they distribute the plugins...I think it's a fairly unimpeachable method of comparison. If you also read his criticism of Noel's testing method, his points seem entirely valid. I'd love to see Noel's response to that.
What a test like that doesn't show is how well you interact with your DAW, it's workflow, and it's features. These are clearly the most important aspects in picking a DAW. Sonar still some has some things that drive me nuts (limited routing options, the terribly huge, non-resizable mixer, no varispeed, horrible notation) but overall, it's still the best DAW choice for me. I've yet to find the perfect DAW, and every one out there seems like it's good at some things and not so good at others, and overall Sonar has become my main choice. So to those who point out that the benchmarks don't address this aspect, I agree with you 100%.
But that doesn't mean anyone should just blindly ignore the test results because Sonar didn't come out on top. At that point, people are turning DAW preference into a psuedo-religion by ignoring the science. I think we can all be a little more rational than that.
I honestly don't think the people saying the test is completely irrelevant would be too upset if Cakewalk came out and said, "hey, we improved Sonar's efficiency by up to 25% in large projects by making adjustments to how we handle load balancing" or something like that as a result of benchmarks like these.