This is not exactly true.
http://ixbtlabs.com/articles3/cpu/ar...2009-7-p1.html
Up to 18% gain in some desktop apps from the third mem channel. For more cores and higher frequency the gain may be much larger. Now if you consider real multitasking (running many apps in parallel) the diff can be even higher. This is why 6-core opteron dosn't show perfect scalability in SpecINT/SpecFP.
16GB/s current agregate HT BW is equal to BW of
one PCIe x16 slot. Considering upcoming SATA3, USB3, PCIe3 (and all this with CF config) this probably going to be a bottleneck.