I can believe K10 would run rings around K8 at same clock. The mere doubling of SSE throughput means that a lot of HPC apps will run near twice as fast. Core 2 is in many cases near twice as fast as P4 clock for clock
in HPC is some cases up to 4 times as fast.
The 32 byte fetch means K10 should be a floating point monster (HPC).The load forwarding capabilities of K8 are quite deficient (none!) compared to Core 2 ( load forwarding already in pentium pro) which means that their inclusion in K10 will give an even bigger boost than Core 2 got from it. Too bad the clock rate is low and the cache is relatively small.
Bookmarks