Originally Posted by
savantu
Gosh/Duby229/Kassler - you're clueless now as you have always been , sorry to be so blunt.
The way Nehalem's and K10s L3 work is completely different , one is inclusive , the other is exclusive.Get the point ? While both a Ford Model T and a Mercedes S600 have 4 wheels , that's where the similarities end.
Nehalem scales better , you know why ? Because data from all the L1s and L2s on chip are also found in the L3.When there is a cache miss in the L1 or L2, data is searched in the L3 , if it's not there , a memory request is sent.
On K10 , when a cache miss occurs in the L1 or L2 , the L3 is searched and a request is send to the remaining cores for searching in their L1, L2s.Only after the reply comes from all cores , data is requested from RAM.
That's extra latency => imagine with multiple threads and multiple misses.Nehalem has no such problems.Intel did its homework right.
Intel was 1st with DC , AMD did it right.
Amd was 1st with single die QC , Intel did it right.
Too bad zealots cannot give credit where it's due.