Quote Originally Posted by demonkevy666 View Post
they believe their is a problem with B1's L3 cache.
if you look you will see it's write cache that is the slowest part here not just L3 cache all the way threw write cache is too slow.
It's already mentioned in optimization manual that was released in April. Also AMD promises to fix this in BD version 2.
http://support.amd.com/us/Processor_TechDocs/47414.pdf
The following performance caveats apply when using streaming stores on AMD Family 15h cores.
• When writing out a single stream of data sequentially, performance of AMD Family 15h
processors is comparable to previous generations of AMD processors.
• When writing out two streams of data, AMD Family 15h version 1 processors can be up to three
times slower than previous-generation AMD processors. AMD Family 15h version 2 processor
performance is approximately 1.5 times slower than previous AMD processors.
• When writing out four non-temporal streams, AMD Family 15h version 1 can be up to three
times slower than previous AMD processors. AMD Family 15h version 2 processor performance
is comparable to previous AMD processors.
• Using non-temporal stores but not writing out an entire cacheline may cause performance to be up
to six times slower than previous AMD processors.