It is Spec_INT, more or less single threaded. Autoparallel cannot offer significant speedups..
Why ? AVX and SSE have the same throughput for BD since it did not spend any transistors to optimize for AVX. The only question is whether having used FMA would have made a difference.And comparing SSE3 code for AMD and AVX code for Intel is totally irrelevant.




Reply With Quote
Bookmarks