Quote Originally Posted by drfedja View Post
Why do you think that AVX is so much powerful than SSE? Thuban Core and BD module can execute same number of raw FLOPS. AVX and SSE are vectorised packed FP instructions. BD module can execute one 256-bit AVX which contain 4DP FP operations, same as two 128-bit AVX or SSE. In some cases 256 AVX can be faster, but how much? Two times...
Yes, AVX would do nothing, but FMA could be the big difference. SiSoft normally always programs special code for each CPU, thus on Bd, it should use XOP&FMA.