Jack,it's rather simple. If yo urun single thread on a module that is SIMD heavy,all FPU resources(2xFMAC) will be dedicated to one core. Then ,if you ran same test but MT ,across all 4 modules,all 8 cores will then share 4 FLexFPs (in total 4 256bit units). You will have scaling from ST to MT akin to sclaing from single core to QC,only this time your single trhead results SHOULD be very high as you are using one double-sized very powerful FPU(whole FLexFP within a module). Scaling is better than pure 4x since now SMT works within FlexFP as 2 cores per module share 2 128bit FMACs and this improves performance additionally.
The problem is,however,that one FLexFP is somehow slower than one Thuban core
@xsecret
Well vrzone just ran the same chip on 2 different motherboards xsecret.Guess what,on different boards,same chip performed with 80% delta... So it's the firmware problem it seems.





Reply With Quote
Bookmarks