This answer is not for that question, because he was speaking about 4CU/8C vs. 4CU/4C, which is irrelevant in case of games with 4 threads or less.
So, let's we go back to 4CU/4C vs. 2CU/4C.
You're searcing for the answer in the right direction, floating-point wise. Although, let's not forget this FPU is 2-way SMT, with double the resources of K10's FPU: it's 2xFMAC vs. 1xFADD+1xFMUL, FMAC being 1xFADD+1xFMUL combined. The thing is that all of this computing power can only be fully utilized with FMA code, because you can't have an FADD and FMUL independently started in the same cycle on an FMAC unit, only if it's an FMA instruction.If you use 2CU/4C, there are 2 floating point units available, in 4CU/4C mode 4. So a task using 4 cores, gets access to 4 vs. 2 fpus. The performance gain in 50% int and 50 % fpu scenarios, is up to 25 %. Xbox ported games, mostly use 2 cores. So they get access to 2 fpus in both scenarios and you see performance gain of about 5 % only, from doubled cache per core.
Thus, for legacy code 4CU/4C mode means double the resources utilizable all the time (not shared with the second thread), per hw thread, so it can have independent FADD's and FMUL's started in the same cycle on the two FMAC units. (Extra bonus is that it can have also 2xFADD or 2xFMUL, not just 1xFADD+1xFMUL, like on K10.) In 2CU/4C mode, it can have usually only 1xFADD OR 1xFMUL started in the same cycle (mostly one FMAC unit available because of the second thread engage the other one), per hw thread.
(And so, I think there won't be such a significant gain in performance for FMA heavy code, going 4CU/4C from 2CU/4C. EDIT: Or, even if there be so, both cases will be signicicantly faster than the same algorithm with legacy code.)
(I think it's not the best wording, because less than quad-threaded apps can also be "using 4 cores" upon the constant core-variation of Windows. So, it's about how many threads an application have.)...So a task using 4 cores...
Definitely worth a testing.The problem is Dirt, F1, Civ5, Crysis 2,... . Northbridge overclocking and 4CU/4C can be the difference between playable or not.
So, you're saying if we enable the 2nd core here, then we can't disable ony other cores, alone in a CU?
Wow, I've forgot about this little utility. Perhaps it could be enhanced and adapted to BD, so all of it could work fully automatic! Would be even more useful!![]()






Reply With Quote


Bookmarks