As I posted in the other thread... Note, that all the numbers assume the actual 4+1 -> 4D transition with only 98.5 % loss in performance.
Barts Pro will have some 960 SPs. At 850 MHz core clock it can achieve maximum theoretical of ~2 TFLOPS, or something along those lines.
Barts XT will have some 1280 SPs. At 850 MHz core clock it can achieve maximum theoretical of 2.67 TFLOPS, or something along those lines.
Barts XT would need 863 MHz core clock to surpass Cypress XT's 2.72 TFLOPS shader performance.
Barts XT would need 952 MHz core clock to achieve 3 TFLOPS.
Cayman with 1920 SPs at 850 will hit 4 TFLOPS.

Give it 952 MHz for core and it will go 4.5, and 1058 and it will go 5 TFLOPS. Single chip. At 40 nm.
But again, note that real world performance does not scale linearly with shader power, so the gains won't be as big as the raw numbers would suggest. Or, at least the performance boost is not because of raw shader performance. We do not know if they've done changes to tesselation unit. Since people say that it's the weak spot in DX11, I'd guess they would focus some effort there too, bringing the performance up even more.
Barts XT with 3 TFLOPS(should be able to get it to ~950 MHz, no?) + more advanced tesselation unit = GTX 480 will cry.
Oh, and chip sizes should go up a bit, but Barts XT should be smaller than Cypress XT etc. AMD's already unbeatable Perf/mm˛ should be up by some 20-25 %.
Bookmarks