I don't know where your numbers from or your math is seriously skewed, but that is just not right what you are saying there. It has also been know for a long long time that you just can't compare shader/pipeline (and whatever you want to call it) count between ATI and NVIDIA, if you could then the X850 cards from ATI would have been way faster than NVIDIA's 6800 cards for example and this was clearly not the case.
A GTX280 has 240 stream processors while a 4870 has 800, these SPs on the GTX280 can do 1 MADD and 1 MUL per cycle (1 MADD+1 MUL/clock cycle) while those SPs on the 4870 can do 1 MADD per cycle. So that means the 4870 can do 1600 FLOPs/clock cycle and the GTX280 can do 360 FLOPs/clock cycle, that's not to hard to see now is it? It's just that the SPs on the GTX280 run at a way higher clock and are a little more flexible because there is less dependency between the SPs, but if you would write code that is optimized specifically for each architecture then you can truly get 1200 GigaFLOPs out of the 4870 and only 933 GigaFLOPs out of the GTX280. But we by now know that these numbers mean nothing for in game results though.






Bookmarks