Originally Posted by DudeMiester
I saw some talk about Cell, and I wanted to make some corrections:
--The Cell is not dual core, it has 9 cores.
--The Cell is not rated at 4.5Ghz, it actually runs at 4.5Ghz, and up to 5.2Ghz with enough voltage and a big PSU. That's on air mind you, on cascade you would probably get close to 10Ghz.
--The Cell theoretically can get 256GFLOPs performance, whereas a P4 even at 7.2Ghz would probably only get about 15.
--The reason Cell can achieve such speeds is because it exchanges control logic for processing logic. The actual processing units of P4s and AMDs make up a small portion of the overall chip (excluding cache), the rest is eaten up by control logic. Of course, by removing all that control logic it means the chip's performance is far more sensitive to how you program it. Now you have to control branch prediction, pre-fetching for caches, etc all manually. Of course this is primarily with the SPUs, the PPC core retains brach prediction, auto-prefetch, etc however it doesn't have out of order execution, so even then you have to be more careful. Even so, many of these problems, while seemingly very difficult, are imho actually quite manageable with the advancement of compilers and thinking differently about how you program.
Essentially the Cell takes the best from modern CPUs and GPUs and puts them together for awesome performance, and imho a better programming model. When you program the Cell, you really are programming the hardware directly. You are not hidden behind a monolith of control logic that totally re-compiles and changes your code to what the CPU sees fit. Rather such responsiblity is left to the programmer, where imho it should lie. With Cell you simply can't get away with being a bad programmer.
Of course realisitcally you won't get 256GFLOPs of performance, maybe 150GFLOPs sustained, which is still about 10x what a 7.2Ghz P4 can do. Then put the Cell on cascade, and that very same P4 will look like ENIAC in compairison, lol. Personally I can't wait to get one, high quality real time raytracing might be possible on such a processor. Does Cell mean the end of the GPU then, far from it imho. I believe it will allow a much more seamless integration of the GPU, and any other co-processor, within the computer system. They will be treated not as disparate devices, but rather extensions of the CPU itself. Sure, x86 processor will last a long time thanks to their extensive codebase, but Cell and Cell-like processors are the future. Especially so, when it's certainly powerful enough to run a virtual x86 processor (i.e. emulate it) and still beat the doors of any P4, even the overclocked ones.