JumpingJack: Do you know the latency when Intel needs to send data through the FSB?
If the game has two threads, one is writing data and one is sending data to the GPU. Some of the requests are done at the same time. Will the latency be the same as when the FSB isn’t "taken" by another request?

Compare this with the situation on AMD. Hypertransport for I/O (GPU traffic) and IMC for memory.
Do you know the latency for AMD when it needs to write or read data to external hardware?

You also know that if one game has two rendering threads, then this game needs to synchronize commands to the GPU. On C2D this would work well but on C2Q, what happens if one thread is located on one C2D and the other is located on the other C2D? (You know that C2Q is two C2D that communicates through the FSB)
Phenom has L3 cache that can synchronize four cores.