Quote Originally Posted by madcho View Post
it's 3 ALU/AGU for K10, they are general pipes. ( the third pipes give 5% more performance said by AMD ).

it's 2 specialised ALU, and 2 specialised AGU. so it's more faster on integer on Bulldozer. And i think this is a more efficient disign about power comsuption.
Well, first the above slide is a little bit misleading for someone that dont knows the DENEB (Magny Cours) architecture.

Secondly i believe the slide wants to show the difference of the CMP vs Bulldozers Cluster-Based Multithreading design and so the slide is not an accurate representation of Denebs (Magny Cours) Integer/fp Execution Units.

Denebs integer execution unit has 6x pipelines, 3x ALUs (Integer) and 3x AGUs (Load/Store) and the Integer Scheduler can issue 6x MicroOP (uops) to it.

Bulldozer integer execution unit has 4x pipelines, 2x ALUs (Integer) and 2x AGen(Address Generators) Plus a Load/Store unit (40 Load/24 Store) and the Integer Scheduler can issue 4 uops to it.

The DENEB don’t have FP FMACs but 3x Pipelines, FADD, FMUL,FMISC
Bulldozer FP has 4x pipelines, 2x 128-bit FMACs and 2x 128-bit MMX and one FP shared Scheduler that can issue 4x uops to the FP execution unit.