One small addition,a picture is worth a thousand words

:
Let me try again using Informals pic.
In the pic there are 2 FMAC units. Article says, each FMAC unit can process 2 64-bit double-precision or 4 32-bit single-precision operations simultaneously. I stated I thought WUs were 32 bit single-precision. Right? So does that mean each FMAC unit would do 4 WUs? In the pic it looks like each core has one FMAC unit. I do not care about modules or threads. I do not care about games, benchmark programs or whatever else. WCG WUs.
How many WUs from WCG will run on one core? 1 core= 1 FMAC unit. The pic shows core 1 and core 2 and 2 FMAC units, one each core. 1 FMAC unit will run 4 32-bit precision operation.
I dunno if it is just the change in architecture, terminology or a lack of people reading the first post. I think Informal and Enoc may have answered this but I am not understanding so I appreciate their patience.
