That less depends on ISA but more on actual hardware implementation but depends even more on software code quality. There are many effective techniques to optimize code for OOO architectures (such as loop unroling e.t.c).
It seems they counted only arithmetic instructions (IPC < 1 does not make sence). Also in a different part of a code the ALU consumption may greatly vary.Originally Posted by informal
Bookmarks