Shintai ,there are about ~100 or less new instructions in AVX(new as not being previously supported by intel hw). The rest ,more than 300 legacy SSE instr. are updated for better performance on future hw(and less than 100 from those 300 are widened to 256bits to support those fp vector instructions).
BTW,SSE4a is 4(in words:four) instructions so the die space that was "wasted" is really huge(I can see AMD pulling their hair over this "wasted" space). Two of those four(LZCNT and POPCNT) doesn't even need to be included in a SSE4 in order to be available and used.
Bookmarks