The move to 4-VLIW is this: instead of 4 simple + 1 complex, there's 4 moderately complex shaders. Each can still do MADD. However, to do a transcendental, I'm not sure how AMD is going to handle it - it might have one of the 4 shaders iteratively calculate a value, or it might use all 4 to do it at the same time so it requires fewer clock cycles.
Transcendentals will be calculated with 3 out of 4 Shaders inside the SP of the VLIW-4 architecture in Cayman. The problem is that calculating Trancendentals in this way could be slower by 10% vs the old VLIW-5 in Cypress/Barts.
Intel Core i7 920@4GHz, ASUS GENE II, 3 x 4GB DDR-3 1333MHz Kingston, 2x ASUS HD6950 1G CU II, Intel SSD 320 120GB, Windows 7 Ultimate 64bit, DELL 2311HM
Bookmarks