The TMUs are setup differently on RV770 than on R600/RV670. On R600/RV670 a TMU block was assigned to 4 SP blocks, each block from a different SIMD 'core'. This approach has its benefits, but it is a bit tough to scale it and that's why they abandoned it for RV770. RV770 has 4 TA units and 4 TF units per SIMD 'core'. Just check here: http://www.rage3d.com/reviews/video/.../index.php?p=5
There's one odd part though, and that is that there are 'only' 32 interpolators. But that will only become a bottleneck if you INT8 bilinear filtering, you won't notice it once you apply some more AF for example and then the bottleneck will shift to the filtering units. It's still weird though, as you'd expect a texture related unit to reside with the TF/TA units and thus also scale in capability along with those.
I don't think they will increase the number of SPs per SIMD 'core', not with their first 40 nm. chips and probably not even with RV870. To me it seems like it should be fairly easy for them to increase the amount of SIMD 'cores' on a chip. They increased the amount of SIMD 'cores' that RV770 would get at the last minute when they needed to fill up some extra space, so to me it seems like you don't need to do a lot of redesigning/work to increase the SIMD 'core' count. I'd say 16 SIMD 'cores' for RV790 shouldn't be too hard to do for them.
I'd like to know what they would to with their RBEs if they do put 16 SIMD 'cores' on there though, although I'm not really sure if yet if 16 RBEs would become a bottleneck.
It will simply be guesswork for us until it's launched I guess, still interesting to think about though.![]()
Bookmarks