Not what I was saying. What I was saying is that one thread is assigned to each group of five ALUs, unlike Nvidia's architecture where each ALU gets it's own thread. Because of this RV870 can only run a maximum of 1600 / 5 = 320 threads, which is way below the 1024-thread limit of DirectCompute11.
Bookmarks