Urgh purleeze. The version that Anandtech used 1) Didn't have any vectorization optimizations/experimentation 2) Was not done with the -local switch. nVidia's OCL compiler probably adds it automatically by now, you need to specify it on the Radeons.
But again you don't seem to know much more about GPGPU either, with the cherrypicking of benchmarks that suit your agenda.







Bookmarks