the directx gpgpu approach seems to be very promising, a united api that drivers are already optimized to work with and that is already somewhat familiar to developers makes more sense than cuda or ctm imo.
yep, even opengl gpgpu is quite easy to do. but ctm/cuda, especially ctm give you much more options to improve performance and flexibility
Bookmarks