Just wanted to share my 8800 GT screen:
Whatever happened to superpi - cuda?
what about this program?
what happened is this going out at all ?
CPU : Q9550 / Board : Asus P5E64 WS Evolution / Ram : 2x1 OCZ D9GTR DDR3 / Vga : HD 4870 / PSU : PPC&C 750W / SSD Ocz Vertex 30Gb / All under Water & Tec's
Overclockers Wannabe Athens Dept...
Yea what did happen to this?
My Rig can do EpicFLOPs, Can yours?
Once this baby hits 88 TeraFLOPs, You're going to see some serious $@#%....
Build XT7 is currently active.
Current OS Systems: Windows 10 64bit
Probably noticed the poor performance gains due to poorly threadable algorithm used. CUDA is just like finding a needle from haystack, a burden that no one really wants to stick to unless they really have to.
@warboy:
As far as I know, they stopped because it wasn't an easy task and proved to be time consuming.
@calmatory
Not exactly.
But proper parallel coding isn't the easiest thing on earth...
Coding 24/7... Limited forums/PMs time.
-Justice isn't blind, Justice is ashamed.
Many thanks to: Sue Wu, Yiwen Lin, Steven Kuo, Crystal Chen, Vivian Lien, Joe Chan, Sascha Krohn, Joe James, Dan Snyder, Amy Deng, Jack Peterson, Hank Peng, Mafalda Cogliani, Olivia Lee, Marta Piccoli, Mike Clements, Alex Ruedinger, Oliver Baltuch, Korinna Dieck, Steffen Eisentein, Francois Piednoel, Tanja Markovic, Cyril Pelupessy (R.I.P. ), Juan J. Guerrero
that cuda-z program looks pritty sweet.. but why did that drop the whole superpi cuda thing...
i5 3570k 4.2ghz 1.28v H100 cooled stable 24/7 | Asus Sabertooth Z77 TUF | 2x4GB 8GB Mushkin Blackline 998954 | GTX 680 2gb | 120GB OCZ Vertex 3/3TB Seagate | Enermax Platimax 850w | NZXT Switch 810 White | Windows 7 Ultimate | Samsung 40" LCD HDTV
A+ NET+ Linux+ MCP MCTS MCSA MCITP
World of Tanks WGLNA
CSGO ESEA
Heatware: MarcusFoX
And who's idea was this? Because somebody didn't do their homework...
Pi was never an easy thing to multi-thread... And only now do we finally have a multi-threaded pi program for multiple cores - let alone CUDA...
http://www.xtremesystems.org/forums/...d.php?t=221773
p.s.
more like 50m in 20 seconds
i7 920 C0 @ 3.5Ghz
Asus P6T Deluxe
3x2GB G.Skill DDR3 1600 9-9-9-24
HD4870x2
Intel x25-m G2 80GB
2x WD Caviar Black 640GB
Samsung Spinpoint F1 1TB
PC P&C 750W
Interesting old thread... I can only laugh at the idea... because the math needed for such a thing isn't quite there yet...
I don't think that's what he meant. The power of the GPU is that they have MANY cores... and that you'll need to use them to get a 20s 32M time.
And yes... This new multithreaded pi program IS able to compute 50 million digits in 20 seconds.
50,000,000 digits:
19.3864 - Movieman - Dual 2.93GHz Xeon Gainestown
20.553 - spdycpu - Core i7 920 @ 4.3GHz (21 * 205)
20.7997 - poke349 - Dual 3.2GHz Xeon Harpertown
21.7047 - tet5uo - 3.20GHz Core i7 @ 4.00 GHz
25.774 - Serotoninn - 2.66GHz Core i7 @ 3.20GHz
I'll re-post the link to the thread:
http://www.xtremesystems.org/forums/...d.php?t=221773
Main Machine:
AMD FX8350 @ stock --- 16 GB DDR3 @ 1333 MHz --- Asus M5A99FX Pro R2.0 --- 2.0 TB Seagate
Miscellaneous Workstations for Code-Testing:
Intel Core i7 4770K @ 4.0 GHz --- 32 GB DDR3 @ 1866 MHz --- Asus Z87-Plus --- 1.5 TB (boot) --- 4 x 1 TB + 4 x 2 TB (swap)
Sorry, I lazy to read whole thread.
Is there any cuda/openCL programmers? Can you port SuperPi to gpgpu, without adding multithreading? Using only 1 shader. Just imagine: very low-cost 8-sp Graphic card making WR!! I want to see OpenCL-PI among HWBOT benchmarks!
Sorry for my bad English
Why should single threaded PI on gpu beat a cpu?
3570K @ 4.5Ghz | Gigabyte GA-Z77-D3H | 7970 Ghz 1100/6000 | 256GB Samsung 830 SSD (Win 7) | 256GB Samsung 840 Pro SSD (OSX 10.8.3) | 16GB Vengeance 1600 | 24'' Dell U2412M | Corsair Carbide 300R
What would be the point running a serial program on a GPU? The problem isn't cuda or opencl. The problem is that either the math isn't parallelizable or people aren't smart enough (yet) to make it so.
It isn't good to mix CPU & GPU results. I meant that low-cost GPU beat hi-end one. And of course, it is very difficult to parallelize PI calculation, so I proposed idea of single-threaded PI program.
kromosto, good luck, i want to use this program
Last edited by ShtopoRrr; 04-27-2009 at 04:59 AM.
This is correct.
The math IS parallelizable. And the proof is here (already posted):
http://www.xtremesystems.org/forums/...d.php?t=221773
HOWEVER,
It is very difficult to do. It was hard enough to parallel it into several threads for several cores. So it will be MUCH harder to parallel it into hundreds of threads for a GPU...
Also GPUs right now don't have the right instruction set to be efficient with this kind of computation. (They are too specific for graphics.) So I won't be surprised if even the best of GPU implementations have trouble beating a CPU implementation.
Main Machine:
AMD FX8350 @ stock --- 16 GB DDR3 @ 1333 MHz --- Asus M5A99FX Pro R2.0 --- 2.0 TB Seagate
Miscellaneous Workstations for Code-Testing:
Intel Core i7 4770K @ 4.0 GHz --- 32 GB DDR3 @ 1866 MHz --- Asus Z87-Plus --- 1.5 TB (boot) --- 4 x 1 TB + 4 x 2 TB (swap)
And what about wPrime on GPU? Multithreaded of course
starting value for benching Pi is ?
512M Pi? 1024M Pi?
Main Rig:
Processor & Motherboard:AMD Ryzen5 1400 ' Gigabyte B450M-DS3H
Random Access Memory Module:Adata XPG DDR4 3000 MHz 2x8GB
Graphic Card:XFX RX 580 4GB
Power Supply Unit:FSP AURUM 92+ Series PT-650M
Storage Unit:Crucial MX 500 240GB SATA III SSD
Processor Heatsink Fan:AMD Wraith Spire RGB
Chasis:Thermaltake Level 10GTS Black
It is good to be like CPU SuperPi, for single-threaded GPU SuperPi, I think
I dont think so. SuperPi is a main and simple calculation benchmark, a Standard. imho
But only benchers care about it. For CUDA to develop it has to have real-world uses that developers can score funding to code for and turn heads to CUDA that way.
Bookmarks