New Multi-Threaded Pi Program - Faster than SuperPi and PiFast

**bonis62** · 02-10-2010, 01:40 PM

Originally Posted by poke349

I'm already pushing "double" to its limit of precision.
"float" has less than half the precision of "double" so that would require more than 4x the work.

Actually, because of the way the algorithm works, using type "float" would require MUCH more than 4x the work. It would actually fail above a certain (small) size.

The run-time complexity is this:

where "n" is the # of digits.
and "w" is the # of bits of precision in the floating-point.

When the denominator goes to zero, the run-time (and memory) blows up to infinity - in other words, the algorithm fails.

This is the reason why I can't use GPU.

If there was a 128-bit floating-point type that was supported by hardware, the program would actually be MUCH faster.

EDIT: That complexity is just a reasonable approximation to the true complexity.
The true complexity (ignoring round-off error), has special functions in it... so it's unreadable to normal people. (even myself)

yeah

if
n=100.5
w=64.0

o= n/(w-Log(n))Log(n/(w-Log(n))) = 1.6922085689143893

it's your formula ?

but if you make this :

o= n/(w-Log(n))Log(n/(w-Log(n))) = 1.6922085689143893
o2= sqrt(o) = 1.3008491722388069

you have a large control predictor...

this is only my theory to bypass error

read this :

http://en.wikipedia.org/wiki/Floating_point

Thread: New Multi-Threaded Pi Program - Faster than SuperPi and PiFast

Thread Tools

Search Thread

Rate This Thread

Display

Threaded View

Tags for this Thread

Bookmarks

Bookmarks

Posting Permissions