Search:

Type: Posts; User: poke349; Keyword(s):

Page 1 of 20 1 2 3 4

Search: Search took 0.03 seconds.

  1. Replies
    20
    Views
    18,245

    SuperPi compute all (decimal) digits from start...

    SuperPi compute all (decimal) digits from start to N. The algorithm used in SuperPi is (for simplicity) not parallelizable. That's why the initial SuperPi on GPU project didn't get anywhere.

    As...
  2. So I posted an answer here earlier, but it got...

    So I posted an answer here earlier, but it got lost through the server downtime...



    Yeah, that's an easy enough feature request. Expect to see it in the next version. (Thanks for the...
  3. No idea. And there's no real way to tell what it...

    No idea. And there's no real way to tell what it breaks on. If none of the other tests reveal the problem, then I guess it's trial-and-error with the OC settings.
  4. Nice! Try the latest y-cruncher (v0.6.5). It...

    Nice!

    Try the latest y-cruncher (v0.6.5). It will use AVX2 and FMA3. :)
  5. Neat! Glad to hear that it worked. And thanks for...

    Neat! Glad to hear that it worked. And thanks for going out of the way to test this for me. I really appreciate it!
  6. Just a small heads up. I've just released version...

    Just a small heads up. I've just released version v0.6.5.

    This version adds optimizations for Haswell processors. It will use AVX2, FMA3, and BMI2 instructions.
    I've also noticed that because of...
  7. Yep, you got it. Once the computation is done,...

    Yep, you got it. Once the computation is done, you'll need to use the Digit Viewer to read the last few 50 - 100 hexadecimal digits.
    The program lets you disable the hexadecimal digits. I just...
  8. Yeah. It'll take more than a week to run all the...

    Yeah. It'll take more than a week to run all the stuff after the series anyway. So you can wait until after the series is done.
    Make sure you have hexadecimal digits enabled. Since that's needed for...
  9. I've updated the program to detect if the OS has...

    I've updated the program to detect if the OS has AVX enabled. And if it isn't, it will spit out a warning and fall back to SSE3 or SSE4.1.
    I haven't been able to test it since all my OS's are fully...
  10. Yeah, that's an easter egg. :) I should probably...

    Yeah, that's an easter egg. :) I should probably raise the threshold for that. I set it about 2 years ago and it's clearly becoming too easy to hit 2GB/s.
    Back then, it was only possible via ram...
  11. No problem. :D The app will allow up to 256...

    No problem. :D The app will allow up to 256 threads. But that's an arbitrary limit that I can increase at any time.
  12. Win7 SP1 supports AVX. So my program is properly...

    Win7 SP1 supports AVX. So my program is properly detecting it.
    But according to this: http://superuser.com/questions/244213/how-can-i-check-whether-intels-avx-is-enabled-on-my-computer
    It looks...
  13. An answered most of this in the email reply. But...

    An answered most of this in the email reply. But yes, it's an amazing system. :)



    In the first case with the illegal instruction, it appears that you don't have proper operating system support...
  14. Some updates on v0.6.4... Back in November...

    Some updates on v0.6.4...

    Back in November when I "plugged in" my pre-written FMA4 instruction macros, the performance gain on AMD Piledriver actually negative. Some 10 - 20% slower. This was...
  15. You can get the older versions here:...

    You can get the older versions here: http://www.numberworld.org/y-cruncher/versions.html



    That should not happen. And I haven't received any other reports of this issue. Do you have a...
  16. Yeah it kinda does. At least enough to match the...

    Yeah it kinda does. At least enough to match the internal bandwidth or the socket <-> socket connection. That's the problem when you try to use distributed memory like shared memory. Latencies can be...
  17. It would probably depend on how fast the...

    It would probably depend on how fast the Infiniband is. For a system of this calibur you'd gonna need at least 20 GB/s of sustained bandwidth to have any hope of being able to use it efficiently as...
  18. That will be interesting. Especially since the...

    That will be interesting. Especially since the NUMA affect will be extreme.

    There's one NUMA friendly algorithm in the program. But it's activated only above 50 billion digits since it is slow. So...
  19. Wow... Is that an 8-socket I'm seeing? :p:

    Wow... Is that an 8-socket I'm seeing? :p:
  20. A bump after a long time... Here's a...

    A bump after a long time...

    Here's a screenshot from a binary tuned for AMD Bulldozer using FMA4 and XOP instructions.
    AMD FX-8350 @ 4.0 GHz (stock) with 16 GB @ 1333 MHz:
    ...
  21. Version v0.6.2 is out! Now with the long...

    Version v0.6.2 is out!

    Now with the long overdue swap modes...

    Ridiculous times on a ridiculous computer: 100 billion digits in under 7 hours. :eek:
    ...
  22. Replies
    44
    Views
    27,886

    So that's what the baby looks like... :D When...

    So that's what the baby looks like... :D

    When v0.6.2 comes out, all you need to do is pop a bunch of high-end hard drives in there and you'll be good to take on Shigeru Kondo on the larger sizes...
  23. It's an old habit I've had since I first started...

    It's an old habit I've had since I first started programming - which got reinforced by the fact that C89/90 doesn't allow declarations inside the for-loop statement.

    I'm slowing trying to break...
  24. BKT is pure integer work. No memory, almost...

    BKT is pure integer work. No memory, almost entirely L1 cache.
    HNT is a mix of everything from integer, floating-point, cache, memory...


    I can't say how well each test stresses the integer...
  25. Yes it should drop between tests. But only for a...

    Yes it should drop between tests. But only for a split-second - most of the time it isn't even noticeable.
    That's because it kills off the old threads for the old task and recreates them for the...
Results 1 to 25 of 500
Page 1 of 20 1 2 3 4