MMM
Results 1 to 25 of 815

Thread: New Multi-Threaded Pi Program - Faster than SuperPi and PiFast

Threaded View

  1. #9
    Xtreme Enthusiast
    Join Date
    Mar 2009
    Location
    Bay Area, California
    Posts
    705
    Quote Originally Posted by 2good4you View Post
    But, why does the K10 CPU's run at SSE3 instructionset and not the supported SSE4? The Intel CPU's run on it's supported SSE4.1. Thanks for the program and an answer would been appreciated. Happy new year from Sweden also.
    SSE4.1 and SSE4a are different instruction sets.

    Intel has SSE4.1.
    AMD has SSE4a.

    In my opinion, there's nothing in the SSE4a instruction set that is useful for this program.

    EDIT:
    And even it did, I don't have access to a K10 machine with enough ram to properly test it. (Since an SSE4a version won't run on my Xeon workstation.)

    EDIT 2:
    The AMD optimized (Kasumi) binary was also tested on my Xeon workstation for correctness. (along with all the other x64 binaries)
    For correctness testing, it doesn't matter that I'm using an Intel machine to test an AMD binary.
    Only the performance tuning had to be done on an AMD machine - which was a Phenom II X3 unlocked to 4 cores.

    And yes, Happy New Year!!!
    Last edited by poke349; 01-03-2010 at 02:14 AM. Reason: rephrasing
    Main Machine:
    AMD FX8350 @ stock --- 16 GB DDR3 @ 1333 MHz --- Asus M5A99FX Pro R2.0 --- 2.0 TB Seagate

    Miscellaneous Workstations for Code-Testing:
    Intel Core i7 4770K @ 4.0 GHz --- 32 GB DDR3 @ 1866 MHz --- Asus Z87-Plus --- 1.5 TB (boot) --- 4 x 1 TB + 4 x 2 TB (swap)

Tags for this Thread

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •