Page 3 of 5 FirstFirst 12345 LastLast
Results 51 to 75 of 101

Thread: hwbot multipi - multithreaded superpi (open beta)

  1. #51
    Admin
    Join Date
    Dec 2003
    Location
    Hillsboro, OR
    Posts
    5,225
    Frederik and Kunaak-

    This isn't an issue about people being upset over extra stuff to install. It's a question of which language will work best to create a taxing benchmark. Java is inherently too inefficient to be optimal for the task at hand. This current version fails to even stress the CPU 100% at realtime. (Check task manager) Java is bloated and a little clunky with memory allocation. It has a lot of nice uses, but IMHO, it is less than optimal to be used in such an application. I too feel that C/C++ would be better choices, with seperate versions compiled for Windows, Linux and MacOS. The loss of portability is unfortunate, but I highly doubt that even a Java version would result in identical performance across all operating systems given the same hardware.

  2. #52
    Xtreme Member
    Join Date
    Aug 2006
    Posts
    100
    The 100% cpu thing is odd, but it uses 98% on my (single core) pc. The problem with having the c versions for every platform is maintaining it. RichBa5tard has already said its too much hassle, and if it works well enough in java, which i think it does, then its no problem.

  3. #53
    Xtreme Member
    Join Date
    Nov 2004
    Posts
    239
    2:32.938 - AXP-M @ 2420MHz

    These times are still too quick IMO. If this program is to test the next generation of cpus... its going to need to hold conroes to 2min+
    | Intel 2500K @ 5GHz | Noctua NH-D14 | Asus P67 Sabertooth | Samsung 2x4GB DDR3 @ 2133 MHz | Crucial M4 256GB | Asus GTX 770 OC | OCZ Fatal1ty 750W | Qnix QX2710

    History: AXP-2400M, AXP-2500M, Core2 E6600 - all minimum 50% overclock

  4. #54
    Admin
    Join Date
    Dec 2003
    Location
    Hillsboro, OR
    Posts
    5,225
    I think the duration is all right...takes me around 40 secs. A bit slower would be nice though to take into account quad scores. Here's my score for what its worth.


  5. #55
    hwbot crew
    Join Date
    Jun 2002
    Location
    Belgium!
    Posts
    880
    Quote Originally Posted by Gautam
    Frederik and Kunaak-

    This isn't an issue about people being upset over extra stuff to install. It's a question of which language will work best to create a taxing benchmark. Java is inherently too inefficient to be optimal for the task at hand. This current version fails to even stress the CPU 100% at realtime. (Check task manager) Java is bloated and a little clunky with memory allocation. It has a lot of nice uses, but IMHO, it is less than optimal to be used in such an application. I too feel that C/C++ would be better choices, with seperate versions compiled for Windows, Linux and MacOS. The loss of portability is unfortunate, but I highly doubt that even a Java version would result in identical performance across all operating systems given the same hardware.
    Gautam,

    The goal of superpi/multipi is not to be the most efficient Pi calculation app. The algoritm is filled with checks for bad memory / calculation errors to dedect unstable overclocks, that's why it's so 'slow'.

    As for the not stressing the processors 100%, that's intentional too. The first version (0.1) stressed all the cores at 100%, but that gave such a huge performance gain on a quad core, it just wouldn't be fun. A mediocre 2ghz A64 needed 4 minutes, a quad core 30 seconds... those differences are just way too high to make it fun for everyone. v0.2 taxes the memory a lot more, which causes a 100% cpuload with one core, and ~80% with 2 cores. I think this is much more fun as now your memory speed matters too.

    I am, in no way, claiming that I've used the most effecient algoritm to calculate pi, nor that I've used the most efficient language to do it.

    My goals are:
    - fun to benchmark
    - advantage with multiple cores, but single core should still be 'fun' and quad core should not obliterate all at stock speed
    - cross platform
    - easy integration with hwbot.org

    My goals are not:
    - calculate pi in an efficient way

    SuperPi doesn't uses the most efficient algorithm either... i haven't seen anyone complaining about that either.


    One last thing Gautam, do you know how much time it takes to develop it in C++ for 3 different platforms properly? I'm with Kunaak on this, i've seen too many great benchmark apps go down the pooper because the community was demanding too much. I can write it in java in my spare time, I can not do it in C++ without taking a break from work.
    Last edited by RichBa5tard; 08-16-2006 at 10:47 PM.
    HTPC (win xp): Turion MT-30 @ 2Ghz | NF4 | XFX 7900GT | 26" TFT
    Development (mac osx): Macbook Pro | Core Duo 1.86Ghz | 1.5GB DDR2


  6. #56
    Xtreme Member
    Join Date
    May 2006
    Posts
    115
    54 sec on dual dual 265 opterons

  7. #57
    Admin
    Join Date
    Dec 2003
    Location
    Hillsboro, OR
    Posts
    5,225
    I didn't realize that that was deliberate...so you ended up "cheating" ultimately. But you know that I'm in support for leveling the playing field, so I retract what I said earlier in that case. Good concept.

    I wasn't expecting this to be such a sore subject. You know I appreciate all of your hard work, but I always try to share my thoughts as well. The efficiency isn't a big deal like I said...just the stressing of both the CPU and memory subsystem. If you feel Java is the best path to take then no worries.

  8. #58
    hwbot crew
    Join Date
    Jun 2002
    Location
    Belgium!
    Posts
    880
    Yeah, cross platform is a sore subject for me. I don't want to install windows just for benching! Give us mac/linux users a break!
    HTPC (win xp): Turion MT-30 @ 2Ghz | NF4 | XFX 7900GT | 26" TFT
    Development (mac osx): Macbook Pro | Core Duo 1.86Ghz | 1.5GB DDR2


  9. #59
    Admin
    Join Date
    Dec 2003
    Location
    Hillsboro, OR
    Posts
    5,225
    Riddle me this though, do you think that *nix results will be comparable with Windows results?

    Anyone with a Core-based Mac run this yet?

  10. #60
    Aussie God
    Join Date
    Feb 2005
    Location
    Copenhagen, Denmark
    Posts
    4,596
    RichBa5tard, you have PM.

    EDIT:
    Got a 53 sec time on my 3.6ghz conroe.
    Competition ranking;
    2005; Netbyte, Karise/Denmark #1 @ PiFast
    2008; AOCM II, Minfeld/Germany #2 @ 01SE/AM3/8M (w. Oliver)
    2009; AMD-OC, Viborg/Denmark #2 @ max freq Gigabyte TweaKING, Paris/France #4 @ 32M/01SE (w. Vanovich)
    2010: Gigabyte P55, Hamburg/Germany #6 @ wprime 1024/SPI 1M (w. THC) AOCM III, Minfeld/Germany #6 @ 01SE/AM3/1M/8M (w. NeoForce)

    Spectating;
    2010; GOOC 2010 Many thanks to Gigabyte!


  11. #61
    hwbot crew
    Join Date
    Jun 2002
    Location
    Belgium!
    Posts
    880
    Quote Originally Posted by Gautam
    Riddle me this though, do you think that *nix results will be comparable with Windows results?

    Anyone with a Core-based Mac run this yet?
    I honestly think so, yes.

    I've got a core based mac, with dual boot windows. I'll compare it tonight.
    HTPC (win xp): Turion MT-30 @ 2Ghz | NF4 | XFX 7900GT | 26" TFT
    Development (mac osx): Macbook Pro | Core Duo 1.86Ghz | 1.5GB DDR2


  12. #62
    hwbot crew
    Join Date
    Jun 2002
    Location
    Belgium!
    Posts
    880
    Well, I've tested 0.2 thoroughly and there seems to be a difference between platforms I think it has mainly to do with the amount of open processors primarely caused by the window manager. The heavier/fancier your GUI, the slower multipi gets.

    Ranked by speed:
    linux terminal > linux KDE > windows XP > MacOSX Tiger

    The gain for a linux terminal was too much to call it fair, so I started tweaking the algorithm again.

    The new algorithm (0.3, to be released early next week), will tax memory less, but will be comparable between platforms. I haven't had the chance to test it on Mac OSX yet, but results on a linux terminal and windows gui are nearly identical (I did 5 runs on each and fastest run on linux was not more than 200 milliseconds faster than slowest run on windows xp). If I get the same, identical results on macosx i think we have a winner.

    It doesn't tax the memory as much anymore though, but I find consistency between platforms much more important than taxing the memory.
    HTPC (win xp): Turion MT-30 @ 2Ghz | NF4 | XFX 7900GT | 26" TFT
    Development (mac osx): Macbook Pro | Core Duo 1.86Ghz | 1.5GB DDR2


  13. #63
    Admin
    Join Date
    Dec 2003
    Location
    Hillsboro, OR
    Posts
    5,225
    Yeah, I was expecting mostly the same. In case you haven't noticed, all experienced Pi benchers use Windows Server 2003. Despite using the same kernel and otherwise being virtually identical to XP, its much leaner and as a result will nearly almost win, albeit by a tiny amount. Linux would be one step further.

    Once again this is quite a feat- a multithreaded benchmark comparable across virtually any platform. Quite unprecedented I believe.

  14. #64
    hwbot crew
    Join Date
    Jun 2002
    Location
    Belgium!
    Posts
    880
    w00t

    I'll see if I can get hold of a Win2003 somewhere.
    HTPC (win xp): Turion MT-30 @ 2Ghz | NF4 | XFX 7900GT | 26" TFT
    Development (mac osx): Macbook Pro | Core Duo 1.86Ghz | 1.5GB DDR2


  15. #65
    Admin
    Join Date
    Dec 2003
    Location
    Hillsboro, OR
    Posts
    5,225
    180 day eval free from Microsoft.

  16. #66
    hwbot crew
    Join Date
    Jun 2002
    Location
    Belgium!
    Posts
    880
    Thanks!
    HTPC (win xp): Turion MT-30 @ 2Ghz | NF4 | XFX 7900GT | 26" TFT
    Development (mac osx): Macbook Pro | Core Duo 1.86Ghz | 1.5GB DDR2


  17. #67
    xtreme energy
    Join Date
    Oct 2004
    Location
    Europe, Latvia
    Posts
    4,145
    Sorry what do you mean by "taxing" memory?

    I can see there is a switch to VM how much ram to use but not sure what is optimal
    ...

  18. #68
    Xtreme Legend
    Join Date
    Mar 2005
    Location
    Australia
    Posts
    17,242
    Quote Originally Posted by RichBa5tard
    Well, I've tested 0.2 thoroughly and there seems to be a difference between platforms I think it has mainly to do with the amount of open processors primarely caused by the window manager. The heavier/fancier your GUI, the slower multipi gets.

    Ranked by speed:
    linux terminal > linux KDE > windows XP > MacOSX Tiger

    The gain for a linux terminal was too much to call it fair, so I started tweaking the algorithm again.

    The new algorithm (0.3, to be released early next week), will tax memory less, but will be comparable between platforms. I haven't had the chance to test it on Mac OSX yet, but results on a linux terminal and windows gui are nearly identical (I did 5 runs on each and fastest run on linux was not more than 200 milliseconds faster than slowest run on windows xp). If I get the same, identical results on macosx i think we have a winner.

    It doesn't tax the memory as much anymore though, but I find consistency between platforms much more important than taxing the memory.
    lol you just gave me a good idea

    *searches for my copy of linux that installs in RAM heh
    Team.AU
    Got tube?
    GIGABYTE Australia
    Need a GIGABYTE bios or support?



  19. #69
    Xtreme Member
    Join Date
    Aug 2006
    Posts
    100
    Quote Originally Posted by dinos22
    lol you just gave me a good idea

    *searches for my copy of linux that installs in RAM heh
    What you need is a live cd, they dont use the hard drive at all :P
    Try something like SLAX, its pretty easy to use, and it can copy itself entirely to ram.
    Last edited by dig412; 08-19-2006 at 02:03 AM.
    Check out my benchmarking program here.

  20. #70
    Registered User
    Join Date
    Aug 2003
    Posts
    78
    1:54.254 sec's. On my Newark at 3060 ghz.
    AMD Phenom II X4 965 Black Edition C3 0945FPA
    BIOSTAR TA890FXE Bios 89FAD927
    OCZ OCZ3BE1600C8LV4GK
    Corsair H50
    XFX 8800 GTS 320
    PCPOWER&COOLING SILENCER 750 QUAD
    Coolermaster HAF-132 case

  21. #71
    2.4C killer
    Join Date
    May 2003
    Location
    San Diego, CA
    Posts
    1,924
    I havent posted here in like a year but decided to try this out

    P4 Prescott w/HT 3.6ghz - 2m 31s

    P4 Prescott w/o HT 3.6ghz - 3m 11s

    Nice benchmark!
    Last edited by Peen; 08-20-2006 at 01:17 PM.

  22. #72
    Xtreme Member
    Join Date
    May 2005
    Posts
    226


    conroe really doesnt dominate here for the mhz. i wonder what an x2 at like 3.6ghz would do.


    i no longer use a laptop.



    Do You Owe Me Heatware??

  23. #73
    I am Xtreme
    Join Date
    Jan 2005
    Posts
    4,714
    Quote Originally Posted by CCUABIDExORxDIE
    conroe really doesnt dominate here for the mhz. i wonder what an x2 at like 3.6ghz would do.
    certainly not 40s. I guess about 50s
    Where courage, motivation and ignorance meet, a persistent idiot awakens.

  24. #74
    Registered User
    Join Date
    Sep 2005
    Location
    Lisboa, Portugal
    Posts
    30
    DC Opty 175 @ 2.25GHz - 1m 39.266s
    24/7 OS, a lot of apps running in background
    i5 2500k / asus p8p67 pro b3 (1305) / 2x4gb gskill 1600 / evga gtx 560 ti fpb / seasonic s12 600w
    x25-m g2 80gb / 2*wd 320gb / 1*f3 1tb / x-fi / stacker / cuplex xt / bix 360 / eheim s600

    retired: opteron 175 @ 2.65ghz / asus a8n32-sli deluxe / 2x1gb ocz eb plat / 7800gtx 256mb

  25. #75
    Registered User
    Join Date
    Jul 2006
    Posts
    6
    This doesn't work right in OS X 10.4.7... it's really screwed up. ran as sh file.sh

    Code:
    [01:53:54] kurosaki-ichigo:~/Desktop/multipi michaeldesilva$ sh multipi.sh 
    **** MULTIPI 0.2: a multithreaded pi calculator ****
    
    USE: /MultiPi <numberofiterations> <workunitsize>
    <numberofiterations>: total number of iterations to find pi - DEFAULT: 100000000
    <workunitsize>: divise iterations into blocks of this size to spread work over cores - DEFAULT: 5000000
    
    WARNING: this is a beta version, do not distribute unless you're a betatester. Results may vary in future versions.
    
    Starting Pi calculation
    Available cores dedected: 2
    Calculation on core 1 started.
    Calculation on core 2 started.
    30m 08.295s - Loop 1 of 20 finished
    30m 16.048s - Loop 3 of 20 finished
    ^Z
    [1]+  Stopped                 sh multipi.sh
    it goes towards 60secs, and ends up with a 32min end time..it's messed up
    Last edited by bsodmike; 09-11-2006 at 12:24 PM.

Page 3 of 5 FirstFirst 12345 LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •