Page 1 of 2 12 LastLast
Results 1 to 25 of 40

Thread: Mandelbrot Benchmark

  1. #1
    Registered User
    Join Date
    May 2006
    Location
    Germany
    Posts
    28

    Mandelbrot Benchmark

    Hi there,

    after lots of optimization work I finished now an almost final version of my Mandelbrot Benchmark. It focuses on FPU and SSE2 units of current processors, while neglecting almost all influence of cache and memory subsystem and graphics card. I used a lot of time implementing the whole thing completely in assembler and used lots of techniques to make it really fast with loop unrolling, independent instruction chains, multi core support using the benefits of all the available CPU's.

    I think it really shows how over the years theses calculation units where improved or not...I have a column on my webpage for efficiency per core per MHz.

    Any comments and test results welcome...

    @@@ New Version released 01-Jun-2008 @@@

    Direct link:
    http://www.mikusite.de/x86/KMB_V0.53H-32b-MT.zip
    Homepage with results:
    http://www.mikusite.de/pages/x86.htm
    Last edited by Kuemmel; 06-07-2008 at 12:07 AM.

  2. #2
    Xtreme Enthusiast
    Join Date
    Jan 2007
    Location
    Santa Barbara, CA
    Posts
    794
    Mandelbrots are awesome, I'll have to check this out later.
    Computer: Intel i7-4770k | Asus Z87 Maximus VI Impact | 240GB Corsair Force GT | 240GB OCZ Vertex 3 | 16GB G.Skill Trident X 2400MHz | EVGA GTX690 Hydro Copper 4GB | Corsair AX850 | Steelseries 7G | Logitech G500 | 27" 2560x1440 Overlord Tempest X270OC 120Hz S-IPS
    Cooling: Koolance 380i CPU block | Swiftech Hydro Copper GPU block | EK X3 150 Reservoir | EK X-TOP D5 PWM drive | PrimoChill PrimoFlex PRO LRT Bloodshed red | Alphacool NexXxos 240mm XT45 | Alphacool NexXxos 280mm UT60 | Bitspower Fittings | 4x Bitfenix Spectre Pro 120mm | 2x Bitfenix Spectre Pro 140mm | Lamptron FC-5V2
    Case: Custom modded matte black Caselabs Mercury S3

  3. #3
    Xtreme Enthusiast
    Join Date
    May 2007
    Posts
    831
    SSE2:
    3365.647

    FPU:
    1095.311

    Fix it so that if you minimize the benchmark window, it doesn't come back a big black square that won't go away.
    Gigabyte P35-DQ6 | Intel Core 2 Quad Q6700 | 2x1GB Crucial Ballistix DDR2-1066 5-5-5-15 | MSI nVIDIA GeForce 7300LE

  4. #4
    Xtreme Member
    Join Date
    Jun 2005
    Location
    Bulgaria, Varna
    Posts
    447
    Sse2: 2645.455

    Fpu: 843.686
    Last edited by fellix_bg; 04-13-2008 at 08:39 PM.

  5. #5
    Xtreme Enthusiast
    Join Date
    Jul 2007
    Posts
    730
    Q6600 (400x9.0)
    SSE2: 4711.906
    FPU: 1504.404
    [ 3770K @ 4.2 : H100i : ASRock Z77E-ITX : GTX560 Ti : 16GB DDR3 1800 : +4TB : Bitfenix Prodigy : 2x Dell S2340M : Filco Majestouch-2 [Cherry Brown] : BX8a Deluxe]

  6. #6
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Mid UK. Lift a few rocks, eventually you will find me.
    Posts
    665
    FPU = 680.446
    SSE2 = 2182.936

    C2D E6400@8x400=3.2GHz. 1:1 FSB.

    Kuemmel, that's a quick algorithm.


    ChaosPro is a good fractal proggy to play with.
    Fun Box: Asus P8Z68-V GEN3++Corsair AX850++i5 2500k@4.5Ghz-1.272v++Corsair A50++2x8Gb Corsair Vengeance++MSI R7970 Lightning++Audigy2 Plat-EX++TBS 6280 DVB-T2 tuner++256Gb OCZ Vertex 4.500Gb Caviar Black.500Gb Seagate Barracuda++Sony AD7240s++Lian-Li PC-60++Linux Mint/Win 7++Asus P238Q

    Work Box: Gigabyte H61MA-DV3++Corsair HX620++i5 3450@stock++2x8Gb Corsair Vengeance++120Gb OCZ Agility 3++Linux Mint

    Quantum theory in a nutshell: It's so small we don't know where it is, it could be here, it could be there.

    Just 'cos it's legal don't make it right.

  7. #7
    Xtreme Addict
    Join Date
    Apr 2005
    Location
    Houston, TX
    Posts
    1,196
    My kind of benchmark... quick.

    C2Q Q9550 @ 4.03Ghz/4GB OCZ DDR 1066 @ 902Mhz /BFG GTX 260/Gigabyte GA-EP45-UD3P/2 X Samsung F1 1TB/Samsung F1 750GB/Samsung SH-S223Q/Corsair TX650/X-Fi Titanium Fata1ity/Swiftech H20-220/Logitech Z5300 Speakers/Samsung SyncMaster 2253BW Monitor/Win 7 Ultimate x64

  8. #8
    Registered User
    Join Date
    May 2006
    Location
    Germany
    Posts
    28
    Thanks for all the results !

    I'm trying hard to find somebody with a P4 or Xeon based P4 CPU with Hyperthreading...anybody out there ? Or can you pass it to somebody with that kind of CPU ??? Pleeeeeeeaaaaase... ;-)

  9. #9
    Xtreme Enthusiast
    Join Date
    Jan 2007
    Location
    Santa Barbara, CA
    Posts
    794
    Oh, we have a bunch of 3.2ghz P4's with HT in my lab on campus. Ill give it a shot next Tuesday.
    Computer: Intel i7-4770k | Asus Z87 Maximus VI Impact | 240GB Corsair Force GT | 240GB OCZ Vertex 3 | 16GB G.Skill Trident X 2400MHz | EVGA GTX690 Hydro Copper 4GB | Corsair AX850 | Steelseries 7G | Logitech G500 | 27" 2560x1440 Overlord Tempest X270OC 120Hz S-IPS
    Cooling: Koolance 380i CPU block | Swiftech Hydro Copper GPU block | EK X3 150 Reservoir | EK X-TOP D5 PWM drive | PrimoChill PrimoFlex PRO LRT Bloodshed red | Alphacool NexXxos 240mm XT45 | Alphacool NexXxos 280mm UT60 | Bitspower Fittings | 4x Bitfenix Spectre Pro 120mm | 2x Bitfenix Spectre Pro 140mm | Lamptron FC-5V2
    Case: Custom modded matte black Caselabs Mercury S3

  10. #10
    Moral Police
    Join Date
    Nov 2004
    Location
    Belgium
    Posts
    2,099
    Dual Xeon 2.4Ghz with hyperthreading here

    Will Test tomorrow. To tired right now. if I forget just shoot me a PM

  11. #11
    Xtremely Hot Sauce
    Join Date
    Sep 2007
    Location
    New York
    Posts
    3,586
    I was trying to get you results from an old K6 Chomper Ext at 550MHz, but it didn't work.

    My toys:
    Asus Sabertooth X58 | Core i7-950 (D0) | CM Hyper 212+ | G.Skill Sniper LV 12GB DDR3-1600 CL9 | GeForce GTX 670-2048MB | OCZ Agility 4 512GB, WD Raptor 150GB x 3 (RAID0), WD Black 1TB x 2 (RAID0) | XFX 650W CAH9 | Lian-Li PC-9F | Win 7 Pro x86-64
    Gigabyte EX58-UD3R | Core i7-920 (D0) | Stock HSF | G.Skill Sniper LV 4GB DDR3-1600 CL9 | Radeon HD 2600 Pro 512MB | WD Caviar 80GB IDE, 4TB x 2 (RAID5) | Corsair TX750 | XClio 188AF | Win 7 Pro x86-64
    Dell Dimension 8400 | Pentium 4 530 HT (E0) | Stock HSF | 1.5GB DDR2-400 CL3 | GeForce 8800 GT 256MB | WD Caviar 160GB SATA | Stock PSU | (Broken) Stock Case | Win Vista HP x86
    Little Dot DAC_I | Little Dot MK IV | Beyerdynamic DT-880 Premium (600 Ω) | TEAC AG-H300 MkIII | Polk Audio Monitor 5 Series 2's

  12. #12
    Registered User
    Join Date
    May 2006
    Location
    Germany
    Posts
    28
    Quote Originally Posted by Bobsama View Post
    I was trying to get you results from an old K6 Chomper Ext at 550MHz, but it didn't work.
    Hi, I think that's possible and okay, becuase the K6 neither supports the additional FPU instructions back in time invented with the incarnation of "Pentium Pro" and "Pentium II" and of course no SSE2 also at that time...sorry...may be should have implemented a kind of check and warning...

    ...by the way...you can send just any results, also AMD FX and X2 and Phenoms are very interesting !!!

  13. #13
    Registered User
    Join Date
    Apr 2008
    Location
    Florida, USA
    Posts
    41
    cool
    Attached Thumbnails Attached Thumbnails Click image for larger version. 

Name:	mandelbot.JPG 
Views:	1456 
Size:	89.2 KB 
ID:	76877  
    CPU: QX9770 C1 | Striker II Extreme, Bios ? | GPU: 2x MSI 8800Ultra SLI | MEM: 4GB Corsair DDR3 1800
    HDD: 4x Raptors 150GB w/ARC-1210 Controller Raid0, 1x Seagate ST3500320NS 500GB
    PSU: PC P&C 1KW Single Rail | WATER BLOCKS: CPU= D-Tek fusion, GPUs = EK | RADIATORS: 2x ThermoChill PA120.3 | PUMPS: 2x Laing D5
    CASE: Mountain Mods U2-UFO Horizon | Logitech G15 & G9 | MONITOR: Dell 3007WFP | XP Pro x64


  14. #14
    Xtreme Enthusiast
    Join Date
    Jun 2007
    Location
    Victoria, Australia
    Posts
    948
    Quote Originally Posted by Qwik View Post
    cool
    Just wondering why you score so low at 4ghz?
    Using vista?

    Got Q6600 @ 3.645 mem at 996 4-4-4 and score of:

    1533.97
    4858.58

    Very nice benchmark though, love quick ones.
    Last edited by The0men; 04-18-2008 at 10:36 PM.

  15. #15
    Registered User
    Join Date
    May 2006
    Location
    Germany
    Posts
    28
    Quote Originally Posted by The0men View Post
    Just wondering why you score so low at 4ghz?
    Using vista?...
    ...Very nice benchmark though, love quick ones.
    Hm, yep, your result is like expected, his result is a bit slow...I had one result where Vista 32 bit was slow and Vista 64 bit was okay, but also another Vista 32 bit result where everything was okay...no conclusion yet...may be other programs running or something ?

    Because normally the benchmark really scales totally in line with the clock speed of a processor generation, so all Core 2 scale the same.

    @EDIT: in other words, at 4050 MHz with a Core 2 Quad I think a result up to 5350-5400 shoule be possible independant of the memory setup of the system or motherboard...
    Last edited by Kuemmel; 04-18-2008 at 11:59 PM.

  16. #16
    Registered User
    Join Date
    Apr 2008
    Location
    Florida, USA
    Posts
    41
    Quote Originally Posted by The0men View Post
    Just wondering why you score so low at 4ghz?
    Using vista?

    Got Q6600 @ 3.645 mem at 996 4-4-4 and score of:

    1533.97
    4858.58

    Very nice benchmark though, love quick ones.
    Nope I am using XP32 .. and I ran bench @ everyday settings with other programs running like normal(virus scan, skype, EI7, etc..). I'll rerun it
    by it self see what a difference it makes.

    it did.
    Attached Thumbnails Attached Thumbnails Click image for larger version. 

Name:	mandelbot.JPG 
Views:	1405 
Size:	100.9 KB 
ID:	76909  
    Last edited by Qwik; 04-19-2008 at 11:52 AM.
    CPU: QX9770 C1 | Striker II Extreme, Bios ? | GPU: 2x MSI 8800Ultra SLI | MEM: 4GB Corsair DDR3 1800
    HDD: 4x Raptors 150GB w/ARC-1210 Controller Raid0, 1x Seagate ST3500320NS 500GB
    PSU: PC P&C 1KW Single Rail | WATER BLOCKS: CPU= D-Tek fusion, GPUs = EK | RADIATORS: 2x ThermoChill PA120.3 | PUMPS: 2x Laing D5
    CASE: Mountain Mods U2-UFO Horizon | Logitech G15 & G9 | MONITOR: Dell 3007WFP | XP Pro x64


  17. #17
    Xtreme Cruncher
    Join Date
    Jun 2006
    Location
    On top of a mountain
    Posts
    4,163
    Interesting result



    I score about 300 points higher if I have all 8 cores at 100% load Folding

    Turn off Folding...score drops.
    20 Logs on the fire for WCG: i7 920@2.8 X3220@3.0 X3220@2.4 E8400@4.05 E6600@2.4

  18. #18
    Xtreme Cruncher
    Join Date
    Jun 2006
    Location
    Land o' 10,000 lakes
    Posts
    836
    Is there a reason I'm getting TR/Crypt.XPACK.Gen warnings?

    I'm almost always available on Steam to chat. Same username.

  19. #19
    Xtreme Enthusiast
    Join Date
    Jul 2007
    Posts
    730
    Quote Originally Posted by jspace View Post
    Is there a reason I'm getting TR/Crypt.XPACK.Gen warnings?
    False positive. From the thread in Benchmarking:...

    Quote Originally Posted by Kuemmel
    Yep, known problem, it's a false alarm. I loaded my program up on "www.virustotal.com" and only got 3 kind of positive or warning from 32...it's a common problem within the users of FlatAssembler for coding...plenty false alarams there...
    [ 3770K @ 4.2 : H100i : ASRock Z77E-ITX : GTX560 Ti : 16GB DDR3 1800 : +4TB : Bitfenix Prodigy : 2x Dell S2340M : Filco Majestouch-2 [Cherry Brown] : BX8a Deluxe]

  20. #20
    I am Xtreme
    Join Date
    Feb 2005
    Location
    SiliCORN Valley
    Posts
    5,543
    Q6600 G0 @ 3.835

    i wanna see someone do Mandel's that are created or benchmarked as 100% GPU based,, thats G P U
    see what the power of the GPU can do vs CPU
    Attached Thumbnails Attached Thumbnails Click image for larger version. 

Name:	KM SSE2.jpg 
Views:	1400 
Size:	12.7 KB 
ID:	76961   Click image for larger version. 

Name:	KM.FPUjpg.jpg 
Views:	1373 
Size:	13.4 KB 
ID:	76962   Click image for larger version. 

Name:	2008-04-20_00014.jpg 
Views:	1395 
Size:	65.2 KB 
ID:	76963  
    Last edited by Lestat; 04-19-2008 at 10:43 PM.
    "These are the rules. Everybody fights, nobody quits. If you don't do your job I'll kill you myself.
    Welcome to the Roughnecks"

    "Anytime you think I'm being too rough, anytime you think I'm being too tough, anytime you miss-your-mommy, QUIT!
    You sign your 1248, you get your gear, and you take a stroll down washout lane. Do you get me?"

    Heat Ebay Feedback

  21. #21
    Registered User
    Join Date
    May 2006
    Location
    Germany
    Posts
    28
    Quote Originally Posted by Lestat View Post
    Q6600 G0 @ 3.835
    i wanna see someone do Mandel's that are created or benchmarked as 100% GPU based,
    see what the power of the GPU can do vs CPU
    Yep, interesting topic...the problem is that at the moment the floating point unit on a GPU is only single precision and when you go 'deeper' into the Mandelbrot 'land' you need double precision...I read that the GPU producer might change this may be in the future...here's a refernece:

    http://www.theinquirer.net/gb/inquir...gpgpu-monsters

    ...but I don't know what's the actual state...so it's more for the future, once these things are spread...

  22. #22
    Registered User
    Join Date
    May 2006
    Location
    Germany
    Posts
    28
    Quote Originally Posted by CyberDruid View Post
    I score about 300 points higher if I have all 8 cores at 100% load Folding Turn off Folding...score drops.
    Hm, what is acutally "load Folding" ? I didn't find anything directly usefull while googleing it...where do you set this thing and is it normally supposed to be turned on for everyday use or something ?
    Last edited by Kuemmel; 04-19-2008 at 11:41 PM.

  23. #23
    Xtreme Enthusiast
    Join Date
    Jun 2007
    Location
    Victoria, Australia
    Posts
    948
    He means when his processors are being used at %100 load whilst he is folding, which is just a program using your CPU to do equations for various tasks like scientific research, or medical research, or biological etc.

    Just like running a stress test like prime on all 8 of his cores.

  24. #24
    Xtreme Enthusiast
    Join Date
    Feb 2008
    Location
    Northeast Ohio, Where the weather changes every 30 min...
    Posts
    598
    Laptop: 1.73ghz Pentium M

    FPU: 171.636
    SSE2: 190.112

    Desktop: Phenom 9600 (TLB fix disabled) 258x11.5=2967

    FPU: 1036.228
    SSE2: 2628.747

    It seemed kind of short, taking only like 10 seconds to run on the phenom. I'm not completely familiar with mandelbrots, but maybe you could make the graphs large by changing the functions some so the cpu has to calculate alot more. Very nice job so far though.
    Not much to say right now.

  25. #25
    Registered User
    Join Date
    May 2006
    Location
    Germany
    Posts
    28
    Quote Originally Posted by Oldguy932 View Post
    Desktop: Phenom 9600 (TLB fix disabled) 258x11.5=2967

    FPU: 1036.228
    SSE2: 2628.747

    It seemed kind of short, taking only like 10 seconds to run on the phenom. I'm not completely familiar with mandelbrots, but maybe you could make the graphs large by changing the functions some so the cpu has to calculate alot more. Very nice job so far though.
    Thanks for the first Phenom result for my list I just think you could have scored almost 2800 on SSE2...any applications running on the side ?

    Yeah, I already made it run a bit 'slower', but even with under 10 seconds or the Core 2's running it at may be 5 seconds it delivers still pretty stable results...I could also do some zooming thing, but for the moment I focused more on the speed of the algoritm, so that all these new CPU's get something to do ;-) I'll try to keep up your ideas in the next releases...

Page 1 of 2 12 LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •