Page 3 of 11 FirstFirst 123456 ... LastLast
Results 51 to 75 of 263

Thread: Nehalem,One cpu to rule them all!

  1. #51
    Registered User
    Join Date
    Aug 2008
    Posts
    26
    I've been following your replies to threads in the news forum waiting for your report and you don't disappoint very nice

  2. #52
    Xtreme Owner Charles Wirth's Avatar
    Join Date
    Jun 2002
    Location
    Las Vegas
    Posts
    11,656
    Good read Dave,
    Intel 9990XE @ 5.1Ghz
    ASUS Rampage VI Extreme Omega
    GTX 2080 ti Galax Hall of Fame
    64GB Galax Hall of Fame
    Intel Optane
    Platimax 1245W

    Intel 3175X
    Asus Dominus Extreme
    GRX 1080ti Galax Hall of Fame
    96GB Patriot Steel
    Intel Optane 900P RAID

  3. #53
    Xtreme Cruncher
    Join Date
    Dec 2006
    Location
    Stavanger , Norway
    Posts
    383
    Good lord,Movieman, nice result, i understand why have fallen in love with it.
    I have ordered a core i7 920 myself,the cheapest, but gee, what prices on those boards, hope the ram will be cheaper.

  4. #54
    Xtreme Addict
    Join Date
    Mar 2008
    Location
    川崎市
    Posts
    2,076
    The Emperor has landed.


    I go over I7 and will it to produce some numbers...

    (in a few weeks when it actually arrives in the uk).

    Nice numbers Dave

  5. #55
    XS WCG Hamster Herder
    Join Date
    Mar 2007
    Location
    Pacific Northwest, USA
    Posts
    2,389
    Quote Originally Posted by FUGGER View Post
    Good read Dave,
    ..And your threads were not slackers either, Sir.... I made the mistake of finally downloading our own XS super pi, in order to see where I stacked up....Big mistake, now I have to call my therapist, Dr. SiG and report a severe case of pi inadequacy.. My best time, on my fastest, and only, harper rig @3.6g (crunching stable), at 1M digits was 12.844. A far cry from MM's results and very far away from yours.

    Did you happen to do any power draw type analysis on your rigs? Particularly at clocks that us mere mortal crunchers (Read as, air or water cooled.) can run 24/7 on? I am particularly interested in this since I have to amortize/justify the total investment over a couple of years when I'm planning for the farm. If these are more cost effective when I pay the power bill, I can spend a bit more upfront, get more crunching done, and contribute my cash more smartly to our mission here.

    We are also interested in your impressions on how "baked" you think the mobos you ran on are. Dave's board seemed pretty solid. You seem to imply a preference for your gigabyte board, at least I saw more screenies from it.

    From what I'm seeing from all of you testing is that we are ready for the release of this beast on the computing world. Yes?

    Thanks much, hope to meet you in Vegas this Jan.

    Regards,
    Bob
    If You ain't Crunching, you ain't Xtreme enough. Go Here
    Help cure CANCER, MS, AIDS, and other diseases.
    ....and don't let your GPU sit there bored...Crunch or Fold on it!!
    Go Here, Or Here

  6. #56
    Banned
    Join Date
    Aug 2007
    Location
    C:\Philippines\TPC
    Posts
    1,525
    Thanks Dave for the info!

  7. #57
    Xtreme Addict
    Join Date
    Mar 2008
    Location
    川崎市
    Posts
    2,076
    Madshrimps has some power consumption numbers in their review, I7 uses only roughly the same as a E8600
    linky

  8. #58
    Banned
    Join Date
    Aug 2007
    Location
    C:\Philippines\TPC
    Posts
    1,525
    Quote Originally Posted by naokaji View Post
    Madshrimps has some power consumption numbers in their review, I7 uses only roughly the same as a E8600
    linky
    LegionHardware on the other hand shows these i7s are way better at idle


  9. #59
    Xtreme Member
    Join Date
    May 2008
    Location
    Sydney, Australia
    Posts
    242
    Great stuff, MM. I've been wondering & waiting for this info ever since reading your post on WCG re. disguising the real ID of a CPU. You said then it was "a beast", but I was skeptical because Intel were only claiming about 10-15% over Penryn, clock-for-clock, and I thought that may have been mainly on apps with high data thruput, such as servers, where the 3 mem channels would kill the old FSB. I hadn't figured on HT being so effective.

    If you really want to see the WCG points roll in, you might try crunching some FAAH WUs, or DDDT when it recovers from Hurricane Ike. With "single redundancy", points awarded are based on the benchmark internal to each WU, and they are about 13% higher than claimed by my Intels, and 10% below the greedy claims of my A64.
    - BF, aka rickjb on WCG -

    [Edit]: Re WCG points and FAAH, I just remembered that I'm running Xp-32 vs your XP-64. 64-bit BOINC inflates its points claims, because the 64-bit BOINC integer benchmarks are much higher than the 32-bit values. Your FAAH points may be scaled down, not up.
    You might also be interested in my post #63, below.
    Last edited by BlindFreddie; 11-03-2008 at 05:30 AM. Reason: 32/64-bit issues

  10. #60
    Xtreme crazy bastid
    Join Date
    Apr 2007
    Location
    On mah murder-sickle!
    Posts
    5,878
    Quote Originally Posted by naokaji View Post
    ... I7 uses only roughly the same as a E8600
    ... and is HOW many times faster?

    [SIGPIC][/SIGPIC]

  11. #61
    Xtreme Member
    Join Date
    May 2008
    Location
    Sydney, Australia
    Posts
    242

    Done but not disgraced

    MovieMan & All: I've been doing some thinking about those Nelly WCG benchmarks and have figured out some speed comparisons against my Yorkfields.

    Caveats:
    Both of my Yorkies are running XP-32 Pro SP3, so they are at a disadvantage to MM's XP-64. I believe that there is some difference in CPU-time per WorkUnit between XP-32 and XP-64, but I have not seen credible numbers on this ***See edit below. (For those not familiar with WCG, all the science applications are 32-bit, so run-times should be similar for both OSes, even if the Grid Computing Manager (BOINC) is 64-bit). There is a known large difference in benchmark scores between 32-bit and 64-bit BOINC clients in the integer (Dhrystone) scores, though not the floating-point (Whetstone) scores****See edit below. These scores are used by the BOINC client to determine the Points Claimed from WCG, and they affect the Points Awarded by WCG by different degrees on different WCG projects. The project most affected is HCC, the one that MM has crunched. His WCG Points Awarded will be inflated compared to mine, so we can't use them in real crunching-speed comparisons. However, it's interesting to have a look at the BOINC benchmarks.
    My machines are:
    • Q9650 @ 467 x 9 = 4203MHz = 1.114 x 3773Mhz (i7 speed). BOINC scores (fp/int) 4442/9101. Scaling this back to 3773Mhz gives 3945/8083. I don't have any HCC times for it as it's currently running 100% FAAH. *****See edit below.
    • QX9650 @ 392 x 10 = 3920MHz = 1.050 x 3733MHz. BOINC scores 4146/8683. Scaling to 3773MHz gives 3948/7983. This machine is currently running HCC.

    The scaled BOINC scores for my 2 machines are very similar, but they are not comapable to those for the i7 because of the 32-bit/64-bit issue.

    Now let's examine the HCC execution times from my QX, which will have a disadvantage of 1% due to XP-32. Average CPU time for the last 6 WUs was 2.66 hrs. Scaling up by the 1.05 clock speed ratio gives 2.79 hrs. The i7's times running without HT are close to 2.5 hrs, giving a speed ratio 11.6% faster than the Yorkie (10.6% after 32/64bit). This is right in the range of Intel's claim of 10-15% clock-to-clock advantage to Nehalem.

    Finally, the Hyperthreaded scores. MM's i7 does HCC WUs in about 4h24m, ie 4.4 hrs, times 8 virtual cores, equivalent to 4 cores in 2.2 hrs. The speed ratio is now a 27% advantage to Nehalem (26% after 32/64bit).

    Yorkie is done but not disgraced by Nelly.

    [Edit]:
    *** I did some testing of XP-32 vs XP-64: I made a snapshot copy of my BOINC data directory, under XP-32, while it contained a mix of WUs from FAAH, HCC, HPF2. By selectively suspending processing of some WUs, I forced 1 WU of each type to start simultaneously, and ran, with "network activity suspended", until the longest of the 3 finished, and recorded the CPU times. I swapped my boot drive for one with XP-64 on it, copied in the BOINC data snapshot, and repeated the run. CPU times showed about 1% gain for XP-64 over XP-32 for all 3 projects.
    [Before making my snapshot, I suspended crunching activity, suspended network activity, did Advanced >> Shutdown connected client, then exited BOINC manager. That removes any lock files.]
    **** I found that the FP scores were much lower on XP-64, while the in scores were higher. Sorry, but I have misplaced the actual values.
    *****I ran some HCC on the Q9650 @ 467 x 9. CPU Times scale with clock speed to the QX.
    Last edited by BlindFreddie; 12-02-2008 at 06:32 AM.

  12. #62
    Xtreme Cruncher
    Join Date
    Dec 2002
    Location
    NW Ohio
    Posts
    3,334
    Nice review MM!

    I need 2 of these, now!
    Donate to XtremeSystems!

    Workstation: Intel Core i7 4770, Asus Maximus VI Gene, 32GB Corsair Dominator Platinum DDR3-1866, eVGA SC GTX Titan, 256GB Crucial M4, Corsair HX850, Corsair H100i. Corsair Carbide 350D
    Fileserver: 2x AMD Opteron 2425HE, Supermicro H8DME-2, 24GB DDR2-667, Supermicro 846TQ 24bay Chassis, Redundant 920w, 256 Crucial M4 boot, 20TB Storage
    Notebook Asus Zenbook UX32VD-DH71, Intel Core i7 3517u, 10GB DDR3-1600, 256GB Crucial M4, Geforce GT 620M

  13. #63
    Xtreme Enthusiast
    Join Date
    Dec 2007
    Posts
    816
    I am Francois, and I approve this message.
    (Private joke for American's )

    Core i7 on seti here : http://setiathome.berkeley.edu/top_h...edit&offset=20 the Who? machine! going up, in top 40 right now, I hope to get to top 5 with a single processor.
    Last edited by Drwho?; 11-03-2008 at 07:42 AM.
    DrWho, The last of the time lords, setting up the Clock.

  14. #64
    version 2.0
    Join Date
    Feb 2005
    Location
    Flanders
    Posts
    3,862
    I'd like to see the impact of memory timings with the on-die memory controller.

    Tony hinted the best performance is with low latency timings like 6-5-5 or 6-6-6

  15. #65
    Mr. Boardburner
    Join Date
    Jun 2005
    Location
    the Netherlands
    Posts
    5,340
    That is simply amazing... Dave, will we see 7M+ in the stats tomorrow?
    Main rig:
    CPU: I7 920C0 @ 3.6Ghz (180*20)
    Mobo: DFI UT X58 T3eH8
    RAM: 12GB OCZ DDR3-1600 Platinum
    GPU/LCD: GeForce GTX280 + GeForce 8600GTS (Quad LCDs)
    Intel X25-M G2 80GB, 12TB storage
    PSU/Case: Corsair AX850, Silverstone TJ07

  16. #66
    Xtremely Hot Sauce
    Join Date
    Sep 2007
    Location
    New York
    Posts
    3,586
    Wow Dave. Awesome rig--but do get out your KillAWatt! I want to see load and idle draw.

    My toys:
    Asus Sabertooth X58 | Core i7-950 (D0) | CM Hyper 212+ | G.Skill Sniper LV 12GB DDR3-1600 CL9 | GeForce GTX 670-2048MB | OCZ Agility 4 512GB, WD Raptor 150GB x 3 (RAID0), WD Black 1TB x 2 (RAID0) | XFX 650W CAH9 | Lian-Li PC-9F | Win 7 Pro x86-64
    Gigabyte EX58-UD3R | Core i7-920 (D0) | Stock HSF | G.Skill Sniper LV 4GB DDR3-1600 CL9 | Radeon HD 2600 Pro 512MB | WD Caviar 80GB IDE, 4TB x 2 (RAID5) | Corsair TX750 | XClio 188AF | Win 7 Pro x86-64
    Dell Dimension 8400 | Pentium 4 530 HT (E0) | Stock HSF | 1.5GB DDR2-400 CL3 | GeForce 8800 GT 256MB | WD Caviar 160GB SATA | Stock PSU | (Broken) Stock Case | Win Vista HP x86
    Little Dot DAC_I | Little Dot MK IV | Beyerdynamic DT-880 Premium (600 Ω) | TEAC AG-H300 MkIII | Polk Audio Monitor 5 Series 2's

  17. #67
    Xtreme Cruncher
    Join Date
    Nov 2005
    Location
    Netherlands
    Posts
    1,778
    Quote Originally Posted by Movieman View Post
    The first 8 WU I ran in 2 groups of 4 with HT off and as you see they finished in app 2 hours 28 minutes.
    The last group of 8 WU I ran at one time with HT enabled and was happily surprised to see that they finished in less than twice the time of running 4 units with HT off.
    4 hours 25 mins app .This shows the effect of HT and that it is a positive one.
    From the brief numbers I can see it looks like this machine at 3733mhz will pull just over 20,800 points a day with HT off and well over 30,000 a day with HT enabled.
    Wait..

    2:28 (148 minutes) for 4 WU's
    4:25 (265 minutes) for 8 WU's

    Is that what you are saying?

    So basically 37 minutes per WU without HT and 33.125 minutes per WU with HT. That's a difference of 11%

    How is the output 20800 versus 30000 then? That's almost 30% more?
    Last edited by v0dka; 11-03-2008 at 08:58 AM.

  18. #68
    Chasing After Diety
    Join Date
    Jan 2007
    Location
    Absolutely Speachless :O
    Posts
    11,930
    Wait i just got the title now

    dave the title one nehalem to rule them all is a joke at hera huh?

    Wait til hera gets setup!!! GRRRRRR!~!
    Nadeshiko: i7 990 12GB DDR3 eVGA Classified *In Testing... Jealous? *
    Miyuki: W3580 6GB DDR3 P6T-Dlx
    Lind: Dual Gainestown 3.07
    Sammy: Dual Yonah Sossoman cheerleader. *Sammy-> Lind.*

    [12:37] skinnee: quit helping me procrastinate block reviews, you asshat. :p
    [12:38] Naekuh: i love watching u get the firing squad on XS
    Its my fault.. and no im not sorry about it either.

  19. #69
    Xtreme Cruncher
    Join Date
    Jun 2006
    Location
    On top of a mountain
    Posts
    4,163
    Thanks Dave.


    Go get him Naekuh.
    20 Logs on the fire for WCG: i7 920@2.8 X3220@3.0 X3220@2.4 E8400@4.05 E6600@2.4

  20. #70
    V3 Xeons coming soon!
    Join Date
    Nov 2005
    Location
    New Hampshire
    Posts
    36,363
    Quote Originally Posted by v0dka View Post
    Wait..

    2:28 (148 minutes) for 4 WU's
    4:25 (265 minutes) for 8 WU's

    Is that what you are saying?

    So basically 37 minutes per WU without HT and 33.125 minutes per WU with HT. That's a difference of 11%

    How is the output 20800 versus 30000 then? That's almost 30% more?
    No, You misread. With HT off it does 4 WU at a time and completes the 4 in 2 hours 28 minutes.
    With HT turned on it does 8 WU at a time and completes the 8 in 4 hours 25 mins.
    Output was based on claims made and credit given.
    Quote Originally Posted by NaeKuh View Post
    Wait i just got the title now

    dave the title one nehalem to rule them all is a joke at hera huh?

    Wait til hera gets setup!!! GRRRRRR!~!
    No, not at all my friend. Was based on the Nehalem being the best cpu in terms of computational power that you can buy.
    Quote Originally Posted by CyberDruid View Post
    Thanks Dave.


    Go get him Naekuh.
    I'm sure he will.. He has more $$ than I do!

    To Blindfreddie:
    Many thanks for the data.
    I agree, Yorkies aren't dead.
    With all the data what app I'm really glad I ran was Folding at home.
    No benchmarks that could interfere with the numbers just turn it loose and see how fast it completes.
    When I saw the machine capable of making almost 5000 points a day at 3733mhz vs RoadRunners 4GHz Yorkie making 3300 that told me what kind of power was in this thing.
    BOINC and it's benchmarks have always been flawed but if you ignore those benchmarks in BOINC and just look at times to complete you see the true power.
    8 HCC WU completeing in 4.5 hours is inline with my 8 core clover at 3000mhz but done at 3733mhz on just 4 cores. Perhaps that is a better way to understand the power that resides in these Nehalem cpu's. That's not scientific but more of a real world way of seeing into what it's truly capable of.
    Last edited by Movieman; 11-03-2008 at 10:30 AM.
    Crunch with us, the XS WCG team
    The XS WCG team needs your support.
    A good project with good goals.
    Come join us,get that warm fuzzy feeling that you've done something good for mankind.

    Quote Originally Posted by Frisch View Post
    If you have lost faith in humanity, then hold a newborn in your hands.

  21. #71
    Xtreme Cruncher
    Join Date
    Nov 2005
    Location
    Netherlands
    Posts
    1,778
    Quote Originally Posted by Movieman View Post
    No, You misread. With HT off it does 4 WU at a time and completes the 4 in 2 hours 28 minutes.
    With HT turned on it does 8 WU at a time and completes the 8 in 4 hours 25 mins.
    Output was based on claims made and credit given.
    That's what I said. 4 WU in 2:28 and 8 in 4:25.

    So 148 minutes for 4 WU's (or 296 for 8 it's the same) and 265 minutes for 8 WU's. Difference is 11%.

    Based on the fact that there was 11% more WU's crunched or 11% less time needed to complete the same number of WU's I'd say that the difference in credit granted can never be larger than 11%. What you witnessed was probably quorum related.

    Not that this isn't great, I mean, for a CPU that has only 4 real cores any bonus from using these four emulated cores is pretty much mindblowing. And if someone could explain that would be great. But 30.000 (or two Q6600) is just way too much.

  22. #72
    Xtreme crazy bastid
    Join Date
    Apr 2007
    Location
    On mah murder-sickle!
    Posts
    5,878
    Dave, could you grab us a screenshot from your WCG page showing the stats for this machine? I don't think I saw one before.

    [SIGPIC][/SIGPIC]

  23. #73
    Xtreme crazy bastid
    Join Date
    Apr 2007
    Location
    On mah murder-sickle!
    Posts
    5,878
    vOdka, based on the numbers we've seen so far that thing (with HT running) should be claiming over 43k (WCG that is) per day. It won't get it due to the quorum, but that's the expected claim.

    [SIGPIC][/SIGPIC]

  24. #74
    V3 Xeons coming soon!
    Join Date
    Nov 2005
    Location
    New Hampshire
    Posts
    36,363
    Quote Originally Posted by Drwho? View Post
    I am Francois, and I approve this message.
    (Private joke for American's )

    Core i7 on seti here : http://setiathome.berkeley.edu/top_h...edit&offset=20 the Who? machine! going up, in top 40 right now, I hope to get to top 5 with a single processor.
    Francois: We really need to get you onto WCG. SETI is Ok I guess but if ET wants to find us I'm sure he will.
    Finding cures for Cancer and Aids to me is all based on how much computational power we can throw at the problem.
    With your personal and corporate resources we could cut the timeframe.
    Think about it please.
    Quote Originally Posted by [XC] Jaco View Post
    I'd like to see the impact of memory timings with the on-die memory controller.

    Tony hinted the best performance is with low latency timings like 6-5-5 or 6-6-6
    Memory has always been my weakness so perhaps with some sharp guy like Tony I could get better performance out of this.
    I wonder what his telephone number is?
    Quote Originally Posted by D_A View Post
    Dave, could you grab us a screenshot from your WCG page showing the stats for this machine? I don't think I saw one before.
    They don't mean anything yet, wait a week and then we'll have good info.
    Quote Originally Posted by D_A View Post
    vOdka, based on the numbers we've seen so far that thing (with HT running) should be claiming over 43k (WCG that is) per day. It won't get it due to the quorum, but that's the expected claim.
    Pretty close, I saw app 40,800 as the claim but remember, BOINC NEEDS 5 solid days history with a machine to see that the claims are consistant OR it automatically takes the other machine in the quorum as the norm and you as the outliar.
    Now for today, the first "full" day just running WCG I see this for the 1/2 day update..

    Statistics
    Date.... Total Run Time Points Generated Results Returned
    11/3/08 0:005:00:06:49 16,356...................... 28


    based on the above being credit granted for essentially 5/8 of a full 24 hour day those numbers extrapolate to a granted credit of 26,170 points per day but remember, the machine is being killed in the quorum so I only expect those numbers to go up.
    Even if they don't it is by far the best performing quad that we've ever seen.
    Crunch with us, the XS WCG team
    The XS WCG team needs your support.
    A good project with good goals.
    Come join us,get that warm fuzzy feeling that you've done something good for mankind.

    Quote Originally Posted by Frisch View Post
    If you have lost faith in humanity, then hold a newborn in your hands.

  25. #75
    Xtreme crazy bastid
    Join Date
    Apr 2007
    Location
    On mah murder-sickle!
    Posts
    5,878
    There is another little thing about my projected claim. It assumes 24hrs of work. Every time the system completes a unit and grabs another it loses a couple of seconds, so unless the units take longer than 24hrs each to process there's some lost time. More units processed means more lost time but faster subsystems (using SATA2 instead of old IDE HDDs for example, but also memory speed etc) reduce that amount of lost time.
    I'm only talking seconds here, maybe one or two between units, but it adds up. Mind you, even if you doubled the total speed of all your subsystems you'd only gain a matter of a few seconds, maybe a minute, worth of actual work time ie stuff all over a day. I think (personal opinion) this is why there's no perceivable difference between single and multi-channel RAM *for this specific purpose*. Even if you could cut that delay down to a couple of milliseconds, you'd only gain a couple of minutes over a whole day.

    [SIGPIC][/SIGPIC]

Page 3 of 11 FirstFirst 123456 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •