MMM
Page 1 of 3 123 LastLast
Results 1 to 25 of 70

Thread: HP2 computation errors

  1. #1
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166

    HP2 computation errors

    Just a warning for those of you who've just started crunching. The Human Proteome 2 project has been known to cause errors. There are numerous threads at the WCG forum on this problem.

    http://www.worldcommunitygrid.org/fo...d_thread,26706

    Hypernova says he's seen fewer errors on this project using an i7-980X.

    http://www.worldcommunitygrid.org/fo...fset,80#277196

    Still, my advice to people is that unless you feel a strong need to run this project, uncheck it in your Device Profiles section.
    Attached Images Attached Images
    Audentes fortuna iuvat

    "Fortune favors the bold"

  2. #2
    Xtreme Legend
    Join Date
    Mar 2008
    Location
    Plymouth (UK)
    Posts
    5,279
    Thanks Sierra (this works too)


    My Biggest Fear Is When I die, My Wife Sells All My Stuff For What I Told Her I Paid For It.
    79 SB threads and 32 IB Threads across 4 rigs 111 threads Crunching!!

  3. #3
    Xtreme Cruncher
    Join Date
    Jun 2007
    Location
    SK, Canada
    Posts
    836
    Yeah this project has many issues, especially with Vista and Win 7. Older OS's were OK but you'll still get the occasional WU that will "hang" at a certain % and stay there until it times out which is annoying as heck, not to mention wasteful.
    i7 3970X @ 4500MHz 1.28v
    Asus Rampage IV Extreme
    4x4GB Corsair Dominator GT 2133MHz 9-11-10-27
    Gigabyte Windforce 7970 OC 3-way Crossfire
    Windows 7 Ultimate x64
    HK 3.0-MCP655-Phobya 400mm rad
    Corsair AX1200i
    Sandisk Exrtreme 240GB
    3x2TB WD Greens for storage
    TT Armor VA8003SWA





  4. #4
    Crunching For The Points! NKrader's Avatar
    Join Date
    Dec 2005
    Location
    Renton WA, USA
    Posts
    2,891
    yikes hope i havnt wasted much time..

  5. #5
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    You may be one of the lucky ones who isn't getting errors. If that's the case, there's no reason to stop running HP2.

    You can check how many errors you're getting by going to your My Grid page. On the left-hand side, click Results Status. Then, on the second drop-down menu, go to Error and then click Filter.
    Attached Images Attached Images
    Audentes fortuna iuvat

    "Fortune favors the bold"

  6. #6
    Crunching For The Points! NKrader's Avatar
    Join Date
    Dec 2005
    Location
    Renton WA, USA
    Posts
    2,891
    Quote Originally Posted by sierra_bound View Post
    You may be one of the lucky ones who isn't getting errors. If that's the case, there's no reason to stop running HP2.

    You can check how many errors you're getting by going to your My Grid. On the left-hand side, click Results Status. Then, on to second drop-down menu, go to Error and then click Filter.
    ah i saw i had like 253 returned results..


    hmmm no errors on QX6700 @ 3.5
    and i have like P4 and a Dualcore all running differant OS yay for me time to check that box again

  7. #7
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    253 returned results or 253 work units with errors?
    Audentes fortuna iuvat

    "Fortune favors the bold"

  8. #8
    Crunching For The Points! NKrader's Avatar
    Join Date
    Dec 2005
    Location
    Renton WA, USA
    Posts
    2,891
    Quote Originally Posted by sierra_bound View Post
    253 returned results or 253 work units with errors?
    returned results out of the ammount of points in my siggy i only have 3 errors total from all projects.

  9. #9
    Xtreme Mentor
    Join Date
    Apr 2004
    Location
    CT, USA!!!
    Posts
    2,821
    120 Errors since 4/12 works out to roughly 1400-1500 BOINC ppd.
    Last edited by =[PULSAR]=; 04-27-2010 at 12:09 PM.

  10. #10
    Xtreme Addict
    Join Date
    Feb 2005
    Location
    Maine, USA
    Posts
    1,029
    My HP2 project participation has returned 119 errors, 25 pending WUs, 257 verified WUs, and 0 invalid WUs in the last 13 days since I've joined the team. That's about a 30% error rate. All of them have returned an error within 0.05 CPU time hours except for one WU which returned an error after 32.64 hours. The following is the claimed/granted BOINC credits for WUs that returned errors: 836.25/47.75 (take out that one bad 32.64 hour WU and it would have been 55.75/47.75).

    I may be wrong, but I don't think the amount of wasted time I have had yet justifies discontinuing my participation in the HP2 project. Although, I'll have to keep a close eye on this because if I get another WU error that wastes 32+ hours again, I'll have to reevaluate my decision.

    This is on two Windows 7 machines w/ i7-860 and i7-720QM.

  11. #11
    Xtreme crazy bastid
    Join Date
    Apr 2007
    Location
    On mah murder-sickle!
    Posts
    5,878
    HPF2 is ... rather error prone on a lot of machines. I refuse to run it to prevent wasting my time as well as a form of protest. A lot of people, I suspect, never check their results status and so never realize they're getting errors on this project.

    [SIGPIC][/SIGPIC]

  12. #12
    Xtreme Cruncher
    Join Date
    Feb 2007
    Location
    Benicia, CA
    Posts
    283
    Wow, ugly.
    Two pages of HP2 errors for me.
    Removed from my projects.

    Thanks for the info.
    Currently messing with:
    Intel DP55SB Sharpsberg
    i7-860 (stock) under a Zalman CNPS-8700NT cooler
    4x4GB Corsair Vengence @ 1600
    XFX Radeon 5850 Black Edition (765, 1125 stock)
    Mushkin Chronos DX 240GB and 1.5 TB WD Black
    Powered by a Seasonic X-650 and stuffed into a Silverstone GD05 case



  13. #13
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    I will sticky this thread for a week or two until everyone has a chance to see it.
    Audentes fortuna iuvat

    "Fortune favors the bold"

  14. #14
    Xtreme Cruncher
    Join Date
    May 2008
    Location
    Roswell
    Posts
    479
    The guys at WCG made some adjustments to the science application (version 6.17) and were able to significantly reduce the errors with that in the last beta test. The results are currently reviewed by the scientists to ensure everything is OK.

    Here are two quotes from uplinger:

    http://www.worldcommunitygrid.org/fo...d_thread,28940
    Sek, the slow part of the link was me on the beta test. The results on the beta test were not 100% complete so I was waiting on that before sending them the results. They recently have received the results and I have asked them to evaluate them and let us know if the results look good. We need to give them time to evaluate the results to make sure everything science wise is still good.
    http://www.worldcommunitygrid.org/fo...8872_offset,20
    The beta so far has been a good increase over what we saw in production for the same batch.
    There were 57 exit 1's for BETA compared to 609 for production.
    We are going to send this information to the researchers later today when a few more results come back in foor them to review.
    I can't wait to turn it on again
    Last edited by mreuter80; 04-27-2010 at 02:38 PM.

  15. #15
    Xtremely High Voltage Sparky's Avatar
    Join Date
    Mar 2006
    Location
    Ohio, USA
    Posts
    16,040
    What gets me is I currently have 10 errored units listed, all from my main rig, none HP2 as that is unchecked. And my system is stock
    The Cardboard Master
    Crunch with us, the XS WCG team
    Intel Core i7 2600k @ 4.5GHz, 16GB DDR3-1600, Radeon 7950 @ 1000/1250, Win 10 Pro x64

  16. #16
    Xtreme Mentor
    Join Date
    Apr 2004
    Location
    CT, USA!!!
    Posts
    2,821
    Which project are they from? No random crashes or system reboots?

  17. #17
    Xtremely High Voltage Sparky's Avatar
    Join Date
    Mar 2006
    Location
    Ohio, USA
    Posts
    16,040
    No system appears rock solid. Never had it do anything strange. They are from random projects that I'm running, cancer and FAAH and such.
    The Cardboard Master
    Crunch with us, the XS WCG team
    Intel Core i7 2600k @ 4.5GHz, 16GB DDR3-1600, Radeon 7950 @ 1000/1250, Win 10 Pro x64

  18. #18
    Xtreme Cruncher
    Join Date
    Jun 2007
    Location
    SK, Canada
    Posts
    836
    You might want to do some stability testing just to be sure there's no issues with the rig. An hour of OCCT Linpack will usually turn up anything that's amiss. Also, if you use antivirus software, go in and exclude the BOINC folders(there's 2 of them) from scanning. This has been known to corrupt WU's too.
    i7 3970X @ 4500MHz 1.28v
    Asus Rampage IV Extreme
    4x4GB Corsair Dominator GT 2133MHz 9-11-10-27
    Gigabyte Windforce 7970 OC 3-way Crossfire
    Windows 7 Ultimate x64
    HK 3.0-MCP655-Phobya 400mm rad
    Corsair AX1200i
    Sandisk Exrtreme 240GB
    3x2TB WD Greens for storage
    TT Armor VA8003SWA





  19. #19
    Xtreme Addict
    Join Date
    Feb 2005
    Location
    Maine, USA
    Posts
    1,029
    My understanding is this:

    error = most likely WCG WU fault (if they're uncommon or have low reproducibility user system fault could be possible cause)
    invalid = 100% user system fault (usually due to system instability)

  20. #20
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    Invalid means the work unit was completed and returned to WCG, but was not found to be similar to the other identical work units that were turned in. So the result gets tossed out.

    Errors can be caused by various things - the user aborted the work unit, the work unit itself is bad, or system instability.

    A result could also be called "inconclusive". What that means is that the validation program could not determine which of the results returned are valid. When that happens, the same work unit is sent out to more computers and a second validation will be attempted when they're turned in.
    Audentes fortuna iuvat

    "Fortune favors the bold"

  21. #21
    Xtreme Mentor
    Join Date
    Jan 2009
    Location
    Oslo - Norway
    Posts
    2,879
    Good to hear it. I've got totally 8 errors, but 7 of them belongs this project. I thought something on my system didn't like them :noob:. It is good to know it wasn't only me.

    Last edited by Sam_oslo; 04-27-2010 at 11:06 PM.

    ASUS P8P67 Deluxe (BIOS 1305)
    2600K @4.5GHz 1.27v , 1 hour Prime
    Silver Arrow , push/pull
    2x2GB Crucial 1066MHz CL7 ECC @1600MHz CL9 1.51v
    GTX560 GB OC @910/2400 0.987v
    Crucial C300 v006 64GB OS-disk + F3 1TB + 400MB RAMDisk
    CM Storm Scout + Corsair HX 1000W
    +
    EVGA SR-2 , A50
    2 x Xeon X5650 @3.86GHz(203x19) 1.20v
    Megahalem + Silver Arrow , push/pull
    3x2GB Corsair XMS3 1600 CL7 + 3x4GB G.SKILL Trident 1600 CL7 = 18GB @1624 7-8-7-20 1.65v
    XFX GTX 295 @650/1200/1402
    Crucial C300 v006 64GB OS-disk + F3 1TB + 2GB RAMDisk
    SilverStone Fortress FT01 + Corsair AX 1200W

  22. #22
    Xtreme Enthusiast
    Join Date
    Apr 2008
    Location
    France
    Posts
    950
    I had lot of errors on my main rig, found out by searching they were related to that project and not to my overclocking (which should be stabilized by now at 4.15Ghz stage).

    24/7 running quiet and nice

  23. #23
    Xtreme crazy bastid
    Join Date
    Apr 2007
    Location
    On mah murder-sickle!
    Posts
    5,878
    A rig can seem stable for general use and still return errors with WCG. If all your trouble was HPF2, then yes, drop that project. If you were getting errors across the board, drop your clock slightly.

    [SIGPIC][/SIGPIC]

  24. #24
    Xtreme Enthusiast
    Join Date
    Apr 2008
    Location
    France
    Posts
    950
    All errors were on HPF2 (0.02 cpu time on units etc.) and were the errors reported on the WCG thread. And my rig has gone under some tough stability tests (not general use by far) like Linpack, etc.. BOINC actually produces a BSOD if something's not right, as i found out
    I double checked by going to stock clocks, and still getting the errors on that project... Not disabling it for now as units that produce errors only run for a few seconds, not much of a loss.

    24/7 running quiet and nice

  25. #25
    Xtreme Cruncher
    Join Date
    Dec 2008
    Location
    The Netherlands
    Posts
    896
    I've been crunching almost a week now I think. I only got 1 error so far, which errored out in less than a minute, so I'm leaving it enabled now. Q9650 with Windows 7 x65 pro

Page 1 of 3 123 LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •