MMM
Page 2 of 3 FirstFirst 123 LastLast
Results 26 to 50 of 70

Thread: HP2 computation errors

  1. #26
    Xtreme Cruncher
    Join Date
    Feb 2007
    Location
    Benicia, CA
    Posts
    283
    Looking through my errors in more detail I have come a similar conclusion as Xenolith.
    Almost all of them errored out in .02 or similar.
    Longest was 2 1/2 hours and even that one granted near full points.

    I have re-enabled HP2 trusting that they are "on it" but will keep a closer eye on this going forward.
    Currently messing with:
    Intel DP55SB Sharpsberg
    i7-860 (stock) under a Zalman CNPS-8700NT cooler
    4x4GB Corsair Vengence @ 1600
    XFX Radeon 5850 Black Edition (765, 1125 stock)
    Mushkin Chronos DX 240GB and 1.5 TB WD Black
    Powered by a Seasonic X-650 and stuffed into a Silverstone GD05 case



  2. #27
    Xtreme Mentor
    Join Date
    Apr 2004
    Location
    CT, USA!!!
    Posts
    2,821
    hmmm I wonder if my others errors have something to do with not installing win updates?

  3. #28
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    I don't think Windows updates have any impact on errors. What I do know, however, is that shutting down or rebooting your computer too quickly can cause errors. This was a problem with Vista which has a tendency to shut down very fast. I have one computer that runs Vista. I always make a point of exiting BOINC first, then waiting ten seconds before rebooting.
    Audentes fortuna iuvat

    "Fortune favors the bold"

  4. #29
    Xtremely High Voltage Sparky's Avatar
    Join Date
    Mar 2006
    Location
    Ohio, USA
    Posts
    16,040
    Quote Originally Posted by sierra_bound View Post
    I don't think Windows updates have any impact on errors. What I do know, however, is that shutting down or rebooting your computer too quickly can cause errors. This was a problem with Vista which has a tendency to shut down very fast. I have one computer that runs Vista. I always make a point of exiting BOINC first, then waiting ten seconds before rebooting.
    Yeah Vista's fast shutdown would trip up F@H on my GPU all the time as well. At least it usually would restart where it left off but not always.
    The Cardboard Master
    Crunch with us, the XS WCG team
    Intel Core i7 2600k @ 4.5GHz, 16GB DDR3-1600, Radeon 7950 @ 1000/1250, Win 10 Pro x64

  5. #30
    Xtreme Mentor
    Join Date
    Apr 2004
    Location
    CT, USA!!!
    Posts
    2,821
    Is it typical to have a long list of pending validations for a new host?

  6. #31
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    Yes, that's typical. Until you build up a queue you won't get a steady flow of validations. The reason for that is some work units get validated in a day, while others take a week or more. If you have only a few work units awaiting validation, it may be days before they are approved. You're somewhat at the mercy of the other person who's crunching the identical work unit. If he or she has a very slow machine, it will take longer for them to crunch and return that work unit. Worse yet, if that person has an error, then another identical work unit has to be sent out to another user.
    Audentes fortuna iuvat

    "Fortune favors the bold"

  7. #32
    Xtreme Mentor
    Join Date
    Apr 2004
    Location
    CT, USA!!!
    Posts
    2,821
    Thanks for the help S_B your always so helpful no wonder your the king

  8. #33
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    You're welcome. I just checked. I have 29 pages of work units awaiting validation. If you have a lot of crunchers, that's typical.

    Ideally, if you have a fast computer, you want the other guy or guys crunching the same work unit to have fast computers too. Unfortunately you're sometimes matched up with a person who's running a slow machine.
    Audentes fortuna iuvat

    "Fortune favors the bold"

  9. #34
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    One other thing about validations. Completed work units are validated in the order they're received. It's like standing in line outside a movie theater. You start out at the back of the line and gradually make your way to the front.

    On weekends, the validation process slows down for some reason and then picks up again when the work week starts. That's one reason why our team numbers tend to jump on Mondays.
    Last edited by sierra_bound; 04-29-2010 at 10:11 AM.
    Audentes fortuna iuvat

    "Fortune favors the bold"

  10. #35
    Xtreme Mentor
    Join Date
    Apr 2004
    Location
    CT, USA!!!
    Posts
    2,821
    If your waiting on other slower computers to complete units doesn't it make sense to keep a higher work buffer, so by the time you start computing someone else with a slower computer already completed it?

  11. #36
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    I'm not sure about that. There is one advantage to being paired with someone with a slow machine. Sometimes that other person will get a higher claimed credit than you do. If both work units are validated, BOINC awards credit based on an average of the two scores. That will boost the granted credit you're awarded.

    The quorum process is sort of like going on a blind date. Sometimes the other person turns out to be drop-dead gorgeous, while other times he or she may be the date from hell.
    Audentes fortuna iuvat

    "Fortune favors the bold"

  12. #37
    Xtreme Mentor
    Join Date
    Apr 2004
    Location
    CT, USA!!!
    Posts
    2,821
    I'm going to try the 10 day buffer and see how it works out, plus its been really windy the last couple days and would hate for the cable to go out on me.

  13. #38
    Xtreme Legend
    Join Date
    Dec 2003
    Location
    Northern California
    Posts
    204,166
    I think 10 days is a little too much. 4-5 days should be enough unless you're anticipating a prolonged cable outage. The wind may end up knocking out your power as well.
    Audentes fortuna iuvat

    "Fortune favors the bold"

  14. #39
    Xtreme Cruncher
    Join Date
    Jun 2007
    Location
    SK, Canada
    Posts
    836
    I haven't found much benefit in running a large cache in terms or reducing my pending validation queue. Once your machine goes over 2 days for it's average turnaround time, it is disqualified from receiving make-up a or repair WU's, ie- WU's that were sent out earlier but not returned or returned with an error or invalid. These WU's are validated and points awarded immediately when you return them which helps reduce your PV queue. Another way to reduce PV is to run a zero redundance project like FAAH, here you will never have a PV queue, only a few that are marked "inconclusive" and sent out to a second computer as a safety check on the work your machine sent in.
    i7 3970X @ 4500MHz 1.28v
    Asus Rampage IV Extreme
    4x4GB Corsair Dominator GT 2133MHz 9-11-10-27
    Gigabyte Windforce 7970 OC 3-way Crossfire
    Windows 7 Ultimate x64
    HK 3.0-MCP655-Phobya 400mm rad
    Corsair AX1200i
    Sandisk Exrtreme 240GB
    3x2TB WD Greens for storage
    TT Armor VA8003SWA





  15. #40
    Xtreme Addict
    Join Date
    Feb 2005
    Location
    Maine, USA
    Posts
    1,029
    Quote Originally Posted by sierra_bound View Post
    I think 10 days is a little too much. 4-5 days should be enough unless you're anticipating a prolonged cable outage. The wind may end up knocking out your power as well.
    Who lost their power today?

    <:::raises hand:::

    I had no crunching for five hours due to 60mph winds.

  16. #41
    Turkey Man
    Join Date
    Mar 2005
    Location
    Jakarta (ex-Australia)
    Posts
    2,560
    30 errors for this project since the 20th
    All failed very quickly though so not sure i will bother dropping it.

  17. #42
    Xtreme Enthusiast
    Join Date
    Jun 2008
    Posts
    526
    I haven't been paying much attention recently but only just noticed this. Had a quick look at my stats over on wcg and human protene folding has been churning out errors

    Pretty sure its not my system as its not OC'd at all.

  18. #43
    Xtreme Legend
    Join Date
    Mar 2008
    Location
    Plymouth (UK)
    Posts
    5,279
    I believe this is an OS problem rather than an OC problem...Vista and 7 are prone to errors


    My Biggest Fear Is When I die, My Wife Sells All My Stuff For What I Told Her I Paid For It.
    79 SB threads and 32 IB Threads across 4 rigs 111 threads Crunching!!

  19. #44
    Xtreme Member
    Join Date
    Dec 2006
    Location
    Denver, Co
    Posts
    233
    Thanks for the information I just looked at my stats
    63 Tasks Run
    9 in Progress
    4 Valid
    50 Error (Wow)

    In some cases the error tasks took 15 or more hours of CPU time. Roughly 2k of Wasted CPU time.
    Crunchers up and running


  20. #45
    Xtreme Member
    Join Date
    Dec 2006
    Location
    Denver, Co
    Posts
    233
    Quote Originally Posted by OldChap View Post
    I believe this is an OS problem rather than an OC problem...Vista and 7 are prone to errors
    I would counter this because I am running Linux on 3 crunchers that have generated errors. Only my primary computer is running Vista 64 and results from it have about the same mixture of being valid vs error status.
    Crunchers up and running


  21. #46
    Xtreme Addict
    Join Date
    Feb 2005
    Location
    Maine, USA
    Posts
    1,029
    Quote Originally Posted by Wolfsburg18 View Post
    I would counter this because I am running Linux on 3 crunchers that have generated errors. Only my primary computer is running Vista 64 and results from it have about the same mixture of being valid vs error status.
    By what you're describing, I would say the version Linux you're using is by far worse than any other OS.

    50 returned error WUs totaling 2000 CPU hours? That can't be right.

  22. #47
    Admin
    Join Date
    Feb 2005
    Location
    Ann Arbor, MI
    Posts
    12,338
    If you're getting any errors, it's definitely worth dropping! The erroneous units may only report using a few minutes of CPU time, but I know my CPU hours on a fully-dedicated quad core were around 56hrs/day when it was enabled. Disabled it and averaging 95-96hrs/day now

  23. #48
    Xtreme crazy bastid
    Join Date
    Apr 2007
    Location
    On mah murder-sickle!
    Posts
    5,878
    Quote Originally Posted by xenolith View Post
    By what you're describing, I would say the version Linux you're using is by far worse than any other OS.

    50 returned error WUs totaling 2000 CPU hours? That can't be right.
    It's the units, not the OS. If you re-read his post he says the Vista is getting the same mixture of errors and valids.

    [SIGPIC][/SIGPIC]

  24. #49
    Xtreme Addict
    Join Date
    Feb 2005
    Location
    Maine, USA
    Posts
    1,029
    Quote Originally Posted by D_A View Post
    It's the units, not the OS. If you re-read his post he says the Vista is getting the same mixture of errors and valids.
    I understand it's the WUs not the OS. But by his initial description of 50 WU errors wasting 2000 CPU hours, that by far is the worst I've seen, regardless of OS. (I've been looking around at a few places concerning this common error.)

    Yesterday, I dropped HP2 just to be done with it...

  25. #50
    Xtreme crazy bastid
    Join Date
    Apr 2007
    Location
    On mah murder-sickle!
    Posts
    5,878
    I suffered through the errors long enough to get my badge, but I won't touch it after that. Too many errors, many many reports and the HPF2 team still won't do anything about it. By comparison, FAAH seems almost bullet proof. Can't kill it with a stick.

    [SIGPIC][/SIGPIC]

Page 2 of 3 FirstFirst 123 LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •