Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 27
|
![]() |
Author |
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
With all the eager beavers on the board, think too that aborting this one and let the techs/scientists deal will the cause analysis is the best action... lest -Uplinger protests.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
TBirdTheYuri
Advanced Cruncher France Joined: Mar 5, 2006 Post Count: 115 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have cancelled unit erlc_e088_ps0000 because % are blocked at 1.6% after 14 hours.
an other unit, erlc_b101_ps0000, have terminated on error after 29 hours of compute. |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
mweisensee,
Feel free to abort. -Uplinger |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
mweisensee, Feel free to abort. -Uplinger Hmmm, My assumption was wrong. Meanwhile the third repair WU (erlc_ b124_ ps0000_ 4) succeded. So I resumed this WU and with 47% it is now well behind the 42.32% where all other WUs errored. Maybe it was a resource/computer dependant issue (here it runs on an athlon 7750). I will continue making backups every 15%, but if there is no error/restore it will be complete before the deadline applies. |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Normally if BOINC finds that the running taskS total memory needs exceed the allowed or available it will start pausing tasks. The tasks Status column should highlight that with a "waiting for memory" kind of warning also logged in the messages tab, but guess that was not the issue.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Normally if BOINC finds that the running taskS total memory needs exceed the allowed or available it will start pausing tasks. The tasks Status column should highlight that with a "waiting for memory" kind of warning also logged in the messages tab, but guess that was not the issue. Sorry, can't return it in time. This night it had an error at 91% after running 21% without backup (yes, I have to sleep sometimes...). So I restarted from last backup and save it every 4-8 percent. Now it passed the 91% mark but will finish after the deadline. So one more replacement WU will be sent. :-( But I hope to finish it still today. It seems to be a really resource hungry WU - I tried it on another system with 1.5GB ram and 5GB swapfile and it had an error every 8%... But without message log or warning line. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Phew, finally it completed and got validated. That was really hard work! Running it without interruption wouldn't have been possible on my computers! Guess that was the reason for that three WUs with errors. Maybe with restarts they could have finished it as well...
I hope that this remains the only WU with such a challenging behaviour - at least for me. ;-) |
||
|
|
![]() |