Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 6
|
![]() |
Author |
|
cjslman
Master Cruncher Mexico Joined: Nov 23, 2004 Post Count: 2082 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I had a CEP2 WU that ended in error (error log listed below). Any idea why it happened? Now all my WUs are in PV jail
----------------------------------------![]() Thanks, CJSL Result Log Result Name: E220635_ 764_ K.24.C18FH8NOS2Se.00222203.4.set1d06_ 0-- <core_client_version>7.2.47</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [14:42:21] Number of jobs = 16 [14:42:21] Starting job 0,CPU time has been restored to 0.000000. [14:44:34] Finished Job #0 [14:44:34] Starting job 1,CPU time has been restored to 107.063486. [14:51:56] Finished Job #1 [14:51:56] Starting job 2,CPU time has been restored to 483.478299. [16:37:48] Finished Job #2 [16:37:48] Starting job 3,CPU time has been restored to 5974.526298. [16:46:08] Finished Job #3 [16:46:08] Starting job 4,CPU time has been restored to 6388.396951. [16:51:55] Finished Job #4 [16:51:55] Starting job 5,CPU time has been restored to 6690.368087. [16:57:53] Finished Job #5 [16:57:53] Starting job 6,CPU time has been restored to 6998.064459. [17:03:41] Finished Job #6 [17:03:41] Starting job 7,CPU time has been restored to 7295.683167. [17:11:02] Finished Job #7 [17:11:02] Starting job 8,CPU time has been restored to 7678.260019. [17:16:55] Finished Job #8 [17:16:55] Starting job 9,CPU time has been restored to 7986.658396. [17:22:45] Finished Job #9 [17:22:45] Starting job 10,CPU time has been restored to 8285.010309. [17:34:25] Finished Job #10 [17:34:25] Starting job 11,CPU time has been restored to 8900.387453. [17:42:08] Finished Job #11 [17:42:08] Starting job 12,CPU time has been restored to 9294.898382. [18:29:27] Finished Job #12 [18:29:27] Starting job 13,CPU time has been restored to 11770.088249. [20:04:08] Finished Job #13 [20:04:08] Starting job 14,CPU time has been restored to 16745.833744. [21:35:37] Finished Job #14 [21:35:37] Starting job 15,CPU time has been restored to 21552.630157. [23:25:01] Finished Job #15 23:25:10 (29856): called boinc_finish </stderr_txt> ]]> ![]() ---------------------------------------- [Edit 1 times, last edit by cjslman at Apr 8, 2014 11:53:17 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello cjslman,
I do not see any error in the part that you pasted. What is the error? Lawrence |
||
|
cjslman
Master Cruncher Mexico Joined: Nov 23, 2004 Post Count: 2082 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
You mean that there is no error in the results log? What I pasted is all that there is in the result log of the following WU (in blue ):
----------------------------------------E220635_ 764_ K.24.C18FH8NOS2Se.00222203.4.set1d06_ 0-- R8XZ4P5 Error 4/4/14 17:06:58 4/6/14 08:01:20 7.58 / 7.86 131.0 / 66.5 Any further ideas? It would be really bad if a good WU was marked as an errored WU. Thanks, CJSL Crunching for a better future.... |
||
|
Paul Schlaffer
Senior Cruncher USA Joined: Jun 12, 2005 Post Count: 244 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I had 8 WU validate with "error" over two machines. I'm guessing it had to due with the server issue, as this occurred during the same time time-frame, and "errors" on these machines have been non-existent.
----------------------------------------4/5/2014 9:49:34 PM | World Community Grid | [error] Error reported by file upload server: can't write file /usr/local/boinc/data/upload/39b/E220657_667_K.25.C18FH8N3S3.00498633.0.set1d06_0_1: No space left on server 4/5/2014 10:05:33 PM | World Community Grid | [error] Error reported by file upload server: Server is out of disk space ![]() “Where an excess of power prevails, property of no sort is duly respected. No man is safe in his opinions, his person, his faculties, or his possessions.” – James Madison (1792) |
||
|
cjslman
Master Cruncher Mexico Joined: Nov 23, 2004 Post Count: 2082 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Ah.. I see... I should have looked to find out what failure/error was listed in the Event Log. I didn't think about that (duh)
----------------------------------------![]() ![]() Thanks, CJSL Crunching for a better world... |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I had one like that too. While it may be correct to say that it happened when the server issues started, I don't think that that is a proper explanation.
Personally I suspect that there is some bug in the validation process that has been triggered as a result of the server problems. However, as the number of WUs experiencing the problem seems to be small it will not be worthwhile to look for and correct it. Crunching on (assuming that the techs can keep the server issues under control -- but what a great problem to have!) |
||
|
|
![]() |