Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: The Clean Energy Project - Phase 2 Forum Thread: Odd short result |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 9
|
Author |
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7219 Status: Offline Project Badges: |
This was an exceptionally short CEP2 unit which skipped most of the jobs but was still valid. Has anyone else seen something like this ? I see it exited with the RC=0x100 after Job 0. The log for the second job is the same.
----------------------------------------Result Log Result Name: E236237_ 95_ S.208.C18H11N1S3Se1.SKFRCANOXKSRJE-UHFFFAOYSA-N.19_ s1_ 14_ 1-- <core_client_version>7.2.7</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [12:04:15] Number of jobs = 8 [12:04:15] Starting job 0,CPU time has been restored to 0.000000. [12:04:15] Starting new Job [12:04:15] Qink name = fldman [12:04:16] Qink name = gesman [12:04:16] Qink name = scfman Application exited with RC = 0x100 [12:42:39] Finished Job #0 [12:42:39] Starting job 1,CPU time has been restored to 2129.169075. [12:42:39] Skipping Job #1 [12:42:39] Starting job 2,CPU time has been restored to 2129.169075. [12:42:39] Skipping Job #2 [12:42:39] Starting job 3,CPU time has been restored to 2129.169075. [12:42:39] Skipping Job #3 [12:42:39] Starting job 4,CPU time has been restored to 2129.169075. [12:42:39] Skipping Job #4 [12:42:39] Starting job 5,CPU time has been restored to 2129.169075. [12:42:39] Skipping Job #5 [12:42:39] Starting job 6,CPU time has been restored to 2129.169075. [12:42:39] Skipping Job #6 [12:42:39] Starting job 7,CPU time has been restored to 2129.169075. [12:42:39] Skipping Job #7 12:42:40 (3652): called boinc_finish Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Not seen, but did the C18 not stand for a small mol?
The E236237 is a later batch number than the first [baddish] February beta, with 8 jobs, where the current beta has 5 in them. Could we be getting 2 different task composites in production? |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7219 Status: Offline Project Badges: |
Not seen, but did the C18 not stand for a small mol? The E236237 is a later batch number than the first [baddish] February beta, with 8 jobs, where the current beta has 5 in them. Could we be getting 2 different task composites in production? Even if it is a small molecule as indicated by the C18, it still seems strange it would not want to complete the remaining jobs. Just weird I guess. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
BobCat13
Senior Cruncher Joined: Oct 29, 2005 Post Count: 295 Status: Offline Project Badges: |
I have had two like that on linux.
Result Name: E236243_ 150_ S.262.C23H17N3O1S1Se1Si2.MFCNGWKPOVCPJW-UHFFFAOYSA-N.8_ s1_ 14_ 1-- <core_client_version>7.4.41</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [20:09:13] Number of jobs = 8 [20:09:13] Starting job 0,CPU time has been restored to 0.000000. [20:09:13] Starting new Job [20:09:13] Qink name = fldman [20:09:13] Qink name = gesman [20:09:14] Qink name = scfman Application exited with RC = 0x100 [21:23:00] Finished Job #0 [21:23:00] Starting job 1,CPU time has been restored to 4183.085217. [21:23:00] Skipping Job #1 [21:23:00] Starting job 2,CPU time has been restored to 4183.085217. [21:23:00] Skipping Job #2 [21:23:00] Starting job 3,CPU time has been restored to 4183.085217. [21:23:00] Skipping Job #3 [21:23:00] Starting job 4,CPU time has been restored to 4183.085217. [21:23:00] Skipping Job #4 [21:23:00] Starting job 5,CPU time has been restored to 4183.085217. [21:23:00] Skipping Job #5 [21:23:00] Starting job 6,CPU time has been restored to 4183.085217. [21:23:00] Skipping Job #6 [21:23:00] Starting job 7,CPU time has been restored to 4183.085217. [21:23:00] Skipping Job #7 21:23:01 (6038): called boinc_finish </stderr_txt> ]]> Result Name: E236237_ 805_ S.222.C20H14N2O1S2Se1.SOIYNPOHNVVXDQ-UHFFFAOYSA-N.10_ s1_ 14_ 1-- <core_client_version>7.4.41</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [12:01:29] Number of jobs = 8 [12:01:29] Starting job 0,CPU time has been restored to 0.000000. [12:01:30] Starting new Job [12:01:30] Qink name = fldman [12:01:30] Qink name = gesman [12:01:30] Qink name = scfman Application exited with RC = 0x100 [12:37:51] Finished Job #0 [12:37:51] Starting job 1,CPU time has been restored to 2026.935196. [12:37:51] Skipping Job #1 [12:37:51] Starting job 2,CPU time has been restored to 2026.935196. [12:37:51] Skipping Job #2 [12:37:51] Starting job 3,CPU time has been restored to 2026.935196. [12:37:51] Skipping Job #3 [12:37:51] Starting job 4,CPU time has been restored to 2026.935196. [12:37:51] Skipping Job #4 [12:37:51] Starting job 5,CPU time has been restored to 2026.935196. [12:37:51] Skipping Job #5 [12:37:51] Starting job 6,CPU time has been restored to 2026.935196. [12:37:51] Skipping Job #6 [12:37:51] Starting job 7,CPU time has been restored to 2026.935196. [12:37:51] Skipping Job #7 12:37:51 (1950): called boinc_finish </stderr_txt> ]]> |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Is there truly a checkpoint logged? (asking because the exited entry is recorded before the Finished)
Application exited with RC = 0x100 [12:37:51] Finished Job #0 |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 1978 Status: Offline Project Badges: |
Here also seen, this is from the Workunit Status page:
E236238_403_S.222.C18H11N3O2S2Se1.LQFCAXLFPCDMRZ-UHFFFAOYSA-N.5_s1_14_1-- Linux 4.4.1-1-default and this is from both Result Logs: Result Name: E236238_ 403_ S.222.C18H11N3O2S2Se1.LQFCAXLFPCDMRZ-UHFFFAOYSA-N.5_ s1_ 14_ 1-- <core_client_version>7.6.21</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [07:56:42] Number of jobs = 8 [07:56:42] Starting job 0,CPU time has been restored to 0.000000. [07:56:42] Starting new Job [07:56:42] Qink name = fldman [07:56:42] Qink name = gesman [07:56:42] Qink name = scfman Application exited with RC = 0x100 [08:45:19] Finished Job #0 [08:45:19] Starting job 1,CPU time has been restored to 2725.324000. [08:45:19] Skipping Job #1 etc. Result Name: E236238_ 403_ S.222.C18H11N3O2S2Se1.LQFCAXLFPCDMRZ-UHFFFAOYSA-N.5_ s1_ 14_ 0-- <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [15:42:35] Number of jobs = 8 [15:42:35] Starting job 0,CPU time has been restored to 0.000000. [15:42:38] Starting new Job [15:42:39] Qink name = fldman [15:42:40] Qink name = gesman [15:42:40] Qink name = scfman Application exited with RC = 0x100 [16:34:59] Finished Job #0 [16:34:59] Starting job 1,CPU time has been restored to 1972.730555. [16:34:59] Skipping Job #1 etc. |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7219 Status: Offline Project Badges: |
Is there truly a checkpoint logged? (asking because the exited entry is recorded before the Finished) Application exited with RC = 0x100 [12:37:51] Finished Job #0 I am thinking the answer to that question would be "no," but the WU being so short, I never checked. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
BobCat13
Senior Cruncher Joined: Oct 29, 2005 Post Count: 295 Status: Offline Project Badges: |
Is there truly a checkpoint logged? (asking because the exited entry is recorded before the Finished) Application exited with RC = 0x100 [12:37:51] Finished Job #0 I don't think there was a checkpoint. I believe [12:37:51] Finished Job #0 happens immediately after Application exited with RC = 0x100. |
||
|
deltavee
Ace Cruncher Texas Hill Country Joined: Nov 17, 2004 Post Count: 4835 Status: Offline Project Badges: |
I'm getting lots of very short runtime CEP2 WUs which exit with the RC=0x100 after Job 0, and with exited entry before the finish. All Valid. Can't catch one that has checkpointed. They all look like this:
----------------------------------------Result Name: E236254_ 443_ S.170.C11H5N7S1Se1.QTDDOEFUWSAYBM-UHFFFAOYSA-N.11_ s1_ 14_ 0-- <core_client_version>7.6.22</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [18:35:32] Number of jobs = 8 [18:35:32] Starting job 0,CPU time has been restored to 0.000000. Application exited with RC = 0x1 [19:19:12] Finished Job #0 [19:19:12] Starting job 1,CPU time has been restored to 2572.343750. [19:19:12] Skipping Job #1 [19:19:12] Starting job 2,CPU time has been restored to 2572.343750. [19:19:12] Skipping Job #2 [19:19:12] Starting job 3,CPU time has been restored to 2572.343750. [19:19:12] Skipping Job #3 [19:19:12] Starting job 4,CPU time has been restored to 2572.343750. [19:19:12] Skipping Job #4 [19:19:12] Starting job 5,CPU time has been restored to 2572.343750. [19:19:12] Skipping Job #5 [19:19:12] Starting job 6,CPU time has been restored to 2572.343750. [19:19:12] Skipping Job #6 [19:19:12] Starting job 7,CPU time has been restored to 2572.343750. [19:19:12] Skipping Job #7 19:19:13 (3300): called boinc_finish |
||
|
|