Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 90
|
![]() |
Author |
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Plz post event log from before you suspended the task to after resume.
(If you have the cpu_sched log flag set, you'd see a line like this 14863 3/17/2016 1:35:08 PM Suspending computation - CPU is busy 14864 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting FAH2_000098_avx38789_000012_0005_004_wcgfahb00020000_0 (left in memory) 14865 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting FAH2_000068_avx17556_000083_0035_026_0 (left in memory) 14866 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting FAH2_000075_avx17680_000070_0054_015_0 (left in memory) 14867 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting FAH2_000084_avx38747_000019_0035_005_0 (left in memory) 14868 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting FAH2_000104_gl5243104_000054_0038_005_0 (left in memory) 14869 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting FAH2_000104_gl5243104_000067_0041_005_0 (left in memory) 14870 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting FAH2_000104_gl5243104_000091_0056_004_0 (left in memory) 14871 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting ugm1_ugm1_23848_2192_1 (left in memory) 14872 World Community Grid 3/17/2016 1:35:08 PM [cpu_sched] Preempting BETA_E236439_6_S.422.C44H18N4O2S6.BRHMYCYOWFFHGT-UHFFFAOYSA-N.6_s1_14_1 (left in memory) 14873 3/17/2016 1:35:18 PM Resuming computation 14874 World Community Grid 3/17/2016 1:35:18 PM [cpu_sched] Resuming FAH2_000098_avx38789_000012_0005_004_wcgfahb00020000_0 14875 World Community Grid 3/17/2016 1:35:18 PM [cpu_sched] Resuming FAH2_000068_avx17556_000083_0035_026_0 14876 World Community Grid 3/17/2016 1:35:18 PM [cpu_sched] Resuming FAH2_000075_avx17680_000070_0054_015_0 14877 World Community Grid 3/17/2016 1:35:18 PM [cpu_sched] Resuming FAH2_000084_avx38747_000019_0035_005_0 14878 World Community Grid 3/17/2016 1:35:18 PM [cpu_sched] Resuming FAH2_000104_gl5243104_000054_0038_005_0 14879 World Community Grid 3/17/2016 1:35:18 PM [cpu_sched] Resuming FAH2_000104_gl5243104_000067_0041_005_0 14880 World Community Grid 3/17/2016 1:35:18 PM [cpu_sched] Resuming ugm1_ugm1_23848_2192_1 14881 World Community Grid 3/17/2016 1:35:18 PM [cpu_sched] Resuming BETA_E236439_6_S.422.C44H18N4O2S6.BRHMYCYOWFFHGT-UHFFFAOYSA-N.6_s1_14_1 Set to 50% CPU load pausing, which it's doing with LAIM 'on' and see nothing indicating a regression. |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2160 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Plz post event log from before you suspended the task to after resume. Unfortunately(?) - This is all I have, Rob:2016-03-17T11:26:53 CET | World Community Grid | task BETA_E236437_655_S.364.C42H20N4S4.MZVSYBDUMBXWRY-UHFFFAOYSA-N.19_s1_14_1 suspended by user [Edit 1 times, last edit by adriverhoef at Mar 17, 2016 1:49:56 PM] |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1320 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I don't see what went wrong with this task ending as an error result:
Result Name: BETA_ E236437_ 816_ S.372.C52H28S2.HAEGOSUBGPDZCC-UHFFFAOYSA-N.9_ s1_ 14_ 0-- <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [22:55:38] Number of jobs = 5 [22:55:38] Starting job 0,CPU time has been restored to 0.000000. [22:55:38] Starting new Job [22:55:38] Qink name = fldman [22:55:41] Qink name = gesman [22:55:43] Qink name = scfman [01:58:42] Qink name = anlman [01:58:42] Qink name = drvman [02:02:34] Qink name = optman [02:02:35] Qink name = fldman [02:02:35] Qink name = gesman [02:02:38] Qink name = scfman [02:29:02] Qink name = anlman [02:29:02] Qink name = drvman [02:32:47] Qink name = optman [02:32:48] Qink name = fldman [02:32:48] Qink name = gesman [02:32:51] Qink name = scfman [02:57:51] Qink name = anlman [02:57:51] Qink name = drvman [03:01:33] Qink name = optman [03:01:34] Qink name = fldman [03:01:34] Qink name = gesman [03:01:37] Qink name = scfman [03:26:05] Qink name = anlman [03:26:06] Qink name = drvman [03:29:45] Qink name = optman [03:29:45] Qink name = fldman [03:29:45] Qink name = gesman [03:29:49] Qink name = scfman [03:55:26] Qink name = anlman [03:55:26] Qink name = drvman [03:59:14] Qink name = optman [03:59:14] Qink name = fldman [03:59:14] Qink name = gesman [03:59:18] Qink name = scfman [04:24:41] Qink name = anlman [04:24:41] Qink name = drvman [04:28:29] Qink name = optman [04:28:30] Qink name = fldman [04:28:30] Qink name = gesman [04:28:33] Qink name = scfman [04:53:36] Qink name = anlman [04:53:36] Qink name = drvman [04:57:21] Qink name = optman [04:57:21] Qink name = fldman [04:57:21] Qink name = gesman [04:57:25] Qink name = scfman [05:23:45] Qink name = anlman [05:23:45] Qink name = drvman [05:27:32] Qink name = optman [05:27:33] Qink name = fldman [05:27:33] Qink name = gesman [05:27:36] Qink name = scfman [05:54:19] Qink name = anlman [05:54:19] Qink name = drvman [05:58:10] Qink name = optman [05:58:11] Qink name = fldman [05:58:11] Qink name = gesman [05:58:14] Qink name = scfman [06:25:29] Qink name = anlman [06:25:29] Qink name = drvman [06:29:21] Qink name = optman [06:29:21] Qink name = fldman [06:29:21] Qink name = gesman [06:29:25] Qink name = scfman [06:54:22] Qink name = anlman [06:54:23] Qink name = drvman [06:58:10] Qink name = optman [06:58:10] Qink name = fldman [06:58:10] Qink name = gesman [06:58:14] Qink name = scfman [07:24:13] Qink name = anlman [07:24:13] Qink name = drvman [07:28:00] Qink name = optman [07:28:01] Qink name = fldman [07:28:01] Qink name = gesman [07:28:04] Qink name = scfman [07:50:58] Qink name = anlman [07:50:58] Qink name = drvman [07:54:40] Qink name = optman [07:54:41] Qink name = fldman [07:54:41] Qink name = gesman [07:54:44] Qink name = scfman [08:15:37] Qink name = anlman [08:15:37] Qink name = drvman [08:19:15] Qink name = optman [08:19:16] Qink name = fldman [08:19:16] Qink name = gesman [08:19:19] Qink name = scfman [08:40:03] Qink name = anlman [08:40:03] Qink name = drvman [08:43:48] Qink name = optman [08:43:49] Qink name = fldman [08:43:49] Qink name = gesman [08:43:52] Qink name = scfman [09:10:23] Qink name = anlman [09:10:24] Qink name = drvman [09:14:12] Qink name = optman [09:14:13] Qink name = fldman [09:14:13] Qink name = gesman [09:14:16] Qink name = scfman [09:38:43] Qink name = anlman [09:38:43] Qink name = drvman [09:42:23] Qink name = optman [09:42:23] Qink name = fldman [09:42:23] Qink name = gesman [09:42:27] Qink name = scfman [10:03:19] Qink name = anlman [10:03:19] Qink name = drvman [10:07:12] Qink name = optman [10:07:13] Qink name = fldman [10:07:13] Qink name = gesman [10:07:16] Qink name = scfman [10:30:36] Qink name = anlman [10:30:36] Qink name = drvman [10:34:23] Qink name = optman [10:34:24] Qink name = fldman [10:34:24] Qink name = gesman [10:34:27] Qink name = scfman [10:55:27] Qink name = anlman [10:55:27] Qink name = drvman [10:59:07] Qink name = optman [10:59:08] Qink name = fldman [10:59:08] Qink name = gesman [10:59:11] Qink name = scfman [11:17:50] Qink name = anlman [11:17:50] Qink name = drvman [11:21:35] Qink name = optman [11:21:36] Qink name = anlman [11:48:10] End of Job [11:48:12] Finished Job #0 [11:48:12] Starting job 1,CPU time has been restored to 44094.579739. [11:48:12] Starting new Job [11:48:12] Qink name = fldman [11:48:15] Qink name = gesman [11:48:15] Qink name = scfman [12:24:45] Qink name = anlman [12:50:42] End of Job [12:50:44] Finished Job #1 [12:50:44] Starting job 2,CPU time has been restored to 47811.672042. [12:50:44] Starting new Job [12:50:44] Qink name = fldman [12:50:47] Qink name = gesman [12:50:47] Qink name = scfman [13:14:06] Qink name = anlman [13:40:27] End of Job [13:40:29] Finished Job #2 [13:40:29] Starting job 3,CPU time has been restored to 50729.406389. [13:40:29] Starting new Job [13:40:30] Qink name = fldman [13:41:00] Qink name = gesman [13:41:03] Qink name = scfman Application exited with RC = 0x100 [14:39:11] Finished Job #3 [14:39:11] Starting job 4,CPU time has been restored to 54199.215238. [14:39:11] Skipping Job #4 14:39:15 (6241): called boinc_finish |
||
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
@Crystal Pellet
----------------------------------------I have several of those too. As explained to me by Uplinger during the first beta these tasks are returning results outside the parameters as preset by the scientists. (nothing to do with WCG) Keith said they should really be labeled as invalid rather than errors but the last batch were never reclassified and these new ones may not be either. You'll still get credit for the time.
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
![]() ![]() |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Plz post event log from before you suspended the task to after resume. Unfortunately(?) - This is all I have, Rob:2016-03-17T11:26:53 CET | World Community Grid | task BETA_E236437_655_S.364.C42H20N4S4.MZVSYBDUMBXWRY-UHFFFAOYSA-N.19_s1_14_1 suspended by user I'd reconfirm LAIM is truly on. A gotcha, if you ever opened local preferences and saved them, the use of the website device profiles will be permanently superseded by the local prefs [except project selections]. Just in case you were doing settings via the website, which must always be followed by hitting update on the effected clients. |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2160 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Plz post event log from before you suspended the task to after resume. Unfortunately(?) - This is all I have, Rob:2016-03-17T11:26:53 CET | World Community Grid | task BETA_E236437_655_S.364.C42H20N4S4.MZVSYBDUMBXWRY-UHFFFAOYSA-N.19_s1_14_1 suspended by user I'd reconfirm LAIM is truly on. A gotcha, if you ever opened local preferences and saved them, the use of the website device profiles will be permanently superseded by the local prefs [except project selections]. Just in case you were doing settings via the website, which must always be followed by hitting update on the effected clients. You're right, Rob, I see now that LAIM is off. Sorry for the confusion. So LAIM is off. I should have checked it before posting, convinced I was it was on, while it wasn't. Sorry! |
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1673 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
One of my host received a couple of hours ago a high priority WU - BETA_ E236439_ 388_ S.420.C44F2H16N6S5.SIQVRHDHDDANTL-UHFFFAOYSA-N.10_ s1_ 14_ 4-- - on Windows 7 Pro x64 with a 1.5 day deadline with a forecasted duration of 1 day and 20 hours.
----------------------------------------The WU is currently computed but the deadline will probably not be met ![]() Cheers, Yves |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
So happens to have asked for a feature that makes CEP2 never show a TTC higher than the cap of 18 hours. Your 1.5 days deadline is plenty time with the knowledge we have but the client currently can't be made to wise up on, at least AFAIK.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I was surprised by this result.
The units with shorter runtimes (_0 and my _2) both exited with RC = 0x1 in Job #0 and became Valid, and the unit with longer runtime (_1) exited with RC = 0x1 in Job #3 but became Invalid. Is that the optimal/desired outcome? It looks like luck of the draw whether a _2 matches a _0 or a _1. BETA_ E236439_ 762_ S.426.C52H22N4S4.ZUSOKAFTCOBIAO-UHFFFAOYSA-N.4_ s1_ 14_ 2-- Microsoft Windows 7 Professional x64 Edition, Service Pack 1, (06.01.7601.00) 700 Valid 17/03/16 12:00:03 18/03/16 01:43:32 1.65 69.7 / 76.7 BETA_ E236439_ 762_ S.426.C52H22N4S4.ZUSOKAFTCOBIAO-UHFFFAOYSA-N.4_ s1_ 14_ 1-- Microsoft Windows 7 Home Premium x64 Edition, Service Pack 1, (06.01.7601.00) 700 Invalid 16/03/16 22:42:26 17/03/16 11:57:56 8.58 261.6 / 261.6 BETA_ E236439_ 762_ S.426.C52H22N4S4.ZUSOKAFTCOBIAO-UHFFFAOYSA-N.4_ s1_ 14_ 0-- Microsoft Windows 7 Professional x64 Edition, Service Pack 1, (06.01.7601.00) 700 Valid 16/03/16 22:41:26 17/03/16 05:08:08 3.86 83.7 / 76.7 |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
As the saying goes, two wrongs do not make a right [but this is seeming validator logic in place]. The Invalid still is credited and then the results get moved to the scientists. Question of course is, is the invalid also communicated back to Harvard? The design far as I know is to move the canonical copy into the master db and upload those, but in CEP2's case the _4 part in production goes directly to them. Do they still ditch it because of the presumably incorrect label?
|
||
|
|
![]() |