Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 120
|
![]() |
Author |
|
DSL Freak
Advanced Cruncher USA Joined: Feb 12, 2013 Post Count: 62 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Is the Result.out = 1627661.000000 different to the two valid of that last result that went invalid for you? Yes... Here's the status for that WU:BETA_ MCM1_ 0000144_ 4956_ 3-- (Valid) Result.out = 1627372.000000 BETA_ MCM1_ 0000144_ 4956_ 2-- (Invalid) Result.out = 1627661.000000 <--- Mine BETA_ MCM1_ 0000144_ 4956_ 0-- (Invalid) Result.out = 1627434.000000 BETA_ MCM1_ 0000144_ 4956_ 1-- (Valid) Result.out = 1627372.000000
Crunchin' for a cure!
|
||
|
slakin
Advanced Cruncher Joined: Jul 4, 2008 Post Count: 79 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I successfully processed a number of WU's that had been restarted, some multiple times. I did have 3 WU's that were deemed INVALID, the only thing that jumped out is that they were all from the BETA_MCM1_0000144 range ..jumped out as the others reported as invalid in this thread also seemed to be in that batch. The result out is different, appears that my wingmen did not do a restart. Let me know if there is any data you would like me to supply.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Here's an interesting one: BETA_MCM1_0000144_4267
Suffix _0 restarted 2 times and gave Result.out = 1627470.000000 Suffix _1 restarted 2 times and gave Result.out = 1627033.000000 Suffix _2 restarted 0 times and gave Result.out = 1627011.000000 Suffix _3 restarted 2 times and gave Result.out = 1627263.000000 Suffix _4 is still out there. Nothing consistent here then! |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
We've seen those in production [Sgt.Joe had one] All 5 copies a different RO value.
|
||
|
yoro42
Ace Cruncher United States Joined: Feb 19, 2011 Post Count: 8979 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have two Beta 7.27 WUs running on two PCs. Both have been running for over 22 hours and both have a 24 hour deadline. Last night and again at noon today the estimated completion times were easily possible. Now however it looks unlikely. Time left values do not seem to be going down as one would expect. I'll post when they finish.
----------------------------------------Detail Below: WCGDAWS 1.3.3.0 Device Project Name Name Status Sent LastUpdate Returned OSName CPU Model ElapsedTime Thelonious BETA BETA_ MCM1_ 0000218_ 8668_ 3-- In Progress 12/11/2013 3:29:43 AM 12/11/2013 5:15:00 PM 12/12/2013 3:29:43 AM W7U XPS 8300 Jamal BETA BETA_ MCM1_ 0000218_ 9618_ 3-- In Progress 12/11/2013 3:35:18 AM 12/11/2013 5:15:00 PM 12/12/2013 3:35:18 AM XPPro Dimension 8400 BoincTasks 1.58 Application Name Status Rcvd Elapsed Time Time Left Prog % CPU % CK Point Deadline Memory Virtual Thelonious 7.27 beta17 BETA_MCM1_0000218_8668_3 Running High P. 12/10/2013 8:29:50 PM 20:30:26 (18:10:59) 02:10:28 87.94 88.666 [3] 00:06:24 12/11/2013 8:29:43 PM 30.96 MB 90.21 MB Jamal 7.27 beta17 BETA_MCM1_0000218_9618_3 Running High P. 12/10/2013 8:35:14 PM 20:56:32 (20:22:47) 05:13:47 51.64 97.315 [3] 00:03:10 12/11/2013 8:35:18 PM 43.38 MB 81.22 MB ![]() |
||
|
gomeyer
Senior Cruncher USA Joined: Jul 11, 2008 Post Count: 161 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I forced 32 WUs from this latest BETA release to do a restart. Of those 32, 3 have been marked invalid and 3 more are still PVer and may well also be invalid.
----------------------------------------EDIT: Don't know if it matters, but those invalid/pver WUs were from both Window 32 and Linux 64 machines. ![]() [Edit 1 times, last edit by gomeyer at Dec 12, 2013 4:17:29 AM] |
||
|
yoro42
Ace Cruncher United States Joined: Feb 19, 2011 Post Count: 8979 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have two Beta 7.27 WUs running on two PCs. Both have been running for over 22 hours and both have a 24 hour deadline. Last night and again at noon today the estimated completion times were easily possible. Now however it looks unlikely. Time left values do not seem to be going down as one would expect. I'll post when they finish. Detail Below: WCGDAWS 1.3.3.0 Device Project Name Name Status Sent LastUpdate Returned OSName CPU Model ElapsedTime Thelonious BETA BETA_ MCM1_ 0000218_ 8668_ 3-- In Progress 12/11/2013 3:29:43 AM 12/11/2013 5:15:00 PM 12/12/2013 3:29:43 AM W7U XPS 8300 BoincTasks 1.58 Application Name Status Rcvd Elapsed Time Time Left Prog % CPU % CK Point Deadline Memory Virtual Thelonious 7.27 beta17 BETA_MCM1_0000218_8668_3 Running High P. 12/10/2013 8:29:50 PM 20:30:26 (18:10:59) 02:10:28 87.94 88.666 [3] 00:06:24 12/11/2013 8:29:43 PM 30.96 MB 90.21 MB The log of first to finish follows and the one still running shows 5:57:53 to go. Result Log Result Name: BETA_ MCM1_ 0000218_ 8668_ 3-- <core_client_version>7.2.33</core_client_version> <![CDATA[ <stderr_txt> Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.27_windows_x86_64 -SettingsFile MCM1_0000218_8668.txt -DatabaseFile dataset-17_72_SDG_v1.txt Settings File DateOfDesign = 11/08/2013 Designer = PMCC_OCI WorkOrderID = 0000218_8668 DatasetID = 17_72_SDG_v1 NumberOfGenesInStartingSignature = 20 NumberOfGenesInSignatureMin = 10 NumberOfGenesInSignatureMax = 20 GroupVectorValues = {A}{B}{C}{D}{E}{F} ExplicitStartingGeneSignatures = A B D F StartingGeneSignatureAlgorithm = randomFixedLengthSearch SearchAlgorithmNumberToCreate = 1 SearchAlgorithmSequentialStartPosition = 5 RunPermutationAlgorithm = 1 PermutationGroups = A PermutationGroupsForReplacement = G PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy PermutationsNumIterations = 67911 OptimizationAlgorithmFrequency = 0 0 1 FBeta = 1.5 SimAnnealIMax = 20000 SimAnnealAlpha = 0.9996 NReps = 10 TrainFrac = 0.7 NFolds = 10 VMethod = LOO ModelType = SVM FitnessFn = 0 MinFitness = 0.61 SvmArgs = "-v 0 -c 0.1 -t 1 -d 2 -r 0" SvmLearnLimit = 500000 RSeed = 538668 [20:29:53] Initializing wcg_learn_limit = 500000 [20:29:59] Running [20:29:59] EvaluateFitnessOfStartingGeneSignatures 1 [20:30:00]: Computing pass 0 Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.27_windows_x86_64 -SettingsFile MCM1_0000218_8668.txt -DatabaseFile dataset-17_72_SDG_v1.txt [03:30:49] Initializing wcg_learn_limit = 500000 [03:31:28] Running [03:31:28] EvaluateFitnessOfStartingGeneSignatures 1 [03:31:29]: Computing pass 0 Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.27_windows_x86_64 -SettingsFile MCM1_0000218_8668.txt -DatabaseFile dataset-17_72_SDG_v1.txt [11:12:53] Initializing wcg_learn_limit = 500000 [11:13:19] Running [11:13:19] EvaluateFitnessOfStartingGeneSignatures 1 [11:13:20]: Computing pass 0 [20:45:56] Exiting PermutateGeneSignature [20:45:56] Writing final output [20:45:56] Closing Output Stream [20:45:56] Cleaning up Result.out = 222.000000 Run complete, CPU time: 74693.509894 20:45:56 (6760): called boinc_finish </stderr_txt> ]]> ![]() |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7697 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
All of the WU in batches named 218 seem to be quite lengthy. I have three which will go probably 24 hrs or more, but they all seem to be progressing normally - the percentage done continues to increment.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
armstrdj
Former World Community Grid Tech Joined: Oct 21, 2004 Post Count: 695 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
We were able to track down a bug in the restart code that seems to be causing the majority of the invalids. We are currently running that fix in our alpha environment and would expect it to enter beta as early as tomorrow.
Thanks, armstrdj |
||
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Many thanks for the update! Looking forward to the new Beta. We will make sure we will stress test those WUs. Hope this one nails it.
----------------------------------------Cheers! Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() ![]() |
||
|
|
![]() |