Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 603
|
![]() |
Author |
|
KWSN - A Shrubbery
Master Cruncher Joined: Jan 8, 2006 Post Count: 1585 Status: Offline |
Just another break in the parents. Mine went up to 2388; even with your results, we're still missing 2390-2397.
----------------------------------------![]() Distributed computing volunteer since September 27, 2000 [Edit 1 times, last edit by KWSN - A Shrubbery at Feb 7, 2012 9:36:34 PM] |
||
|
coolstream
Senior Cruncher SCOTLAND Joined: Nov 8, 2005 Post Count: 475 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Doing a quick inventory of results from my machines and I see that yesterday, one of my machines racked up 6 pages of errors in a 90-minute period. The errors were from 2458-2459. Each errored out in approx 1 minute. I haven't checked them all, but here's an example
----------------------------------------Result Log Result Name: CMD2_ 2459-1AVF_ A.clustersOccur-2DAE_ A.clustersOccur_ 0_ 4496_ 5178_ 1-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> WU download error: couldn't get input files: <file_xfer_error> <file_name>hcmd2.1AVF_A.clustersOccur.pdb.gzb</file_name> <error_code>-200</error_code> </file_xfer_error> </message> ]]> Normally, I would tend to want to do a restart of the machine after so many errors, but I see that things appear to have run smoothly again and have been doing so for the past 12 hours. Some of the WUs are PV status for 1 wingman but I haven't found one that has been validated (not checked all but have done random checks). Has anyone else noticed this? Or did that one machine throw a 90-minute wobble? ![]() Crunching in memory of my Mum PEGGY, cousin ROPPA and Aunt AUDREY. |
||
|
KWSN - A Shrubbery
Master Cruncher Joined: Jan 8, 2006 Post Count: 1585 Status: Offline |
I had one machine toss out 15 or so errors in a row the same way. The "couldn't get input files:" is typically server side. I don't pretend to know why it only happens to certain machines and only for limited times, but I wouldn't worry about it.
----------------------------------------![]() Distributed computing volunteer since September 27, 2000 |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I haven't seen errors on either of my machines. But they are still crunching on the parents 2388 batch that where send out.
|
||
|
coolstream
Senior Cruncher SCOTLAND Joined: Nov 8, 2005 Post Count: 475 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thanks, KWSN - A Shrubbery. It certainly was a new phenomenon for me
----------------------------------------![]() ![]() Crunching in memory of my Mum PEGGY, cousin ROPPA and Aunt AUDREY. |
||
|
lomieheard
Cruncher Joined: Sep 20, 2011 Post Count: 8 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I am sorry in advance for my ignorance, but I'm still trying to learn all the lingo. What is meant by parents, children, grandchildren, etc?
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Try this post by Sekerob [Sep 30, 2009 11:34:38 AM]. It describes the naming standard if an initial workunit (WU), called a "parent", does not complete within its 6 hour or 12 hour time limit, with the remaining calculations being allocated to one or more "child" WUs. If a child WU fails to complete in time, then the (fewer) remaining calculations are allocated to one or more "grandchild" WUs, and so on.
There's also an FAQ entitled Help Cure Muscular Dystrophy, Phase 2 Parents... task sizes and durations with even more detail. |
||
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I just downloaded 30 tasks and only got several children from batches 2466 and 2467.
----------------------------------------AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W AMD Ryzen 7 7730U 8C/16T 3.0 GHz [Edit 1 times, last edit by Falconet at Feb 9, 2012 2:36:34 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
ah yes. true blue just before the finishline.
You sure it are parents? Not something like CMD2 2467 1A2B A clusters occur 12345 67890 1 |
||
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Lol not parents but children.
----------------------------------------Part of the number was hidden and I was sleepy :D AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W AMD Ryzen 7 7730U 8C/16T 3.0 GHz |
||
|
|
![]() |