Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 46
|
![]() |
Author |
|
kateiacy
Veteran Cruncher USA Joined: Jan 23, 2010 Post Count: 1027 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
People have made their caches so large to get lots of DDDT2 WUs. It could take days for a wingman to even start one of these and identify a problem.
----------------------------------------![]() |
||
|
sk..
Master Cruncher http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif Joined: Mar 22, 2007 Post Count: 2324 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Still 5 days to go. I increased my cache so as not to crunch one ;)
|
||
|
Somervillejudson@netscape.net
Veteran Cruncher USA Joined: May 16, 2008 Post Count: 1065 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Had one task run for 17 hours + then computational error. Shame I did not catch it sooner.
|
||
|
GB033533
Senior Cruncher UK Joined: Dec 8, 2004 Post Count: 201 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Sek, you were right that there would be no more iterations after the fourth, so I would have been safe to abort then. But I wanted to wait till the fourth wingman had errored, which he just has. So I've aborted mine at last! I think I got off lightly, only losing 1 hour.
----------------------------------------ts05_ d424_ sr45b0_ 4-- 617 Error 9/7/10 09:48:33 9/8/10 09:49:55 13.50 203.8 / 0.0 ts05_ d424_ sr45b0_ 3-- 617 Error 9/6/10 07:33:41 9/7/10 09:33:28 12.14 200.9 / 0.0 ts05_ d424_ sr45b0_ 2-- 617 Error 9/4/10 18:42:56 9/6/10 06:50:33 9.02 255.5 / 0.0 ts05_ d424_ sr45b0_ 1-- 617 Error 9/2/10 18:46:21 9/4/10 18:27:27 8.11 251.3 / 0.0 ts05_ d424_ sr45b0_ 0-- 617 User Aborted 9/2/10 18:46:17 9/8/10 10:32:31 1.00 15.4 / 0.0 ![]() |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Okay, so in effect, 3rd error sends out 1 more ''in progress'' and unless there's a valid result or one that at least goes into PV state, nothing more. At least I'd expect for the logic to allow that a 4th or 5th return going into PV state pushes out an additional wingman to crunch even when it's the 6th. Good is that the safeties were tightened as before it was 5th errors causing suspend. Think the FAQ needs fixing... not sure if it's science specific or across the board for any (with maybe HPF2 as an exception). It's somewhere in a knreed or uplinger post.
----------------------------------------crunching on... now right midway to Ruby and still getting backfill.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have another problem DDDT2 WU. It's been running for 8:27:16 with 0% progress.
----------------------------------------WU name is ts05_c229_sbq005_0 I have suspended it for now. What should i do with it?
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
![]() ![]() |
||
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 2982 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have suspended it for now. What should i do with it? When did you last restart your system?, what about your wingman - have they reported in yet? ![]() |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Okay, so in effect, 3rd error sends out 1 more ''in progress'' and unless there's a valid result or one that at least goes into PV state, nothing more. At least I'd expect for the logic to allow that a 4th or 5th return going into PV state pushes out an additional wingman to crunch even when it's the 6th. Good is that the safeties were tightened as before it was 5th errors causing suspend. Think the FAQ needs fixing... not sure if it's science specific or across the board for any (with maybe HPF2 as an exception). It's somewhere in a knreed or uplinger post. crunching on... now right midway to Ruby and still getting backfill. Found the original post by uplinger on the reduction of repair copies: http://www.worldcommunitygrid.org/forums/wcg/...ead,28881_offset,0#276016 The reduced repair job circulation only applies to DDDT2 as it reads.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have suspended it for now. What should i do with it? When did you last restart your system?, what about your wingman - have they reported in yet? System restarted yesterday. Am I correct in assuming the wingman's WU name ends in _1? If so it is PV.
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
![]() ![]() |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Wingman numbers increment. _0 sets the base for which platform the wingman will run which gets _1. If there are repairs the suffix increments... _2 _3 etc.
----------------------------------------If the wingman is in PV, then I'd not hesitate to abort the task if after a client restart nothing changes, certainly if the run time is just a normal number for the PV. Any of the C type s branch run about 1.5 hours on my quad.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
![]() |