Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 46
Posts: 46   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 120253 times and has 45 replies Next Thread
kateiacy
Veteran Cruncher
USA
Joined: Jan 23, 2010
Post Count: 1027
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

People have made their caches so large to get lots of DDDT2 WUs. It could take days for a wingman to even start one of these and identify a problem.
----------------------------------------

[Sep 7, 2010 12:29:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
sk..
Master Cruncher
http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif
Joined: Mar 22, 2007
Post Count: 2324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

Still 5 days to go. I increased my cache so as not to crunch one ;)
[Sep 7, 2010 7:06:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Somervillejudson@netscape.net
Veteran Cruncher
USA
Joined: May 16, 2008
Post Count: 1065
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

Had one task run for 17 hours + then computational error. Shame I did not catch it sooner.
[Sep 7, 2010 10:26:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
GB033533
Senior Cruncher
UK
Joined: Dec 8, 2004
Post Count: 201
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

Sek, you were right that there would be no more iterations after the fourth, so I would have been safe to abort then. But I wanted to wait till the fourth wingman had errored, which he just has. So I've aborted mine at last! I think I got off lightly, only losing 1 hour.

ts05_ d424_ sr45b0_ 4-- 617 Error 9/7/10 09:48:33 9/8/10 09:49:55 13.50 203.8 / 0.0
ts05_ d424_ sr45b0_ 3-- 617 Error 9/6/10 07:33:41 9/7/10 09:33:28 12.14 200.9 / 0.0
ts05_ d424_ sr45b0_ 2-- 617 Error 9/4/10 18:42:56 9/6/10 06:50:33 9.02 255.5 / 0.0
ts05_ d424_ sr45b0_ 1-- 617 Error 9/2/10 18:46:21 9/4/10 18:27:27 8.11 251.3 / 0.0
ts05_ d424_ sr45b0_ 0-- 617 User Aborted 9/2/10 18:46:17 9/8/10 10:32:31 1.00 15.4 / 0.0
----------------------------------------

[Sep 8, 2010 10:43:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

Okay, so in effect, 3rd error sends out 1 more ''in progress'' and unless there's a valid result or one that at least goes into PV state, nothing more. At least I'd expect for the logic to allow that a 4th or 5th return going into PV state pushes out an additional wingman to crunch even when it's the 6th. Good is that the safeties were tightened as before it was 5th errors causing suspend. Think the FAQ needs fixing... not sure if it's science specific or across the board for any (with maybe HPF2 as an exception). It's somewhere in a knreed or uplinger post.

crunching on... now right midway to Ruby and still getting backfill.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Sep 8, 2010 11:01:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
nanoprobe
Master Cruncher
Classified
Joined: Aug 29, 2008
Post Count: 2998
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

I have another problem DDDT2 WU. It's been running for 8:27:16 with 0% progress.
WU name is ts05_c229_sbq005_0
I have suspended it for now. What should i do with it?
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.


[Sep 12, 2010 4:24:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 2982
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

I have suspended it for now. What should i do with it?


When did you last restart your system?, what about your wingman - have they reported in yet?
----------------------------------------

[Sep 12, 2010 4:28:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

Okay, so in effect, 3rd error sends out 1 more ''in progress'' and unless there's a valid result or one that at least goes into PV state, nothing more. At least I'd expect for the logic to allow that a 4th or 5th return going into PV state pushes out an additional wingman to crunch even when it's the 6th. Good is that the safeties were tightened as before it was 5th errors causing suspend. Think the FAQ needs fixing... not sure if it's science specific or across the board for any (with maybe HPF2 as an exception). It's somewhere in a knreed or uplinger post.

crunching on... now right midway to Ruby and still getting backfill.

Found the original post by uplinger on the reduction of repair copies:

http://www.worldcommunitygrid.org/forums/wcg/...ead,28881_offset,0#276016

The reduced repair job circulation only applies to DDDT2 as it reads.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Sep 12, 2010 4:33:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nanoprobe
Master Cruncher
Classified
Joined: Aug 29, 2008
Post Count: 2998
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

I have suspended it for now. What should i do with it?


When did you last restart your system?, what about your wingman - have they reported in yet?


System restarted yesterday. Am I correct in assuming the wingman's WU name ends in _1? If so it is PV.
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.


[Sep 12, 2010 5:41:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: 50 hour WU?

Wingman numbers increment. _0 sets the base for which platform the wingman will run which gets _1. If there are repairs the suffix increments... _2 _3 etc.

If the wingman is in PV, then I'd not hesitate to abort the task if after a client restart nothing changes, certainly if the run time is just a normal number for the PV. Any of the C type s branch run about 1.5 hours on my quad.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Sep 12, 2010 6:04:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 46   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread