Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 8
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2547 times and has 7 replies Next Thread
anhhai
Veteran Cruncher
Joined: Mar 22, 2005
Post Count: 839
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
More errors lately -- not just me, but my wingmen too

I am not sure what is going on but I have had 3 WU that error out lately. I doubt it is my setup because everyone else that has been working on these 3 WU also error out. I am glad that the server was able to figure out that the WU was bad and didn't keep sending them to more machines (well more then 5), but I am wondering is this really normal? I mean all 3 died in about a 24 hr period. Did we get a bad batch or something?

WU 1:
ts05_ d277_ pqa000_ 4-- - In Progress 8/21/10 18:01:45 8/25/10 18:01:45 0.00 0.0 / 0.0
ts05_ d277_ pqa000_ 3-- 617 Error 8/21/10 13:53:37 8/21/10 17:56:18 0.20 3.6 / 0.0
ts05_ d277_ pqa000_ 2-- 617 Error 8/20/10 22:18:33 8/21/10 13:30:32 0.27 5.0 / 0.0
ts05_ d277_ pqa000_ 1-- 617 Error 8/20/10 21:51:46 8/20/10 22:16:48 0.20 3.4 / 3.4
ts05_ d277_ pqa000_ 0-- - In Progress 8/20/10 21:51:41 8/30/10 21:51:41 0.00 0.0 / 0.0



WU 2:
ts05_ d277_ pqa003_ 4-- 617 Error 8/21/10 11:49:07 8/21/10 16:00:45 0.19 3.2 / 3.2
ts05_ d277_ pqa003_ 3-- 617 Error 8/21/10 08:19:46 8/21/10 13:50:56 0.36 5.4 / 5.4
ts05_ d277_ pqa003_ 2-- 617 Error 8/21/10 00:55:55 8/21/10 11:44:57 0.25 3.8 / 3.8
ts05_ d277_ pqa003_ 1-- 617 Error 8/20/10 21:52:04 8/21/10 00:48:06 0.26 4.7 / 4.7
ts05_ d277_ pqa003_ 0-- 617 Error 8/20/10 21:51:56 8/21/10 08:08:27 0.17 3.2 / 3.2



WU 3:
ts05_ a239_ pr78b1_ 4-- 617 Error 8/21/10 05:53:43 8/21/10 10:50:25 0.14 2.2 / 2.2
ts05_ a239_ pr78b1_ 3-- 617 Error 8/20/10 12:24:37 8/21/10 04:20:55 0.11 2.1 / 2.1
ts05_ a239_ pr78b1_ 2-- 617 Error 8/20/10 05:41:46 8/20/10 12:19:39 0.18 3.2 / 3.2
ts05_ a239_ pr78b1_ 0-- 617 Server Aborted 8/18/10 14:56:40 8/21/10 11:05:03 0.00 0.0 / 0.0
ts05_ a239_ pr78b1_ 1-- 617 Error 8/18/10 14:56:40 8/20/10 04:43:38 0.18 2.7 / 2.7
----------------------------------------

[Aug 21, 2010 10:26:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: More errors lately -- not just me, but my wingmen too

The systems stops sending out repair copies after 5 errors have been reported, then the last 2 are allowed to process, making the final total at maximum 7.

The result status log (click on error link) will tell you what the error(s) were. Probably all identical. We see the occasional WU that just goes... a computational model condition that the app can't handle. Nothing much can be done about these, which is why WCG changed policy that known errors for results will still give credit for computing time and points.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Aug 22, 2010 7:30:24 AM]   Link   Report threatening or abusive post: please login first  Go to top 
seippel
Former World Community Grid Tech
Joined: Apr 16, 2009
Post Count: 392
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: More errors lately -- not just me, but my wingmen too

Sek,

DDDT2's settings are slightly different than the other projects. For this project, we will stop sending out a work unit after 3 errors have been reported against it (and the remaining 2 are allowed to process for a maximum of 5).

Seippel
[Aug 23, 2010 9:58:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: More errors lately -- not just me, but my wingmen too

Maybe I should have remembered something was said about that because of the very long run times?... fortunately for many they don't do that any longer. This 3 and 5 is fine too with me and most I suppose.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Aug 23, 2010 10:05:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 2982
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: More errors lately -- not just me, but my wingmen too

Seippel, as I haven't seen your name against the WCG Techs before, welcome biggrin Does this mean that one of your colleagues (as mentioned in this WCG wiki page Forum titles ) has moved onto another assignment?
----------------------------------------

[Aug 23, 2010 10:34:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: More errors lately -- not just me, but my wingmen too

Hi,

Paul Simon had a song with Chevy Chase about a guy going by the name http://www.youtube.com/watch?v=jqrKejQTynk

nelsoc, the man of the famous X rated word list (that of course could not be published), left some months ago for a change of venue.

cheers
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Aug 23, 2010 10:45:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 2982
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: More errors lately -- not just me, but my wingmen too

Thanks Sek for letting me know - the page in question has now duly been updated smile
----------------------------------------

[Aug 23, 2010 11:22:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
seippel
Former World Community Grid Tech
Joined: Apr 16, 2009
Post Count: 392
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: More errors lately -- not just me, but my wingmen too

Yup - I'm the new guy here and thanks for the welcome. It's exciting to be involved with such a great project (WCG) and I'm impressed with the involvement of the community. I'll be involved with some of the work unit management and screen savers, so you should see more posts from me in those areas in the future.

Seippel, as I haven't seen your name against the WCG Techs before, welcome biggrin Does this mean that one of your colleagues (as mentioned in this WCG wiki page Forum titles ) has moved onto another assignment?

[Aug 24, 2010 4:55:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread