Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 30
Posts: 30   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 44371 times and has 29 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
work units not finishing

I had these messages :

08/02/2011 17:55:39 World Community Grid Task E201163_267_A.28.C23H13NOS2Se.322.1.set1d06_0 exited with zero status but no 'finished' file
08/02/2011 17:55:39 World Community Grid If this happens repeatedly you may need to reset the project.
08/02/2011 17:55:39 World Community Grid Task E201163_693_A.29.C21H13N5S2Si.176.4.set1d06_0 exited with zero status but no 'finished' file
08/02/2011 17:55:39 World Community Grid If this happens repeatedly you may need to reset the project.
08/02/2011 17:55:39 World Community Grid Restarting task E201163_267_A.28.C23H13NOS2Se.322.1.set1d06_0 using cep2 version 635
08/02/2011 17:55:39 World Community Grid Restarting task E201163_693_A.29.C21H13N5S2Si.176.4.set1d06_0 using cep2 version 635

The Wu's then restart to zero.

What does it mean and do I have to do something ?

It never happened before ...
[Feb 8, 2011 5:36:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: work units not finishing

Check for when it checkpointed most rercently

In propoties
It may be in a loop

Edit: Sorry for bad spelling
----------------------------------------
[Edit 1 times, last edit by Former Member at Feb 8, 2011 6:10:05 PM]
[Feb 8, 2011 6:08:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: work units not finishing

I do not know when these Wu's take checkpoints. it seems to be randomly....
[Feb 8, 2011 6:13:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
kateiacy
Veteran Cruncher
USA
Joined: Jan 23, 2010
Post Count: 1027
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: work units not finishing

I get that error in WUs for some projects when my computer loses its Internet connection and BOINC tries unsuccessfully to connect through the Internet.
----------------------------------------

[Feb 8, 2011 7:06:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: work units not finishing

@ Legrandpiou



For the checkpoint reading details see this FAQ: http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=11332 . The instructions are right after the first screenshot and read on through the second paragraph else you miss it... adding a <logflag> in the cc_config.xml, which for Windows does not by default exist (will for the upcoming WCG skinned 6.10.58 I think)



from Sekerob


(did that in a balmy mood a while ago)... RLOL

biggrin
----------------------------------------
[Edit 2 times, last edit by Former Member at Feb 8, 2011 11:53:56 PM]
[Feb 8, 2011 11:13:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Jim1348
Veteran Cruncher
USA
Joined: Jul 13, 2009
Post Count: 1066
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: work units not finishing

08/02/2011 17:55:39 World Community Grid Task E201163_267_A.28.C23H13NOS2Se.322.1.set1d06_0 exited with zero status but no 'finished' file
08/02/2011 17:55:39 World Community Grid If this happens repeatedly you may need to reset the project.
....................................
08/02/2011 17:55:39 World Community Grid Restarting task E201163_267_A.28.C23H13NOS2Se.322.1.set1d06_0 using cep2 version 635

The Wu's then restart to zero.

Are you really losing work because of this? It happened to me when I was using a Gen. 1 SSD that couldn't keep up with the CEP2 writes, but I never lost any work since it started back up immediately. (And the error message went away with a newer SSD).
[Feb 9, 2011 4:00:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: work units not finishing

I don't really know because when a Work unit restarts at zero for the third time, I ususally cancel it...
[Feb 12, 2011 4:07:32 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: work units not finishing

I get that error in WUs for some projects when my computer loses its Internet connection and BOINC tries unsuccessfully to connect through the Internet.

Sadly, to this day to include version 6.10.58, so true, last experienced multiple times when the WiFi was failing on Linux. Old never completely squashed bug.

It's an absolute abomination that this error still exists due to failing or absent internet connection. Sometimes it's instantaneous, sometimes it's with zero status reset, sometimes with a heartbeat time-out. Whole series of results can be eaten up by this, thus is one of the reasons why I often have "Network based on preferences" on with a 1 hour slot just before midnight UTC + a cache of 1 day so the results upload and new work fetched in that frame. More sadly, the 5 minute auto-connect function was removed between 6.2 and 6.10. That was brilliant as then always crunching off-line and now and then hitting the update button to open the line briefly for fetching and reporting and it then closing the line again.

Ce la BOINC
[Feb 12, 2011 4:24:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: work units not finishing

coffee I wanted to see how the WUs for CEP2 were completing so I went to Results Status of Valid, and found that those with a short run time (5 to 7 hours) would execute normally through job 11. On job 12 I would see a completion of

[23:38:16] Starting job 12,CPU time has been restored to 12451.110614.
Application exited with RC = 0xc0000005
[00:24:56] Finished Job #12

or

[03:30:55] Starting job 12,CPU time has been restored to 14482.212434.
Application exited with RC = 0x1
[05:27:41] Finished Job #12

The jobs following job 12 would then be skipped such as the one which follows:

[05:27:41] Starting job 13,CPU time has been restored to 21461.931176.
[05:27:41] Skipping Job #13

What is happening in job 12 which results in the errors?
[Feb 20, 2011 3:06:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: work units not finishing

Dear Legrandpiou,
It’s not random but due to technical difficulties currently not as regular as we would like. We recommend the ‘Leave Application in Memory’ option which bypasses the use of checkpoints as long as you don’t have to reboot.

Dear dkt,
The jobs in a wu are stringed together and we use the result of one job as an input for the next. It sometimes happens that this scheme fails, and then all subsequent jobs will also fail. We have carefully designed our wus to minimize this issue, but it will happen from time to time. That’s the nature of science – things fail from time to time.

Best wishes from

Your Harvard CEP team
[Feb 21, 2011 5:55:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 30   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread