Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 52
Posts: 52   Pages: 6   [ 1 2 3 4 5 6 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 9732 times and has 51 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
ERROR: exit code 95 (0x5f)

Sorry if this has been addressed previously. This is for WU ach1_1_4_3. 3 Others in the quorum have Error status also (some are vaild; 2 others with Error status):


Result Log

<core_client_version>5.8.16</core_client_version>
<![CDATA[
<message>
- exit code 95 (0x5f)
</message>
<stderr_txt>
Failed to get VersionInfo size: 1812
World Community Grid AutoDock (projects/www.worldcommunitygrid.org/wcg_acah_wrf_5.09_windows_intelx86) version
INFO: No state to restore. Start from the beginning.
ERROR: Restoring checkpoint failed. Unable to restore state!
Start_year/Start_Month/Start_Day::Start_Hour:Start_Minute:Start_Second Restart2002/12/18::0:0:0 0
Exception: Access Violation
At line 296 of file wrf_io.f
Traceback: not available, compile with -ftrace=frame or -ftrace=full

</stderr_txt>
]]>
----------------------------------------
[Edit 3 times, last edit by Former Member at Sep 4, 2007 9:19:40 PM]
[Sep 3, 2007 10:51:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
confused Re: ERROR: Restoring checkpoint failed. Unable to restore state

Hello esoteric17,
This is a new error to investigate. Could you paste the relevant messages from stdoutdae.txt for the staff to look at?

Lawrence
[Sep 3, 2007 11:00:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: ERROR: Restoring checkpoint failed. Unable to restore state

2007-09-01 22:06:46 [World Community Grid] Starting ach1_1_4_3
2007-09-01 22:06:47 [World Community Grid] Starting task ach1_1_4_3 using acah version 509
2007-09-01 23:25:01 [World Community Grid] Starting dddt0101a0030_ZINC00637125-0001_01_4
2007-09-01 23:25:03 [World Community Grid] Starting task dddt0101a0030_ZINC00637125-0001_01_4 using dddt version 508
2007-09-01 23:30:52 [World Community Grid] Aborting task dddt0101a0030_ZINC00637125-0001_01_4: exceeded disk limit: 72.52MB > 71.53MB
2007-09-01 23:30:52 [World Community Grid] Deferring communication for 1 min 0 sec
2007-09-01 23:30:52 [World Community Grid] Reason: Unrecoverable error for result dddt0101a0030_ZINC00637125-0001_01_4 (Maximum disk usage exceeded)
2007-09-01 23:30:58 [World Community Grid] Computation for task dddt0101a0030_ZINC00637125-0001_01_4 finished
2007-09-01 23:30:58 [World Community Grid] Resuming task ach1_1_4_3 using acah version 509
2007-09-02 09:42:54 [---] Running CPU benchmarks
2007-09-02 09:42:54 [---] Suspending computation - running CPU benchmarks
2007-09-02 09:43:55 [---] Benchmark results:
2007-09-02 09:43:55 [---] Number of CPUs: 2
2007-09-02 09:43:55 [---] 1678 floating point MIPS (Whetstone) per CPU
2007-09-02 09:43:55 [---] 3022 integer MIPS (Dhrystone) per CPU
2007-09-02 09:43:56 [---] Resuming computation
2007-09-02 10:21:34 [World Community Grid] Deferring communication for 1 min 0 sec
2007-09-02 10:21:34 [World Community Grid] Reason: Unrecoverable error for result ach1_1_4_3 ( - exit code 95 (0x5f))
2007-09-02 10:21:34 [World Community Grid] Computation for task ach1_1_4_3 finished
2007-09-02 10:21:34 [World Community Grid] Output file ach1_1_4_3_0 for task ach1_1_4_3 absent
2007-09-02 10:21:34 [World Community Grid] Output file ach1_1_4_3_1 for task ach1_1_4_3 absent
2007-09-02 10:21:34 [World Community Grid] Output file ach1_1_4_3_2 for task ach1_1_4_3 absent
2007-09-02 10:21:34 [World Community Grid] Output file ach1_1_4_3_3 for task ach1_1_4_3 absent

----------------------------------------
[Edit 1 times, last edit by Former Member at Sep 3, 2007 11:11:50 PM]
[Sep 3, 2007 11:09:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
jal2
Senior Cruncher
USA
Joined: Apr 28, 2007
Post Count: 422
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: ERROR: Restoring checkpoint failed. Unable to restore state

my AC@H work-unit also encountered this error before finally abending on a different error.

workunitId=8641730

This WU had to wait for several hours before all of the files were available for processing (from 2007-09-01 12:11:44 until 2007-09-03 10:10:25) and then abended after processing 6.71 hours.
I skipped most of the "file not found" errors. Let me know if you need the middle section.

2007-09-01 12:11:15 [World Community Grid] [file_xfer] Started download of file wcg_acah_wrf_5.09_i686-pc-linux-gnu
2007-09-01 12:11:15 [World Community Grid] [file_xfer] Started download of file acah_image01_5.09.tga
2007-09-01 12:11:18 [World Community Grid] [file_xfer] Finished download of file acah_image01_5.09.tga
2007-09-01 12:11:18 [World Community Grid] [file_xfer] Throughput 118441 bytes/sec
2007-09-01 12:11:18 [World Community Grid] [file_xfer] Started download of file acah_image02_5.09.tga
2007-09-01 12:11:19 [World Community Grid] [file_xfer] Finished download of file acah_image02_5.09.tga
2007-09-01 12:11:19 [World Community Grid] [file_xfer] Throughput 37714 bytes/sec
2007-09-01 12:11:19 [World Community Grid] [file_xfer] Started download of file acah_image03_5.09.tga
2007-09-01 12:11:20 [World Community Grid] [file_xfer] Finished download of file acah_image03_5.09.tga
2007-09-01 12:11:20 [World Community Grid] [file_xfer] Throughput 38652 bytes/sec
2007-09-01 12:11:20 [World Community Grid] [file_xfer] Started download of file acah_image04_5.09.tga
2007-09-01 12:11:21 [World Community Grid] [file_xfer] Finished download of file acah_image04_5.09.tga
2007-09-01 12:11:21 [World Community Grid] [file_xfer] Throughput 12205 bytes/sec
2007-09-01 12:11:21 [World Community Grid] [file_xfer] Started download of file acah_image05_5.09.tga
2007-09-01 12:11:22 [World Community Grid] [file_xfer] Finished download of file acah_image05_5.09.tga
2007-09-01 12:11:22 [World Community Grid] [file_xfer] Throughput 19885 bytes/sec
2007-09-01 12:11:22 [World Community Grid] [file_xfer] Started download of file acah_image06_5.09.tga
2007-09-01 12:11:23 [World Community Grid] [file_xfer] Finished download of file acah_image06_5.09.tga
2007-09-01 12:11:23 [World Community Grid] [file_xfer] Throughput 2755 bytes/sec
2007-09-01 12:11:23 [World Community Grid] [file_xfer] Started download of file acah_image07_5.09.tga
2007-09-01 12:11:24 [World Community Grid] [file_xfer] Finished download of file acah_image07_5.09.tga
2007-09-01 12:11:24 [World Community Grid] [file_xfer] Throughput 9352 bytes/sec
2007-09-01 12:11:24 [World Community Grid] [file_xfer] Started download of file acah_image08_5.09.tga
2007-09-01 12:11:37 [World Community Grid] [file_xfer] Finished download of file acah_image08_5.09.tga
2007-09-01 12:11:37 [World Community Grid] [file_xfer] Throughput 91474 bytes/sec
2007-09-01 12:11:37 [World Community Grid] [file_xfer] Started download of file acah_image09_5.09.tga
2007-09-01 12:11:38 [World Community Grid] [file_xfer] Finished download of file acah_image09_5.09.tga
2007-09-01 12:11:38 [World Community Grid] [file_xfer] Throughput 14608 bytes/sec
2007-09-01 12:11:38 [World Community Grid] [file_xfer] Started download of file wcg_acah_wrf_5.09_i686-pc-linux-gnu.so
2007-09-01 12:11:41 [World Community Grid] [file_xfer] Finished download of file wcg_acah_wrf_5.09_i686-pc-linux-gnu.so
2007-09-01 12:11:41 [World Community Grid] [file_xfer] Throughput 76074 bytes/sec
2007-09-01 12:11:41 [World Community Grid] [file_xfer] Started download of file acah.GENPARM.TBL
2007-09-01 12:11:42 [World Community Grid] [file_xfer] Finished download of file acah.GENPARM.TBL
2007-09-01 12:11:42 [World Community Grid] [file_xfer] Throughput 1251 bytes/sec
2007-09-01 12:11:42 [World Community Grid] [file_xfer] Started download of file acah.LANDUSE.TBL
2007-09-01 12:11:44 [World Community Grid] [file_xfer] Finished download of file wcg_acah_wrf_5.09_i686-pc-linux-gnu
2007-09-01 12:11:44 [World Community Grid] [file_xfer] Throughput 134934 bytes/sec
2007-09-01 12:11:44 [World Community Grid] [file_xfer] Finished download of file acah.LANDUSE.TBL
2007-09-01 12:11:44 [World Community Grid] [file_xfer] Throughput 7698 bytes/sec
2007-09-01 12:11:44 [World Community Grid] [file_xfer] Started download of file acah.RRTM_DATA
2007-09-01 12:11:44 [World Community Grid] [file_xfer] Started download of file ach1_1__VEGPARM_01.TBL
2007-09-01 12:11:46 [World Community Grid] [file_xfer] Temporarily failed download of ach1_1__VEGPARM_01.TBL: file not found
2007-09-01 12:11:46 [World Community Grid] Backing off 1 min 0 sec on download of file ach1_1__VEGPARM_01.TBL

:
:
:
2007-09-03 03:41:48 [World Community Grid] [file_xfer] Started download of file ach1_1__SOILPARM_01.TBL
2007-09-03 03:41:51 [World Community Grid] [file_xfer] Temporarily failed download of ach1_1__SOILPARM_01.TBL: file not found
2007-09-03 03:41:51 [World Community Grid] Backing off 2 hr 39 min 56 sec on download of file ach1_1__SOILPARM_01.TBL
2007-09-03 06:21:49 [World Community Grid] [file_xfer] Started download of file ach1_1__SOILPARM_01.TBL
2007-09-03 06:21:51 [World Community Grid] [file_xfer] Temporarily failed download of ach1_1__SOILPARM_01.TBL: file not found
2007-09-03 06:21:51 [World Community Grid] Backing off 3 hr 48 min 32 sec on download of file ach1_1__SOILPARM_01.TBL
2007-09-03 10:10:25 [World Community Grid] [file_xfer] Started download of file ach1_1__SOILPARM_01.TBL
2007-09-03 10:10:29 [World Community Grid] [file_xfer] Finished download of file ach1_1__SOILPARM_01.TBL
2007-09-03 10:10:29 [World Community Grid] [file_xfer] Throughput 513 bytes/sec
2007-09-03 10:10:30 [World Community Grid] Starting ach1_1_1_11
2007-09-03 10:10:36 [World Community Grid] Starting task ach1_1_1_11 using acah version 509
2007-09-03 18:46:51 [World Community Grid] Deferring communication for 1 min 0 sec
2007-09-03 18:46:51 [World Community Grid] Reason: Unrecoverable error for result ach1_1_1_11 (process exited with code 131 (0
x83))
2007-09-03 18:46:53 [World Community Grid] Computation for task ach1_1_1_11 finished
2007-09-03 18:46:53 [World Community Grid] Output file ach1_1_1_11_0 for task ach1_1_1_11 absent
2007-09-03 18:46:53 [World Community Grid] Output file ach1_1_1_11_1 for task ach1_1_1_11 absent
2007-09-03 18:46:53 [World Community Grid] Output file ach1_1_1_11_2 for task ach1_1_1_11 absent
2007-09-03 18:46:53 [World Community Grid] Output file ach1_1_1_11_3 for task ach1_1_1_11 absent
2007-09-03 18:47:54 [World Community Grid] Sending scheduler request: To report completed tasks
----------------------------------------
[Sep 3, 2007 11:33:24 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: ERROR: Restoring checkpoint failed. Unable to restore state

Btw I have the same error on the following WUs (others in the quorum have Error or are pending validation):

ach1_1_4_3
ach1_ 1_ 21_ 8--
ach1_ 1_ 29_ 1--
ach1_1_6_9
ach1_ 1_ 66_ 0--
ach1_ 1_ 59_ 2--
ach1_ 1_ 13_ 10--
ach1_ 1_ 7_ 10--
[Sep 4, 2007 1:51:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: ERROR: Restoring checkpoint failed. Unable to restore state

Has anyone seen any valid ones in the quorum? I checked some of mine that were "pending validation" and they all have this same message (although they're not listed as "error"... yet).
[Sep 4, 2007 5:35:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: ERROR: Restoring checkpoint failed. Unable to restore state

Hi esoteric17,
My first ACH work unit has sent out 2 make-up copies for 2 'errors'. One has returned. We are still waiting for the last make-up unit.
Lawrence
[Sep 4, 2007 6:09:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: ERROR: Restoring checkpoint failed. Unable to restore state

I have one AC-wu valid today. The error messages in the result log still the same as expected.
[Sep 5, 2007 9:54:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: ERROR: Restoring checkpoint failed. Unable to restore state

I got 43 of these WUs before I saw the errors & unchecked it.

13 of them have error status (others in the quorum are error, pending validation, or inconclusive). All have at least one other in the quorum with error status, and a few have all error or inconclusive:

ach1_ 1_ 29_ 1--
ach1_ 1_ 21_ 8--
ach1_ 1_ 17_ 8--
ach1_1_4_3
ach1_ 1_ 13_ 10-- (10 copies, all error/inconclusive)
ach1_ 1_ 59_ 2-- (13 copies, all error/inconclusive)
ach1_ 1_ 66_ 0--
ach1_1_6_9
ach1_ 1_ 55_ 8--
ach1_ 1_ 68_ 0--
ach1_ 1_ 7_ 10--
ach1_ 1_ 78_ 3--
ach1_ 1_ 74_ 3--


2 of them are inconclusive:
ach1_ 1_ 24_ 11-- (11 copies, most inconclusive some error)
ach1_ 1_ 27_ 7-- (11 copies, most inconclusive some error)


The other 28 WUs I got are still pending validation.
[Sep 6, 2007 2:00:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
jal2
Senior Cruncher
USA
Joined: Apr 28, 2007
Post Count: 422
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: ERROR: Restoring checkpoint failed. Unable to restore state

10 WU on 5 different crunchers.

5 error
3 pending
1 inconclusive
1 in progress

78.29 CPU hours and counting

622.6 Boinc credits claimed
0.0 Boinc credits granted

still crunching in the hope this unofficial beta testing will prove useful.
----------------------------------------
[Sep 6, 2007 10:14:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 52   Pages: 6   [ 1 2 3 4 5 6 | Next Page ]
[ Jump to Last Post ]
Post new Thread