Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 52
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Sorry if this has been addressed previously. This is for WU ach1_1_4_3. 3 Others in the quorum have Error status also (some are vaild; 2 others with Error status):
----------------------------------------Result Log <core_client_version>5.8.16</core_client_version> <![CDATA[ <message> - exit code 95 (0x5f) </message> <stderr_txt> Failed to get VersionInfo size: 1812 World Community Grid AutoDock (projects/www.worldcommunitygrid.org/wcg_acah_wrf_5.09_windows_intelx86) version INFO: No state to restore. Start from the beginning. ERROR: Restoring checkpoint failed. Unable to restore state! Start_year/Start_Month/Start_Day::Start_Hour:Start_Minute:Start_Second Restart2002/12/18::0:0:0 0 Exception: Access Violation At line 296 of file wrf_io.f Traceback: not available, compile with -ftrace=frame or -ftrace=full </stderr_txt> ]]> [Edit 3 times, last edit by Former Member at Sep 4, 2007 9:19:40 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello esoteric17,
This is a new error to investigate. Could you paste the relevant messages from stdoutdae.txt for the staff to look at? Lawrence |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
2007-09-01 22:06:46 [World Community Grid] Starting ach1_1_4_3
----------------------------------------2007-09-01 22:06:47 [World Community Grid] Starting task ach1_1_4_3 using acah version 509 2007-09-01 23:25:01 [World Community Grid] Starting dddt0101a0030_ZINC00637125-0001_01_4 2007-09-01 23:25:03 [World Community Grid] Starting task dddt0101a0030_ZINC00637125-0001_01_4 using dddt version 508 2007-09-01 23:30:52 [World Community Grid] Aborting task dddt0101a0030_ZINC00637125-0001_01_4: exceeded disk limit: 72.52MB > 71.53MB 2007-09-01 23:30:52 [World Community Grid] Deferring communication for 1 min 0 sec 2007-09-01 23:30:52 [World Community Grid] Reason: Unrecoverable error for result dddt0101a0030_ZINC00637125-0001_01_4 (Maximum disk usage exceeded) 2007-09-01 23:30:58 [World Community Grid] Computation for task dddt0101a0030_ZINC00637125-0001_01_4 finished 2007-09-01 23:30:58 [World Community Grid] Resuming task ach1_1_4_3 using acah version 509 2007-09-02 09:42:54 [---] Running CPU benchmarks 2007-09-02 09:42:54 [---] Suspending computation - running CPU benchmarks 2007-09-02 09:43:55 [---] Benchmark results: 2007-09-02 09:43:55 [---] Number of CPUs: 2 2007-09-02 09:43:55 [---] 1678 floating point MIPS (Whetstone) per CPU 2007-09-02 09:43:55 [---] 3022 integer MIPS (Dhrystone) per CPU 2007-09-02 09:43:56 [---] Resuming computation 2007-09-02 10:21:34 [World Community Grid] Deferring communication for 1 min 0 sec 2007-09-02 10:21:34 [World Community Grid] Reason: Unrecoverable error for result ach1_1_4_3 ( - exit code 95 (0x5f)) 2007-09-02 10:21:34 [World Community Grid] Computation for task ach1_1_4_3 finished 2007-09-02 10:21:34 [World Community Grid] Output file ach1_1_4_3_0 for task ach1_1_4_3 absent 2007-09-02 10:21:34 [World Community Grid] Output file ach1_1_4_3_1 for task ach1_1_4_3 absent 2007-09-02 10:21:34 [World Community Grid] Output file ach1_1_4_3_2 for task ach1_1_4_3 absent 2007-09-02 10:21:34 [World Community Grid] Output file ach1_1_4_3_3 for task ach1_1_4_3 absent [Edit 1 times, last edit by Former Member at Sep 3, 2007 11:11:50 PM] |
||
|
jal2
Senior Cruncher USA Joined: Apr 28, 2007 Post Count: 422 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
my AC@H work-unit also encountered this error before finally abending on a different error.
----------------------------------------workunitId=8641730 This WU had to wait for several hours before all of the files were available for processing (from 2007-09-01 12:11:44 until 2007-09-03 10:10:25) and then abended after processing 6.71 hours. I skipped most of the "file not found" errors. Let me know if you need the middle section. 2007-09-01 12:11:15 [World Community Grid] [file_xfer] Started download of file wcg_acah_wrf_5.09_i686-pc-linux-gnu 2007-09-01 12:11:15 [World Community Grid] [file_xfer] Started download of file acah_image01_5.09.tga 2007-09-01 12:11:18 [World Community Grid] [file_xfer] Finished download of file acah_image01_5.09.tga 2007-09-01 12:11:18 [World Community Grid] [file_xfer] Throughput 118441 bytes/sec 2007-09-01 12:11:18 [World Community Grid] [file_xfer] Started download of file acah_image02_5.09.tga 2007-09-01 12:11:19 [World Community Grid] [file_xfer] Finished download of file acah_image02_5.09.tga 2007-09-01 12:11:19 [World Community Grid] [file_xfer] Throughput 37714 bytes/sec 2007-09-01 12:11:19 [World Community Grid] [file_xfer] Started download of file acah_image03_5.09.tga 2007-09-01 12:11:20 [World Community Grid] [file_xfer] Finished download of file acah_image03_5.09.tga 2007-09-01 12:11:20 [World Community Grid] [file_xfer] Throughput 38652 bytes/sec 2007-09-01 12:11:20 [World Community Grid] [file_xfer] Started download of file acah_image04_5.09.tga 2007-09-01 12:11:21 [World Community Grid] [file_xfer] Finished download of file acah_image04_5.09.tga 2007-09-01 12:11:21 [World Community Grid] [file_xfer] Throughput 12205 bytes/sec 2007-09-01 12:11:21 [World Community Grid] [file_xfer] Started download of file acah_image05_5.09.tga 2007-09-01 12:11:22 [World Community Grid] [file_xfer] Finished download of file acah_image05_5.09.tga 2007-09-01 12:11:22 [World Community Grid] [file_xfer] Throughput 19885 bytes/sec 2007-09-01 12:11:22 [World Community Grid] [file_xfer] Started download of file acah_image06_5.09.tga 2007-09-01 12:11:23 [World Community Grid] [file_xfer] Finished download of file acah_image06_5.09.tga 2007-09-01 12:11:23 [World Community Grid] [file_xfer] Throughput 2755 bytes/sec 2007-09-01 12:11:23 [World Community Grid] [file_xfer] Started download of file acah_image07_5.09.tga 2007-09-01 12:11:24 [World Community Grid] [file_xfer] Finished download of file acah_image07_5.09.tga 2007-09-01 12:11:24 [World Community Grid] [file_xfer] Throughput 9352 bytes/sec 2007-09-01 12:11:24 [World Community Grid] [file_xfer] Started download of file acah_image08_5.09.tga 2007-09-01 12:11:37 [World Community Grid] [file_xfer] Finished download of file acah_image08_5.09.tga 2007-09-01 12:11:37 [World Community Grid] [file_xfer] Throughput 91474 bytes/sec 2007-09-01 12:11:37 [World Community Grid] [file_xfer] Started download of file acah_image09_5.09.tga 2007-09-01 12:11:38 [World Community Grid] [file_xfer] Finished download of file acah_image09_5.09.tga 2007-09-01 12:11:38 [World Community Grid] [file_xfer] Throughput 14608 bytes/sec 2007-09-01 12:11:38 [World Community Grid] [file_xfer] Started download of file wcg_acah_wrf_5.09_i686-pc-linux-gnu.so 2007-09-01 12:11:41 [World Community Grid] [file_xfer] Finished download of file wcg_acah_wrf_5.09_i686-pc-linux-gnu.so 2007-09-01 12:11:41 [World Community Grid] [file_xfer] Throughput 76074 bytes/sec 2007-09-01 12:11:41 [World Community Grid] [file_xfer] Started download of file acah.GENPARM.TBL 2007-09-01 12:11:42 [World Community Grid] [file_xfer] Finished download of file acah.GENPARM.TBL 2007-09-01 12:11:42 [World Community Grid] [file_xfer] Throughput 1251 bytes/sec 2007-09-01 12:11:42 [World Community Grid] [file_xfer] Started download of file acah.LANDUSE.TBL 2007-09-01 12:11:44 [World Community Grid] [file_xfer] Finished download of file wcg_acah_wrf_5.09_i686-pc-linux-gnu 2007-09-01 12:11:44 [World Community Grid] [file_xfer] Throughput 134934 bytes/sec 2007-09-01 12:11:44 [World Community Grid] [file_xfer] Finished download of file acah.LANDUSE.TBL 2007-09-01 12:11:44 [World Community Grid] [file_xfer] Throughput 7698 bytes/sec 2007-09-01 12:11:44 [World Community Grid] [file_xfer] Started download of file acah.RRTM_DATA 2007-09-01 12:11:44 [World Community Grid] [file_xfer] Started download of file ach1_1__VEGPARM_01.TBL 2007-09-01 12:11:46 [World Community Grid] [file_xfer] Temporarily failed download of ach1_1__VEGPARM_01.TBL: file not found 2007-09-01 12:11:46 [World Community Grid] Backing off 1 min 0 sec on download of file ach1_1__VEGPARM_01.TBL : : : 2007-09-03 03:41:48 [World Community Grid] [file_xfer] Started download of file ach1_1__SOILPARM_01.TBL 2007-09-03 03:41:51 [World Community Grid] [file_xfer] Temporarily failed download of ach1_1__SOILPARM_01.TBL: file not found 2007-09-03 03:41:51 [World Community Grid] Backing off 2 hr 39 min 56 sec on download of file ach1_1__SOILPARM_01.TBL 2007-09-03 06:21:49 [World Community Grid] [file_xfer] Started download of file ach1_1__SOILPARM_01.TBL 2007-09-03 06:21:51 [World Community Grid] [file_xfer] Temporarily failed download of ach1_1__SOILPARM_01.TBL: file not found 2007-09-03 06:21:51 [World Community Grid] Backing off 3 hr 48 min 32 sec on download of file ach1_1__SOILPARM_01.TBL 2007-09-03 10:10:25 [World Community Grid] [file_xfer] Started download of file ach1_1__SOILPARM_01.TBL 2007-09-03 10:10:29 [World Community Grid] [file_xfer] Finished download of file ach1_1__SOILPARM_01.TBL 2007-09-03 10:10:29 [World Community Grid] [file_xfer] Throughput 513 bytes/sec 2007-09-03 10:10:30 [World Community Grid] Starting ach1_1_1_11 2007-09-03 10:10:36 [World Community Grid] Starting task ach1_1_1_11 using acah version 509 2007-09-03 18:46:51 [World Community Grid] Deferring communication for 1 min 0 sec 2007-09-03 18:46:51 [World Community Grid] Reason: Unrecoverable error for result ach1_1_1_11 (process exited with code 131 (0 x83)) 2007-09-03 18:46:53 [World Community Grid] Computation for task ach1_1_1_11 finished 2007-09-03 18:46:53 [World Community Grid] Output file ach1_1_1_11_0 for task ach1_1_1_11 absent 2007-09-03 18:46:53 [World Community Grid] Output file ach1_1_1_11_1 for task ach1_1_1_11 absent 2007-09-03 18:46:53 [World Community Grid] Output file ach1_1_1_11_2 for task ach1_1_1_11 absent 2007-09-03 18:46:53 [World Community Grid] Output file ach1_1_1_11_3 for task ach1_1_1_11 absent 2007-09-03 18:47:54 [World Community Grid] Sending scheduler request: To report completed tasks |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Btw I have the same error on the following WUs (others in the quorum have Error or are pending validation):
ach1_1_4_3 ach1_ 1_ 21_ 8-- ach1_ 1_ 29_ 1-- ach1_1_6_9 ach1_ 1_ 66_ 0-- ach1_ 1_ 59_ 2-- ach1_ 1_ 13_ 10-- ach1_ 1_ 7_ 10-- |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Has anyone seen any valid ones in the quorum? I checked some of mine that were "pending validation" and they all have this same message (although they're not listed as "error"... yet).
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi esoteric17,
My first ACH work unit has sent out 2 make-up copies for 2 'errors'. One has returned. We are still waiting for the last make-up unit. Lawrence |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I have one AC-wu valid today. The error messages in the result log still the same as expected.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I got 43 of these WUs before I saw the errors & unchecked it.
13 of them have error status (others in the quorum are error, pending validation, or inconclusive). All have at least one other in the quorum with error status, and a few have all error or inconclusive: ach1_ 1_ 29_ 1-- ach1_ 1_ 21_ 8-- ach1_ 1_ 17_ 8-- ach1_1_4_3 ach1_ 1_ 13_ 10-- (10 copies, all error/inconclusive) ach1_ 1_ 59_ 2-- (13 copies, all error/inconclusive) ach1_ 1_ 66_ 0-- ach1_1_6_9 ach1_ 1_ 55_ 8-- ach1_ 1_ 68_ 0-- ach1_ 1_ 7_ 10-- ach1_ 1_ 78_ 3-- ach1_ 1_ 74_ 3-- 2 of them are inconclusive: ach1_ 1_ 24_ 11-- (11 copies, most inconclusive some error) ach1_ 1_ 27_ 7-- (11 copies, most inconclusive some error) The other 28 WUs I got are still pending validation. |
||
|
jal2
Senior Cruncher USA Joined: Apr 28, 2007 Post Count: 422 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
10 WU on 5 different crunchers.
----------------------------------------5 error 3 pending 1 inconclusive 1 in progress 78.29 CPU hours and counting 622.6 Boinc credits claimed 0.0 Boinc credits granted still crunching in the hope this unofficial beta testing will prove useful. |
||
|
|
![]() |