Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 7
|
![]() |
Author |
|
wplachy
Senior Cruncher Joined: Sep 4, 2007 Post Count: 423 Status: Offline |
I have three ts01 WUs that errored out on three different PCs, after 1.21, 1.33 & .47 hrs. These devices normally do not throw errors.
----------------------------------------Two are running Win 7 64-bit (I7 920 HT on & C2D quad) other is running WinXP 32 (AMD4800+ duo). The result logs are the same with the exception of the WU name. Result Name: ts01_ c236_ se0000_ 1-- Result Name: ts01_ c248_ se0000_ 1-- Result Name: ts01_ d177_ se0000_ 1-- <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d) </message> <stderr_txt> INFO: No state to restore. Start from the beginning. ENERGY CHANGE TOLERANCE EXCEEDED Encountered error. Exiting. </stderr_txt> BOINC messages are: ------------------- World Community Grid 4/10/2010 2:17:49 AM Starting ts01_c236_se0000_1 World Community Grid 4/10/2010 2:17:49 AM Starting task ts01_c236_se0000_1 using dddt2 version 617 .... World Community Grid 4/10/2010 2:53:16 AM Computation for task ts01_c236_se0000_1 finished World Community Grid 4/10/2010 2:53:16 AM Output file ts01_c236_se0000_1_0 for task ts01_c236_se0000_1 absent World Community Grid 4/10/2010 2:53:16 AM Output file ts01_c236_se0000_1_1 for task ts01_c236_se0000_1 absent World Community Grid 4/10/2010 2:53:16 AM Starting ts01_c235_se0000_0 World Community Grid 4/10/2010 2:53:16 AM Starting task ts01_c235_se0000_0 using dddt2 version 617 World Community Grid 4/10/2010 2:53:19 AM Started upload of ts01_c236_se0000_1_2 World Community Grid 4/10/2010 2:53:21 AM Finished upload of ts01_c236_se0000_1_2 ------------------- World Community Grid 4/9/2010 10:16:00 PM Starting ts01_c248_se0000_1 World Community Grid 4/9/2010 10:16:01 PM Starting task ts01_c248_se0000_1 using dddt2 version 617 .... World Community Grid 4/9/2010 11:36:14 PM Computation for task ts01_c248_se0000_1 finished World Community Grid 4/9/2010 11:36:14 PM Output file ts01_c248_se0000_1_0 for task ts01_c248_se0000_1 absent World Community Grid 4/9/2010 11:36:14 PM Output file ts01_c248_se0000_1_1 for task ts01_c248_se0000_1 absent ------------------- World Community Grid 4/9/2010 5:19:59 PM Starting ts01_d177_se0000_1 World Community Grid 4/9/2010 5:20:00 PM Starting task ts01_d177_se0000_1 using dddt2 version 617 .... World Community Grid 4/9/2010 6:33:05 PM Computation for task ts01_d177_se0000_1 finished World Community Grid 4/9/2010 6:33:05 PM Output file ts01_d177_se0000_1_0 for task ts01_d177_se0000_1 absent World Community Grid 4/9/2010 6:33:05 PM Output file ts01_d177_se0000_1_1 for task ts01_d177_se0000_1 absent World Community Grid 4/9/2010 6:33:05 PM Starting erlc_d060_pcb004_1 World Community Grid 4/9/2010 6:33:06 PM Starting task erlc_d060_pcb004_1 using dddt2 version 617 World Community Grid 4/9/2010 6:33:09 PM Started upload of ts01_d177_se0000_1_2 World Community Grid 4/9/2010 6:33:11 PM Finished upload of ts01_d177_se0000_1_2 ------------------- Initial wingmen and repair WUs are In Progress for all 3 WUs. Edit: Added 3rd error
Bill P
----------------------------------------![]() [Edit 2 times, last edit by wplachy at Apr 10, 2010 3:40:06 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I rememder have this myself a few mouhts back, Also with a windows7 system.
![]() Maybe a little too much, but i saw my cashe running empty. The WU's ran only a few minutes. I did reboot. ![]() ![]() ![]() A low tech solution but it help, i blame windows. |
||
|
wplachy
Senior Cruncher Joined: Sep 4, 2007 Post Count: 423 Status: Offline |
Hi Arnolddepater! Thanks for the reply. I have also seen this on HPF2 and Winxx 64 bit. Since it happened on 32 bit WinXP as well as Win 7, on three diferent machines and I have DDDT2 WUs in progress I'm going to hold off on the re-boots for now.
----------------------------------------It really looks to me like a WU rather than machine problem. Happy Crunching ![]()
Bill P
![]() |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
In that case i would suspend them so Uplinger can have a look. One of the recent threads mention how.
----------------------------------------To me it happenen to a type of WU that worked ok before, but crashed suddenly. ![]() Keep up that good work. [Edit 2 times, last edit by Former Member at Apr 10, 2010 10:32:04 AM] |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
"ENERGY CHANGE TOLERANCE EXCEEDED"
This is a good exit by the program. It shows up as error because the work unit technically errored out. There is nothing wrong with your machine. The calculations for the experiment you were running got out of reasonable range and thus did not seem feasible. This is supposed to catch it before it causes a memory access violation as some have seen. You will get credit for the time calculated as others should get the same "error 29". Thanks, -Uplinger |
||
|
wplachy
Senior Cruncher Joined: Sep 4, 2007 Post Count: 423 Status: Offline |
Thank you for the reply uplinger! When I saw the message I suspected as much but thought it reasonable to let you know in case it was a WU problem.
----------------------------------------
Bill P
![]() |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Just looking through the laptop log and see 2 se types that have near double the normal time. I recollect them having restarted after a power out that 0.000% when already near 5 hours into the job. They did not loose the run time to go with that which I consider as good, for the full time contributed gets honored. Both tasks ended in valid
----------------------------------------ts01_ d118_ se0000_ 1-- 95711 Valid 9-4-10 08:45:34 10-4-10 01:07:13 9.65 97.8 / 58.0 ts01_ d119_ se0000_ 0-- 95711 Valid 9-4-10 08:45:34 10-4-10 01:04:12 9.40 95.3 / 59.1 The log shows 2 starts, the wingmen just one. Result Log
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
![]() |