Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 5
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
On two of my boxes all WUs are failing with the following error
<core_client_version>6.12.6</core_client_version> <![CDATA[ <message> - exit code 195 (0xc3) </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [22:23:29] Number of jobs = 16 [22:23:29] Starting job 0,CPU time has been restored to 0.000000. Application exited with RC = 0x502 [22:27:10] Finished Job #0 22:27:10 (3324): called boinc_finish </stderr_txt> ]]> The BOINC version on the other box is 6.4 so I don't think that's the reason. I've not experienced any problems on other projects. Also these WUs are cruching fine on a third box. Can anybody shed any light on this error? Thanks |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello Ram Raider,
That is a new error to me so . . . no idea. Perhaps knowing your system and profile would give some clues. To start, try posting the startup messages in BOINC for a system and describe the profile that it is running under. How many CEP2 jobs are running at the same time? Lawrence |
||
|
Jim1348
Veteran Cruncher USA Joined: Jul 13, 2009 Post Count: 1066 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
[22:23:29] Number of jobs = 16 Are you running 8 cores hyper-threaded? It could be that your disk drive can't keep up with all the writes. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
One WU has 16 jobs. I think Lawrence was referring to the number of WUs running simultaneously on one machine.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks for the suggestions.
Here's the startup messages for BOINC:- 26-Nov-2010 12:15:19 [---] Starting BOINC client version 6.12.6 for windows_intelx86 26-Nov-2010 12:15:19 [---] Config: GUI RPC allowed from: 26-Nov-2010 12:15:19 [---] Config: 192.168.0.2 26-Nov-2010 12:15:19 [---] log flags: file_xfer, sched_ops, task, cpu_sched 26-Nov-2010 12:15:19 [---] Libraries: libcurl/7.19.7 OpenSSL/0.9.8l zlib/1.2.5 26-Nov-2010 12:15:19 [---] Running as a daemon 26-Nov-2010 12:15:19 [---] Data directory: C:\BOINC 26-Nov-2010 12:15:19 [---] Running under account boinc_master 26-Nov-2010 12:15:19 [---] Processor: 2 GenuineIntel Intel(R) Core(TM)2 Duo CPU T7300 @ 2.00GHz [Family 6 Model 15 Stepping 11] 26-Nov-2010 12:15:19 [---] Processor: 4.00 MB cache 26-Nov-2010 12:15:19 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 nx lm vmx tm2 pbe 26-Nov-2010 12:15:19 [---] OS: Microsoft Windows XP: Professional x86 Edition, Service Pack 3, (05.01.2600.00) 26-Nov-2010 12:15:19 [---] Memory: 1.96 GB physical, 5.81 GB virtual 26-Nov-2010 12:15:19 [---] Disk: 93.16 GB total, 8.21 GB free 26-Nov-2010 12:15:19 [---] Local time is UTC +0 hours 26-Nov-2010 12:15:19 [---] No usable GPUs found 26-Nov-2010 12:15:22 [climateprediction.net] URL http://climateprediction.net/; Computer ID 921587; resource share 1000 26-Nov-2010 12:15:22 [Quake-Catcher Network] URL http://qcn.stanford.edu/sensor/; Computer ID 4379; resource share 100 26-Nov-2010 12:15:22 [WUProp@Home] URL http://wuprop.boinc-af.org/; Computer ID 6625; resource share 100 26-Nov-2010 12:15:22 [malariacontrol.net] URL http://www.malariacontrol.net/; Computer ID 116030; resource share 670 26-Nov-2010 12:15:22 [World Community Grid] URL http://www.worldcommunitygrid.org/; Computer ID 746613; resource share 660 26-Nov-2010 12:15:22 [malariacontrol.net] General prefs: from malariacontrol.net (last modified 05-Jul-2010 17:50:00) 26-Nov-2010 12:15:22 [malariacontrol.net] Computer location: home 26-Nov-2010 12:15:22 [malariacontrol.net] General prefs: no separate prefs for home; using your defaults 26-Nov-2010 12:15:22 [---] Reading preferences override file 26-Nov-2010 12:15:22 [---] Preferences: 26-Nov-2010 12:15:22 [---] max memory usage when active: 2006.22MB 26-Nov-2010 12:15:22 [---] max memory usage when idle: 2006.22MB 26-Nov-2010 12:15:22 [---] max disk usage: 8.83GB 26-Nov-2010 12:15:22 [---] (to change preferences, visit the web site of an attached project, or select Preferences in the Manager) 26-Nov-2010 12:15:22 [---] Using proxy info from GUI 26-Nov-2010 12:15:22 [---] Not using a proxy And here's the messages when the WU fails (after about 5 mins):- 26-Nov-2010 21:21:42 [World Community Grid] Starting E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1 26-Nov-2010 21:21:42 [World Community Grid] [cpu_sched] Starting E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1 (initial) 26-Nov-2010 21:21:42 [World Community Grid] Starting task E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1 using cep2 version 635 26-Nov-2010 21:27:46 [World Community Grid] Computation for task E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1 finished 26-Nov-2010 21:27:54 [World Community Grid] Started upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_0 26-Nov-2010 21:27:54 [World Community Grid] Started upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_1 26-Nov-2010 21:27:56 [World Community Grid] Finished upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_0 26-Nov-2010 21:27:56 [World Community Grid] Started upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_2 26-Nov-2010 21:27:59 [World Community Grid] Finished upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_2 26-Nov-2010 21:27:59 [World Community Grid] Started upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_3 26-Nov-2010 21:28:03 [World Community Grid] Finished upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_1 26-Nov-2010 21:28:03 [World Community Grid] Finished upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_3 26-Nov-2010 21:28:03 [World Community Grid] Started upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_4 26-Nov-2010 21:28:11 [World Community Grid] Finished upload of E200639_674_A.27.C20H11N3OS3.219.0.set1d06_1_4 Also I only run one CEP2 WU at a time - I've had no problems running any other project or other WCG sub-projects. So does anybody know what error 0x502 means? Thanks |
||
|
|
![]() |