Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 13
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I was checking the processing today and I am seeing a lot of errors - over 20 pages - occurring. It looks like they were all received on 4/3 and the errors occurred on 4/7. I checked the process information and am seeing the following:
----------------------------------------Result Log Result Name: X0900127880685201107011218_ 1-- <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> No process is on the other end of the pipe. (0xe9) - exit code 233 (0xe9) </message> <stderr_txt> </stderr_txt> ]]> [Edit 1 times, last edit by Former Member at Apr 7, 2013 5:57:17 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I have one machine that gets a lot of these errors as well. It has 3 GPU's and If I try to run more that ~18 WU's simultaneously it starts producing these errors. At 21 WU's, I see 5-15 errors per day and the number goes up exponentially from there. If I try to run 36, almost all of them error out. I haven't been able to trace the source of these errors. I know others who run 24+ WU's at a time with success so I'm guessing it's hardware related or possibly a problem with some non-BOINC software.
----------------------------------------[Edit 1 times, last edit by Former Member at Apr 8, 2013 2:20:28 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I just checked again, and I am getting more of these errors. The latest were received on 4/8 and the error occurred on 4/9. I have not had the error, that first occurred on 4/7, previously. The machine receiving the error has only one GPU card.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The latest results status information for the last error received is
Workunit Status Project Name: Help Conquer Cancer Created: 04/07/2013 16:11:26 Name: X0930129771353201108241026 Minimum Quorum: 2 Replication: 2 Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time / Elapsed Time (hours) Claimed/ Granted BOINC Credit X0930129771353201108241026_ 2-- - In Progress 4/9/13 16:22:08 4/16/13 16:22:08 0.00 0.0 / 0.0 X0930129771353201108241026_ 0-- 705 Pending Validation 4/8/13 04:49:13 4/9/13 03:06:12 0.12 63.3 / 0.0 X0930129771353201108241026_ 1-- 705 Error 4/8/13 04:49:10 4/9/13 16:22:00 0.00 0.1 / 0.0 |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I have not received any errors on 4/9 or 4/10 (yet).
|
||
|
armstrdj
Former World Community Grid Tech Joined: Oct 21, 2004 Post Count: 695 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
dkt,
Error code 233 occurs when the application does a performance check at the beginning of execution to make sure the video card can run the application. However with this error you should get information in stderr about the card and the time for the performance check, not sure why this is not showing up in your stderr. The error can occur if your gpu is loaded with other activity or if you try and run to many workunits at a time. Thanks, armstrdj |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks armstrdj -
----------------------------------------My son uses that computer for video games. Would this cause the errors as you describe? [Edit 1 times, last edit by Former Member at Apr 11, 2013 4:43:55 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
My error messages have changed, but I will have to wait a day or two to see if I am still getting the previous errors. Now, toward the end of the stream for the Error, I am getting
----------------------------------------Result Log Result Name: X0930130721179201111171053_ 0-- <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> No process is on the other end of the pipe. (0xe9) - exit code 233 (0xe9) </message> <stderr_txt> Commandline: projects/www.worldcommunitygrid.org ... [and a lot of lines followed by...] Estimated kernel execution time = 3.21459 [sec] ERROR: Kernel execution time estimate too high, exiting. 19:45:12 (10156): called boinc_finish </stderr_txt> ]]> I will give an update on Monday with the weekend processing results. The last error was (sorted by return time) X0930130720623201111171101_ 1-- SSTsComp Error 4/10/13 14:46:02 4/11/13 16:06:17 0.00 / 0.00 0.1 / 0.0 [Edit 2 times, last edit by Former Member at Apr 12, 2013 5:17:13 PM] |
||
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
quote]Commandline: projects/www.worldcommunitygrid.org ... [and a lot of lines followed by...] Estimated kernel execution time = 3.21459 [sec] ERROR: Kernel execution time estimate too high, exiting. 19:45:12 (10156): called boinc_finish When I saw this error the only cure was to lower the amount of tasks running. Luckily you don't lose any runtime because this error shows up in the first 2-4 seconds. ----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
![]() ![]() |
||
|
Paul Schlaffer
Senior Cruncher USA Joined: Jun 12, 2005 Post Count: 244 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
During the beta testing I remember reading that running more than 1 workunit per CPU would cause errors. Lowering the WU number to the actual # of CPU cores should clear the errors.
----------------------------------------![]() “Where an excess of power prevails, property of no sort is duly respected. No man is safe in his opinions, his person, his faculties, or his possessions.” – James Madison (1792) |
||
|
|
![]() |