Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 24
Posts: 24   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 20682 times and has 23 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Random errors

Hello

My rigs are working on HCC, there are random errors popping up on different rigs, an odd one here and there, now I have 4 showing up today, on 4 different rigs and all of them looking like they completed? I am _1

Result Name: X0000100480101200806201639_ 1--



<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
In ExtractGlcmFeatures: End of 0 iteration of outer loop.
In ExtractGlcmFeatures: End of 1 iteration of outer loop.
In ExtractGlcmFeatures: End of 2 iteration of outer loop.
In ExtractGlcmFeatures: End of 3 iteration of outer loop.
In ExtractGlcmFeatures: End of 4 iteration of outer loop.
In ExtractGlcmFeatures: End of 5 iteration of outer loop.
In ExtractGlcmFeatures: End of 6 iteration of outer loop.
In ExtractGlcmFeatures: End of 7 iteration of outer loop.
In ExtractGlcmFeatures: End of 8 iteration of outer loop.
In ExtractGlcmFeatures: End of 9 iteration of outer loop.
In ExtractGlcmFeatures: End of 10 iteration of outer loop.
In ExtractGlcmFeatures: End of 11 iteration of outer loop.
In ExtractGlcmFeatures: End of 12 iteration of outer loop.
In ExtractGlcmFeatures: End of 13 iteration of outer loop.
In ExtractGlcmFeatures: End of 14 iteration of outer loop.
In ExtractGlcmFeatures: End of 15 iteration of outer loop.
In ExtractGlcmFeatures: End of 16 iteration of outer loop.
In ExtractGlcmFeatures: End of 17 iteration of outer loop.
In ExtractGlcmFeatures: End of 18 iteration of outer loop.
In ExtractGlcmFeatures: End of 19 iteration of outer loop.
In ExtractGlcmFeatures: End of 20 iteration of outer loop.
In ExtractGlcmFeatures: End of 21 iteration of outer loop.
In ExtractGlcmFeatures: End of 22 iteration of outer loop.
In ExtractGlcmFeatures: End of 23 iteration of outer loop.
In ExtractGlcmFeatures: End of 24 iteration of outer loop.
21:22:08 (1024): called boinc_finish


X0000100480101200806201639_ 3-- 642 Pending Validation 20/05/11 00:19:34 20/05/11 05:47:15 1.34 20.1 / 0.0
X0000100480101200806201639_ 2-- - In Progress 20/05/11 00:19:30 22/05/11 19:31:30 0.00 0.0 / 0.0
X0000100480101200806201639_ 0-- 642 Error 18/05/11 13:30:24 20/05/11 00:18:59 1.74 26.5 / 0.0
X0000100480101200806201639_ 1-- 642 Error 18/05/11 13:29:20 18/05/11 20:23:53 1.14 31.3 / 0.0



==============================================


Result Name: X0000100571045200806241011_ 1--



<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
In ExtractGlcmFeatures: End of 0 iteration of outer loop.
In ExtractGlcmFeatures: End of 1 iteration of outer loop.
In ExtractGlcmFeatures: End of 2 iteration of outer loop.
In ExtractGlcmFeatures: End of 3 iteration of outer loop.
In ExtractGlcmFeatures: End of 4 iteration of outer loop.
In ExtractGlcmFeatures: End of 5 iteration of outer loop.
In ExtractGlcmFeatures: End of 6 iteration of outer loop.
In ExtractGlcmFeatures: End of 7 iteration of outer loop.
In ExtractGlcmFeatures: End of 8 iteration of outer loop.
In ExtractGlcmFeatures: End of 9 iteration of outer loop.
In ExtractGlcmFeatures: End of 10 iteration of outer loop.
In ExtractGlcmFeatures: End of 11 iteration of outer loop.
In ExtractGlcmFeatures: End of 12 iteration of outer loop.
In ExtractGlcmFeatures: End of 13 iteration of outer loop.
In ExtractGlcmFeatures: End of 14 iteration of outer loop.
In ExtractGlcmFeatures: End of 15 iteration of outer loop.
In ExtractGlcmFeatures: End of 16 iteration of outer loop.
In ExtractGlcmFeatures: End of 17 iteration of outer loop.
In ExtractGlcmFeatures: End of 18 iteration of outer loop.
In ExtractGlcmFeatures: End of 19 iteration of outer loop.
In ExtractGlcmFeatures: End of 20 iteration of outer loop.
In ExtractGlcmFeatures: End of 21 iteration of outer loop.
In ExtractGlcmFeatures: End of 22 iteration of outer loop.
In ExtractGlcmFeatures: End of 23 iteration of outer loop.
In ExtractGlcmFeatures: End of 24 iteration of outer loop.
01:19:10 (4516): called boinc_finish

</stderr_txt>

Project Name: Help Conquer Cancer
Created: 18/05/11
Name: X0000100571045200806241011
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
X0000100571045200806241011_ 2-- 642 Valid 20/05/11 00:21:09 20/05/11 07:14:15 2.29 26.9 / 30.0
X0000100571045200806241011_ 3-- 642 Valid 20/05/11 00:21:09 20/05/11 02:59:28 1.50 33.1 / 30.0
X0000100571045200806241011_ 1-- 642 Error 19/05/11 01:28:40 20/05/11 00:20:27 1.34 30.3 / 0.0
X0000100571045200806241011_ 0-- 642 Error 19/05/11 01:28:38 19/05/11 08:34:25 0.90 21.2 / 0.0


=============================================


Result Name: X0000100661268200806181314_ 1--



<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
In ExtractGlcmFeatures: End of 0 iteration of outer loop.
In ExtractGlcmFeatures: End of 1 iteration of outer loop.
In ExtractGlcmFeatures: End of 2 iteration of outer loop.
In ExtractGlcmFeatures: End of 3 iteration of outer loop.
In ExtractGlcmFeatures: End of 4 iteration of outer loop.
In ExtractGlcmFeatures: End of 5 iteration of outer loop.
In ExtractGlcmFeatures: End of 6 iteration of outer loop.
In ExtractGlcmFeatures: End of 7 iteration of outer loop.
In ExtractGlcmFeatures: End of 8 iteration of outer loop.
In ExtractGlcmFeatures: End of 9 iteration of outer loop.
In ExtractGlcmFeatures: End of 10 iteration of outer loop.
In ExtractGlcmFeatures: End of 11 iteration of outer loop.
In ExtractGlcmFeatures: End of 12 iteration of outer loop.
In ExtractGlcmFeatures: End of 13 iteration of outer loop.
In ExtractGlcmFeatures: End of 14 iteration of outer loop.
In ExtractGlcmFeatures: End of 15 iteration of outer loop.
In ExtractGlcmFeatures: End of 16 iteration of outer loop.
In ExtractGlcmFeatures: End of 17 iteration of outer loop.
In ExtractGlcmFeatures: End of 18 iteration of outer loop.
In ExtractGlcmFeatures: End of 19 iteration of outer loop.
In ExtractGlcmFeatures: End of 20 iteration of outer loop.
In ExtractGlcmFeatures: End of 21 iteration of outer loop.
In ExtractGlcmFeatures: End of 22 iteration of outer loop.
In ExtractGlcmFeatures: End of 23 iteration of outer loop.
In ExtractGlcmFeatures: End of 24 iteration of outer loop.
20:30:34 (2460): called boinc_finish




Project Name: Help Conquer Cancer
Created: 18/05/11
Name: X0000100661268200806181314
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
X0000100661268200806181314_ 2-- - In Progress 20/05/11 00:43:49 22/05/11 19:55:49 0.00 0.0 / 0.0
X0000100661268200806181314_ 3-- - In Progress 20/05/11 00:43:49 22/05/11 19:55:49 0.00 0.0 / 0.0
X0000100661268200806181314_ 0-- 642 Error 19/05/11 12:26:45 20/05/11 00:41:09 1.47 36.8 / 0.0
X0000100661268200806181314_ 1-- 642 Error 19/05/11 12:26:41 19/05/11 19:44:56 1.16 32.5 / 0.0


================================================


Result Name: X0000100681001200807021726_ 1--



<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
In ExtractGlcmFeatures: End of 0 iteration of outer loop.
In ExtractGlcmFeatures: End of 1 iteration of outer loop.
In ExtractGlcmFeatures: End of 2 iteration of outer loop.
In ExtractGlcmFeatures: End of 3 iteration of outer loop.
In ExtractGlcmFeatures: End of 4 iteration of outer loop.
In ExtractGlcmFeatures: End of 5 iteration of outer loop.
In ExtractGlcmFeatures: End of 6 iteration of outer loop.
In ExtractGlcmFeatures: End of 7 iteration of outer loop.
In ExtractGlcmFeatures: End of 8 iteration of outer loop.
In ExtractGlcmFeatures: End of 9 iteration of outer loop.
In ExtractGlcmFeatures: End of 10 iteration of outer loop.
In ExtractGlcmFeatures: End of 11 iteration of outer loop.
In ExtractGlcmFeatures: End of 12 iteration of outer loop.
In ExtractGlcmFeatures: End of 13 iteration of outer loop.
In ExtractGlcmFeatures: End of 14 iteration of outer loop.
In ExtractGlcmFeatures: End of 15 iteration of outer loop.
In ExtractGlcmFeatures: End of 16 iteration of outer loop.
In ExtractGlcmFeatures: End of 17 iteration of outer loop.
In ExtractGlcmFeatures: End of 18 iteration of outer loop.
In ExtractGlcmFeatures: End of 19 iteration of outer loop.
In ExtractGlcmFeatures: End of 20 iteration of outer loop.
In ExtractGlcmFeatures: End of 21 iteration of outer loop.
In ExtractGlcmFeatures: End of 22 iteration of outer loop.
In ExtractGlcmFeatures: End of 23 iteration of outer loop.
In ExtractGlcmFeatures: End of 24 iteration of outer loop.
23:51:48 (2040): called boinc_finish



Project Name: Help Conquer Cancer
Created: 18/05/11
Name: X0000100681001200807021726
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
X0000100681001200807021726_ 2-- 642 Pending Validation 20/05/11 00:22:05 20/05/11 10:06:35 1.25 20.2 / 0.0
X0000100681001200807021726_ 3-- - In Progress 20/05/11 00:22:05 22/05/11 19:34:05 0.00 0.0 / 0.0
X0000100681001200807021726_ 0-- 642 Error 19/05/11 15:42:15 20/05/11 00:21:22 1.70 34.3 / 0.0
X0000100681001200807021726_ 1-- 642 Error 19/05/11 15:42:09 19/05/11 22:52:25 1.11 30.0 / 0.0




======================================
[May 20, 2011 11:05:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Random errors

Suspect this was/is an unexpected relation following the Server stop on Thursday, the 19th. Notice how _0 and _1, the original 2 copies both error and then the newly generated _3 and _4 copies validating for the most, still waiting on others in progress.

Calling Techs, Calling Techs

--//--
[May 20, 2011 12:05:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Random errors

This is 5 hours of wasted cpu time for me, when there was nothing wrong with my wu's. As in my OP, there are random errors on various machines over a period since we switched to the new HCC. I can run clean for weeks, then bump an error wu. I intend to use this thread to log them


cheers
[May 20, 2011 2:05:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Random errors

I am working to discover what might have happened. I have set two of the workunits to attempt validation again and they appear to have worked. I am sifting through logs right now to hopefully find the reason.

Thanks,
-Uplinger
[May 20, 2011 2:58:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
HCC: Random errors

Yes, by all means log them in this thread. Please do share OS/Hardware/Security software info... any exemptions set for BOINC? As one member noted a few days ago, the best McAfee software is the un-installed McAfee software, but there are a few other brands I'd not touch with a long pole (a private opinion) being source of unfathomable errors.

The result logs don't show anything but the ordinary, so maybe the system event log would reveal something during the failed result runs. We're those of past an uninterrupted crunching session? Was there some form of network timeout (saw one last nite, saying server were down for maint).

This is science at a very large scale, about 325,000 tasks per day validating, so the one here or there lost or a few in a blob could happen... always pains to loose crunching hours, but running clean for weeks means 4 million results in between that came out the hopper. As I noted, was looking for a tech response, as both original results failing, then the make-up copies send out after work distribution resume succeeding is a bit too coincidental.

--//--

edit: Put science moniker in post title.
----------------------------------------
[Edit 1 times, last edit by Former Member at May 20, 2011 3:17:19 PM]
[May 20, 2011 3:01:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
BobCat13
Senior Cruncher
Joined: Oct 29, 2005
Post Count: 295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Random errors

Got one error on C4CW under Linux_x64 (know it's not HCC, but didn't want to start a new thread for one task)

Project Name: Computing for Clean Water
Created: 5/18/11
Name: c4cw_target03_129718938


c4cw_ target03_ 129718938_ 1-- - In Progress 5/20/11 00:20:47 5/24/11 00:20:47 0.00 0.0 / 0.0
c4cw_ target03_ 129718938_ 0-- 641 Error 5/19/11 02:55:48 5/20/11 00:20:00 3.18 81.4 / 0.0

Here is a snippet with the start and end of stderr.txt:

Result Log

Result Name: c4cw_ target03_ 129718938_ 0--
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Commandline = ../../projects/www.worldcommunitygrid.org/wcg_c4cw_lmps_6.41_x86_64-pc-linux-gnu -screen none -in in.wcg.acc -var wcgsteps1 10000 -var wcgsteps2 10000 -var loop 0 -var restart 0 -var rinterval 100 -var ifile in.wcg.acc -var wcgseed 129718938
[17:00:32] Percent complete = 0.499975
[17:01:28] Percent complete = 0.999950
[17:02:25] Percent complete = 1.499925
[17:03:22] Percent complete = 1.999900
[17:04:18] Percent complete = 2.499875
[17:05:15] Percent complete = 2.999850
[17:06:11] Percent complete = 3.499825
[17:07:08] Percent complete = 3.999800
...
snipped
...
[20:07:54] Percent complete = 98.495075
[20:08:51] Percent complete = 98.995050
[20:09:49] Percent complete = 99.495025
[20:10:46] Percent complete = 99.995000
20:10:47 (2450): called boinc_finish

</stderr_txt>
]]>
[May 20, 2011 3:21:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Random errors

vandiesel and BobCat, please check your work units again. I have reset the results with validation error to attempt again. It appears to have been related to the server outage yesterday.

Thanks,
-Uplinger
[May 20, 2011 5:02:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Random errors

thanks SekeRob/uplinger errors now gone

I will add to thread when/if errors occur


cheers
[May 20, 2011 8:56:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
BobCat13
Senior Cruncher
Joined: Oct 29, 2005
Post Count: 295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Random errors

That one C4CW task now shows as Valid.

Thanks for fixing, Uplinger.
[May 21, 2011 1:54:50 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Random errors

Couple of errors, wingman showing errors also, I had a few last week, but I had a power outage and put some of them down to that.

Result Name: X0000109820872200905191623_ 0--



<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
One or more arguments are invalid (0x80000003) - exit code -2147483645 (0x80000003)
</message>
<stderr_txt>
In ExtractGlcmFeatures: End of 0 iteration of outer loop.
In ExtractGlcmFeatures: End of 1 iteration of outer loop.
In ExtractGlcmFeatures: End of 2 iteration of outer loop.
In ExtractGlcmFeatures: End of 3 iteration of outer loop.
In ExtractGlcmFeatures: End of 4 iteration of outer loop.
In ExtractGlcmFeatures: End of 5 iteration of outer loop.
In ExtractGlcmFeatures: End of 6 iteration of outer loop.
In ExtractGlcmFeatures: End of 7 iteration of outer loop.
In ExtractGlcmFeatures: End of 8 iteration of outer loop.
In ExtractGlcmFeatures: End of 9 iteration of outer loop.
In ExtractGlcmFeatures: End of 10 iteration of outer loop.
In ExtractGlcmFeatures: End of 11 iteration of outer loop.
In ExtractGlcmFeatures: End of 12 iteration of outer loop.
In ExtractGlcmFeatures: End of 13 iteration of outer loop.
In ExtractGlcmFeatures: End of 14 iteration of outer loop.
In ExtractGlcmFeatures: End of 15 iteration of outer loop.
In ExtractGlcmFeatures: End of 16 iteration of outer loop.
In ExtractGlcmFeatures: End of 17 iteration of outer loop.
In ExtractGlcmFeatures: End of 18 iteration of outer loop.
In ExtractGlcmFeatures: End of 19 iteration of outer loop.
In ExtractGlcmFeatures: End of 20 iteration of outer loop.
In ExtractGlcmFeatures: End of 21 iteration of outer loop.
In ExtractGlcmFeatures: End of 22 iteration of outer loop.
In ExtractGlcmFeatures: End of 23 iteration of outer loop.
In ExtractGlcmFeatures: End of 24 iteration of outer loop.
ERROR: Invalid parameter detected in function (null). File: (null) Line: 0
ERROR: Expression: (null)


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Breakpoint Encountered (0x80000003) at address 0x7C901230

Engaging BOINC Windows Runtime Debugger...



Project Name: Help Conquer Cancer
Created: 31/05/11
Name: X0000109820872200905191623
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
X0000109820872200905191623_ 4-- 642 Valid 05/06/11 12:53:52 05/06/11 21:31:59 1.53 25.0 / 24.1
X0000109820872200905191623_ 3-- 642 Valid 02/06/11 17:22:38 06/06/11 16:48:00 1.53 23.2 / 24.1
X0000109820872200905191623_ 2-- 642 Error 02/06/11 15:08:06 02/06/11 17:20:42 0.00 0.0 / 0.0
X0000109820872200905191623_ 1-- 642 Valid 02/06/11 02:22:22 08/06/11 22:45:15 4.36 32.4 / 24.1
X0000109820872200905191623_ 0-- 642 Error 02/06/11 02:21:31 02/06/11 15:06:40 1.23 33.1 / 0.0


I am _0



====================================================





Result Name: X0000110840246200907011528_ 1--



<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
In ExtractGlcmFeatures: End of 0 iteration of outer loop.
In ExtractGlcmFeatures: End of 1 iteration of outer loop.
In ExtractGlcmFeatures: End of 2 iteration of outer loop.
In ExtractGlcmFeatures: End of 3 iteration of outer loop.
In ExtractGlcmFeatures: End of 4 iteration of outer loop.
In ExtractGlcmFeatures: End of 5 iteration of outer loop.
In ExtractGlcmFeatures: End of 6 iteration of outer loop.
In ExtractGlcmFeatures: End of 7 iteration of outer loop.
In ExtractGlcmFeatures: End of 8 iteration of outer loop.
In ExtractGlcmFeatures: End of 9 iteration of outer loop.
In ExtractGlcmFeatures: End of 10 iteration of outer loop.
In ExtractGlcmFeatures: End of 11 iteration of outer loop.
In ExtractGlcmFeatures: End of 12 iteration of outer loop.
In ExtractGlcmFeatures: End of 13 iteration of outer loop.
In ExtractGlcmFeatures: End of 14 iteration of outer loop.
In ExtractGlcmFeatures: End of 15 iteration of outer loop.
In ExtractGlcmFeatures: End of 16 iteration of outer loop.
In ExtractGlcmFeatures: End of 17 iteration of outer loop.
In ExtractGlcmFeatures: End of 18 iteration of outer loop.
In ExtractGlcmFeatures: End of 19 iteration of outer loop.
In ExtractGlcmFeatures: End of 20 iteration of outer loop.
In ExtractGlcmFeatures: End of 21 iteration of outer loop.
In ExtractGlcmFeatures: End of 22 iteration of outer loop.
In ExtractGlcmFeatures: End of 23 iteration of outer loop.
In ExtractGlcmFeatures: End of 24 iteration of outer loop.
19:27:27 (4596): called boinc_finish

</stderr_txt>



roject Name: Help Conquer Cancer
Created: 07/06/11
Name: X0000110840246200907011528
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
X0000110840246200907011528_ 2-- - In Progress 08/06/11 21:52:23 11/06/11 17:04:23 0.00 0.0 / 0.0
X0000110840246200907011528_ 3-- 642 Pending Validation 08/06/11 21:52:22 09/06/11 03:52:27 1.54 37.0 / 0.0
X0000110840246200907011528_ 1-- 642 Error 08/06/11 11:58:53 08/06/11 19:04:05 1.24 28.8 / 0.0
X0000110840246200907011528_ 0-- 642 Error 08/06/11 11:58:52 08/06/11 21:49:02 1.27 18.4 / 0.0




I am _1
[Jun 9, 2011 10:14:21 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 24   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread