World Community Grid - View Thread - unknown error exit code 195?

World Community Grid Forums

Category: Completed Research

Forum: The Clean Energy Project - Phase 2 Forum

Thread: unknown error exit code 195?

Quick Go »

No member browsing this thread

Thread Status: Active
Total posts in this thread: 11

[ ]

Author

This topic has been viewed 3128 times and has 10 replies

Composer
Cruncher
Joined: May 28, 2014
Post Count: 29
Status: Offline
Project Badges:

2 year badge for The Clean Energy Project - Phase 2

180 day badge for Mapping Cancer Markers

1 year badge for Uncovering Genome Mysteries

90 day badge for Outsmart Ebola Together

45 day badge for FightAIDS@Home - Phase 2

180 day badge for Microbiome Immunity Project

180 day badge for Africa Rainfall Project

2 year badge for OpenPandemics - COVID-19


unknown error exit code 195?

Im seeing a lot of WU's that are throwing computation errors, on my computer it is within seconds of the task starting, and judging by the stdrr logs the same is going for others as well.

Result Log

Result Name: E236522_ 982_ S.212.C20H10N6O3S1.HODSJZLVGMNSMW-UHFFFAOYSA-N.13_ s1_ 14_ 2--
<core_client_version>7.2.47</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 195 (0xc3)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[11:27:09] Number of jobs = 8
[11:27:09] Starting job 0,CPU time has been restored to 0.000000.
Application exited with RC = 0x1
[11:27:10] Finished Job #0
11:27:10 (16708): called boinc_finish

</stderr_txt>
]]>

anybody know what exit code 195 indicates, or what it means when the application exits with RC = 0x1?

It is not doing this on all of them, I currently have 8 that are crunching happily with no signs of any problems. Is this at all related to the recent issue with people not getting work units?

[Apr 17, 2016 10:35:54 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: unknown error exit code 195?

I had the same problem a few years ago (and also saw my share of them recently -- just before the workunit well dried up). I got reassurance from a fellow cruncher here:
https://secure.worldcommunitygrid.org/forums/...ead,33490_offset,0#385626

and here:
https://secure.worldcommunitygrid.org/forums/...ead,34196_offset,0#399663

[Apr 18, 2016 6:33:27 PM]

Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7668
Status: Offline
Project Badges:

2 year badge for Human Proteome Folding - Phase 2

14 day badge for Help Cure Muscular Dystrophy

2 year badge for Discovering Dengue Drugs - Together

2 year badge for Nutritious Rice for the World

14 day badge for The Clean Energy Project

10 year badge for Help Fight Childhood Cancer

90 day badge for Influenza Antiviral Drug Search

2 year badge for Help Cure Muscular Dystrophy - Phase 2

45 day badge for Discovering Dengue Drugs - Together - Phase 2

2 year badge for Computing for Clean Water

5 year badge for Drug Search for Leishmaniasis

5 year badge for GO Fight Against Malaria

2 year badge for Computing for Sustainable Water

200 year badge for Mapping Cancer Markers

5 year badge for Uncovering Genome Mysteries

20 year badge for Outsmart Ebola Together

10 year badge for FightAIDS@Home - Phase 2

100 year badge for Smash Childhood Cancer

10 year badge for Microbiome Immunity Project

2 year badge for Africa Rainfall Project

100 year badge for OpenPandemics - COVID-19


Re: unknown error exit code 195?

Still happening:

Result Log

Result Name: E236523_ 381_ S.216.C18F2H8N6O1S2.NXTRQRZZGYDYOX-UHFFFAOYSA-N.21_ s1_ 14_ 1--
<core_client_version>7.2.7</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[17:20:30] Number of jobs = 8
[17:20:30] Starting job 0,CPU time has been restored to 0.000000.
[17:20:33] Starting new Job
[17:20:33] Qink name = fldman
[17:20:34] Qink name = gesman
Error reading in TMP file 53/0 (1479200): No such file or directory
Application exited with RC = 0x100
[17:20:34] Finished Job #0
17:20:35 (4159): called boinc_finish

Cheers

----------------------------------------

Sgt. Joe
*Minnesota Crunchers*

[Apr 29, 2016 10:26:15 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: unknown error exit code 195?

About Error reading in TMP file 53/0 (1479200): No such file or directory message, as I recently said in another thread, I have seen it appearing while crunching too much WUs simultaneously (or too much large WUs simultaneously) on a RAM disk thus the RAM size became insufficient to satisfy the request. Maybe this is your case too.

[Apr 30, 2016 8:33:29 AM]

Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7668
Status: Offline
Project Badges:


Re: unknown error exit code 195?

I don't think so. This is an dual xeon e5410 with 8gb of ram. It used to crunch up to three CEP2 units simultaneaously while also doing 5 units for other projects, mainly MCM1 without a problem. This is the first and only CEP2 unit I have had for about 3 to 4 weeks. I am thinking defective workunit as two other machines also choked on this unit with errors in the same fashin in about the same time.
Cheers

----------------------------------------

Sgt. Joe
*Minnesota Crunchers*

[Apr 30, 2016 12:03:45 PM]

SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline


Re: unknown error exit code 195?

Think, defective units is the only thing left in circulation for CEP2.

[Apr 30, 2016 12:10:18 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: unknown error exit code 195?

It sounds convincing (especially if that kind of error didn't show up in the past under the same conditions), however I still see some similarities between your case and mine: when I experienced the above error I was crunching 7 WUs at a time on an 8 GB RAM disk so the load on my CPU was similar to yours. If none of the other crunchers got the Error reading in TMP file message but all of them got the process exited with code 195 (0xc3, -61) message then it may be possible that your WU failed (error code 195) because it was defective but only you received the other message (error readinf in TMP file) because of scarce memory.

[Apr 30, 2016 1:12:16 PM]

Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7668
Status: Offline
Project Badges:


Re: unknown error exit code 195?

Three out of the four errors got the "Error reading in TMP file 53/0 (1479200): No such file or directory" entry. All got the "process exited with code 195" entry. The fifth unit was "server aborted." The one that did not get the TMP error was Windows. Mine was Linux and the other two errors were Darwin, all with the TMP error.
I am sticking with a problem with the work unit. Thanks for the input, your thoughts were appreciated.
Cheers
Edit: spelling

----------------------------------------

Sgt. Joe
*Minnesota Crunchers*

----------------------------------------
[Edit 2 times, last edit by Sgt.Joe at Apr 30, 2016 6:35:32 PM]

[Apr 30, 2016 6:33:25 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: unknown error exit code 195?

I was on Linux when I encountered the error too. Hence I guess it's a Linux related problem which occurs with faulty WUs, contrary to what I thounght until now. Thanks for taking me to the (likely) right answer.

[Apr 30, 2016 7:07:13 PM]

RTorpey
Advanced Cruncher
Joined: Aug 24, 2005
Post Count: 67
Status: Offline
Project Badges:

10 year badge for Human Proteome Folding - Phase 2

2 year badge for Help Cure Muscular Dystrophy

45 day badge for Discovering Dengue Drugs - Together

1 year badge for Nutritious Rice for the World

2 year badge for Help Fight Childhood Cancer

45 day badge for Influenza Antiviral Drug Search

5 year badge for The Clean Energy Project - Phase 2

1 year badge for Computing for Clean Water

2 year badge for Drug Search for Leishmaniasis

1 year badge for GO Fight Against Malaria

14 day badge for Computing for Sustainable Water

50 year badge for Mapping Cancer Markers

10 year badge for Uncovering Genome Mysteries

100 year badge for Outsmart Ebola Together

20 year badge for FightAIDS@Home - Phase 2

20 year badge for Smash Childhood Cancer

100 year badge for Microbiome Immunity Project

20 year badge for Africa Rainfall Project


Re: unknown error exit code 195?

Just had another Windows job exit with:

Result Log

Result Name: E236523_ 493_ S.222.C21H13N5S3.MGWOEAQCIPWOBG-UHFFFAOYSA-N.22_ s1_ 14_ 0--
<core_client_version>7.6.22</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 195 (0xc3)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[07:51:57] Number of jobs = 8
[07:51:57] Starting job 0,CPU time has been restored to 0.000000.
Application exited with RC = 0x1
[07:52:43] Finished Job #0
07:52:43 (10572): called boinc_finish

</stderr_txt>
]]>

[May 13, 2016 12:11:15 PM]

[ ]