Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 84
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
humm.. don't have access to the system right now...
![]() Thanks anyway for your help Sekerob. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Yep! I reached the system (was not easy from home..)
CPU is a Pentium 3 ... I know.. an old one .. but still okay for crunching... |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
OK, that suggests there's a pattern (if memory serves me right on at least one other user system)... maybe something going on older CPU families.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Anything I could do to help troubleshooting ?
|
||
|
martin64
Senior Cruncher Germany Joined: May 11, 2009 Post Count: 445 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I got several "Application exited with RC = 0x4" errors with one of my linux systems when running CEP2 Beta WUs Now this error appears again with the normal CEP2 WUs on this system. At one of my computers it's similar, but exit code 195. Production WUs run for a time between half a minute and 3 minutes, then they all abort. <core_client_version>6.10.56</core_client_version> <![CDATA[ <message> process exited with code 195 (0xc3, -61) </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [08:11:16] Number of jobs = 16 [08:11:16] Starting job 0,CPU time has been restored to 0.000000. [08:11:16] Starting new Job [08:11:16] Qink name = fldman [08:11:16] Qink name = gesman [08:11:16] Qink name = scfman Application exited with RC = 0xb [08:11:18] Finished Job #0 called boinc_finish Exiting 195 Starting BOINC client version 6.10.56 for i686-pc-linux-gnu Processor: 2 GenuineIntel Intel(R) Core(TM)2 Duo CPU P8400 @ 2.26GHz [Family 6 Model 23 Stepping 10] OS: Linux: 2.6.31.6 (KNOPPIX 6.2.1) Memory: 1.85 GB physical, 0 bytes virtual Disk: 7.33 GB total, 5.25 GB free Regards, Martin ![]() |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Found only one previous report, by myself, from a log on off a wingman:
----------------------------------------http://www.worldcommunitygrid.org/forums/wcg/...ad,29123_offset,40#281139 Seems your description pretty much reflects what the error code text says... failing of the worker app to start properly. See in your report only the VM being zero as suspicious when 2 concurrent may not fit into memory.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
martin64
Senior Cruncher Germany Joined: May 11, 2009 Post Count: 445 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
See in your report only the VM being zero as suspicious when 2 concurrent may not fit into memory. That's what I thought as well first, and I was sure you would bring this up. ![]() However, my other machine (E6300 dual core) with the same Linux configuration & 2 GB memory didn't produce any error on CEP2 yet. And what I tried as well was to switch to one core, even opted out of "leave in memory", but it didn't help. Regards, Martin ![]() |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1323 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I got at 22:41 UTC and 06:05 UTC two repair jobs.
Will they be used by the scientists even when the CEP2 is already launched? Or is it just for me some extra runtime hours on the the way to the next BETA badge ![]() |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Is it a repair due an error or a make up for a no-reply?
----------------------------------------Last time the techs wanted the test cycle to complete. One way to see it could be that a problem would be highlighted before we get to that stage in a production batch... would those last quorums turn valid or inconclusive or error?
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1323 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Is it a repair due an error or a make up for a no-reply? Both jobs were because of 'No Reply'. Meanwhile I returned the first job and this and that of the wingman are valid. btw: That long running Beta's do have a negative influence on the DCF, so the estimates for the HCC's are ~70% to high now. |
||
|
|
![]() |