Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: The Clean Energy Project - Phase 2 Forum Thread: [Resolved] CEP2 Validation Issues |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 33
|
Author |
|
AgrFan
Senior Cruncher USA Joined: Apr 17, 2008 Post Count: 358 Status: Offline Project Badges: |
This unit exceeded the deadline in Job #0 and was marked as Error. This should not happen. The validator still has issues. 18 hours wasted
----------------------------------------E225614_ 783_ S.310.C38H31N5O2.LRTXBBMRZJMOBG-UHFFFAOYSA-N.13_ s1_ 14_ 0-- 700 Error 10/2/14 21:52:35 10/3/14 16:02:44 18.00 336.9 / 0.0 <core_client_version>7.0.27</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [17:54:59] Number of jobs = 8 [17:54:59] Starting job 0,CPU time has been restored to 0.000000. [17:54:59] Starting new Job [17:54:59] Qink name = fldman [17:55:01] Qink name = gesman ... [11:37:30] Qink name = gesman [11:37:32] Qink name = scfman Killing job because cpu time limit has been exceeded. 0.000000||64800.053749||0.000000 [12:04:21] Finished Job #0 12:04:22 (1583): called boinc_finish </stderr_txt> ]]> [Edit 4 times, last edit by AgrFan at Oct 3, 2014 11:46:06 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Aggravating as this may sound and rather presumptuous on my part to speak of behalf of, but eventually choices had to be made over which volunteer devices to target and not compromise the object of this research, to also compute bigger molecules, real or synthesized. If 18 hours is not enough to get through even 1 of 8 jobs, the heaviest at start, than maybe consider your device was not made for this 'opt-in' science. If the validator were to be set to credit for even an incomplete job #0, it would invite others to go compute cep2 when clearly not fit for it. Read the instructions, then opt a device in or out. Just tested on linux with an ancient and one cpu thread and it has a very hard time with the first part, so decided it will not run this project routinely, opted-out.
----------------------------------------What we're 'waiting' for is wcg developing skill to target heavy sets of tasks to heavy machines and lighter to, well less powerful. This would open up cep2 to a wider range of devices and consequently speed up the research. That ball is thoroughly in wcg/cleanenergy court, if they're interested to play that ball. 1) cleanenergy teaching wcg how to recognize heavy batches/molecules (pre-run a task for each batch for instance) 2) wcg classifying volunteer devices based on for instance their benchmark and active/on_frac hours. Then we would not have lost time. But how big is the problem fraction, is it 0.01 percent, 0.1 percent 1.0 percent, 5 percent, 10 percent and is this worth the effort in a context of for instance doubling the cep2 participation, again if cleanenergy is even interested, or can even handle that much incoming data? [Edit 1 times, last edit by Former Member at Oct 4, 2014 8:03:25 AM] |
||
|
AgrFan
Senior Cruncher USA Joined: Apr 17, 2008 Post Count: 358 Status: Offline Project Badges: |
I've been able to run 4 concurrent CEP2 units successfully on this box for many months. It's a Intel i5-650 8GB RAM running Ubuntu Server 64-bit. Plenty of horsepower. Let's see what happens with the second copy. My quess is it will be successful. This is what is so frustrating about this project.
----------------------------------------EDIT: second copy completed after 16 hours and received full credit. Job #0 took 12 hours. It failed in Job #6 with a 0x1 return code. [Edit 4 times, last edit by AgrFan at Oct 5, 2014 12:19:20 AM] |
||
|
|