Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 1
|
![]() |
Author |
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
World Community Grid requires work send to volunteered computers to be returned before an X number of days. Reason is to ensure the overall project batches do not get delayed. At the same time this facilitates the participation of devices that are on only few hours a day. E.g. home computers can process a project in the background whilst doing email, web browsing and other housekeeping chores. The principle is that every cycle counts and a work unit eventually does get completed. Frequent Checkpoint-Saving will let these jobs resume very near to where they were shut down the previous time.
----------------------------------------The deadlines for work generated, updated as of Sep. 06, 2013 are:
When the feeders get clogged [due to backlog of rush jobs not finding a wingman in their own homogeneity class], one of the ways to clear it is to drop the 'reliability' feature temporarily so that the workunits get assigned faster. In that case the repair units are given the original deadline period as per above. Rush jobs to make up for tasks missing ** deadlines are usually a fraction of the original project deadline, 1-5 days, the server computes for each device (host), but are generally send to those computers known to WCG for quick, valid returns and high current uptime history ***. The Rush deadline algorithm is:
Beta jobs have varying deadlines depending on criteria like pre-launch testing or bug resolution. Again they have short return time requirements. Tasks that were lost by the client and are resubmitted to same client get a new sent time equal to the date/time of resending. Their deadlines are then recalculated as a function of past device return history and original deadline. Most cases this works out to be the original deadline, sometimes later, sometimes earlier. The maximum total work queued a BOINC client allows is 10 days and can either be set in the My Grid > Device Manager > Device Profile > Profile of Choice > Custom Options > Cache, or in the versions 5.10 and higher of BOINC clients using the Advanced menu > Preferences > Network Usage > "Additional Buffer Days" option. BOINC will automatically manage the buffered work order through it's scheduler function so all can meet the individual assigned Task deadline. Rush jobs ++ will show messages like "Running - High Priority" and others like "Waiting to Run" if paused or preempted. It is not advisable to exceed 7 days of work buffer [Recommend maximum 4] if e.g. running the 10 day deadline projects, alone or in a mix, simply because of potential outages, particularly if work is only returned periodically. Take computer down time into consideration! Secondly, do not fill up the buffer with brand new projects as often they suffer child diseases and bugs in the new science software. An initial full size production could reveal those late ones and cause a client to process large volumes of work that might in fact be bad. WCG can remotely cancel that work when aware of a high failure rate, but only if the host is on-line and has a scheduled contact with the project servers. For information on the number of copies circulated to arrive at proper quorum validation visit the FAQ BOINC: Minimum Quorum &am... Work Units (Tasks)[/size ** Late Tasks generally do not get credit granted unless completed and sent back before the Extra Copy is returned. If not in time they get marked as "Too Late" on the Result Status page and get zero credit, except where WCG stopped the redistribution of additional quorum copies due work unit problems. Then credit will be granted manually: No need to alert staff! *** Fast / Reliable clients are determined by the following:
NB: ++ Because of database efficiency reasons, deadlines for "Rush" jobs (Repair & Make up for bad jobs) use the 40% rule (40% of initial quorum deadline). A request is outstanding to give these particular "Rush" jobs the same deadline as the original, with a minimum of the 40% rule. These Rush Jobs are only send to "reliable" clients that are known to have a very high "valid" rate and return results usually within 48 hours from submission to a device. 40% was chosen because occasionally a very long running job goes out. Then a short deadline could not be met. +++ A) Last 5 (five) checked results as valid. [Zero redundant sciences require >= 20 (twenty) consecutive validation to be allowed to run them alone.] B) An average return time of less than or equal to 2 days (48 hours) C) A maximum quota permission at time of any work assignment (120 per device core as at March 6, 2012) Results that require a second result copy for verification go to any host. ++ As of March 2013, user aborted tasks that have no computing time on them are reissued also to regular clients that are not full repair rated. This is because large abort events could cause the feeder shared memory priority of these 'repair' jobs led to work temporarily only going to 'reliable' clients and 'regular' client getting messages there was temporarily no work. Related topics: Return to Start Here FAQ Index
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 72 times, last edit by Former Member at Sep 6, 2013 7:33:48 AM] |
||
|
|
![]() |