World Community Grid - View Thread - WU Stuck in Waiting to send

World Community Grid Forums

Category: Completed Research

Forum: Drug Search for Leishmaniasis Forum

Thread: WU Stuck in Waiting to send

Quick Go »

No member browsing this thread

Thread Status: Active
Total posts in this thread: 16

[ ]

Author

This topic has been viewed 3404 times and has 15 replies

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: WU Stuck in Waiting to send

Seems to me that the "until death do us part" was already reached [for DSFL/GFAM and Project 20] as before having these devices included in the Intel group led to invalid results... how much effort is acceptable to keep the last outpost in play?

From past conversations with knreed, there would be a feature coming that identifies a device if unsuccessful at science X Y or Z, and then reject further requests for a longer period. Not read if there's also the permanent, but to me there's no point of assigning tasks to devices that are known to always fail on same.

--//--

[Feb 2, 2012 5:08:57 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: WU Stuck in Waiting to send

Ok, WCG gave Socket-A all the chances for DSFL/GFAM. We will miss you Socket-A: "Invalids did us part." wink

Speaking of a different part, time for me to look forward to considering the Piledriver, a.k.a part-2 of the AMD Bulldozer-class CPU family coming up next from AMD. I wish it will be an 8-core with hardware-assisted hyperthreading for a total of 16-cores.
;

[Feb 2, 2012 5:47:41 PM]

KWSN - A Shrubbery
Master Cruncher
Joined: Jan 8, 2006
Post Count: 1585
Status: Offline


Re: WU Stuck in Waiting to send

Thanks for the responses. As long as there are non-VINA applications, I'll keep running this machine while I can. So far, I have only needed to replace the CPU fan, but there's always the next failure looming on the horizon.

----------------------------------------

Distributed computing volunteer since September 27, 2000

[Feb 2, 2012 6:12:54 PM]

Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:

2 year badge for Human Proteome Folding - Phase 2

180 day badge for Discovering Dengue Drugs - Together

1 year badge for Nutritious Rice for the World

1 year badge for The Clean Energy Project

2 year badge for Help Fight Childhood Cancer

180 day badge for Influenza Antiviral Drug Search

2 year badge for Help Cure Muscular Dystrophy - Phase 2

1 year badge for Discovering Dengue Drugs - Together - Phase 2

2 year badge for The Clean Energy Project - Phase 2

2 year badge for Computing for Clean Water

2 year badge for Drug Search for Leishmaniasis

2 year badge for GO Fight Against Malaria

2 year badge for Computing for Sustainable Water

20 year badge for Mapping Cancer Markers

5 year badge for Uncovering Genome Mysteries

5 year badge for Outsmart Ebola Together

5 year badge for FightAIDS@Home - Phase 2

10 year badge for Microbiome Immunity Project

5 year badge for Africa Rainfall Project

20 year badge for OpenPandemics - COVID-19


Re: WU Stuck in Waiting to send

From past conversations with knreed, there would be a feature coming that identifies a device if unsuccessful at science X Y or Z, and then reject further requests for a longer period. Not read if there's also the permanent, but to me there's no point of assigning tasks to devices that are known to always fail on same.

While it's possible to permanently blacklist a specific computer, I'm not sure if it's possible to do this just for a specific application-version for individual computers...

But, atleast in cases like these there it's known that a whole class of old, slow computers is either erroring-out or only gives invalid results for an application, it's little point of relying on the per-application quota-limits that will after some time set quota to 1 due to all the errors.

A better solution is to use plan-classes, example remove the current DSFL/GFAM-applications and instead release them under an app_plan_SSE2. Since pre-64-bit Amd's don't have SSE2, and it's the same with old P3 and similar cpu's, these cpu's won't get any DSFL/GFRAM-work at all if the only available applications is of the app_plan_SSE2-type.

While BOINC currently don't include a SSE2-plan-class, it should be easy to use the SSE3-one as basis.

----------------------------------------

"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."

[Feb 2, 2012 6:22:44 PM]

knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:

180 day badge for Human Proteome Folding

90 day badge for Human Proteome Folding - Phase 2

45 day badge for Help Cure Muscular Dystrophy - Phase 2

90 day badge for Computing for Clean Water

14 day badge for Uncovering Genome Mysteries

45 day badge for Outsmart Ebola Together

180 day badge for FightAIDS@Home - Phase 2

1 year badge for Microbiome Immunity Project

1 year badge for Africa Rainfall Project

180 day badge for OpenPandemics - COVID-19


Re: WU Stuck in Waiting to send

What happens is that the max_results_day field is added to the host_app_version level. This means that if a client errors out completely or returns only invalids for one app_version but runs other projects fine, then it will drop to 1 result max per day for that project. So it doesn't quite drive it to 0 but that is how it should be for an automatic system since if it is fixed, then it should automatically recover.

[Feb 2, 2012 10:31:56 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: WU Stuck in Waiting to send

OK, great, at least at app version level, but distinctly remember the suggestion for it to be longer, since the CPU as a quorum validator problem is likely not going to go away. At least, the chance with this is substantially less for a 2 or more opted device to run dry. For the 'All Sciences' no worries, the 'if there is no work' for' rule maybe needing review. Mind you, with a quota of 80 per core/day, it will still take multiple days on a 2 and up science selected device to reach that 1 per day quota. Maybe the 'general' quota then also needs adjusting to also be at a science app level? Any such effort of course has to take the achievable gain into account, but to the individual it will most always work as a disappointment/frustration if her/his device gets hit with the issue.

--//--

[Feb 3, 2012 11:34:40 AM]

[ ]