Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 75
|
![]() |
Author |
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 964 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I wonder if the bad batch(es) of SCC1 tasks (for MyoD1-C)[1] are clogging up the work array and might be contributing to the problem? I've been seeing a lot of "No work is available" for everything but SCC1, and the "Tasks are committed to other platforms" message, which is presumably SCC1 as it isn't mentioned otherwise! :-)
As it is, the only "new" work I've seen in the last day or more has been retries for ARP1 tasks that have timed out and SCC1 MyoD1-C tasks (which promptly fail!) On a different note, there's also the matter of many ARP1 work units appearing to be stuck with a quorum but still awaiting validation[2] -- that's a common topic in the ARP1 forum Work Available thread, especially in the last week or so!... Cheers - Al. [1] There's a thread about this in the SCC1 forum but I don't know if it's been spotted by WCG; several of us have been expecting the batch(es) for that specific target to be withdrawn :-) [2] I usually process about a dozen ARP1 tasks a day, yet i have over 80 tasks stuck in "PVal jail" -- some of them have been there so long (with another result also waiting, so it really is "jail") that the "No Reply" tasks have been able to reply days later and still become third (or fourth!) validation candidates! If my small contribution is affected like that, it must be far worse for the heavier hitters :-( |
||
|
brian163
Cruncher USA Joined: Aug 11, 2007 Post Count: 8 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
@cyclops
I'm sure this has been brought up before but can we get status updates on some regular interval? (Every 3 hours?) "Still working on it" may not seem helpful to some but to others it's at least verification that the problem is being worked on and it just strikes me as a simple way to help address the "PR" issues around outages or other issues. I'm sure the team has to sleep so a "we're picking this up tomorrow at <time/zone>" would also likely prove insightful considering folks are in different time zones all around the world so their relative sense of "actively working on it" differs considerably. (You could be asleep right now ;-) ). Just my 2 cents. |
||
|
imakuni
Advanced Cruncher Joined: Jun 11, 2009 Post Count: 103 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I'm sure this has been brought up before but can we get status updates on some regular interval? (Every 3 hours?) Updates are pointless if they are meaningless. "We're working on it" doesn't do anything other than take away time to post a useless message that could be spent trying to diagnose the issue (I'm still not convinced the tech team knows exactly what they need to do). Not even "it's fixed guys!" or "we're out of beta, WCG is back baby!" holds any value anymore since, time and time again, it's been proven that the project is still on life support. If you want updates, keep an eye on the community. They'll post messages when WU and errors start flowing again. ![]() Want to have an image of yourself like this on? Check this thread: https://secure.worldcommunitygrid.org/forums/wcg/viewthread_thread,29840 |
||
|
Grumpy Swede
Master Cruncher Svíþjóð Joined: Apr 10, 2020 Post Count: 2177 Status: Recently Active Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Seems as if they found the problem. I just got the 5 MCM tasks that I asked for.
----------------------------------------[Edit 1 times, last edit by Grumpy Swede at May 16, 2023 10:32:32 PM] |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7668 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
If it is any consolation Denis@Home currently has about 70,000 work units available (22:30 UTC).
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Grumpy Swede
Master Cruncher Svíþjóð Joined: Apr 10, 2020 Post Count: 2177 Status: Recently Active Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Well, I got my 5 MCM tasks, and now when I have crunched, uploaded, and reported 4 of them, the well is dry again.
----------------------------------------![]() Edit, added: Four _2 resend MCM tasks a while later. Not exactly a good flow of MCM tasks yet. [Edit 3 times, last edit by Grumpy Swede at May 17, 2023 12:25:50 AM] |
||
|
shanen0
Cruncher Joined: Feb 4, 2021 Post Count: 21 Status: Offline |
Time for a new project. But in general BOINC seems to have killed itself off.
|
||
|
Robokapp
Senior Cruncher Joined: Feb 6, 2012 Post Count: 249 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Time for a new project. But in general BOINC seems to have killed itself off. it does feel like we're the past rather than the future, doesn't it? |
||
|
bfmorse
Senior Cruncher US Joined: Jul 26, 2009 Post Count: 298 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
@Cyclops
----------------------------------------Noticed an unusual entry in my event.log today. See following: 5/17/2023 10:03:01 AM | World Community Grid | Started upload of ARP1_0018104_143_2_r523091775_1 5/17/2023 10:03:05 AM | World Community Grid | Finished upload of ARP1_0018104_143_2_r523091775_1 5/17/2023 10:03:06 AM | World Community Grid | Sending scheduler request: To fetch work. 5/17/2023 10:03:06 AM | World Community Grid | Reporting 1 completed tasks 5/17/2023 10:03:06 AM | World Community Grid | Requesting new tasks for CPU and Intel GPU 5/17/2023 10:03:07 AM | World Community Grid | Scheduler request completed: got 0 new tasks 5/17/2023 10:03:07 AM | World Community Grid | No tasks sent 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for OpenPandemics - COVID 19 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for OpenPandemics - COVID-19 - GPU 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Africa Rainfall Project 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Help Stop TB 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Mapping Cancer Markers 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Smash Childhood Cancer 5/17/2023 10:03:07 AM | World Community Grid | This computer has finished a daily quota of 1 tasks 5/17/2023 10:03:07 AM | World Community Grid | Project requested delay of 121 seconds This is especially noteworthy as MOST of the current SCC1 WU's are erroring out and SEEMS to be affecting the release of WU's for ALL other tasks. The highlighted entry seems to confirm that assumption: I return SCC1 WUs that have ERRORS, system not sending me ANY more WU's that might error out again. This particular computer system has EIGHT idle processes and EITHER there really are NO AVAILABLE WU's -OR- the system has deemed my computers as a FAILURE RISK and has throttled back WU's that get released to me in a preprogrammed attempt to minimize my impact on reliable data. All because the data set for several SCC1 WU''s appear to have been constructed with formatting errors, as pointed out by another volunteer. I had sent an email with representative ERRORED out SCC1 data but have not received your reply. Email still down? The lack of WU throughput is unfortunate from the perspective of the scientists' anticipating returns on the current batches, yes, and adds to the frustration of the volunteers as well! Bruce ETA: have removed SCC1 from active processing until issues have been resolved. [Edit 1 times, last edit by bfmorse at May 17, 2023 3:10:07 PM] |
||
|
Cyclops
Senior Cruncher Joined: Jun 13, 2022 Post Count: 295 Status: Offline |
@Cyclops Noticed an unusual entry in my event.log today. See following: 5/17/2023 10:03:01 AM | World Community Grid | Started upload of ARP1_0018104_143_2_r523091775_1 5/17/2023 10:03:05 AM | World Community Grid | Finished upload of ARP1_0018104_143_2_r523091775_1 5/17/2023 10:03:06 AM | World Community Grid | Sending scheduler request: To fetch work. 5/17/2023 10:03:06 AM | World Community Grid | Reporting 1 completed tasks 5/17/2023 10:03:06 AM | World Community Grid | Requesting new tasks for CPU and Intel GPU 5/17/2023 10:03:07 AM | World Community Grid | Scheduler request completed: got 0 new tasks 5/17/2023 10:03:07 AM | World Community Grid | No tasks sent 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for OpenPandemics - COVID 19 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for OpenPandemics - COVID-19 - GPU 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Africa Rainfall Project 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Help Stop TB 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Mapping Cancer Markers 5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Smash Childhood Cancer 5/17/2023 10:03:07 AM | World Community Grid | This computer has finished a daily quota of 1 tasks 5/17/2023 10:03:07 AM | World Community Grid | Project requested delay of 121 seconds This is especially noteworthy as MOST of the current SCC1 WU's are erroring out and SEEMS to be affecting the release of WU's for ALL other tasks. The highlighted entry seems to confirm that assumption: I return SCC1 WUs that have ERRORS, system not sending me ANY more WU's that might error out again. This particular computer system has EIGHT idle processes and EITHER there really are NO AVAILABLE WU's -OR- the system has deemed my computers as a FAILURE RISK and has throttled back WU's that get released to me in a preprogrammed attempt to minimize my impact on reliable data. All because the data set for several SCC1 WU''s appear to have been constructed with formatting errors, as pointed out by another volunteer. I had sent an email with representative ERRORED out SCC1 data but have not received your reply. Email still down? The lack of WU throughput is unfortunate from the perspective of the scientists' anticipating returns on the current batches, yes, and adds to the frustration of the volunteers as well! Bruce ETA: have removed SCC1 from active processing until issues have been resolved. Hi bfmorse, we are looking into the SCC1 workunit errors and the effect they are having on the rest of the workunits being sent out. |
||
|
|
![]() |