Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 75
Posts: 75   Pages: 8   [ Previous Page | 1 2 3 4 5 6 7 8 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 20782 times and has 74 replies Next Thread
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 964
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

I wonder if the bad batch(es) of SCC1 tasks (for MyoD1-C)[1] are clogging up the work array and might be contributing to the problem? I've been seeing a lot of "No work is available" for everything but SCC1, and the "Tasks are committed to other platforms" message, which is presumably SCC1 as it isn't mentioned otherwise! :-)

As it is, the only "new" work I've seen in the last day or more has been retries for ARP1 tasks that have timed out and SCC1 MyoD1-C tasks (which promptly fail!)

On a different note, there's also the matter of many ARP1 work units appearing to be stuck with a quorum but still awaiting validation[2] -- that's a common topic in the ARP1 forum Work Available thread, especially in the last week or so!...

Cheers - Al.

[1] There's a thread about this in the SCC1 forum but I don't know if it's been spotted by WCG; several of us have been expecting the batch(es) for that specific target to be withdrawn :-)

[2] I usually process about a dozen ARP1 tasks a day, yet i have over 80 tasks stuck in "PVal jail" -- some of them have been there so long (with another result also waiting, so it really is "jail") that the "No Reply" tasks have been able to reply days later and still become third (or fourth!) validation candidates! If my small contribution is affected like that, it must be far worse for the heavier hitters :-(
[May 16, 2023 9:20:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
brian163
Cruncher
USA
Joined: Aug 11, 2007
Post Count: 8
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

@cyclops

I'm sure this has been brought up before but can we get status updates on some regular interval? (Every 3 hours?)

"Still working on it" may not seem helpful to some but to others it's at least verification that the problem is being worked on and it just strikes me as a simple way to help address the "PR" issues around outages or other issues.

I'm sure the team has to sleep so a "we're picking this up tomorrow at <time/zone>" would also likely prove insightful considering folks are in different time zones all around the world so their relative sense of "actively working on it" differs considerably. (You could be asleep right now ;-) ).

Just my 2 cents.
[May 16, 2023 10:00:34 PM]   Link   Report threatening or abusive post: please login first  Go to top 
imakuni
Advanced Cruncher
Joined: Jun 11, 2009
Post Count: 103
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

I'm sure this has been brought up before but can we get status updates on some regular interval? (Every 3 hours?)

Updates are pointless if they are meaningless. "We're working on it" doesn't do anything other than take away time to post a useless message that could be spent trying to diagnose the issue (I'm still not convinced the tech team knows exactly what they need to do). Not even "it's fixed guys!" or "we're out of beta, WCG is back baby!" holds any value anymore since, time and time again, it's been proven that the project is still on life support.

If you want updates, keep an eye on the community. They'll post messages when WU and errors start flowing again.
----------------------------------------

Want to have an image of yourself like this on? Check this thread: https://secure.worldcommunitygrid.org/forums/wcg/viewthread_thread,29840
[May 16, 2023 10:23:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2177
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

Seems as if they found the problem. I just got the 5 MCM tasks that I asked for.
----------------------------------------
[Edit 1 times, last edit by Grumpy Swede at May 16, 2023 10:32:32 PM]
[May 16, 2023 10:30:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7668
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

If it is any consolation Denis@Home currently has about 70,000 work units available (22:30 UTC).
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[May 16, 2023 10:33:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2177
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

Well, I got my 5 MCM tasks, and now when I have crunched, uploaded, and reported 4 of them, the well is dry again. sad

Edit, added: Four _2 resend MCM tasks a while later. Not exactly a good flow of MCM tasks yet.
----------------------------------------
[Edit 3 times, last edit by Grumpy Swede at May 17, 2023 12:25:50 AM]
[May 16, 2023 11:55:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
shanen0
Cruncher
Joined: Feb 4, 2021
Post Count: 21
Status: Offline
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

Time for a new project. But in general BOINC seems to have killed itself off.
[May 17, 2023 3:29:20 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Robokapp
Senior Cruncher
Joined: Feb 6, 2012
Post Count: 249
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

Time for a new project. But in general BOINC seems to have killed itself off.


it does feel like we're the past rather than the future, doesn't it?
[May 17, 2023 5:15:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
bfmorse
Senior Cruncher
US
Joined: Jul 26, 2009
Post Count: 298
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

@Cyclops

Noticed an unusual entry in my event.log today. See following:

5/17/2023 10:03:01 AM | World Community Grid | Started upload of ARP1_0018104_143_2_r523091775_1
5/17/2023 10:03:05 AM | World Community Grid | Finished upload of ARP1_0018104_143_2_r523091775_1
5/17/2023 10:03:06 AM | World Community Grid | Sending scheduler request: To fetch work.
5/17/2023 10:03:06 AM | World Community Grid | Reporting 1 completed tasks
5/17/2023 10:03:06 AM | World Community Grid | Requesting new tasks for CPU and Intel GPU
5/17/2023 10:03:07 AM | World Community Grid | Scheduler request completed: got 0 new tasks
5/17/2023 10:03:07 AM | World Community Grid | No tasks sent
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for OpenPandemics - COVID 19
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for OpenPandemics - COVID-19 - GPU
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Africa Rainfall Project
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Help Stop TB
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Mapping Cancer Markers
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Smash Childhood Cancer
5/17/2023 10:03:07 AM | World Community Grid | This computer has finished a daily quota of 1 tasks
5/17/2023 10:03:07 AM | World Community Grid | Project requested delay of 121 seconds

This is especially noteworthy as MOST of the current SCC1 WU's are erroring out and SEEMS to be affecting the release of WU's for ALL other tasks. The highlighted entry seems to confirm that assumption:

I return SCC1 WUs that have ERRORS, system not sending me ANY more WU's that might error out again.

This particular computer system has EIGHT idle processes and EITHER there really are NO AVAILABLE WU's -OR- the system has deemed my computers as a FAILURE RISK and has throttled back WU's that get released to me in a preprogrammed attempt to minimize my impact on reliable data.

All because the data set for several SCC1 WU''s appear to have been constructed with formatting errors, as pointed out by another volunteer. I had sent an email with representative ERRORED out SCC1 data but have not received your reply. Email still down?

The lack of WU throughput is unfortunate from the perspective of the scientists' anticipating returns on the current batches, yes, and adds to the frustration of the volunteers as well!

Bruce
ETA: have removed SCC1 from active processing until issues have been resolved.
----------------------------------------
[Edit 1 times, last edit by bfmorse at May 17, 2023 3:10:07 PM]
[May 17, 2023 2:52:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Cyclops
Senior Cruncher
Joined: Jun 13, 2022
Post Count: 295
Status: Offline
Reply to this Post  Reply with Quote 
Re: May 2023 workunit update

@Cyclops

Noticed an unusual entry in my event.log today. See following:

5/17/2023 10:03:01 AM | World Community Grid | Started upload of ARP1_0018104_143_2_r523091775_1
5/17/2023 10:03:05 AM | World Community Grid | Finished upload of ARP1_0018104_143_2_r523091775_1
5/17/2023 10:03:06 AM | World Community Grid | Sending scheduler request: To fetch work.
5/17/2023 10:03:06 AM | World Community Grid | Reporting 1 completed tasks
5/17/2023 10:03:06 AM | World Community Grid | Requesting new tasks for CPU and Intel GPU
5/17/2023 10:03:07 AM | World Community Grid | Scheduler request completed: got 0 new tasks
5/17/2023 10:03:07 AM | World Community Grid | No tasks sent
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for OpenPandemics - COVID 19
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for OpenPandemics - COVID-19 - GPU
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Africa Rainfall Project
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Help Stop TB
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Mapping Cancer Markers
5/17/2023 10:03:07 AM | World Community Grid | No tasks are available for Smash Childhood Cancer
5/17/2023 10:03:07 AM | World Community Grid | This computer has finished a daily quota of 1 tasks
5/17/2023 10:03:07 AM | World Community Grid | Project requested delay of 121 seconds

This is especially noteworthy as MOST of the current SCC1 WU's are erroring out and SEEMS to be affecting the release of WU's for ALL other tasks. The highlighted entry seems to confirm that assumption:

I return SCC1 WUs that have ERRORS, system not sending me ANY more WU's that might error out again.

This particular computer system has EIGHT idle processes and EITHER there really are NO AVAILABLE WU's -OR- the system has deemed my computers as a FAILURE RISK and has throttled back WU's that get released to me in a preprogrammed attempt to minimize my impact on reliable data.

All because the data set for several SCC1 WU''s appear to have been constructed with formatting errors, as pointed out by another volunteer. I had sent an email with representative ERRORED out SCC1 data but have not received your reply. Email still down?

The lack of WU throughput is unfortunate from the perspective of the scientists' anticipating returns on the current batches, yes, and adds to the frustration of the volunteers as well!

Bruce
ETA: have removed SCC1 from active processing until issues have been resolved.

Hi bfmorse, we are looking into the SCC1 workunit errors and the effect they are having on the rest of the workunits being sent out.
[May 17, 2023 3:21:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 75   Pages: 8   [ Previous Page | 1 2 3 4 5 6 7 8 | Next Page ]
[ Jump to Last Post ]
Post new Thread