Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 260
Posts: 260   Pages: 26   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 252410 times and has 259 replies Next Thread
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 983
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

[TL/DR summary -- assimilations still happening at a good rate, but file deletion and d/b purging falling behind...]

Adri suggests that some attempt at a balance between new work and tidying up after the assimilation problems is being made. It does indeed behave as if the MCM1 WU generator/feeder pipeline is deliberately left unfilled on occasion -- if that is happening, it might explain why I sometimes get sent nothing but retries and at other times retries seem to not get a look in :-)

For what it's worth, I suspect that the only difficulty that sending out new work creates for the post-assimilation clean-up is a slight increase in query times -- the mass successful assimilations will probably be putting an equally heavy load on query times, if not higher, as transitioner queries don't see WUs waiting for assimilation, but as soon as they are assimilated the WUs become visible and may well become part of a transitioner backlog...

By the way, it seems that the file deleter(s) can't quite keep up with the assimilators -- I'm seeing the number of items in delete state 1 (waiting for file deletion) increasing at a fairly steady rate... Also I see a steady increase in items at delete state 2, indicating that the db_purge process seems unable to keep on top of the volume of WUs it needs to process at present, and (unfortunately) there's no real solution to that problem[*1] :-(

(As at 02:30 UTC on 2024-03-06, about 50% of my results assimilated since the 2024-02-29 assimilator(s) restart have been purged; about 40% are waiting to be purged and the other 10% or so are waiting for file deletion.)

As has often been pointed out, a lot of patience is going to be needed over the weeks it is likely to take for the assimilation, deletion, purging backlog to be completely clear. And as has also been said more than once, at least WCG still exists, even if it isn't as resilient as it was when it had access to "free" IBM resources [but in this thread I'm probably preaching to the choir :-)]

Cheers - Al.

*1 I don't know whether db_purge would get through its work faster if there was no other BOINC activity going on! There's a lot of file-writing going on and a lot of database record deletion (and index restructuring) as well... (And no, it isn't possible to run more than one standard db_purge process at once!)

[Edit: added information on %ages purged and waiting.]
----------------------------------------
[Edit 1 times, last edit by alanb1951 at Mar 6, 2024 3:11:58 AM]
[Mar 6, 2024 3:03:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2171
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Reading Al's comments on Just Jake's problem(s), I noticed that one of my systems, an AMD Ryzen 7-laptop, is getting only OPNG-work lately (almost plentiful!) and no MCM1-work at all after 2024-03-04T16:42:47 (two days ago), when it was assigned 4 MCM1-tasks (about the size of its queue).

Haven't tried switching profiles (yet), nor shutting down BOINC temporarily, and (probably) will not as long as there are enough OPNG-tasks flowing in (in order to see if the problem resolves itself in the coming days).

Nevertheless, I think it's a good suggestion - "if you have a system that's not getting work" as Al said - to "try juggl(i)ng profiles". wink

Adri
[Mar 6, 2024 10:11:17 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 983
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Thanks to Adri for spotting the typo :-)

As for not getting work on a particular system... I sometimes find that my Ryzens don't get as much work as my Intel systems if it's mostly retries being dished out. Sometimes the Intels might get a full quota [of retries] whilst the Ryzens get nothing at all :-(

It's probably all coincidental, of course :-)

On a more general issue regarding work issued -- I notice a fair number of new posts in other places bewailing the lack of work, and a check on my systems suggests there might well be more of an issue than previously...

Two of my systems are now out of work and the other two are well below quota. After a pause of several hours, one or two new tasks for WUs created yesterday showed up (well out of sequence!) but there's nowhere near enough to get up to quota...

So it does look as if we're more or less out of work for now (and OPNG supply seems to have dried up too, though that;s not as surprising...) -- reasons as yet unknown :-)

Ah well, with luck it'll help ease the resource pressure on file delete and db_purge.

Cheers - Al.

P.S. there's a wonderful new thread in Website Support about a user not being able to load the results page(s). It turns out the user has nearly a million results; that'll stress the user's browser :-) [and it's an indication of just how much of a total backlog there might be...]
[Mar 6, 2024 1:57:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12436
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I am finding I get a few units (Intel/Windows) at a time and only occasionally.

I, too, have difficulty getting my results page(s) and I only have 12,000! It starts with greyed out lines and sometimes has to be refreshed a time or two.

Mike
----------------------------------------
[Edit 1 times, last edit by Mike.Gibson at Mar 6, 2024 6:40:57 PM]
[Mar 6, 2024 6:40:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12436
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Purging continues. Oldest now 13 November.

Mike
[Mar 6, 2024 6:54:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12436
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Is the purging causing poor access to results?

Mike
[Mar 6, 2024 6:56:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
thunder7
Senior Cruncher
Netherlands
Joined: Mar 6, 2013
Post Count: 232
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Is the purging causing poor access to results?


Evidence suggests purging is causing bad communication wink
[Mar 6, 2024 8:36:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2171
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

One machine of mine just got 50 OPNG-tasks in the past 20 minutes. Maybe the machine should visit a casino or buy a lottery ticket today. wink

Adri
[Mar 6, 2024 10:47:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2209
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Lot's of OPNG tasks coming in here too Adri.
[Mar 6, 2024 11:19:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 995
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Thank you for the updates on purging and getting WUs.
[Mar 7, 2024 12:09:53 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 260   Pages: 26   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread