Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 12
Posts: 12   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1409 times and has 11 replies Next Thread
JSYKES
Senior Cruncher
Joined: Apr 28, 2007
Post Count: 200
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Delayed polling for more WU's??

Since upgrading to 6.10.58, I've noticed that the system polls for work far less frequently (and seemingly altering the pref settings has no effect either on polling freq) - to the extent that for the last couple of days, it has been down to running 5 WU's rather than 8, with 23 sitting waiting to be uploaded in the evening. The only way I can get the upload going is to log on to the WCG website, I can't force a 'network communication' to do the trick and ease the log-jam. This is a real pain as I can't leave the machine on its own for more than 8 hours otherwise the upload/download streams are massive and it is sitting idle having run out of WU's.

Has anyone else noticed this since upgrading to the latest version??
----------------------------------------

[Oct 6, 2010 9:34:16 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gb077492
Advanced Cruncher
Joined: Dec 24, 2004
Post Count: 96
Status: Offline
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

I have upgraded a number of machines, from uniprocessors up to 8 cores, some with HT and some not, and I've not seen this behaviour on any of them. I have noticed that the Projects view no longer tells you how long before it will retry if it fails to get any tasks, but I have no reason to think that the polling frequency is any different than it was before.

Maybe someone who follows the BOINC forums can say more...?
[Oct 6, 2010 10:58:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

Since upgrading to 6.10.58, I've noticed that the system polls for work far less frequently (and seemingly altering the pref settings has no effect either on polling freq) - to the extent that for the last couple of days, it has been down to running 5 WU's rather than 8, with 23 sitting waiting to be uploaded in the evening. The only way I can get the upload going is to log on to the WCG website, I can't force a 'network communication' to do the trick and ease the log-jam. This is a real pain as I can't leave the machine on its own for more than 8 hours otherwise the upload/download streams are massive and it is sitting idle having run out of WU's.

Has anyone else noticed this since upgrading to the latest version??

Not asking for work seems to be the easy one in your instance...

If you've got more than 2x #cores uploads in a BOINC project, regardless of upload being in progress or waiting, work-request to this project is blocked until enough uploads has been finished.

As for project-backoffs for failed work-requests, if you fails connecting to project-server at all, you'll get a normal random backoff, between 1 minute and 4 hours, just like before. This backoff shows-up in BOINC Manager.
If you on the other hand connects to server, but doesn't get any work for a work-request, you'll also get a random backoff. This backoff has upper bound 1 minute for 1st. failed request, and upper bound is doubled for each successive failed work-requests, upto a max of 24 hours. How long this backoff is will be shown if you select the project, and hits "Properties". If you hits "Update", or if a task in project finish, this kind of backoff is zeroed-out...

For uploads, you'll have the normal random backoff between 1 minute and 4 hours. There's also a new option, the project-wide backoff. In case of 3 failed transfers in a row, the project gets a similar random backoff.


But, these backoffs will only happen if you've got connection-problems. If you've permanently connected, uploads should happen immediately as tasks finish, just like in older clients.

I'm not aware of WCG having any connection-problems, so if re-booting any modems and so on and re-booting computer doesn't solve your connection-problem, you'll need to give more info...
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
[Oct 6, 2010 11:37:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gb077492
Advanced Cruncher
Joined: Dec 24, 2004
Post Count: 96
Status: Offline
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

How long this backoff is will be shown if you select the project, and hits "Properties".


Thanks, Ingleside, I hadn't noticed the Properties button. (Maybe I need some new spectacles!)
[Oct 6, 2010 11:46:16 PM]   Link   Report threatening or abusive post: please login first  Go to top 
JSYKES
Senior Cruncher
Joined: Apr 28, 2007
Post Count: 200
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

I have finally worked out what's going on. Thanks Ingleside, but the problem is not a back-off one - 6.10.58 does not poll for incremental additional WU's as the earlier version did - that version would poll the scheduler for XX secs work and receive a few WU's and then go through the same routine again a couple of hours later after completing another WU or two - the new routine seems to be that the finished WU's will be reported but the downloads are only coming 2 or 3 times a day and then between 10 and 15 units at a time.... and multiple WU's from a single project - I have just had 7 x C4CW, 5 x HFCC and 3 x CMD appear (21.00) as a single batch, with 2 x C4CW, 2 x HFCC and 6 x CMD at 06.00 this morning, also a single batch.
----------------------------------------

[Oct 7, 2010 8:20:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

So what is the "connect about every xxx days" set at?

edit: For that matter, why don't you tell us all the exact network and disk/buffer settings to include what the options are that are selected on the Activity menu.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Oct 7, 2010 8:26:43 PM]
[Oct 7, 2010 8:24:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

This reminded me of this other thread, which BTW still affects one of my PCs from time to time.
[Oct 8, 2010 2:45:06 AM]   Link   Report threatening or abusive post: please login first  Go to top 
GB033533
Senior Cruncher
UK
Joined: Dec 8, 2004
Post Count: 202
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

I've also just had this problem.
I only upgraded to 6.10.58 this week, and it's supposed to connect every .25 days.
But this morning I was running my last 2 wus, with all the others ready to report, and no obvious reason why it wasn't reporting them and asking for more.
So I manually asked for an update (which I usually try to avoid).
I'll watch it like a hawk in case it happens again, or increase the cache.
----------------------------------------

[Oct 8, 2010 8:51:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

Suggest to set in the Activity menu the Network briefly to Suspended and then back to "Network activity always available". Maybe a stuck bit... had it some months ago where it was constantly flip-flopping by itself.

If it's not too much trouble, please go through the stdoutdae.txt file to see whether anything is logged then also set in the cc_config.xml the log_flags

<network_status_debug>1</network_status_debug>
<work_fetch_debug>1</work_fetch_debug>

For info and instructions see: http://boinc.berkeley.edu/wiki/Cc_config.xml

In the advanced menu you can read in the config. Log flags do not require a client restart.

Believe it was mitrichr who found that quite some months ago that the cache would fully deplete on his hyperthreaded quad and cores idling unless he set the cache to 2 days **. With 2 days you get Ready to Report being forced up since they will when older than 24 hours. Not sure, but maybe the GPU scheduler is part of the problem. I've got that function disabled with the <no_gpus>1</no_gpus> option in cc_config.xml

Can everyone confirm it's that the Transfer tab shows file to upload in this situation.

JSYKES, please expand on this bit "The only way I can get the upload going is to log on to the WCG website". You mean you have to sign in to WCG?

thanks

edit: I've been running 6.10.58 for long and not experienced this. When WCG came out with their custom kit, installed that over and not observed any behavioral changes.

** Cant remember if the Result files were uploaded and it simply being the question of no work being fetched. In that also please flip flop the Work fetch to No New Tasks and back to Allow New Tasks.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 8, 2010 10:51:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Richard Mitnick
Veteran Cruncher
USA
Joined: Feb 28, 2007
Post Count: 583
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Delayed polling for more WU's??

My crunching colleague told me that my problems had come up, so I thought I would check in.

That original i7-920 had problems keeping in work, which after a month of trying different settings was finally made to work properly by setting the Additional work = 2.0.

I then had a Core 2 Duo run out of work several times, I thought since this machine had been trouble free for a couple of years that it was just a strange confluence of events. But I applied the same solution, Additional work = 1.0 did the trick.

I have since added an i5-520M, again Additional work = 1.0 works.

And, finally, I added an i7-840QM, again Additional work = 1.25 works fine.

I am running 6.10.43 (no GPU issues). I leave the Connect every at the default. Strangely believe it on the i5 and the i7-920 it is .3, on the i7-840QM it is .1, and I never touched it.

I have one more machine, another Core 2 Duo, which does fine at Connect every = .3 and Additional work = 0.

My experience is that the one value to consider variable is the Additional work = . You can fiddle around with this, see how much work is cached, see what the Deadline dates are. As long as you can stay within those bounds, you will be fine.

My colleague says that he does not think that the Connect every does anything anyway.

Someone who knows better than my colleague and I might want to comment on his remarks regarding the Connect every.

Happy crunching!!
----------------------------------------
[Oct 9, 2010 8:33:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 12   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread