Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 22
Posts: 22   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3859 times and has 21 replies Next Thread
dondee
Advanced Cruncher
Joined: Jan 16, 2006
Post Count: 100
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
let's restart all of the running work units for the fun of it

Hi everyone,
I have noticed a new and annoying feature of boinc 7.16.6 for linux. When I reboot or start my computer most, if not all, of the work units running when the computer was shut down are restarted.
Has anyone else seen this problem?

dondee
[Apr 15, 2021 6:46:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Falconet
Master Cruncher
Portugal
Joined: Mar 9, 2009
Post Count: 3295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

That depends. With which projects are you seeing this?

If it's ARP, then it normal since that project only checkpoints once every 12.5% of progress (can't be changed, just the way it is, hibernating/suspending is one solution).


This can also happen with MIP at times.
----------------------------------------


AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W
AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W
AMD Ryzen 7 7730U 8C/16T 3.0 GHz
[Apr 15, 2021 7:53:21 AM]   Link   Report threatening or abusive post: please login first  Go to top 
BobbyB
Veteran Cruncher
Canada
Joined: Apr 25, 2020
Post Count: 609
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

Don't fully understand this.

Restart from zero (as in a brand new WU just downloaded thus loosing all work done)?
Or restart from the last checkpoint which is normal.

When I shutdown/reboot I first suspend the project (I only have WCG). If there is an ARP running I check the time since last checkpoint. If I see 2 hours I wait before I shutdown/reboot. The rule of thumb is 3 hours between checkpoints. 12% according to the above post.

Mine run 24/7 so I have this luxury thus they hardly ever need to shutdown or reboot. You may need to reboot like right now so this may not apply.

I run 7.16.6 on my linux boxes.
[Apr 15, 2021 4:54:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Brian Nixon
Cruncher
United Kingdom
Joined: Oct 27, 2020
Post Count: 9
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

Checkpointing – saving the entire internal state of a running task in order to be able to resume from that point later – is a feature of each individual science application, not the BOINC client. It can be difficult, and some are better at it than others.

In BOINC Manager you can open a task’s properties and see CPU time since checkpoint. If that’s the same as CPU time, the task has never checkpointed and will start again from the beginning if it is stopped and restarted.

My only experience is with MIP, whose tasks appear to checkpoint somewhere on a scale from ‘rarely’ to ‘never’.
----------------------------------------
[Edit 1 times, last edit by Brian Nixon at Apr 15, 2021 7:29:50 PM]
[Apr 15, 2021 7:27:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
BobbyB
Veteran Cruncher
Canada
Joined: Apr 25, 2020
Post Count: 609
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

I just checked a few MIPs and it checkpoints about every 6 minutes.
[Apr 15, 2021 7:46:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Brian Nixon
Cruncher
United Kingdom
Joined: Oct 27, 2020
Post Count: 9
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

There must be a lot of variation, then. 116 of the 144 MIPs I currently have running have never checkpointed.
[Apr 15, 2021 8:51:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Falconet
Master Cruncher
Portugal
Joined: Mar 9, 2009
Post Count: 3295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

I believe there used to be MIP tasks that only had 1 structure to resolve and thus never even checkpointed. Others had 2 structures to resolve and only checkpointed once after finishing the first structure.
----------------------------------------


AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W
AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W
AMD Ryzen 7 7730U 8C/16T 3.0 GHz
----------------------------------------
[Edit 1 times, last edit by Falconet at Apr 15, 2021 11:03:40 PM]
[Apr 15, 2021 9:37:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
dondee
Advanced Cruncher
Joined: Jan 16, 2006
Post Count: 100
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

Falconet,
Thanks for the reply. This has started in the last week or so.
I am running only mips at this time.
dondee
[Apr 16, 2021 1:29:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
dondee
Advanced Cruncher
Joined: Jan 16, 2006
Post Count: 100
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

BobbyB,
Thanks for the reply. I think a graphic is worth a million words.
But I cannot figure out how to insert a graphic into this space.
So I will try to explain what happens. When I rebooted my computer, just a few minutes ago, 16 work units varied from 5 percent to 70 percent completed. After rebooting ALL of the work units are starting at 0 percent. The work units have the same names so the work units did not delete, upload or whatever, they just started over at the beginning.
This started in the last week or so.
dondee
[Apr 16, 2021 2:19:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Falconet
Master Cruncher
Portugal
Joined: Mar 9, 2009
Post Count: 3295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: let's restart all of the running work units for the fun of it

If it's what I described, then it's really just normal behaviour.
If you can post a result log or two of those workunits once they complete, we should know.
----------------------------------------


AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W
AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W
AMD Ryzen 7 7730U 8C/16T 3.0 GHz
[Apr 16, 2021 2:24:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 22   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread