Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 22
|
![]() |
Author |
|
dondee
Advanced Cruncher Joined: Jan 16, 2006 Post Count: 100 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hi everyone,
I have noticed a new and annoying feature of boinc 7.16.6 for linux. When I reboot or start my computer most, if not all, of the work units running when the computer was shut down are restarted. Has anyone else seen this problem? dondee |
||
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
That depends. With which projects are you seeing this?
----------------------------------------If it's ARP, then it normal since that project only checkpoints once every 12.5% of progress (can't be changed, just the way it is, hibernating/suspending is one solution). This can also happen with MIP at times. AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W AMD Ryzen 7 7730U 8C/16T 3.0 GHz |
||
|
BobbyB
Veteran Cruncher Canada Joined: Apr 25, 2020 Post Count: 609 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() |
Don't fully understand this.
Restart from zero (as in a brand new WU just downloaded thus loosing all work done)? Or restart from the last checkpoint which is normal. When I shutdown/reboot I first suspend the project (I only have WCG). If there is an ARP running I check the time since last checkpoint. If I see 2 hours I wait before I shutdown/reboot. The rule of thumb is 3 hours between checkpoints. 12% according to the above post. Mine run 24/7 so I have this luxury thus they hardly ever need to shutdown or reboot. You may need to reboot like right now so this may not apply. I run 7.16.6 on my linux boxes. |
||
|
Brian Nixon
Cruncher United Kingdom Joined: Oct 27, 2020 Post Count: 9 Status: Offline Project Badges: ![]() ![]() |
Checkpointing – saving the entire internal state of a running task in order to be able to resume from that point later – is a feature of each individual science application, not the BOINC client. It can be difficult, and some are better at it than others.
----------------------------------------In BOINC Manager you can open a task’s properties and see CPU time since checkpoint. If that’s the same as CPU time, the task has never checkpointed and will start again from the beginning if it is stopped and restarted. My only experience is with MIP, whose tasks appear to checkpoint somewhere on a scale from ‘rarely’ to ‘never’. [Edit 1 times, last edit by Brian Nixon at Apr 15, 2021 7:29:50 PM] |
||
|
BobbyB
Veteran Cruncher Canada Joined: Apr 25, 2020 Post Count: 609 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() |
I just checked a few MIPs and it checkpoints about every 6 minutes.
|
||
|
Brian Nixon
Cruncher United Kingdom Joined: Oct 27, 2020 Post Count: 9 Status: Offline Project Badges: ![]() ![]() |
There must be a lot of variation, then. 116 of the 144 MIPs I currently have running have never checkpointed.
|
||
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I believe there used to be MIP tasks that only had 1 structure to resolve and thus never even checkpointed. Others had 2 structures to resolve and only checkpointed once after finishing the first structure.
----------------------------------------AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W AMD Ryzen 7 7730U 8C/16T 3.0 GHz [Edit 1 times, last edit by Falconet at Apr 15, 2021 11:03:40 PM] |
||
|
dondee
Advanced Cruncher Joined: Jan 16, 2006 Post Count: 100 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Falconet,
Thanks for the reply. This has started in the last week or so. I am running only mips at this time. dondee |
||
|
dondee
Advanced Cruncher Joined: Jan 16, 2006 Post Count: 100 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
BobbyB,
Thanks for the reply. I think a graphic is worth a million words. But I cannot figure out how to insert a graphic into this space. So I will try to explain what happens. When I rebooted my computer, just a few minutes ago, 16 work units varied from 5 percent to 70 percent completed. After rebooting ALL of the work units are starting at 0 percent. The work units have the same names so the work units did not delete, upload or whatever, they just started over at the beginning. This started in the last week or so. dondee |
||
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
If it's what I described, then it's really just normal behaviour.
----------------------------------------If you can post a result log or two of those workunits once they complete, we should know. AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W AMD Ryzen 7 7730U 8C/16T 3.0 GHz |
||
|
|
![]() |