Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 113
Posts: 113   Pages: 12   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 6258 times and has 112 replies Next Thread
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2171
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

Should the Extreme generation, say x, of your targeted workunit inadvertently have more than 5 workunits and generations x-1 and x+1 contain 5 or less workunits, then posting your workunit from generation x is also allowed. wink
Adri,
When I posted, the latest figures for generations 107, 108 and 109 were 1, 7 and 2 respectively, so I am happy to conclude that my post falls within the "relaxed" rule. Phew! smile
Interestingly, according to your excellent table linked in your original post in this thread, generation 108 now has only 3 workunits in it, so if I'd waited a bit before posting, I would have fallen within the "strict" rule. wink
Cheers,
Mark

Indeed, Mark,
The "History of the number of workunits within each generation" seems a valuable source to look back chronologically at the number of workunits in each generation. In the table, I've allocated three characters for the number of workunits (num) in a generation (gen); so a maximum of 999 workunits can exactly be represented in a column, a multiple of 1000 can be represented by 1k, 2k, 3k etc., other numbers between 1k and 10k will be represented by >1k, >2k, >3k etc., and numbers greater than 10k are being represented solely by 10k, 11k, 12k etc., e.g.:

gen num|gen num|gen num|gen num|gen num|gen num
135 5k|136 998|137 >9k|138 >1k|139 19k|140 10k

NB: An important (older) part of it was made possible thanks to the big help of our friend Al.

Adri
[Jan 27, 2025 9:29:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12436
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

gj82854

Once a quorum has been reached, WCG should abort any outstanding units which have not reached the first checkpoint (12.5%). 12 minutes is not enough time to get to the first checkpoint.

Mike
[Jan 28, 2025 5:39:48 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12436
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

gj82854

ARP1_0033555_110_x seems to be a very difficult task. It has been running about 32 hours and only 85% completed (about 2.5% per hour). There 8 other ARP1 tasks running on the same machine and they are running normally (almost 5% per hour). I seem to remember Kevin mentioning that there were tasks that were more difficult either due to weather conditions being modeled or geography or both.


Please check the TimeStep on this unit. If it has shortened then that would explain the longer calculation time.

Mike
[Jan 28, 2025 5:47:11 AM]   Link   Report threatening or abusive post: please login first  Go to top 
MJH333
Senior Cruncher
England
Joined: Apr 3, 2021
Post Count: 268
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

gj82854

ARP1_0033555_110_x seems to be a very difficult task. It has been running about 32 hours and only 85% completed (about 2.5% per hour). There 8 other ARP1 tasks running on the same machine and they are running normally (almost 5% per hour). I seem to remember Kevin mentioning that there were tasks that were more difficult either due to weather conditions being modeled or geography or both.


Please check the TimeStep on this unit. If it has shortened then that would explain the longer calculation time.

Mike
Mike,
gj82854 and I have discussed this. See my post here and the following two posts.
Cheers,
Mark
[Jan 28, 2025 8:41:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 981
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

And another generation 112 WU:

ARP1_0034392_112_0  Arch    Valid      2025-01-27T16:48:15  2025-01-28T04:51:47  11.31/11.49
ARP1_0034392_112_1 Fedora Server Ab. 2025-01-27T16:48:12 2025-01-29T04:48:12
ARP1_0034392_112_2 Ubuntu Valid 2025-01-27T16:48:19 2025-01-27T23:26:19 6.4/6.41

Cheers - Al.
[Jan 28, 2025 10:24:06 AM]   Link   Report threatening or abusive post: please login first  Go to top 
gj82854
Advanced Cruncher
Joined: Sep 26, 2022
Post Count: 109
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

gj82854

Once a quorum has been reached, WCG should abort any outstanding units which have not reached the first checkpoint (12.5%). 12 minutes is not enough time to get to the first checkpoint.

Mike

I don't think that is correct. Usually once a work unit starts it's outside the server's control. The server doesn't know when the checkpoints are taken as they are done by the app. ARP1_0033555_110_3 is still is progress as I write this and was issued at 18:46:31 UTC yesterday (that's almost a day a ago). I'm sure it has reached a checkpoint by now. ARP1_0033555_110_4 was server aborted probably becuase it hadn't started execution yet. Both work units were issued at the same time.
[Jan 28, 2025 3:43:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gj82854
Advanced Cruncher
Joined: Sep 26, 2022
Post Count: 109
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

It seems like maybe 104 and 106 might be stuck. Haven't seen any completed WUs in 2 days. 107 might be stuck too.
[Jan 29, 2025 12:37:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
MJH333
Senior Cruncher
England
Joined: Apr 3, 2021
Post Count: 268
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

Adri,
Here's one from generation 124.
Workunit id 656170469
ARP1_0034391_124_0 Windows 10 Valid 2025-01-29 02:47:18 UTC 2025-01-29 13:05:20 UTC 10.14 / 10.21 602 / 619.2
ARP1_0034391_124_1 Windows 10 Valid 2025-01-29 02:47:04 UTC 2025-01-29 14:21:59 UTC 11.4 / 11.49 636.3 / 619.2
ARP1_0034391_124_2 Windows 10 In Progress 2025-01-29 02:47:06 UTC 2025-01-30 14:47:06 UTC
Cheers,
Mark
[Jan 29, 2025 3:04:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
catchercradle
Advanced Cruncher
England
Joined: Jan 16, 2009
Post Count: 133
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

ARP1_0034388_118_0 Linux Zorin Zorin OS 17.2 [6.8.0-51-generic|libc 2.35] In Progress 2025-01-29 09:46:15 UTC 2025-01-30 21:46:15 UTC
ARP1_0034388_118_1 Linux Fedora Fedora Linux 41 (Xfce) [6.12.7-200.fc41.x86_64|libc 2.40] In Progress 2025-01-29 09:46:31 UTC 2025-01-30 21:46:31 UTC
ARP1_0034388_118_2 Linux Ubuntu Ubuntu 24.10 [6.11.0-13-generic|libc 2.40] Pending Validation 2025-01-29 09:47:13 UTC 2025-01-29 15:44:25 UTC 5.16 / 5.17
[Jan 29, 2025 3:49:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12436
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: The Extremes thread

It seems like maybe 104 and 106 might be stuck. Haven't seen any completed WUs in 2 days. 107 might be stuck too.


Maybe 108 & 109 as well. The oldest mover in the last day was 110.

Mike
[Jan 30, 2025 9:06:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 113   Pages: 12   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread