Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 2802
Posts: 2802   Pages: 281   [ Previous Page | 168 169 170 171 172 173 174 175 176 177 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 577575 times and has 2801 replies Next Thread
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 11812
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

Thank you, Sgt. Joe

That is a tenth unstuck unit identified.

Only 50 to go!

Mike
[Jan 8, 2022 6:16:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 11812
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

Thank you, Crystal Pellet.

That unstuck one is going well - 2 generations in 2 days.

Mike
[Jan 8, 2022 6:18:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 11812
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

The leading ultra is now on the tail of the last stuck unit.

Mike
[Jan 8, 2022 6:23:04 PM]   Link   Report threatening or abusive post: please login first  Go to top 
geophi
Advanced Cruncher
U.S.
Joined: Sep 3, 2007
Post Count: 86
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

Just picked up a 071 ultra
ARP1_0010090_071_0
[Jan 8, 2022 7:15:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Hype
Cruncher
Germany
Joined: Nov 18, 2011
Post Count: 43
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

RAM is also a factor. I can run all 8 threads with ARP because it has 20 GB RAM, albeit taking an extra third more time than if I run 4 threads with ARP.

The problem with a large number of threads is that the checkpoints become more frequent. With 16 ARP threads out of 32, you are likely to have checkpoints every 5 minutes, say, if they are well spread. As they don't take the same amount of time, some checkpoints clash and clog up the machine. And if more than 2 clash you have a bigger problem.

Mike


I have 32 GB of RAM, so that should be fine I guess.
I did some testing with OPN1 and ARP.

I compared OPN1 WU runtimes between running 32 OPN1 WUs, then 24 OPN1 and 8 ARP, and 28 OPN1 and 4 ARP.
With 24 OPN1 and 8 ARP, OPN1 WUs on average are 18% slower.
With 28 OPN1 and 4 ARP, OPN1 WUs on average are 10% slower.
I might do more testing with more ARP, but is this normal and to be expected?
----------------------------------------

[Jan 8, 2022 7:51:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7232
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

I might do more testing with more ARP, but is this normal and to be expected?

Yes, there are also other bottlenecks in addition to memory bottlenecks. See Amdahl's Law
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Jan 8, 2022 8:10:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 11812
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

geophi

That is a known ultra and has moved 3 generations in only 2 days.

Mike
[Jan 8, 2022 8:48:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
geophi
Advanced Cruncher
U.S.
Joined: Sep 3, 2007
Post Count: 86
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

Just picked up a 092 unstuck

ARP1_0033316_092

Edit...this one is definitely running slower than the other unstuck units this PC has run. It's early but it looks like it could take 14 or 15 hours compared to the 9-10 my other unstuck tasks have taken. It's running on the 64 bit executable.
----------------------------------------
[Edit 1 times, last edit by geophi at Jan 8, 2022 9:32:50 PM]
[Jan 8, 2022 9:29:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 736
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

Another triplet... ARP1_0033633_096

This one is using the 64-bit application on my Linux Ryzen and is a bit more sluggish than usual, so I had a look at the file namelist.input in its slots directory -- the parameter I presume to be important here is time_step=24...

The other two ARP1 tasks running on that system at the same time both have the value 36 for that parameter (which tallies with your earlier post in this thread about the default time step being 36 seconds)

If that time-step change is typical for these unstuck cases, that would suggest about a 50% increase in run-time provided it runs on the same application version as usual.

It might be interesting if other folks could delve into that file for apparent stragglers and "awkward" tasks to see if they have any value other than 24 or 36 for that parameter :-)

Cheers - Al.
[Jan 8, 2022 10:25:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
geophi
Advanced Cruncher
U.S.
Joined: Sep 3, 2007
Post Count: 86
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Available

My formerly stuck unit has 24 for time_step as well.
[Jan 8, 2022 11:40:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 2802   Pages: 281   [ Previous Page | 168 169 170 171 172 173 174 175 176 177 | Next Page ]
[ Jump to Last Post ]
Post new Thread