Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 98
Posts: 98   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 11447 times and has 97 replies Next Thread
David_L6
Senior Cruncher
USA
Joined: Aug 24, 2006
Post Count: 296
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

I aborted the 3 that were running with no progress and --- remaining at 2.5 hours. Have one work unit running now - the last one that I received.
----------------------------------------

[Sep 18, 2014 11:16:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
ca05065
Senior Cruncher
Joined: Dec 4, 2007
Post Count: 327
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

I do not have precise numerical evidence but the progress of the 0011 series seems to have slowed down. They were 95% complete at 3 hours but have taken another 45 minutes for next 3%.
[Sep 18, 2014 11:26:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

After 4h CPU time, mine are showing >99.7% progress but those figures are changing very slowly. That still includes _10_1437.
[Sep 18, 2014 11:32:35 PM]   Link   Report threatening or abusive post: please login first  Go to top 
ashrader330
Advanced Cruncher
Joined: Jan 6, 2008
Post Count: 97
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

I have two work units that seem to be running fine but the remaining time estimate is "---". Another weird thing is the CPU time at last check point is "---" too. I am running with the 7.2.42 64 bit Linux client.

EDIT: As others have said, the two units have slowed down significantly as it got higher than 98%. Still no checkpointing an no estimated time remaining but the progress is continuing to going up.

My units are:
BETA_ugm1_ugm1_00011_0636_0
BETA_ugm1_ugm1_00012_1017_1
----------------------------------------

Run time: 4.2y HPF2, 6.9y FAAH, 7.9y HFCC, 20.8y HCC, 26.0y CEP2, 26.0y MCM, 2.1y UGM, 2.0y OET
WU: 4.8k HPF2, 12.3k FAAH, 12.4k HFCC, 135k HCC, 34.3k CEP2, 43.8k MCM, 4.2k UGM, 19.7k OET
----------------------------------------
[Edit 2 times, last edit by ashrader330 at Sep 19, 2014 2:43:27 AM]
[Sep 19, 2014 12:14:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
hendermd
Cruncher
United States
Joined: Apr 30, 2010
Post Count: 29
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

I have two work units BETA_ugm1_ugm1_0010_0010 and _ugm1_ugm1_0010_0012 stuck at 3.963%, restarted 1 and it worked back up to 3.963%.

The following work units show no time remaining and still seem to be working slowly in the 99% range for last two hours.

BETA_ugm1_ugm1_0011_0443
BETA_ugm1_ugm1_0011_0454
BETA_ugm1_ugm1_0011_0445
----------------------------------------

[Sep 19, 2014 12:56:17 AM]   Link   Report threatening or abusive post: please login first  Go to top 
tmedve
Senior Cruncher
USA
Joined: Nov 16, 2004
Post Count: 182
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

I suspended my 4 Betas until I hear further of the direction I should take (keep running or abort). I realize I may have lost work since there were no checkpoints listed. Want to keep working on MCM overnight and my que will take me until the morning.

I gave 1 of them more than 6 hours and 3 of them more than 5 hours and still 0.000% complete and no checkpoints
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by tmedve at Sep 19, 2014 2:13:19 AM]
[Sep 19, 2014 2:07:28 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Yarensc
Advanced Cruncher
USA
Joined: Sep 24, 2011
Post Count: 136
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

After running for the hour and 15 minutes that was estimated for the initial Remaining Time that column changed to ---
Restarted the client since it had been 4 hours with no apparent progress and it doesn't seem to have fixed anything. Is the fact that all the slots running betas have the file "boinc_lockfile" in them indicative of a write/file creation lock preventing the checkpoint files creation?
[Sep 19, 2014 2:09:45 AM]   Link   Report threatening or abusive post: please login first  Go to top 
KWSN - A Shrubbery
Master Cruncher
Joined: Jan 8, 2006
Post Count: 1585
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

Picking one system at random to comment on. It's an Intel i5 running Windows 8 (how I hate Windows 8). Anyway, the four units on there are near completion. The progress is incrementing by 1/1000 of a percent very slowly. It used to move that much every few seconds, now as they approach 99%, they are down to 1/1000 of a percent every few minutes. At this rate, it could be some time even though they appear to not be actually stuck.
----------------------------------------

Distributed computing volunteer since September 27, 2000
----------------------------------------
[Edit 1 times, last edit by KWSN - A Shrubbery at Sep 19, 2014 2:30:42 AM]
[Sep 19, 2014 2:29:40 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

As mikefinn and genhos have noted earlier, stderr.txt in the slots folders for the stuck betas still contain just the line "Unable to open checkpoint file starting from 0", with no "500 query sequences compared" etc. in any of them. This is despite the progress saying >99.99% on all of them on my machines after around 7 to 8 hours CPU time.

FWIW, 3 of the earliest units that did complete quickly have turned Valid smile and 8 are in PVal - all BETA_ ugm1_ ugm1_ 00010.
----------------------------------------
[Edit 1 times, last edit by Former Member at Sep 19, 2014 3:27:01 AM]
[Sep 19, 2014 3:20:54 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Yarensc
Advanced Cruncher
USA
Joined: Sep 24, 2011
Post Count: 136
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 18, 2014 [ Issues Thread ]

Hmm after upgrading BOINC from 7.0.64 to 7.2.42 all my units that were stuck at 0% are now counting up normally. Win 8.1 pro
[Sep 19, 2014 3:47:02 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 98   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread