Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 7
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 990 times and has 6 replies Next Thread
Se Parle
Cruncher
Joined: Feb 9, 2006
Post Count: 15
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Long run time with no progress

I am currently running a HPF2 project that has been running over 9 hours and progressed ZERO percent. The problem also occurs when I'm running the FAAH project also.

Device ID=306353, Processor=Pentium 4.1.8ghz, memory=256mb, storage=.98gb

I haven't been able to return a result since June 30.
[Jul 7, 2006 1:55:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
confused Re: Long run time with no progress

Hello Se Parle,
Reboot, update the project using BOINC Manager, make sure you are running 5.4.9, check your Virtual Memory using something like Belarc Advisor from the Useful Utilities thread at http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=2490

Also, look at CPU Utilization using Task Manager. You might want to change the throttle setting: http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=2683#61655

Problems with FAAH ??! This is a puzzle. I have no single suggestion. Just start looking everywhere and see if something is funny.

Lawrence
[Jul 7, 2006 3:18:21 PM]   Link   Report threatening or abusive post: please login first  Go to top 
dclemens
Cruncher
Joined: Nov 17, 2004
Post Count: 4
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Long run time with no progress

I've had the same problem for the last 4 or 5 days. I'm still at 0% even though this is a fast machine and usually returns 1 or 2 results a day. I'm on Agent Ver 3.0 (2844) working on the Human Proteome Folding Phase 2.

A second machine of mine has had a different problem for the last month. It retreives data but then says "Agent paused, see www.wcgrid.org/cause.html" which didn't really help me understand what's going on.
[Jul 8, 2006 5:59:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Long run time with no progress

Lawrence, i think Se Parle like Dclemens is on UD agent, not BOINC, but could the gentleman please confirm.

Se Parle, the fact that you have problem with both FAAH and HPF2 is not good. Is your virtual memory set (a.k.a. pagefile or swapfile)? You should find it in the C:\ normally. It usually has a size of 150% or more of your RAM. Verify all your systems parameters are normal using the utilities as per link provided by Lawrence.

DClemens, your 2nd machine is very likely not suitable to running HPF2. Did it do something like Receiving 3352 of 3352 bytes prior to pausing **? If so, go to the individual device profiles @ WCG. You can deselect HPF2 for that machine. If it did FAAH before, it will continue with those only.

In all cases, if in the 'i' screen of the UD agent (the fancy graphics page, right hand corner, the version number is 5.05.03 or lower and stuck for many hours at zero %, not even 0.1%, you've unfortunately got ahold of a bad work unit. Go to taskmanager and kill the UD_7234001.exe process (or a similar number). The UD agent will communicate the bad file back to WCG and retrieve a new WU with newer version, just out.

** The 3k initial exchange with the server is the handshake and specification transmittal. If your machine is above/equal 550mhz and 256mb RAM (after deduction of ram used by display) and has swapfile active, it should be able to run HPF2.

Let us know if you observ anything out outside of the UD agent.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Jul 8, 2006 6:55:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
dclemens
Cruncher
Joined: Nov 17, 2004
Post Count: 4
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Long run time with no progress

Thanks for the tips. I killed the UD_xx process in Task Manager and after communicating with the WCG server, my high-end machine is now progressing again.

My 2nd machine is a low-end box (500 MHz) so as you suggested, it's not powerful enough to work on HPF2. I've changed that machine's profile to not work on HPF2 but it is still Paused after receiving the 3366 bytes from the server. Maybe it needs an overnight to process the Profile change?

Thanks again for the help...
[Jul 10, 2006 3:12:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Long run time with no progress

FAAH has the same minimum requirements as HPF2. HPF1 has now finished, so if your machine can't process HPF2 then you will have to wait for the next project launch, or switch that machine to a different crunching project.

I am fairly certain that the next project will a) launch very soon and b) have lower system requirements. It is completely different to the molecular simulations we have been running on the grid so far.
[Jul 10, 2006 3:44:32 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Long run time with no progress

I had the same long CPU time (83 hours) with 0% complete while in Hpf2.

I booted the process thru task manager and wcg assigned me an Faah project. Thanks for the good heads up.
[Jul 11, 2006 8:03:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread