Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 19
|
![]() |
Author |
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 1957 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
And coming home tonight, the host from my first post in this thread has another blocking WU.
----------------------------------------Application Mapping Cancer Markers 7.41And again, how can the CPU time be larger than the elapsed time ? ![]() |
||
|
jay_Orlando
Senior Cruncher USA Joined: Jan 4, 2006 Post Count: 181 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hey There.
----------------------------------------I am running Linux and having a problem with Matching Cancer Markers (MCM). Is this the same problem - or should I start another post? I have 11GB of memory, and 8 processors, yet I see - 12 different (by process ID ) boincmgr tasks running - each using 98 GB of VM - 10 processes of WebKit process - each using 97 GB of VM - 11 processes of WebKit process - each using 81.9 GB of VM I was running - 1 WU of Einstien with 1.0 dedicated CPU - 4 WU of MCM - 3 WU of MIP - I have - Let all WU complete; Reset and removed projetcs; Removed and reinstalled BOINC - and cold shutdown and restarts several times. of User/System times - about 25 % of each WU process is in system time (by htop and gkrellm) HOWEVER, vmstat and the system monitor shows less than 25% of my memory used. My BOINC Computing Preferences include: - Memory - CPU in use: 80% - Memory - CPU Not in use: 80% - Page/swap file : 75% System monitor shows NO swap file in use. (0 of 48.83 GiB) I am stumped. Also, Boinc Event Log shows no errors. Here is what it says:
THanks in ADVANCE!! jay PS Extensive memory tests show no error. Read/Write of every block in that partition holding /var shows No errors. I don't always get this problem. I don't always run MCM. I am aborting and letting all MCM tasks complete and will reboot and seen if the porblem continues. (( Stay safe out there!! )) jay ------------------ [edit] PPS The use of sytem time goes away whem no MCM WU running. However 32 (!) BOINC-related tasks are *still* using 98, 97, or 81GB of memory. [/edit] ![]() [Edit 1 times, last edit by jay_Orlando at Mar 31, 2020 2:34:30 PM] |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12436 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Jay
Off topic but related to your set-up. I am running 1 Einstein with WCG, but my Einstein is running on GPU only, leaving 8 threads for WCG. Mike |
||
|
jay_Orlando
Senior Cruncher USA Joined: Jan 4, 2006 Post Count: 181 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hello Mike,
----------------------------------------That is what I am doing. Einstein WU runs on GPU - but I have 1.0 CPU assigned to support the GPU - leaving 7 of my kernels to crunch WCG. Do you see any weirdness with MCM?? Jay ![]() |
||
|
Macroman
Advanced Cruncher Joined: Jun 4, 2005 Post Count: 112 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I had what I think is the same issue on an old machine running Windows 10. I find that it occasionally gets stuck at various points but my experience has been that I can suspend and restart the task and have it finish normally. I have found that often several hours of computation are wasted after the resume. I have considered creating a tool to monitor and automatically correct stuck tasks but have not undertaken this so far.
|
||
|
DE113936
Cruncher Joined: Mar 28, 2016 Post Count: 7 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I experienced this issue on Windows based systems in the past. After an undefined timespan the WU (Zika or MIP) resets itself, restarts from the last checkpoint and finishes successfully. If you want to speed up the process close BOINC with the option to stop all running tasks. Open task manager and search if there is a project related task still listed (in my case the affected task won’t stop gracefully) and aboard it. Restart BOINC and start processing again. I never experienced that the same task run in this issue twice.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Last I know BOINC can handle at least 200 slots, for each new job one is created whilst the slot for the last finished job is held until it is transmitted and reported. Then it is to go into a cleaning cycle and deleted. A never ending job will of course hold on to the slot but 'blocking slots' to the extend it affects any other jobs, don't think so.
|
||
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 1957 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
And here is a new version of that issue:
----------------------------------------
![]() |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12436 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Jay
If you restrict Einstein to BRP4, it only uses part of a thread, leaving 8 available for WCG. Mike |
||
|
|
![]() |