Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 38
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just seeing your ramdisk info addition, is the disk being back-upped in real time in intervals? Is this process locking pieces of the ram memory, causing bits to not get updated when they need to? Ask the developers at the alpha mail list would be my next step.
On ramdisks, saw this a little while ago about dynamic ramdisk sizing, freeing up memory if storage needs are small, growing when there's demand: http://betanews.com/2014/01/26/imdisk-toolkit-adds-dynamic-ram-disks/ |
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Just seeing your ramdisk info addition, is the disk being back-upped in real time in intervals? Is this process locking pieces of the ram memory, causing bits to not get updated when they need to? Ask the developers at the alpha mail list would be my next step. The RAMdisk is not backed up to the SSD (personal choice; I know I risk losing days of work in a power outage). I do transfer the /slots directory to the SSD prior to system restarts and reload that data to the RAMdisk prior to restarting the boinc client.I am unsure about lockage. If the odd behavior you mention is actually present, I am unsure why it would only surface now, when the machine has been a dedicated CEP2 cruncher (until recently) since August 2013. ![]() |
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
On ramdisks, saw this a little while ago about dynamic ramdisk sizing, freeing up memory if storage needs are small, growing when there's demand: http://betanews.com/2014/01/26/imdisk-toolkit-adds-dynamic-ram-disks/ The program you link to appears to mimic the behavior of linux' tmpfs and ramfs file systems (see also) (tmpfs has a hard limit [in my case 7.7GB] while ramfs doesn't really obey limits, but neither will hog those 7.7GB unless actually using them)![]() |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Pass, ran out of obvious ideas such as backtracking what may have changed at time of the trouble surfacing and doing thorough hardware diagnostics
|
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Pass, ran out of obvious ideas such as backtracking what may have changed at time of the trouble surfacing and doing thorough hardware diagnostics Haha not a problem; thanks for your persistence at trying to work through my problems for me ![]() |
||
|
BobCat13
Senior Cruncher Joined: Oct 29, 2005 Post Count: 295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Mgruben,
The next time you see this, could you run the following command from the /slots/ directory: echo */ | wc I just ran it on my Linux Mint install that has slots 0-6 (7 total) and the output looked like this: 1 7 21 1=/slots/ directory itself 7=0-6 numbered directories 21=total number of directories within /slots/ according to linux It appears . and .. are counted as directories, so that is why the total directories under /slots/ is 21 on my machine. Running several tasks of cep2 may add up with its subdirectories when counting the . and .. as well. |
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Bobcat,
----------------------------------------The 400 slots error resumed this morning, so I followed your suggestion: [root@system home]# cd /var/lib/boinc/slotsThis command however does not appear to count beyond a depth of 1, unlike the following command: [root@system boinc]# find slots -mindepth 1 -type d | wc -l ![]() [Edit 2 times, last edit by Mgruben at Mar 28, 2014 10:00:38 AM] |
||
|
BobCat13
Senior Cruncher Joined: Oct 29, 2005 Post Count: 295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
You are correct as echo does not go deep enough. Sorry about that.
I would be curious to see how many directories are in one of those slots that has a cep2 tasks in it. Could you locate which slot has a cep2 and cd into it and then run the find command again on that slot only? |
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I would be curious to see how many directories are in one of those slots that has a cep2 tasks in it. Could you locate which slot has a cep2 and cd into it and then run the find command again on that slot only? Folder 1 contains a CEP2 WU which is suspended while BOINC lets rosetta catch up; it's directory count alone was 219.I note as an aside however that: (1) the 1000+ directory count noted above was when the rig was working on four Rosetta@Home work units. If this is a boinc-level problem, then the error's presence even outside of WCG-context would make sense, (2) even though the rig has been quiet (has not been giving 400 slot directory errors) for the past 6 hours, the current output of "find slots -mindepth 1 -type d | wc -l" is 1026. One would think that such a high slot directory count should cause errors to be thrown when boinc attempts to start new tasks, but apparently not. Log since this morning (exited boinc client to disable my slots RAMdisk then restarted after umounting) 1: 28-Mar-2014 04:52:32 (low) [] cc_config.xml not found - using defaults ![]() |
||
|
PMH_UK
Veteran Cruncher UK Joined: Apr 26, 2007 Post Count: 771 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Could it be a misleading message due to an issue creating a new slot directory ?
----------------------------------------If something like permissions were such that existing slot directories could be used but new ones not created you may get this message as code does not expect other failures creating a slot directory. Paul.
Paul.
|
||
|
|
![]() |