Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 38
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
mgruben, you may want to post the content of a result log for one task that encountered this issue. Hopefully it records some error code.
|
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Could it be a misleading message due to an issue creating a new slot directory ? I made sure to runIf something like permissions were such that existing slot directories could be used but new ones not created you may get this message as code does not expect other failures creating a slot directory. chown -R boinc:boinc /var/lib/boinc/slotsabout a day or two ago to eliminate the possibility of permissions errors ![]() |
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
mgruben, you may want to post the content of a result log for one task that encountered this issue. Hopefully it records some error code. Unfortunately, I'm not sure which WU that would be; there are(1) WUs which are running, (2) WUs which are suspended, and (3) WUs which BOINC is trying to start when the 400-slot error is thrown. It's not presently throwing the 400-slot error, but when it does I'll see if any results logs have error codes in them and post. ![]() |
||
|
BobCat13
Senior Cruncher Joined: Oct 29, 2005 Post Count: 295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I posted about this thread on the Boinc Alpha mailing list and David Anderson responded:
"Subdirectories of slot directories are not counted. The only reason for lots of slot directories that I can think of are: 1) lots of started jobs, each one getting preempted by the next. If this happens it's a bug in the scheduler. 2) a permission problem that prevents slot directories from getting cleared and reused. If we see this problem again lets look for signs of one of these." It isn't #1 since you only have 6 slot directories, so it could be a permissions problem. Paul may have the right idea that new slots can't be created, but already existing ones can be reused once a task completes. Next time you get the 400 slots error, try checking permissions on on the /slots/ directory. Maybe the individually numbered directories have write permission, but the parent /slots/ directory doesn't. |
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
It isn't #1 since you only have 6 slot directories, so it could be a permissions problem. This seems like a valid deduction; I note that my previous attempt at ensuring that permissions were not the cause was to runsudo chown -R boinc:boinc /var/lib/boinc/slotsThough perhaps I am misunderstanding what the permissions of the slots directory should be set to (e.g., perhaps it's actually a chmod problem): ls -l /var/lib/boinc ![]() |
||
|
seippel
Former World Community Grid Tech Joined: Apr 16, 2009 Post Count: 392 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Mgruben,
When you are experiencing the problem, can you try to create and remove a test directory in the slots directory as the boinc user (from the command line)? Seippel |
||
|
Jim1348
Veteran Cruncher USA Joined: Jul 13, 2009 Post Count: 1066 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I run 12 GB ramdisks on Win7 64-bit machines that are backed up by a UPS and normally have no problems. But if a crash occurs for some reason, then the ramdisks can get corrupted, and I get strange error messages. I don't know about Linux, but I would try deleting the ramdisk and starting over.
|
||
|
Mgruben
Advanced Cruncher Joined: May 26, 2013 Post Count: 94 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
(Seeing your helpful posts; waiting on error to recur before I can test their suggestions)
----------------------------------------![]() |
||
|
|
![]() |