Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 38
Posts: 38   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3557 times and has 37 replies Next Thread
Mgruben
Advanced Cruncher
Joined: May 26, 2013
Post Count: 94
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
"400 slots directories" work unit error

Hey all, the below message is repeated hundreds of times in my boinccmd --get_messages output for both the work units E219825_298_K.21.C14FH9N2OSSi2.00442751.2.set1d06_4 and E219830_386_K.21.C15FH7N2OSSe.00263858.0.set1d06_3:
2988: 10-Mar-2014 03:38:08 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
2989: 10-Mar-2014 03:38:08 (internal error) [World Community Grid] [error] Can't create task for E219830_386_K.21.C15FH7N2OSSe.00263858.0.set1d06_3

----------------------------------------

[Mar 10, 2014 9:52:00 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

Hello Mgruben,
Congratulations for being the first with that error message. I suggest that you reboot and see if you can get it again. If so, please post the first 50 or so lines in your event log so that everybody can see what sort of system you have.

Lawrence
[Mar 10, 2014 1:30:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
BobCat13
Senior Cruncher
Joined: Oct 29, 2005
Post Count: 295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

This really shouldn't require a reboot. The client is supposed to delete any empty slot directories upon each startup, so stopping the client and then starting it again should clean up the slots directory.
[Mar 10, 2014 2:34:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

The client is supposed to reuse empty slots, suggesting there's crud in them, implying the client_state.xml and what else could be corrupted. If a client restart does not clear the situation, project reset, even a project detach add back. Of course, how on earth did it get to this state? 1 started job uses 1 slot, 1 complete job vacates the slot for reuse and if redundant, delete on restart.
[Mar 10, 2014 2:48:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
BobCat13
Senior Cruncher
Joined: Oct 29, 2005
Post Count: 295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

Don't know what caused that many slots directories, but at least it wasn't more than 12 million of them.

http://boinc.berkeley.edu/dev/forum_thread.php?id=8677

After that report, the client was set to ncpus *100 for maximum slot directories.

http://lists.ssl.berkeley.edu/pipermail/boinc_dev/2013-October/020451.html
[Mar 10, 2014 7:43:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mgruben
Advanced Cruncher
Joined: May 26, 2013
Post Count: 94
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

Client cleared itself up after about an hour or so, so the post is now heavily mooted to me (unfortunately for the resolution of this mystery error). While receiving the error, however, one core lay completely idle, so it's not harmless to uptime.

Note though that I only have the directories "0", "1", "2", "3," "4", and "5" in my /var/lib/boinc/slots directory, so my guess is that it may (well, must) be tallying subdirectories as well.
----------------------------------------

[Mar 11, 2014 2:36:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
petnek
Advanced Cruncher
Czech Republic
Joined: Mar 17, 2008
Post Count: 89
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

Hello,
I have also error WUs.

1 on Xeon i5-1620 (32GB RAM)
E219856_ 415_ K.22.C17FH9N2O2.00417287.0.set1d06_ 2--

5 on Xeon W3530 (16GB RAM)
E219875_ 298_ K.22.C16FH8N3S2.00384194.4.set1d06_ 0--
E219875_ 303_ K.22.C15FH6N3O2Se.00259838.2.set1d06_ 0--
E219875_ 302_ K.22.C16FH8N3S2.00290903.2.set1d06_ 0--
E219875_ 515_ K.22.C15FH8N3OSSi.00392895.2.set1d06_ 0--
E219852_ 850_ K.21.C17FH11N2Si.00394230.2.set1d06_ 0--

Log is aslmost same for all these WUs.

Like I see, noone finished these work units yet. For everyone which crunch them is result error.

<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[02:53:52] Number of jobs = 16
[02:53:52] Starting job 0,CPU time has been restored to 0.000000.
[02:56:28] Finished Job #0
[02:56:28] Starting job 1,CPU time has been restored to 146.687740.
[03:02:33] Finished Job #1
[03:02:33] Starting job 2,CPU time has been restored to 499.156400.
[04:49:17] Finished Job #2
[04:49:17] Starting job 3,CPU time has been restored to 6658.824685.
[04:56:34] Finished Job #3
[04:56:34] Starting job 4,CPU time has been restored to 7071.150928.
[05:00:51] Finished Job #4
[05:00:51] Starting job 5,CPU time has been restored to 7321.532533.
[05:05:29] Finished Job #5
[05:05:29] Starting job 6,CPU time has been restored to 7586.734233.
[05:09:52] Finished Job #6
[05:09:52] Starting job 7,CPU time has been restored to 7838.176644.
[05:15:46] Finished Job #7
[05:15:46] Starting job 8,CPU time has been restored to 8175.575607.
[05:19:28] Finished Job #8
[05:19:28] Starting job 9,CPU time has been restored to 8387.799368.
[05:27:29] Finished Job #9
[05:27:29] Starting job 10,CPU time has been restored to 8840.202268.
[05:37:17] Finished Job #10
[05:37:17] Starting job 11,CPU time has been restored to 9406.205096.
[05:43:13] Finished Job #11
[05:43:13] Starting job 12,CPU time has been restored to 9749.672497.
[06:24:40] Finished Job #12
[06:24:40] Starting job 13,CPU time has been restored to 12095.880737.
Quit requested: Exiting
[07:59:10] Number of jobs = 16
[07:59:10] Starting job 13,CPU time has been restored to 12095.880737.
Quit requested: Exiting
[10:52:37] Number of jobs = 16
[10:52:37] Starting job 13,CPU time has been restored to 12095.880737.
[11:32:44] Finished Job #13
[11:32:44] Starting job 14,CPU time has been restored to 14499.824947.
[12:09:42] Finished Job #14
[12:09:42] Starting job 15,CPU time has been restored to 16704.867882.
[13:01:15] Finished Job #15
13:01:19 (7148): called boinc_finish

</stderr_txt>
]]>

----------------------------------------

[Mar 12, 2014 5:54:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

Hello petnek,

The work unit you show appears to have behaved perfectly normally, but has still been marked as in error. This appears to be the same problem as reported in this thread .
[Mar 12, 2014 10:46:57 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Mgruben
Advanced Cruncher
Joined: May 26, 2013
Post Count: 94
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

It returns
3784: 25-Mar-2014 06:32:09 (internal error) [World Community Grid] [error] Can't create task for E220295_770_K.22.C18FH11OSeSi.00386875.3.set1d06_3
3785: 25-Mar-2014 06:32:09 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3786: 25-Mar-2014 06:32:09 (internal error) [World Community Grid] [error] Can't create task for E219964_187_K.22.C19FH11O2.00214978.4.set1d06_4
3787: 25-Mar-2014 06:32:09 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3788: 25-Mar-2014 06:32:09 (internal error) [World Community Grid] [error] Can't create task for E219964_913_K.21.C17FH13N2Si.00273997.2.set1d06_4
3789: 25-Mar-2014 06:32:16 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3790: 25-Mar-2014 06:32:16 (internal error) [World Community Grid] [error] Can't create task for E220295_770_K.22.C18FH11OSeSi.00386875.3.set1d06_3
3791: 25-Mar-2014 06:32:16 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3792: 25-Mar-2014 06:32:16 (internal error) [World Community Grid] [error] Can't create task for E219964_187_K.22.C19FH11O2.00214978.4.set1d06_4
3793: 25-Mar-2014 06:32:16 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3794: 25-Mar-2014 06:32:16 (internal error) [World Community Grid] [error] Can't create task for E219964_913_K.21.C17FH13N2Si.00273997.2.set1d06_4
3795: 25-Mar-2014 06:32:17 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3796: 25-Mar-2014 06:32:17 (internal error) [World Community Grid] [error] Can't create task for E220295_770_K.22.C18FH11OSeSi.00386875.3.set1d06_3
3797: 25-Mar-2014 06:32:17 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3798: 25-Mar-2014 06:32:17 (internal error) [World Community Grid] [error] Can't create task for E219964_187_K.22.C19FH11O2.00214978.4.set1d06_4
3799: 25-Mar-2014 06:32:17 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3800: 25-Mar-2014 06:32:17 (internal error) [World Community Grid] [error] Can't create task for E219964_913_K.21.C17FH13N2Si.00273997.2.set1d06_4
3801: 25-Mar-2014 06:32:24 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3802: 25-Mar-2014 06:32:24 (internal error) [World Community Grid] [error] Can't create task for E220295_770_K.22.C18FH11OSeSi.00386875.3.set1d06_3
3803: 25-Mar-2014 06:32:24 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3804: 25-Mar-2014 06:32:24 (internal error) [World Community Grid] [error] Can't create task for E219964_187_K.22.C19FH11O2.00214978.4.set1d06_4
3805: 25-Mar-2014 06:32:24 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3806: 25-Mar-2014 06:32:24 (internal error) [World Community Grid] [error] Can't create task for E219964_913_K.21.C17FH13N2Si.00273997.2.set1d06_4
3807: 25-Mar-2014 06:32:25 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3808: 25-Mar-2014 06:32:25 (internal error) [World Community Grid] [error] Can't create task for E220295_770_K.22.C18FH11OSeSi.00386875.3.set1d06_3
3809: 25-Mar-2014 06:32:25 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3810: 25-Mar-2014 06:32:25 (internal error) [World Community Grid] [error] Can't create task for E219964_187_K.22.C19FH11O2.00214978.4.set1d06_4
3811: 25-Mar-2014 06:32:25 (internal error) [World Community Grid] [error] exceeded limit of 400 slot directories
3812: 25-Mar-2014 06:32:25 (internal error) [World Community Grid] [error] Can't create task for E219964_913_K.21.C17FH13N2Si.00273997.2.set1d06_4

----------------------------------------

[Mar 25, 2014 11:34:18 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: "400 slots directories" work unit error

When you look in the /boinc/slots place now, does it show that many i.e. slots/399 as the highest? If not do slots plus sub-directories there off add up to this number? As lawrenceharding commented, not seen here before your report, there's something special about your system. Is it caching the disc structures and not writing the updates to disc? Look at write to disc delays. If there's a cache-flush command in linux, run that.
[Mar 25, 2014 2:21:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 38   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread