Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 23
Posts: 23   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 22949 times and has 22 replies Next Thread
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
BOINC: Another scheduler instance is running for this host? [RESOLVED]

17433 World Community Grid 10/30/2015 12:11:50 PM [checkpoint] result FAH2_avx17587-ls_000006_0008_011_0 checkpointed
17434 World Community Grid 10/30/2015 12:12:37 PM [sched_op] Starting scheduler request
17435 World Community Grid 10/30/2015 12:12:37 PM [trickle] read trickle file projects/www.worldcommunitygrid.org/trickle_up_FAH2_avx38779-ls_000018_0002_009_0_1446202843.xml.sent
17436 World Community Grid 10/30/2015 12:12:37 PM Sending scheduler request: To report completed tasks.
17437 World Community Grid 10/30/2015 12:12:37 PM Reporting 1 completed tasks
17438 World Community Grid 10/30/2015 12:12:37 PM Requesting new tasks for CPU
17439 World Community Grid 10/30/2015 12:12:37 PM [sched_op] CPU work request: 5707.24 seconds; 0.00 devices
17440 World Community Grid 10/30/2015 12:12:40 PM Scheduler request completed: got 0 new tasks
17441 World Community Grid 10/30/2015 12:12:40 PM [sched_op] Server version 701
17442 World Community Grid 10/30/2015 12:12:40 PM Another scheduler instance is running for this host
17443 World Community Grid 10/30/2015 12:12:40 PM Project requested delay of 121 seconds
17444 World Community Grid 10/30/2015 12:12:40 PM [sched_op] Deferring communication for 00:02:01
17445 World Community Grid 10/30/2015 12:12:40 PM [sched_op] Reason: requested by project

This started cycling some time ago on 2 different hosts. The effect is, the Ready to Report do not get send as there are perpetual new back-offs/deferrals. Both hosts run FAHB, but the effect in on all apps, notably a CEP2+UGM2+FAHB jobs wont report.

Eventually got tired of this and hit the update button. The effect was 1 connect fail -for just 1 client, actually wanting new work, then the conflicting back-offs resuming on all:

17494 World Community Grid 10/30/2015 12:20:15 PM update requested by user
17495 World Community Grid 10/30/2015 12:20:16 PM sched RPC pending: Requested by user
17496 World Community Grid 10/30/2015 12:20:16 PM [sched_op] Starting scheduler request
17497 World Community Grid 10/30/2015 12:20:16 PM [trickle] read trickle file projects/www.worldcommunitygrid.org/trickle_up_FAH2_avx175573-ls_000074_0018_007_0_1446203862.xml.sent
17498 World Community Grid 10/30/2015 12:20:16 PM [trickle] read trickle file projects/www.worldcommunitygrid.org/trickle_up_FAH2_avx38779-ls_000018_0002_009_0_1446202843.xml.sent
17499 World Community Grid 10/30/2015 12:20:16 PM Sending scheduler request: Requested by user.
17500 World Community Grid 10/30/2015 12:20:16 PM Reporting 1 completed tasks
17501 World Community Grid 10/30/2015 12:20:16 PM Requesting new tasks for CPU
17502 World Community Grid 10/30/2015 12:20:16 PM [sched_op] CPU work request: 6487.39 seconds; 0.00 devices
17503 10/30/2015 12:20:22 PM Project communication failed: attempting access to reference site
17504 World Community Grid 10/30/2015 12:20:22 PM Scheduler request failed: SSL connect error
17505 World Community Grid 10/30/2015 12:20:22 PM [sched_op] Deferring communication for 00:01:34
17506 World Community Grid 10/30/2015 12:20:22 PM [sched_op] Reason: Scheduler request failed
17507 10/30/2015 12:20:24 PM Internet access OK - project servers may be temporarily down.
17508 World Community Grid 10/30/2015 12:20:41 PM [checkpoint] result FAH2_avx17587-ls_000006_0008_011_0 checkpointed
17509 World Community Grid 10/30/2015 12:20:42 PM Started upload of FAH2_avx17587-ls_000006_0008_011_0_5
17510 World Community Grid 10/30/2015 12:20:42 PM Started upload of FAH2_avx17587-ls_000006_0008_011_0_15
17511 World Community Grid 10/30/2015 12:20:48 PM Finished upload of FAH2_avx17587-ls_000006_0008_011_0_5
17512 World Community Grid 10/30/2015 12:20:48 PM Finished upload of FAH2_avx17587-ls_000006_0008_011_0_15
17513 World Community Grid 10/30/2015 12:21:57 PM [sched_op] Starting scheduler request
17514 World Community Grid 10/30/2015 12:21:57 PM [trickle] read trickle file projects/www.worldcommunitygrid.org/trickle_up_FAH2_avx175573-ls_000074_0018_007_0_1446203862.xml.sent
17515 World Community Grid 10/30/2015 12:21:57 PM [trickle] read trickle file projects/www.worldcommunitygrid.org/trickle_up_FAH2_avx17587-ls_000006_0008_011_0_1446204041.xml
17516 World Community Grid 10/30/2015 12:21:57 PM [trickle] read trickle file projects/www.worldcommunitygrid.org/trickle_up_FAH2_avx38779-ls_000018_0002_009_0_1446202843.xml.sent
17517 World Community Grid 10/30/2015 12:21:57 PM Sending scheduler request: To send trickle-up message.
17518 World Community Grid 10/30/2015 12:21:57 PM Reporting 1 completed tasks
17519 World Community Grid 10/30/2015 12:21:57 PM Requesting new tasks for CPU
17520 World Community Grid 10/30/2015 12:21:57 PM [sched_op] CPU work request: 6658.54 seconds; 0.00 devices
17521 World Community Grid 10/30/2015 12:22:01 PM Scheduler request completed: got 0 new tasks
17522 World Community Grid 10/30/2015 12:22:01 PM [sched_op] Server version 701
17523 World Community Grid 10/30/2015 12:22:01 PM Another scheduler instance is running for this host
17524 World Community Grid 10/30/2015 12:22:01 PM Project requested delay of 121 seconds
17525 World Community Grid 10/30/2015 12:22:01 PM [sched_op] Deferring communication for 00:02:01

The logograte user was locked [How/Why is the root cause Q to me]. uplinger reset some processes on the go.
----------------------------------------
[Edit 1 times, last edit by SekeRob* at Oct 30, 2015 2:55:07 PM]
[Oct 30, 2015 11:30:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
depriens
Senior Cruncher
The Netherlands
Joined: Jul 29, 2005
Post Count: 350
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

Same here, unable to report completed tasks or download new ones.
----------------------------------------

[Oct 30, 2015 11:36:21 AM]   Link   Report threatening or abusive post: please login first  Go to top 
cjslman
Master Cruncher
Mexico
Joined: Nov 23, 2004
Post Count: 2082
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

Yeap, seeing the same thing, "Another scheduler instance is running for this host" in the event log and finished WUs not being uploaded. sad

CJSL

Crunching like there's no tomorrow...
----------------------------------------
I follow the Gimli philosophy: "Keep breathing. That's the key. Breathe."
Join The Cahuamos Team


[Oct 30, 2015 11:49:17 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

Same here. It started about 1/2 hour ago.
[Oct 30, 2015 11:50:28 AM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

The WCG site seems sluggish too, connect fails at times [many maybe checking in what's on], but to attempt an advance on the matter, suspended networking and waited a while then visited the Project properties. No backoffs running. Allowed networking again, and the same [standard] cycle resumed of 2:01 minutes.

2108 World Community Grid 10/30/2015 1:01:13 PM Scheduler request completed
2109 World Community Grid 10/30/2015 1:01:13 PM [sched_op] Server version 701
2110 World Community Grid 10/30/2015 1:01:13 PM Another scheduler instance is running for this host
2111 World Community Grid 10/30/2015 1:01:13 PM Project requested delay of 121 seconds
2112 World Community Grid 10/30/2015 1:01:13 PM [sched_op] Deferring communication for 00:02:01
[Oct 30, 2015 12:02:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
asdavid
Veteran Cruncher
FRANCE
Joined: Nov 18, 2004
Post Count: 521
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

That's give me comfort to know i am not alone to have this problem sad
----------------------------------------
Anne-Sophie

[Oct 30, 2015 12:03:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gio777
Advanced Cruncher
Georgia
Joined: Dec 8, 2004
Post Count: 69
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

ok just continuing to work.
----------------------------------------

[Oct 30, 2015 12:28:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

Same thing here. Thought this was due to an outdated client and so went on to update it. Obviously, nothing got resolved...

P.S. And these forums also have SSL connect error occasionally too...
----------------------------------------
[Edit 1 times, last edit by Former Member at Oct 30, 2015 12:59:59 PM]
[Oct 30, 2015 12:54:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
foxfire
Advanced Cruncher
United States
Joined: Sep 1, 2007
Post Count: 121
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

I have the same problem attempting to report completed tasks. Also, can't pull Result Status using the API (https://secure.worldcommunitygrid.org/api/mem...By=DeviceId&xml=true).

Getting:

This page can’t be displayed
Turn on TLS 1.0, TLS 1.1, and TLS 1.2 in Advanced settings and try connecting to https://secure.worldcommunitygrid.org again. If this error persists, it is possible that this site uses an unsupported protocol. Please contact the site administrator
----------------------------------------

[Oct 30, 2015 1:19:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Localizer
Cruncher
Joined: Feb 10, 2006
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC: Another scheduler instance is running for this host?

........ I'm getting alternating messages of: 'Scheduler request failed:SSL connect error' and 'Another scheduler instance is running on this host'

Not really much of an issue now as I have a couple of days of WUs in my queue, but will be very messy if not fixed until after the weekend!!
[Oct 30, 2015 1:23:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 23   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread