Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 143
Posts: 143   Pages: 15   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 165190 times and has 142 replies Next Thread
Sandvika
Advanced Cruncher
United Kingdom
Joined: Apr 27, 2007
Post Count: 112
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

Ouch, all my Valid workunits uploaded at or after 14:21:56 UTC 18 July have turned Invalid. More tidying up to do, techs!
Yes, same here - so much for the statement that there'll be no loss of data!!!

Bang goes my machines reliability status😫


Yes, 918 invalid WUs, not including the 2969 WUs still listed as "In progress" that are all complete and awaiting upload. After the previous outage I eventually got credit for such WUs but they were all re-issued to and recrunched by others first sad
----------------------------------------

[Jul 19, 2017 1:05:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sandvika
Advanced Cruncher
United Kingdom
Joined: Apr 27, 2007
Post Count: 112
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

Care to share how to add another project under BOINC client?


Tools -> Add project -> Add projects run by other researchers or organizations

Then take your pick and register smile
----------------------------------------

[Jul 19, 2017 1:09:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Jim1348
Veteran Cruncher
USA
Joined: Jul 13, 2009
Post Count: 1066
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

I actually run Rosetta along with WCG anyway, so it is all relatively painless, except to see all the files stuck in pending transfers.
[Jul 19, 2017 1:16:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
dango
Senior Cruncher
Joined: Jul 27, 2009
Post Count: 307
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

So the data that you access when you upload and download files sits on a clustered file system. The maintenance window yesterday was scheduled to install the latest kernel on the servers. We completed all the servers associated with our databases, load balancing and website with no issue. We updated the first server associated with this file system with no issue.

However, after rebooting the second server, it marked its disks as 'unrecovered'. The cluster file system has a mechanism for recovering and restoring normal operations, but there was a second issue that is causing that process to run at a much slower pace. We are working on talking to 3rd layer support for the clustered file system software to find out if there is a faster way that we can run the recovery utility.

We do not expect any lose of data, but the utility is extremely careful which makes it very slow in running.


GPFS? I like GPFS smile smile biggrin
[Jul 19, 2017 1:23:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
jayrope
Cruncher
Joined: Jul 23, 2016
Post Count: 46
Status: Offline
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

We do not expect any lose of data, but the utility is extremely careful which makes it very slow in running.


Better to make sure first data is intact. Take your time.
[Jul 19, 2017 1:26:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1672
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

Ouch, all my Valid workunits uploaded at or after 14:21:56 UTC 18 July have turned Invalid. More tidying up to do, techs!

I am surprised about the above statement, since, last June, I had about 2'000 WUs computed by 6 machines on two different locations, and I did not lose any results.
@tonyh205: did you try to contact the tech team at this time (now it is probably too late for troubles related to the June outage)?
Cheers,
Yves
----------------------------------------
[Jul 19, 2017 1:51:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
kkwok
Cruncher
Joined: Nov 23, 2004
Post Count: 5
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

Thank you very much indeed.
[Jul 19, 2017 1:53:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

Ouch, all my Valid workunits uploaded at or after 14:21:56 UTC 18 July have turned Invalid. More tidying up to do, techs!

I am surprised about the above statement, since, last June, I had about 2'000 WUs computed by 6 machines on two different locations, and I did not lose any results.
@tonyh205: did you try to contact the tech team at this time (now it is probably too late for troubles related to the June outage)?
Cheers,
Yves
I suspect they're rather busy at the moment sad
I hope they'll pick up the Valid-to-Invalid issue once they have the file store back in operation.
[Jul 19, 2017 1:57:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
jhindo
Former World Community Grid Admin
Joined: Aug 25, 2009
Post Count: 250
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

Yes, we'll address the invalids once we restore operations. We'll also extend task deadlines.

Thanks everyone for your patience and support!

Juan
[Jul 19, 2017 2:26:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: Scheduled Maint. July 18, 14:00 UTC, extended?

Ouch, all my Valid workunits uploaded at or after 14:21:56 UTC 18 July have turned Invalid. More tidying up to do, techs!

All my PVer/PVal from 13:56:10 UTC i.e. everything uploaded / reported but not yet validated during the maintenance till that cut off at 15:58 UTC (18th) went Invalid. Valid turning Invalid is a novel pathway.

All clients are powered down as they were idling, and this one I've set to not communicate while working through the last of the last.
----------------------------------------
[Edit 1 times, last edit by SekeRob* at Jul 19, 2017 2:55:58 PM]
[Jul 19, 2017 2:53:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 143   Pages: 15   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread