Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 143
|
![]() |
Author |
|
Sandvika
Advanced Cruncher United Kingdom Joined: Apr 27, 2007 Post Count: 112 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Ouch, all my Valid workunits uploaded at or after 14:21:56 UTC 18 July have turned Invalid. More tidying up to do, techs! Yes, same here - so much for the statement that there'll be no loss of data!!!Bang goes my machines reliability status😫 Yes, 918 invalid WUs, not including the 2969 WUs still listed as "In progress" that are all complete and awaiting upload. After the previous outage I eventually got credit for such WUs but they were all re-issued to and recrunched by others first ![]() ![]() ![]() |
||
|
Sandvika
Advanced Cruncher United Kingdom Joined: Apr 27, 2007 Post Count: 112 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Care to share how to add another project under BOINC client? Tools -> Add project -> Add projects run by other researchers or organizations Then take your pick and register ![]() ![]() ![]() |
||
|
Jim1348
Veteran Cruncher USA Joined: Jul 13, 2009 Post Count: 1066 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I actually run Rosetta along with WCG anyway, so it is all relatively painless, except to see all the files stuck in pending transfers.
|
||
|
dango
Senior Cruncher Joined: Jul 27, 2009 Post Count: 307 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
So the data that you access when you upload and download files sits on a clustered file system. The maintenance window yesterday was scheduled to install the latest kernel on the servers. We completed all the servers associated with our databases, load balancing and website with no issue. We updated the first server associated with this file system with no issue. However, after rebooting the second server, it marked its disks as 'unrecovered'. The cluster file system has a mechanism for recovering and restoring normal operations, but there was a second issue that is causing that process to run at a much slower pace. We are working on talking to 3rd layer support for the clustered file system software to find out if there is a faster way that we can run the recovery utility. We do not expect any lose of data, but the utility is extremely careful which makes it very slow in running. GPFS? I like GPFS ![]() ![]() ![]() |
||
|
jayrope
Cruncher Joined: Jul 23, 2016 Post Count: 46 Status: Offline |
We do not expect any lose of data, but the utility is extremely careful which makes it very slow in running. Better to make sure first data is intact. Take your time. |
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1672 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Ouch, all my Valid workunits uploaded at or after 14:21:56 UTC 18 July have turned Invalid. More tidying up to do, techs! I am surprised about the above statement, since, last June, I had about 2'000 WUs computed by 6 machines on two different locations, and I did not lose any results. @tonyh205: did you try to contact the tech team at this time (now it is probably too late for troubles related to the June outage)? Cheers, Yves |
||
|
kkwok
Cruncher Joined: Nov 23, 2004 Post Count: 5 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thank you very much indeed.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Ouch, all my Valid workunits uploaded at or after 14:21:56 UTC 18 July have turned Invalid. More tidying up to do, techs! I am surprised about the above statement, since, last June, I had about 2'000 WUs computed by 6 machines on two different locations, and I did not lose any results. @tonyh205: did you try to contact the tech team at this time (now it is probably too late for troubles related to the June outage)? Cheers, Yves ![]() I hope they'll pick up the Valid-to-Invalid issue once they have the file store back in operation. |
||
|
jhindo
Former World Community Grid Admin Joined: Aug 25, 2009 Post Count: 250 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Yes, we'll address the invalids once we restore operations. We'll also extend task deadlines.
Thanks everyone for your patience and support! Juan |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Ouch, all my Valid workunits uploaded at or after 14:21:56 UTC 18 July have turned Invalid. More tidying up to do, techs! All my PVer/PVal from 13:56:10 UTC i.e. everything uploaded / reported but not yet validated during the maintenance till that cut off at 15:58 UTC (18th) went Invalid. Valid turning Invalid is a novel pathway. All clients are powered down as they were idling, and this one I've set to not communicate while working through the last of the last. [Edit 1 times, last edit by SekeRob* at Jul 19, 2017 2:55:58 PM] |
||
|
|
![]() |