Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Closed
Total posts in this thread: 196
Posts: 196   Pages: 20   [ Previous Page | 7 8 9 10 11 12 13 14 15 16 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1697624 times and has 195 replies Next Thread
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1951
Status: Offline
Project Badges:
Re: Hardware Recovery Update

Doctor Jurasica,

How about double points for a few weeks once the work servers are running to help make up the downtime??
IMHO, this is a pretty stupid idea that would just cater to all those point *****s...

Any resource should be much better be used to fix outstanding problems, like the not properly working validator (which didn't get fixed for about 3 weeks BEFORE the meltdown) or on catching up with the missing stats from June though Sept 28th 2022...

Ralf
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by Cyclops at Mar 20, 2023 2:44:25 PM]
[Mar 20, 2023 6:35:47 AM]   Link   Report threatening or abusive post: please login first  Go to top 
shauge
Cruncher
Joined: Dec 10, 2005
Post Count: 19
Status: Offline
Project Badges:
Re: Hardware Recovery Update

Can we get a status update of the recovery?
I look at twitter for updates and this is the last update I see from the 10th of March:
Update: The storage server was revived yesterday late afternoon. Both database filesystems mounted as before, but the science filesystem did not. It needs a repair; erasing the old log first.

----------------------------------------

----------------------------------------
[Edit 1 times, last edit by shauge at Mar 20, 2023 10:27:59 AM]
[Mar 20, 2023 10:27:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2167
Status: Offline
Project Badges:
Re: Hardware Recovery Update

Doctor Jurasica,

How about double points for a few weeks once the work servers are running to help make up the downtime??
IMHO, this is a pretty stupid idea that would just cater to all those point whores...

Any resource should be much better be used to fix outstanding problems, like the not properly working validator (which didn't get fixed for about 3 weeks BEFORE the meltdown) or on catching up with the missing stats from June though Sept 28th 2022...

Ralf
+1, I fully agree with Ralf.
[Mar 20, 2023 11:15:50 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adrianxw
Senior Cruncher
Denmark
Joined: Apr 13, 2008
Post Count: 192
Status: Offline
Project Badges:
Re: Hardware Recovery Update

+ Another one.
[Mar 20, 2023 11:40:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Greg_BE
Advanced Cruncher
Joined: May 9, 2016
Post Count: 82
Status: Offline
Project Badges:
Re: Hardware Recovery Update

It's almost time to remove this project and find another one to take its place.
Meltdown entering how many weeks now?
What happens after the meltdown? More bugs and delays?
What is happening to the research of the projects that depend on this group to distribute their science needs? It's almost time for them to break out on their own here on BOINC.
----------------------------------------
[Edit 1 times, last edit by Greg_BE at Mar 20, 2023 12:07:28 PM]
[Mar 20, 2023 12:05:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Falconet
Master Cruncher
Portugal
Joined: Mar 9, 2009
Post Count: 3295
Status: Offline
Project Badges:
Re: Hardware Recovery Update

My assumption has been that WCG just needs to be at the same level of technology and expertise that it was when IBM ran the show, so that would be the plan, the 'business vision' you rightly expect.

I'm sorry to have to disagree, since the provided statement is not a business plan.
I would never start an investment project with such a limited statement. When I ask for a quantified business plan, I mean at least:
  • Overall objectives
  • Quantified current situation and configuration
  • Target configuration: incl. hardware, software, personnel, energy, support, repair and maintenance
  • Gap analysis
  • Remediation respectively improvement measures
  • Steering committee
  • Governance rules
  • Incomes and sponsoring
  • Monitoring, controls, and reporting

Based on the 2022 communication, I guessed that Krembil inherited the WCG hardware and software platform from IBM. The messages of the last days show finally a totally different reality.
Without knowing the exact reality, without knowing what is effectively needed, without being able to calculate how much human, financial, technical (i.e. expertise and knowledge) resources are required, it is not reasonable to go ahead since the troubles will recurrently occur and the contributors will become tired by the troubles.

I would prefer to experience a stop now and that Krembil takes time to honestly assess if a reliable solution is reachable.
  • Maybe yes: I will be happy to further support WCG
  • Maybe no: in this case an alternate reliable solution must be found or, otherwise, a controlled shutdown must be considered. I will be very sad, but with a lot of good memories and the feeling to have probably contributed to some sciences during the last 15 years.

In all cases, the current situation is highly disappointing and disturbing.
Cheers,
Yves

It was always clear that WCG transitioned from the IBM cloud to their own servers. At least it was to me.
----------------------------------------


AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W
AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W
AMD Ryzen 7 7730U 8C/16T 3.0 GHz
[Mar 20, 2023 12:10:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12370
Status: Offline
Project Badges:
Re: Hardware Recovery Update

I never expected the hardware to be shifted to Toronto. Software - yes, but changing from Cloud storage would require re-configuration of the software.

Academia is notoriously always short of funding and personnel so taking on WCG was a massive undertaking for Krembil. At least they did, for which we are grateful. I am sure they didn't know how much they were taking on, hence the length of time it has taken to get this far.

Since then, they will have been operating on a shoestring and have been running everything flat out so it is not surprising that equipment failures occur. They need our support - crunching, financial and emotional!

Mike
[Mar 20, 2023 1:57:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2167
Status: Offline
Project Badges:
Re: Hardware Recovery Update

Well, WCG crunching have been down for this long:

Last update host XML 2023-02-28 13:11:22 UTC (19 days 18:29:03 old)
Last update user XML 2023-03-01 01:21:01 UTC (19 days 06:19:24 old)
Last update team XML 2023-03-01 01:21:01 UTC (19 days 06:19:24 old)

----------------------------------------
[Edit 1 times, last edit by Grumpy Swede at Mar 20, 2023 2:32:30 PM]
[Mar 20, 2023 2:29:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Cyclops
Senior Cruncher
Joined: Jun 13, 2022
Post Count: 295
Status: Offline
Re: Hardware Recovery Update

Can we get a status update of the recovery?
I look at twitter for updates and this is the last update I see from the 10th of March:
Update: The storage server was revived yesterday late afternoon. Both database filesystems mounted as before, but the science filesystem did not. It needs a repair; erasing the old log first.

Hi shauge, we are working on the issue and will provide another update when we have substantial news to share. Thanks for your patience.
[Mar 20, 2023 2:30:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
deltavee
Ace Cruncher
Texas Hill Country
Joined: Nov 17, 2004
Post Count: 4884
Status: Offline
Project Badges:
Re: Hardware Recovery Update

Goodbye everyone.
----------------------------------------
4849
[Mar 20, 2023 3:32:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 196   Pages: 20   [ Previous Page | 7 8 9 10 11 12 13 14 15 16 | Next Page ]
[ Jump to Last Post ]
Post new Thread