Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 74
Posts: 74   Pages: 8   [ Previous Page | 1 2 3 4 5 6 7 8 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 580793 times and has 73 replies Next Thread
Jurisica
World Community Grid Admin, Mapping Cancer Markers and Help Conquer Cancer Scientist
Joined: Feb 28, 2007
Post Count: 87
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Thank you Mike - you have summarized it extremely well.

As my lab has benefit from the Grid on 2 projects - we wanted to save the WCG - but finding resources to run is harder than it seemed. Not only some grants (RFP - request for proposals) that were supposed to happen 2 years ago - never did; Grid is larger than any funding organization wants to consider (it is not cancer only, it is not climate only, ..., it is not computational only) and it is not US or Canada only ...
our institution does not provide any support for it (no space, people, $, ..)
(and as you know - science teams do not pay for the service either).

We continue to run it on the side with the funds I have -- but as it is -- some volunteers complaining about lack of communications, lack of technical expertise and response, lack of resources is like me complaining to some of the volunteers why do the not run 1000 CPUs from their basement (i know some are close) - or why do they go on vacation. we are as much volunteers on this as any of other volunteers.


But hopefully, together we will make it work.

thank you
igor
[Aug 11, 2023 12:00:55 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Jurisica
World Community Grid Admin, Mapping Cancer Markers and Help Conquer Cancer Scientist
Joined: Feb 28, 2007
Post Count: 87
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Thank you Grumpy Swede - not sure about the issue over the last weekend (long weekend in Canada) - but MCM and SCC seems to be steady in supply. We will have a look - could still be the issue we had about week ago - which forced us to modify the WU scheduler. Since stats are now fixed (hopefully) - we can look more into the WUs

thanks
igor
[Aug 11, 2023 12:03:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Jurisica
World Community Grid Admin, Mapping Cancer Markers and Help Conquer Cancer Scientist
Joined: Feb 28, 2007
Post Count: 87
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Ralf - to clarify - there are guidelines that we adopted from IBM. Some volunteers were moderated while some were black listed for violating them (being critical is not one of those reasons). We continue this policy. It is not the communication person (as we do not have a permanent staff) who is responsible.

Also - each working day moderated postings are reviewed - and no post was fully deleted. It may be that weekend created some delay.


thanks
igor
[Aug 11, 2023 12:10:35 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7697
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Dr. Jurisica:

Thank you for the information. Many of us are aware your resources are stretched. Your staff deserves any time off they get. However, good communication should be cheap, quick and timely. A mere sentence or two in the forums would be welcomed to acknowledge a problem or outage.
Thank you for your time and efforts.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Aug 11, 2023 2:34:33 AM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1957
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Ralf - to clarify - there are guidelines that we adopted from IBM. Some volunteers were moderated while some were black listed for violating them (being critical is not one of those reasons). We continue this policy. It is not the communication person (as we do not have a permanent staff) who is responsible.

Also - each working day moderated postings are reviewed - and no post was fully deleted. It may be that weekend created some delay.


thanks
igor
Sorry, but this is NOT correct.

I am not talking about delays over a weekend, as bad that is by itself. There have been sometime delays by a week or more. And yes, some messages have completely disappeared as well. And if those posts then are being released after being "reviewed", why is it that there is for days no acknowledgement of reported issues until all the sudden, days after an initial post (attempt) and subsequent multiple posts to the same issue, there is acknowledgement or at least a "we are investigating" post from your communication person, attributing the last, random person to mention that same issue, as if it is the first time anyone at WCG Towers has heard about it?
Why, if those emails are being indeed reviewed "every working day", did some of those posts where I have tried to address the "communication person" in a PS to the message, been posted verbatim? And that particular part not being "partially" deleted?

I don't think that there was ever a real reason for any moderation in the first place, unless it was simply in order to silence someone who is a bit more critical than others and just doesn't swallow every platitude and/or lie/misinformation from your "communication person". And certainly not for months on end. This is practically censorship.

If you would bother to read the forum messages, the number one concern, from dozens of people, for more than a year now, is your utter lack of communication. I am not the only one who is complaining about this. A lot of long time volunteers have given up over that and left, not only because of your unfortunate streak of "bad luck" ever since you took over the project. That basic form of communication what we ask for doesn't cost a fortrune. Mostly just a real will to improve the situation. Not just being satisfied with the status quo.

And it is not only on the forum. There is the same lack of information on Facebook and Twitter as well. And just as many others complaining there about the exact same main issue: total lack of communication on your part.

And as I mentioned in my previous post, your "support" email isn't working pretty much since end of last year! Also a fact that a lot of people on the forum complain about. Not just me.It wouldn't take me more than an hour to cobble together a mailbag server that then would be able to relay emails to and from who ever the "tech person(s)" for WCG is/are.But again, total crickets.

And as I mentioned in my previous post, with that lack of communication, it is quite conceivable that any person/entity otherwise inclined to provide the necessary funding will think at least twice about doing so.

And on the issue of Krembil not supporting the project, you have completely ignored my question, months ago, why Krembil then gets to plaster their name all over the place?

And no, it doesn't seem that you appreciate your volunteers, there have been several people over the last 18 months offered to actively help, having technical expertise, at times for decades, and thus don't just swallow any fish tale that is being put out there. For 8-9 months not a single word, just when the <beep> finally hit the fan and the whole show fell down because of a hardware failure back in spring, just after that, you bothered to tell us how much in need of money you are and that you wouldn't let just anyone actively help you out. Yes, when one of the fish tales being presented recently is that "some DHCP client on the servers failed", sorry, I am again not the only one who is questioning the technical expertise on your end.
No, I do not run a 1000 CPUs out of a basement, it's rather for more than 12 years now on average two dozen everyday working computers that provide spare resources for what seems worthy causes, switching from other projects after having a lucky close brush with stomach cancer the year before. Yes, there are a few pointw****s who can't get enough points and thus think they have to run a server farm just to crunch on DC projects, but I am sure that there are far more people like me, who do this out of less selfish reasons. And there is no need to mock those people then, who are not asking for more than just honesty and more timely communication from your end.

Ralf
----------------------------------------

[Aug 11, 2023 4:57:37 AM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1957
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Thank you Grumpy Swede - not sure about the issue over the last weekend (long weekend in Canada) - but MCM and SCC seems to be steady in supply. We will have a look - could still be the issue we had about week ago - which forced us to modify the WU scheduler. Since stats are now fixed (hopefully) - we can look more into the WUs

thanks
igor
Again, not really an honest answer.
I had (tried to) posted on Friday, BEFORE the long weekend, that I noticed that the number o newf WUs started to decline and first upload errors showed up, as that has happened now repeatedly, just in time for another weekend. Yes, a lot of the people didn't immediately notice that this would be a long weekend in Canada, but even an acknowledgement didn't happen until Wednesday, after WUs started to show up again. And our Swedish friend had posted about this about an hour or two after my intended post (which I could not find looking for it, again) about that issue on Friday. There certainly was from Friday night until Tuesday morning no supply (beside a handful of resends among several volunteers) of either MCM1 nor SCC1. I might simply notice this a bit earlier than some others, who just think they need to run with days worth of cached WUs and might simply not notice such things until well after the problem started. That was one of the reasons why I complained about this habit of some of my fellow volunteers that this hoarding of WUs is a really bad idea.

Likewise, my post (and trying to send an email) on the fact that the external stats seem to have not being processed at some point on Thursday July 20th, almost a day before the system took an unexpected dive days before the previously announced data center outage was not acknowledged until well after the system was back up, and then only by responding another volunteer (among others) to mention this yet again days later. At least something in that regard something happened tonight, when I got back from work.

Ralf
----------------------------------------

[Aug 11, 2023 5:22:44 AM]   Link   Report threatening or abusive post: please login first  Go to top 
NixChix
Veteran Cruncher
United States
Joined: Apr 29, 2007
Post Count: 1187
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Dr. Jurisica:

Thank you for the information. Many of us are aware your resources are stretched. Your staff deserves any time off they get. However, good communication should be cheap, quick and timely. A mere sentence or two in the forums would be welcomed to acknowledge a problem or outage.
Thank you for your time and efforts.
Cheers

+1
----------------------------------------

[Aug 11, 2023 7:24:12 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Falconet
Master Cruncher
Portugal
Joined: Mar 9, 2009
Post Count: 3295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Bringing back Community Advisers could be very helpful regarding communication.
----------------------------------------


AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W
AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W
AMD Ryzen 7 7730U 8C/16T 3.0 GHz
[Aug 11, 2023 1:19:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Spiderman
Advanced Cruncher
United States
Joined: Jul 13, 2020
Post Count: 117
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

"...On my part, this is the only way to reach someone at WCG, as their support email isn't working, since end of last year now!..."


1) I haven't needed to contact them since last month, but my last reply from WCG Support was July 7th, so email *is* working. Prior to that I have replies from them in May and June of this year as well.

2) I have a very high opinion of TigerLily and believe she is truly helping. Dr Jurisica summed it up well in the paragraphs above/below this thread concerning his folks' efforts.

[Feedback]

3) @Ralf, if you find yourself being moderated, perhaps that might be a reflection point to think about. I find your comments & tone often to be rather caustic to the point of being unbecoming of someone of your intellect. Perhaps if you'd tone it down and not interject such negativity, your posts would have much more impact. Speaking from 45-years of I/S background and working with Teams, I think you have the potential to be an excellent leader, but the sarcasm sinks the boat. Everyone on the forum doesn't *have* to publicly know your moderation issues.

[/Feedback]

--

I have a firm belief that we will look back on these early days in coming years and realize just what great effort Jurisica Labs did on a shoestring budget.

I started with a single machine 3-yrs ago. Today I have 15 machines running 24x7, and have plans to continue adding as I can.

WCG thanks!
----------------------------------------
[Edit 1 times, last edit by Spiderman at Aug 12, 2023 12:16:30 AM]
[Aug 11, 2023 5:40:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Aperture_Science_Innovators
Advanced Cruncher
United States
Joined: Jul 6, 2009
Post Count: 139
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2023-07-31 Update (MCM1 issue resolved)

Thank you Mike - you have summarized it extremely well.

As my lab has benefit from the Grid on 2 projects - we wanted to save the WCG - but finding resources to run is harder than it seemed. Not only some grants (RFP - request for proposals) that were supposed to happen 2 years ago - never did; Grid is larger than any funding organization wants to consider (it is not cancer only, it is not climate only, ..., it is not computational only) and it is not US or Canada only ...
our institution does not provide any support for it (no space, people, $, ..)
(and as you know - science teams do not pay for the service either).

We continue to run it on the side with the funds I have -- but as it is -- some volunteers complaining about lack of communications, lack of technical expertise and response, lack of resources is like me complaining to some of the volunteers why do the not run 1000 CPUs from their basement (i know some are close) - or why do they go on vacation. we are as much volunteers on this as any of other volunteers.


But hopefully, together we will make it work.

thank you
igor



Thank you, as always, for keeping WCG going. A WCG that faces intermittent technical difficulties is far better than no WCG. Thanks for detailing some of the challenges you've faced. It hadn't occurred to me that by being a broader project, it would make it harder to find funding, but makes sense when you put it like that.
----------------------------------------

[Aug 11, 2023 6:14:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 74   Pages: 8   [ Previous Page | 1 2 3 4 5 6 7 8 | Next Page ]
[ Jump to Last Post ]
Post new Thread