Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 33
|
![]() |
Author |
|
keputnam
Cruncher Joined: Aug 25, 2014 Post Count: 19 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Before they lose all credibility?
They lost that when they shut down the old server before the new one was fully tested |
||
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 1957 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Before they lose all credibility? Nah, that wasn't possible. But the whole move could have been better planned (from the announcement to the shutdown) and executed (after the shutdown). This all would always have involved some downtime. But the amount of downtime and the haphazard approach of getting things up and running could have been better handled, even with more limited resources as under IBM...They lost that when they shut down the old server before the new one was fully tested Ralf ![]() |
||
|
Paul Schlaffer
Senior Cruncher USA Joined: Jun 12, 2005 Post Count: 245 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hi, And thus risking that a possible DDoS thread is being added to the heap of issues... :(yep, download isn't well working. use the really dirty way and create a crontask with "/usr/bin/boinccmd --network_available" every 10sec (no need to run it as root) or with windows : "C:\Program Files\boinc\boinccmd" --network_available The quotes are required because of the space between Program and Files. https://www.worldcommunitygrid.org/forums/wcg/viewpostinthread?post=675622 Ralf I agree. Setting up a script to hammer the server every 10 seconds doesn't help the problem, it creates a new one. Hopefully people are wise enough not to go down that road. ![]() “Where an excess of power prevails, property of no sort is duly respected. No man is safe in his opinions, his person, his faculties, or his possessions.” – James Madison (1792) |
||
|
HyperComputing
Advanced Cruncher Joined: Aug 10, 2019 Post Count: 74 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Great, there will be more bandwidth for downloads.
----------------------------------------
My GPU compute WUs as fast as I'm crushing bubble wrap.
|
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1677 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Setting up a script to hammer the server every 10 seconds doesn't help the problem, it creates a new one. Hopefully people are wise enough not to go down that road. The problem is that the only 2 machines I have not running the script at this time are running dry because of incomplete download during days. The whole situation is totally creepy. What I don't understand is the reason why Krembil is unable to solve the problem in a timely manner. The problem is so huge and constantly occurring that it should not be as difficult to identify it and to search for the root cause(s) and to solve it. Is the Krembil staff finally able to perform failure analysis and to conduct investigation? I am fully aware that IT can be very complex and failure solving can be demanding. However regarding WCG we speak about BIG failures permanently occurring since weekS / months. In an industrial context it would be simply impossible! The "hard WCG core" is willing to help, to actively contribute to failure investigation, to share knowledge ... instead of leveraging this willingness, members are ignored and they have to pay electricity bill for nothing. In June 2020, one of my hosts experienced a very strange issue. After reporting the trouble, I had direct contact with Keith for performing several tests until we solved the problem. It occurred very rarely and it was accordingly relatively difficult to reproduce. Nevertheless after a couple of tests, validating step-by-step the root cause identification and afterwards the solution, we were able to solve the problem within less than 10 days, mostly because I was not full-time available for performing the tests faster. We expect such an openess and a willingness to solve TOGETHER the problems. Cheers, Yves ---------------------------------------- [Edit 2 times, last edit by KerSamson at Oct 6, 2022 7:23:22 PM] |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7697 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
KerSamson: +1
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Eirik Redd
Cruncher Joined: Dec 28, 2008 Post Count: 5 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
This evening unlike the last three weeks, downloads from Krembil's version of WCG are working a little bit better. A little bit. Most downloads of the MCM fail at least three or six times after showing 107 bytes downloaded and then -- download failed. Connected, downloaded 107 bytes ou of 3--hundred bytes, failed, retry in an hour? WTF
Bigger downloads, when they don't fail on the first few bytes, which mostly they fail, then the downloads succeed IBM left Krembil with a serious problem. I'm a senile (elderly) person who sometimes hits the retry again :) (and again and again each minute) button on the downloads. Some people might think that is looking like paint dry. And the downloads are O so slow, better than dial-up, but so slow, I have a lously 50Mibi adsl but downloads only about 50 Kibi:? not good Did IBM dump WCG on Krembil with no hardware or software support? Kinda looks like it - but I dunno,. whatever, I keep downloading (super slow) and supporting WCG |
||
|
Eirik Redd
Cruncher Joined: Dec 28, 2008 Post Count: 5 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Right you are
|
||
|
Eirik Redd
Cruncher Joined: Dec 28, 2008 Post Count: 5 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Yup, it is a great annoyance.
"What I don't understand is the reason why Krembil is unable to solve the problem in a timely manner." Did IBM dump WCG on Krembil? with no hardware of software support?? Just asking :) Duh :) I do own a very few shares of IBM, not enough to be heard at the next AGM. |
||
|
Just1vet
Cruncher Joined: Nov 9, 2005 Post Count: 25 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
You can see by my stats I take this hobby seriously.
I only have one machine on out of my entire farm. It has 32 hungry cores. It is impossible to keep those cores full. Or even close to full. Actually, I am very lucky to even have 3 cores crunching. I'm not even going to bring the entire farm online until my one machine that is on, operates correctly. |
||
|
|
![]() |