Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 33
Posts: 33   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 5310 times and has 32 replies Next Thread
keputnam
Cruncher
Joined: Aug 25, 2014
Post Count: 19
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

Before they lose all credibility?

They lost that when they shut down the old server before the new one was fully
tested
[Oct 5, 2022 9:52:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1957
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

Before they lose all credibility?

They lost that when they shut down the old server before the new one was fully
tested
Nah, that wasn't possible. But the whole move could have been better planned (from the announcement to the shutdown) and executed (after the shutdown). This all would always have involved some downtime. But the amount of downtime and the haphazard approach of getting things up and running could have been better handled, even with more limited resources as under IBM...

Ralf
----------------------------------------

[Oct 5, 2022 10:05:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Paul Schlaffer
Senior Cruncher
USA
Joined: Jun 12, 2005
Post Count: 245
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

Hi,
yep, download isn't well working.

use the really dirty way and create a crontask with
"/usr/bin/boinccmd --network_available" every 10sec
(no need to run it as root)

or with windows :
"C:\Program Files\boinc\boinccmd" --network_available
The quotes are required because of the space between Program and Files.

https://www.worldcommunitygrid.org/forums/wcg/viewpostinthread?post=675622
And thus risking that a possible DDoS thread is being added to the heap of issues... :(

Ralf


I agree. Setting up a script to hammer the server every 10 seconds doesn't help the problem, it creates a new one. Hopefully people are wise enough not to go down that road.
----------------------------------------

“Where an excess of power prevails, property of no sort is duly respected. No man is safe in his opinions, his person, his faculties, or his possessions.” – James Madison (1792)
[Oct 5, 2022 11:24:04 PM]   Link   Report threatening or abusive post: please login first  Go to top 
HyperComputing
Advanced Cruncher
Joined: Aug 10, 2019
Post Count: 74
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

Great, there will be more bandwidth for downloads.
----------------------------------------
My GPU compute WUs as fast as I'm crushing bubble wrap.
[Oct 6, 2022 1:47:29 AM]   Link   Report threatening or abusive post: please login first  Go to top 
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1677
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

Setting up a script to hammer the server every 10 seconds doesn't help the problem, it creates a new one. Hopefully people are wise enough not to go down that road.

The problem is that the only 2 machines I have not running the script at this time are running dry because of incomplete download during days.
The whole situation is totally creepy.
What I don't understand is the reason why Krembil is unable to solve the problem in a timely manner.
The problem is so huge and constantly occurring that it should not be as difficult to identify it and to search for the root cause(s) and to solve it.
Is the Krembil staff finally able to perform failure analysis and to conduct investigation?
I am fully aware that IT can be very complex and failure solving can be demanding. However regarding WCG we speak about BIG failures permanently occurring since weekS / months.
In an industrial context it would be simply impossible!
The "hard WCG core" is willing to help, to actively contribute to failure investigation, to share knowledge ... instead of leveraging this willingness, members are ignored and they have to pay electricity bill for nothing.

In June 2020, one of my hosts experienced a very strange issue. After reporting the trouble, I had direct contact with Keith for performing several tests until we solved the problem. It occurred very rarely and it was accordingly relatively difficult to reproduce. Nevertheless after a couple of tests, validating step-by-step the root cause identification and afterwards the solution, we were able to solve the problem within less than 10 days, mostly because I was not full-time available for performing the tests faster.

We expect such an openess and a willingness to solve TOGETHER the problems.
Cheers,
Yves
----------------------------------------
----------------------------------------
[Edit 2 times, last edit by KerSamson at Oct 6, 2022 7:23:22 PM]
[Oct 6, 2022 7:15:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7697
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

KerSamson: +1

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Oct 6, 2022 8:00:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Eirik Redd
Cruncher
Joined: Dec 28, 2008
Post Count: 5
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

This evening unlike the last three weeks, downloads from Krembil's version of WCG are working a little bit better. A little bit. Most downloads of the MCM fail at least three or six times after showing 107 bytes downloaded and then -- download failed. Connected, downloaded 107 bytes ou of 3--hundred bytes, failed, retry in an hour? WTF
Bigger downloads, when they don't fail on the first few bytes, which mostly they fail, then the downloads succeed
IBM left Krembil with a serious problem. I'm a senile (elderly) person who sometimes hits the retry again :) (and again and again each minute) button on the downloads. Some people might think that is looking like paint dry.
And the downloads are O so slow, better than dial-up, but so slow, I have a lously 50Mibi adsl but downloads only about 50 Kibi:?
not good
Did IBM dump WCG on Krembil with no hardware or software support? Kinda looks like it - but I dunno,.
whatever, I keep downloading (super slow) and supporting WCG
[Oct 7, 2022 1:12:20 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Eirik Redd
Cruncher
Joined: Dec 28, 2008
Post Count: 5
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

Right you are
[Oct 7, 2022 1:17:13 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Eirik Redd
Cruncher
Joined: Dec 28, 2008
Post Count: 5
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

Yup, it is a great annoyance.
"What I don't understand is the reason why Krembil is unable to solve the problem in a timely manner."

Did IBM dump WCG on Krembil? with no hardware of software support?? Just asking :)
Duh :)

I do own a very few shares of IBM, not enough to be heard at the next AGM.
[Oct 7, 2022 1:31:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Just1vet
Cruncher
Joined: Nov 9, 2005
Post Count: 25
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Download Issues, Redux (or why I gave up on BOINC last time)

You can see by my stats I take this hobby seriously.
I only have one machine on out of my entire farm. It has 32 hungry cores. It is impossible to keep those cores full. Or even close to full. Actually, I am very lucky to even have 3 cores crunching.

I'm not even going to bring the entire farm online until my one machine that is on, operates correctly.
[Oct 7, 2022 2:11:38 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 33   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread