Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 62
|
![]() |
Author |
|
Eric_Kaiser
Veteran Cruncher Germany (Hessen) Joined: May 7, 2013 Post Count: 1047 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thanks for looking into this issue and fixing it.
----------------------------------------![]() |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Looks like we are having file transfer problems again. I'm getting the same HTTP errors as described above. On attempting retries, some complete the download and some fail. Both upload and download files are being impacted
|
||
|
JMrkvicka
Cruncher United States Joined: Jul 14, 2005 Post Count: 33 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
My Uncle Yogi would say we've deja vu all over again...
----------------------------------------![]() |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
My uploads are jammed up again.
![]() Tue 28 Feb 2017 14:23:37 AEDT | | Internet access OK - project servers may be temporarily down. Tue 28 Feb 2017 14:23:40 AEDT | | Project communication failed: attempting access to reference site Tue 28 Feb 2017 14:23:40 AEDT | World Community Grid | Temporarily failed upload of FAHV_1002289_3j3q-7M-P4_Rigid_12526_0_r246707194_0: transient HTTP error Tue 28 Feb 2017 14:23:41 AEDT | | Internet access OK - project servers may be temporarily down. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
started here between 20:45 and 21:00 CDT. I just noticed it.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Another night, another set of upload/download issues....
----------------------------------------Getting to be monotonous.... Seems to always happen after the 0000 UTC stats run.... Is the server running out of sockets? [Edit 1 times, last edit by Doneske at Mar 1, 2017 3:54:35 AM] |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Doneske,
I got alerts last night about the server having issues near the 0500-0600 UTC time. When I reviewed the server, I did not notice anything that was scheduled to run during that time on the machine and the logs do not point to anything specific. I have refreshed the server that was struggling in hopes it clears something. We are still working on a root cause. Thanks, -Uplinger |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
It seemed to start about 8:45 PM CDT (0245 UTC) and I was still getting HTTP errors at 1:15AM CDT (0715). Then the errors ended. This was happening on all 19 machines and, just as before, some transfers would be successful and some wouldn't be. It looks like an attempt is made to transfer (upload or download) and sometimes a connection is made and sometimes not.
----------------------------------------It just seems suspicious that the system runs all day with few or no errors, at least on my machines, and then right after the 0000 UTC stats activity, we start getting these upload/download errors. Another thing I've noticed in the past is the Website runs slightly slower after the stats for a period of time. It may not be the Webserver itself but maybe the back end pieces it connects to in order to display data like the DB maybe. IT just feels like something is happening on the infrastructure right after the stats process. If all the network sockets are in use when the attempt to transfer happens, it would get an error. Not knowing how the server piece is configured, you may have stated that BOINC could handle 1000 connections but the underlying UNIX networking is only configured to handle 512 max socket connections in the TCPIP configuration. The most connections would be limited to 512 and that probably wouldn't show up in the BOINC logs but might show in the UNIX syslog. Maybe not. All this is just guessing on my part... [Edit 2 times, last edit by Doneske at Mar 1, 2017 3:23:12 PM] |
||
|
RTS48
Veteran Cruncher Bolivia Joined: Aug 2, 2009 Post Count: 1350 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I'm not having problems with the upload or download process but I notice that the User totals and averages do not update. Also that Free-DC and BAM stats seem to have got woefully behind.
----------------------------------------
Rod Peel
Santa Cruz Bolivia South America ![]() ![]() |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I'm not having problems with the upload or download process but I notice that the User totals and averages do not update. Also that Free-DC and BAM stats seem to have got woefully behind. as in the exports have not run since [DIR] Parent Directory - [ ] db_dump.xml 28-Feb-2017 00:36 681 [CMP] host.gz 28-Feb-2017 00:34 454M [ ] tables.xml 28-Feb-2017 00:36 5.5K [CMP] team.gz 28-Feb-2017 00:16 2.5M [CMP] user.gz 28-Feb-2017 00:16 33M |
||
|
|
![]() |