Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 148
Posts: 148   Pages: 15   [ Previous Page | 2 3 4 5 6 7 8 9 10 11 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 17323 times and has 147 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Nine on one desktop, 8 valid, 1 pending, runtime range from 2.81 to 7.23, the longest off batch 0026. Batch 0025 and 0028 ran fairly equal.

Sgt.joe, was that heartbeat problem, ahum, on linux mint, iirc.
[Oct 9, 2014 1:42:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7697
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Sgt.joe, was that heartbeat problem, ahum, on linux mint, iirc.


Absolutely on Linux mint. I think I have it pinned down to times when my range extender(wireless) has lost its connection and attempting to re-establish communication. During this time cpu usage will drop to zero until either communication is re-established or it gives up trying. I have seen this with numerous MCM1 units, but they seem to recover without a problem and become valid, even if this occurs multiple times.

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Oct 9, 2014 10:47:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Jason1478963
Senior Cruncher
United States
Joined: Sep 18, 2005
Post Count: 295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Sgt.Joe we have seen that with our linux machines as well. Its even worse when it happens with CEP2 as it can error your whole cache. This seems to happen to me anytime my client cant find the internet on the linux machines.
----------------------------------------

[Oct 10, 2014 12:04:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
KWSN - A Shrubbery
Master Cruncher
Joined: Jan 8, 2006
Post Count: 1585
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Yes, had to completely abandon wireless on any Xnix machine. Any instability in the connection at all crashes out the work. It's bad enough when my wired internet goes down. They don't error immediately because the machines can see the router, but it does cause all sorts of havoc.
----------------------------------------

Distributed computing volunteer since September 27, 2000
[Oct 10, 2014 12:30:32 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7697
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Well so far I have been unable to find a solution, but wireless is my only option for 5 of my machines. At least MCM1 appears to deal with the issue the vast majority of the time. If anyone has a fix I would love to try it.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Oct 10, 2014 1:30:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Now wired the linux ubuntu with wrapped ethernet cable to the windows machine which never has this issue with wifi, then set windows to share the internet connection. After that it's completely configuration free, the linux on a different ip range connects to windows, where windows acts as the router with the 192.xxx.yyy.1 ip. What I'm suggesting is, if you have the linux devices nearby and a single windows up close, connect the linuxes to a router by cable and the router to the windows machine and let that act as internet sharing device. Somewhere there was also mention of running a power off command with iwconfig to stop energy saving kicking in, i.e. your wifi not going to sleep. Then it transmits on maximum all the time.

But here's an entirely different question to technicians or close beta observers. Is this project same as mcm using symbol linking to the project folder? Iow, there's only one copy of the application software and various other permanent/semi-permanent data files in the project folder and each task slot simply has an symbolic link to those files, so no copying is required, something we really really would like to see for cep2.
[Oct 10, 2014 7:24:12 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7697
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Now wired the linux ubuntu with wrapped ethernet cable to the windows machine which never has this issue with wifi, then set windows to share the internet connection. After that it's completely configuration free, the linux on a different ip range connects to windows, where windows acts as the router with the 192.xxx.yyy.1 ip. What I'm suggesting is, if you have the linux devices nearby and a single windows up close, connect the linuxes to a router by cable and the router to the windows machine and let that act as internet sharing device. Somewhere there was also mention of running a power off command with iwconfig to stop energy saving kicking in, i.e. your wifi not going to sleep. Then it transmits on maximum all the time.

Unfortunately I do not have any windows machines at that location. Maybe I need to put one there. Currently all the machines are wired through a switch and then to the range extender. They are all controlled through a kvm box. Thanks for the tip.
I tried messing with the iwconfig settings but there was no effect.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Oct 10, 2014 10:55:49 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Just noticed that a beta repair job had slipped in, all the 0's in batch 25 cool.

BETA_ ugm1_ ugm1_ 00025_ 0000_ 3-- - In Progress 10/10/14 14:56:43 12/10/14 00:32:42 0.00 0.0 / 0.0
BETA_ ugm1_ ugm1_ 00025_ 0000_ 2-- 721 Error 10/10/14 14:54:36 10/10/14 14:56:41 0.00 145.1 / 0.0
BETA_ ugm1_ ugm1_ 00025_ 0000_ 1-- 721 Pending Validation 06/10/14 14:54:35 07/10/14 00:18:19 5.06 104.7 / 0.0
BETA_ ugm1_ ugm1_ 00025_ 0000_ 0-- - No Reply 06/10/14 14:54:34 10/10/14 14:54:34 0.00 0.0 / 0.0

The Result Log for the errored unit is:

<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Can't get shared memory segment name: shmget() failed
</message>

_1 log looks normal.

_3 is progressing ok, checkpoints every 2 minutes. Edit: now Valid.
----------------------------------------
[Edit 1 times, last edit by Former Member at Oct 10, 2014 7:07:38 PM]
[Oct 10, 2014 4:28:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
yoro42
Ace Cruncher
United States
Joined: Feb 19, 2011
Post Count: 8979
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Wingmen for these six will change on October 11th at approximately 17:23 when the deadline is reached. Hope you get a reissue. Good luck!

BETA_ ugm1_ ugm1_ 00026_ 0352_ 0-- Pending Validation 10/7/14 17:22:55 10/8/14 03:24:45 9.71 / 9.87 204.1 / 0.0
BETA_ ugm1_ ugm1_ 00026_ 0370_ 0-- Pending Validation 10/7/14 17:22:55 10/7/14 23:03:03 5.46 / 5.52 147.1 / 0.0
BETA_ ugm1_ ugm1_ 00026_ 0340_ 0-- Pending Validation 10/7/14 17:22:53 10/8/14 00:20:08 6.56 / 6.87 145.5 / 0.0
BETA_ ugm1_ ugm1_ 00026_ 0298_ 0-- Pending Validation 10/7/14 17:22:51 10/8/14 07:59:46 6.66 / 6.66 155.5 / 0.0
BETA_ ugm1_ ugm1_ 00026_ 0324_ 0-- Pending Validation 10/7/14 17:22:51 10/8/14 01:02:45 7.44 / 7.53 127.3 / 0.0
BETA_ ugm1_ ugm1_ 00026_ 0272_ 0-- Pending Validation 10/7/14 17:22:49 10/8/14 03:25:16 9.32 / 9.88 167.2 / 0.0
BETA_ ugm1_ ugm1_ 00026_ 0231_ 1-- Pending Validation 10/7/14 17:22:49 10/8/14 03:58:01 9.67 / 10.42 176.5 / 0.0
----------------------------------------

[Oct 11, 2014 4:08:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
seippel
Former World Community Grid Tech
Joined: Apr 16, 2009
Post Count: 392
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

More work units are now being released. The application version is 7.22. This beta is a potential fix for some of the sizing issues we saw in the last round of beta with some work units.

Seippel
[Oct 11, 2014 2:27:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 148   Pages: 15   [ Previous Page | 2 3 4 5 6 7 8 9 10 11 | Next Page ]
[ Jump to Last Post ]
Post new Thread