Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 148
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Nine on one desktop, 8 valid, 1 pending, runtime range from 2.81 to 7.23, the longest off batch 0026. Batch 0025 and 0028 ran fairly equal.
Sgt.joe, was that heartbeat problem, ahum, on linux mint, iirc. |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7697 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Sgt.joe, was that heartbeat problem, ahum, on linux mint, iirc. Absolutely on Linux mint. I think I have it pinned down to times when my range extender(wireless) has lost its connection and attempting to re-establish communication. During this time cpu usage will drop to zero until either communication is re-established or it gives up trying. I have seen this with numerous MCM1 units, but they seem to recover without a problem and become valid, even if this occurs multiple times. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Jason1478963
Senior Cruncher United States Joined: Sep 18, 2005 Post Count: 295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Sgt.Joe we have seen that with our linux machines as well. Its even worse when it happens with CEP2 as it can error your whole cache. This seems to happen to me anytime my client cant find the internet on the linux machines.
----------------------------------------![]() |
||
|
KWSN - A Shrubbery
Master Cruncher Joined: Jan 8, 2006 Post Count: 1585 Status: Offline |
Yes, had to completely abandon wireless on any Xnix machine. Any instability in the connection at all crashes out the work. It's bad enough when my wired internet goes down. They don't error immediately because the machines can see the router, but it does cause all sorts of havoc.
----------------------------------------![]() Distributed computing volunteer since September 27, 2000 |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7697 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Well so far I have been unable to find a solution, but wireless is my only option for 5 of my machines. At least MCM1 appears to deal with the issue the vast majority of the time. If anyone has a fix I would love to try it.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Now wired the linux ubuntu with wrapped ethernet cable to the windows machine which never has this issue with wifi, then set windows to share the internet connection. After that it's completely configuration free, the linux on a different ip range connects to windows, where windows acts as the router with the 192.xxx.yyy.1 ip. What I'm suggesting is, if you have the linux devices nearby and a single windows up close, connect the linuxes to a router by cable and the router to the windows machine and let that act as internet sharing device. Somewhere there was also mention of running a power off command with iwconfig to stop energy saving kicking in, i.e. your wifi not going to sleep. Then it transmits on maximum all the time.
But here's an entirely different question to technicians or close beta observers. Is this project same as mcm using symbol linking to the project folder? Iow, there's only one copy of the application software and various other permanent/semi-permanent data files in the project folder and each task slot simply has an symbolic link to those files, so no copying is required, something we really really would like to see for cep2. |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7697 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Now wired the linux ubuntu with wrapped ethernet cable to the windows machine which never has this issue with wifi, then set windows to share the internet connection. After that it's completely configuration free, the linux on a different ip range connects to windows, where windows acts as the router with the 192.xxx.yyy.1 ip. What I'm suggesting is, if you have the linux devices nearby and a single windows up close, connect the linuxes to a router by cable and the router to the windows machine and let that act as internet sharing device. Somewhere there was also mention of running a power off command with iwconfig to stop energy saving kicking in, i.e. your wifi not going to sleep. Then it transmits on maximum all the time. Unfortunately I do not have any windows machines at that location. Maybe I need to put one there. Currently all the machines are wired through a switch and then to the range extender. They are all controlled through a kvm box. Thanks for the tip. I tried messing with the iwconfig settings but there was no effect. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just noticed that a beta repair job had slipped in, all the 0's in batch 25
----------------------------------------![]() BETA_ ugm1_ ugm1_ 00025_ 0000_ 3-- - In Progress 10/10/14 14:56:43 12/10/14 00:32:42 0.00 0.0 / 0.0 BETA_ ugm1_ ugm1_ 00025_ 0000_ 2-- 721 Error 10/10/14 14:54:36 10/10/14 14:56:41 0.00 145.1 / 0.0 BETA_ ugm1_ ugm1_ 00025_ 0000_ 1-- 721 Pending Validation 06/10/14 14:54:35 07/10/14 00:18:19 5.06 104.7 / 0.0 BETA_ ugm1_ ugm1_ 00025_ 0000_ 0-- - No Reply 06/10/14 14:54:34 10/10/14 14:54:34 0.00 0.0 / 0.0 The Result Log for the errored unit is: <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> Can't get shared memory segment name: shmget() failed </message> _1 log looks normal. _3 is progressing ok, checkpoints every 2 minutes. Edit: now Valid. [Edit 1 times, last edit by Former Member at Oct 10, 2014 7:07:38 PM] |
||
|
yoro42
Ace Cruncher United States Joined: Feb 19, 2011 Post Count: 8979 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Wingmen for these six will change on October 11th at approximately 17:23 when the deadline is reached. Hope you get a reissue. Good luck!
----------------------------------------BETA_ ugm1_ ugm1_ 00026_ 0352_ 0-- Pending Validation 10/7/14 17:22:55 10/8/14 03:24:45 9.71 / 9.87 204.1 / 0.0 BETA_ ugm1_ ugm1_ 00026_ 0370_ 0-- Pending Validation 10/7/14 17:22:55 10/7/14 23:03:03 5.46 / 5.52 147.1 / 0.0 BETA_ ugm1_ ugm1_ 00026_ 0340_ 0-- Pending Validation 10/7/14 17:22:53 10/8/14 00:20:08 6.56 / 6.87 145.5 / 0.0 BETA_ ugm1_ ugm1_ 00026_ 0298_ 0-- Pending Validation 10/7/14 17:22:51 10/8/14 07:59:46 6.66 / 6.66 155.5 / 0.0 BETA_ ugm1_ ugm1_ 00026_ 0324_ 0-- Pending Validation 10/7/14 17:22:51 10/8/14 01:02:45 7.44 / 7.53 127.3 / 0.0 BETA_ ugm1_ ugm1_ 00026_ 0272_ 0-- Pending Validation 10/7/14 17:22:49 10/8/14 03:25:16 9.32 / 9.88 167.2 / 0.0 BETA_ ugm1_ ugm1_ 00026_ 0231_ 1-- Pending Validation 10/7/14 17:22:49 10/8/14 03:58:01 9.67 / 10.42 176.5 / 0.0 ![]() |
||
|
seippel
Former World Community Grid Tech Joined: Apr 16, 2009 Post Count: 392 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
More work units are now being released. The application version is 7.22. This beta is a potential fix for some of the sizing issues we saw in the last round of beta with some work units.
Seippel |
||
|
|
![]() |