Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: The Clean Energy Project - Phase 2 Forum Thread: [Resolved] CEP2 Validation Issues |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 33
|
Author |
|
armstrdj
Former World Community Grid Tech Joined: Oct 21, 2004 Post Count: 695 Status: Offline Project Badges: |
We are aware of the validatioin issues with the new CEP2 workunits and are investigating. At this point we have all the data we need and are working on a solution.
----------------------------------------Thanks, armstrdj [Edit 1 times, last edit by armstrdj at Aug 2, 2014 8:46:02 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi
That has to be a mess to sort out better you then me. What I have seen is strange. I do a work unit that comes up as a error and every one (going on 6 now) after that also does. But then a work unit that ran in 1.00hr and 0.52hr and came up error on two different computers I ran in 1.70hr and it comes up valid. I may have a older 2000 Lenovo at 1.67GHz but it seems to be doing very well. So I have to ask; 1) how much of this do you think is over clocking? 2) Could some computers made for speed and media performance be inherently unstable for real world number crunching? I'll just say I'm not in this for a prize just happy to help. Thanks for your time. |
||
|
armstrdj
Former World Community Grid Tech Joined: Oct 21, 2004 Post Count: 695 Status: Offline Project Badges: |
We have disabled the CEP2 validator for now while we get the changes made and the new one in place. Thanks for everyone's patience while we fix this issue.
Thanks, armstrdj |
||
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3265 Status: Offline Project Badges: |
Hi That has to be a mess to sort out better you then me. What I have seen is strange. I do a work unit that comes up as a error and every one (going on 6 now) after that also does. But then a work unit that ran in 1.00hr and 0.52hr and came up error on two different computers I ran in 1.70hr and it comes up valid. I may have a older 2000 Lenovo at 1.67GHz but it seems to be doing very well. So I have to ask; 1) how much of this do you think is over clocking? 2) Could some computers made for speed and media performance be inherently unstable for real world number crunching? I'll just say I'm not in this for a prize just happy to help. Thanks for your time. I would say that was just dumb luck. Your CPU should be fine. Unless there is some specific instruction that your CPU doesnt support because of its age but I doubt that otherwise the application would fail to even start. AMD Ryzen 5 1600AF 4C/8T 3.2 GHz - 85W AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W Intel Z3740 4C/4T 1.8 GHz - 6W |
||
|
armstrdj
Former World Community Grid Tech Joined: Oct 21, 2004 Post Count: 695 Status: Offline Project Badges: |
The CEP2 validator has been fixed and is running again and caught up. We did what we could to revalidate workunits so most users should see a change in their result status. If any users feel they have a workunit which may still have the wrong status please post the workunit name so we can investigate. We will continue to monitor things on our end but if anyone continues to see validation issues with CEP2 going forward please post to this thread.
Thanks, armstrdj |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks for sorting this out so quickly guys, we really appreciate it - and thanks to our great crunchers for sticking with us!
Your Harvard CEP Team |
||
|
simjoe
Cruncher Joined: Dec 4, 2013 Post Count: 35 Status: Offline Project Badges: |
Thanks a lot to all of you for that great job, all of my new style WU's changed to valid.
There are still a lots of old style WU's like E224167_854_I.64.C46H24N6O12.00238469.3.set1d06 in error, but if I remember right I read somewhere here that this was expected and never an issue for me. Thanks and regards |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
...if anyone continues to see validation issues with CEP2 going forward please post to this thread. I'm not sure if this is the kind of thing you are referring to, but I have these E224xxx tasks that are showing as errors: E224286_243_J.65.C60H32N2S3.00032514.4.set1d06 E224255_008_I.68.C53H27N7O8.00091433.0.set1d06 E224232_137_I.66.C52H28N8O6.00379166.4.set1d06 E224287_565_J.65.C62H34S3.00034874.1.set1d06 E224261_755_I.68.C58H28N4O6.00288515.4.set1d06 E224280_326_J.66.C52H22N4O8S2.00072648.4.set1d06 E224272_475_J.61.C54H28N4S3.00098585.0.set1d06 E224237_307_I.65.C53H32N4O8.00090263.0.set1d06 MarkR |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I would say that was just dumb luck. Your CPU should be fine. Unless there is some specific instruction that your CPU doesnt support because of its age but I doubt that otherwise the application would fail to even start. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
okay I tried to copy in what I was responding to, but that failed somehow.... so here goes.
----------------------------------------I think you missed my point. As CPU speeds go up near the top end of everything else there has to come a point when some things just start to fail, be it tied to the complexity of the problem, or just because of errors from data transfer within the unit. In all my years working Quality Control I seen it happen time and again.... can't be just dumb luck. Also I never said I was having a problem but ran work units as valid before any problem was corrected that ran on other systems and came up in error. There is always a upper limit for any tool dependent on the task. My computer as far as I know has no issues... I on the other hand has a few. But as long as I am here again..... how many times can one work unit fail before it's no longer passed around?? I have two that failed and it seems they are going on the 7th try. Also what happens with the points/time everyone donated to show it's a DOA WU?? What the heck one more.... as this seems to be the 2nd validator problem in the past short march of weeks, why does it seem everyones reliability goes in the drain over a issue that had nothing to do with data crunchers?? No list of reliable units to start back up with or is there and I'm not seeing it's effects as everything now requires double validation?? Yes I'm sure everything is subject to validation checks anyway, but this current pattern isn't consistent with what happened just before the vaiadaters dropping out. Just asking... [Edit 1 times, last edit by Former Member at Aug 4, 2014 9:09:31 AM] |
||
|
|