Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 36
|
![]() |
Author |
|
Keith Myers
Senior Cruncher USA Joined: Apr 6, 2021 Post Count: 193 Status: Offline Project Badges: ![]() |
Surprised that nobody has commented on all the tasks that have had -255 errors across all my hosts and gpus for the past several days.
----------------------------------------5 hosts - 13 gpus So obviously nothing wrong with any of my hardware as they run all my other projects just fine. Badly formatted tasks I guess. ![]() A proud member of the OFA (Old Farts Association) [Edit 1 times, last edit by Keith Myers at Jan 12, 2022 4:22:56 PM] |
||
|
Richard Haselgrove
Senior Cruncher United Kingdom Joined: Feb 19, 2021 Post Count: 360 Status: Offline Project Badges: ![]() ![]() |
I've just checked my history, and I'm showing just one single -255 error at the moment. That was on an Intel HD 4600 iGPU running Windows.
----------------------------------------Over the last couple of weeks, I've been returning between 300 and 600 tasks per day, mostly using NVidia GTX 16xx cards under both Linux and Windows. I don't see your problem here. (edit - that's one error against 719 valid results, to give an idea of scale and ratio) [Edit 1 times, last edit by Richard Haselgrove at Jan 12, 2022 4:51:09 PM] |
||
|
deltavee
Ace Cruncher Texas Hill Country Joined: Nov 17, 2004 Post Count: 4890 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Keith,
I run 500-600 OPNG WUs daily and there are no errors in my last 1000 returns. I don't think the problem is badly formatted tasks. |
||
|
JohnDK
Advanced Cruncher Denmark Joined: Feb 17, 2010 Post Count: 77 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No errors on my linux and windows hosts.
----------------------------------------
Intel i7-6850K / 16GB / RTX 3090 / 2x RTX 3080 Ti / RTX 3070 Ti
AMD Ryzen 9 5950X / 32GB / RTX 2080 Ti |
||
|
Grumpy Swede
Master Cruncher Svíþjóð Joined: Apr 10, 2020 Post Count: 2201 Status: Recently Active Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No errors on my Windows 8.1 host, with a Nvidia GTX980 Strix GPU.
|
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 977 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Keith,
Out of interest, what happened to the wingmen on the failed tasks??? For what it's worth, I've not seen any issues here [yet, says he...] but then I've only got 1 GPU in each of two (Linux) machines, and only my 1660Ti is trying to do two OPNG at a time (no real benefit from trying two at a time with a 1050Ti!...) The tasks over the last couple of days are no longer dominated by larger ligands with lots of branches - this means more jobs per task; since the beginning of 11th January I've not seen a single task with less than 60 jobs, and the average number of jobs per task has climbed from below 20 to over 80! I've noticed that my task completion monitor is finding it harder to get the information it needs from client_state.xml within the time limits I've set - this suggests a resource choke-point within local BOINC infrastructure (and that's on a 3700X which only has the 1660Ti and never runs more than 10 or 11 BOINC tasks at once!) If you're still having issues, post up some task or workunit examples and maybe we can have a dig for issues... Cheers - Al. |
||
|
sam6861
Advanced Cruncher Joined: Mar 31, 2020 Post Count: 107 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
On this website, go up to "My Contribution", "Results". Look for error, click on work unit, does other computers on the same work unit got error as well?
Click on "Error" on your task, what does this show? If other tasks on same work unit works fine, but your tasks on all your computers all errors, then this can possibly be wrong configuration, missing packages, missing libraries, bad driver install, or if copied a possibly corrupt file/data to all your computers. OPNG: 84 Valid, 4 pending, 1 in progress. Works fine for me on both AMD and NVidia. Ryzen 3900x, 32GB, Win10, AMD 5500 XT 8GB, NVidia GT 1030 Ryzen 2700x, 32GB, Win10, AMD RX 550 4GB |
||
|
Freewill
Cruncher United States Joined: Mar 28, 2006 Post Count: 41 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Keith, I was getting code 255 on my xps 15 laptop with old Intel Graphics 530 and GTX 960M for every GPU task. Both with almost no run time on the task. I did some updates to Ubuntu and rebooted and they went away. Silly question, but have you rebooted? ;)
----------------------------------------Not sure if same problem you're having, but you can check this error here: https://www.worldcommunitygrid.org/contribution/results/183526597/log [Edit 1 times, last edit by Freewill at Jan 13, 2022 1:13:43 AM] |
||
|
Keith Myers
Senior Cruncher USA Joined: Apr 6, 2021 Post Count: 193 Status: Offline Project Badges: ![]() |
Yes, almost all my errored tasks have at least one wingman who also errored the task.
----------------------------------------All the host are updated daily for the security updates and have been rebooted at least once every day. All my error tasks have this at the beginning of the stderr.txt output. <core_client_version>7.17.0</core_client_version> <![CDATA[ <message> process exited with code 255 (0xff, -1)</message> <stderr_txt> I only run single tasks on any card. https://www.worldcommunitygrid.org/contribution/results/186574616/log https://www.worldcommunitygrid.org/contribution/results/186578178/log https://www.worldcommunitygrid.org/contribution/results/186578130/log https://www.worldcommunitygrid.org/contribution/results/186577541/log https://www.worldcommunitygrid.org/contribution/results/186576514/log I have no clue why my tasks are all failing. I have done nothing different to any of my hosts from the last time I got work and the tasks were validating. The only thing that has changed was the incremental kernel security updates. I am also running gpugrid python, milkyway separation and einstein gamma-ray tasks on all the gpus and they all run fine. Only one gpu task per project on any card. ![]() A proud member of the OFA (Old Farts Association) |
||
|
Keith Myers
Senior Cruncher USA Joined: Apr 6, 2021 Post Count: 193 Status: Offline Project Badges: ![]() |
Richard, what the heck is a -255 error? I looked at the old BOINC error list page and it is not one of the mentioned ones.
----------------------------------------![]() A proud member of the OFA (Old Farts Association) |
||
|
|
![]() |