Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 36
Posts: 36   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 6256 times and has 35 replies Next Thread
Keith Myers
Senior Cruncher
USA
Joined: Apr 6, 2021
Post Count: 193
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
All OPNG tasks erroring on all gpus-all hosts

Surprised that nobody has commented on all the tasks that have had -255 errors across all my hosts and gpus for the past several days.

5 hosts - 13 gpus

So obviously nothing wrong with any of my hardware as they run all my other projects just fine.

Badly formatted tasks I guess.
----------------------------------------

A proud member of the OFA (Old Farts Association)
----------------------------------------
[Edit 1 times, last edit by Keith Myers at Jan 12, 2022 4:22:56 PM]
[Jan 12, 2022 4:22:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Richard Haselgrove
Senior Cruncher
United Kingdom
Joined: Feb 19, 2021
Post Count: 360
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

I've just checked my history, and I'm showing just one single -255 error at the moment. That was on an Intel HD 4600 iGPU running Windows.

Over the last couple of weeks, I've been returning between 300 and 600 tasks per day, mostly using NVidia GTX 16xx cards under both Linux and Windows. I don't see your problem here.

(edit - that's one error against 719 valid results, to give an idea of scale and ratio)
----------------------------------------
[Edit 1 times, last edit by Richard Haselgrove at Jan 12, 2022 4:51:09 PM]
[Jan 12, 2022 4:46:04 PM]   Link   Report threatening or abusive post: please login first  Go to top 
deltavee
Ace Cruncher
Texas Hill Country
Joined: Nov 17, 2004
Post Count: 4890
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

Keith,
I run 500-600 OPNG WUs daily and there are no errors in my last 1000 returns. I don't think the problem is badly formatted tasks.
[Jan 12, 2022 8:48:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
JohnDK
Advanced Cruncher
Denmark
Joined: Feb 17, 2010
Post Count: 77
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

No errors on my linux and windows hosts.
----------------------------------------
Intel i7-6850K / 16GB / RTX 3090 / 2x RTX 3080 Ti / RTX 3070 Ti
AMD Ryzen 9 5950X / 32GB / RTX 2080 Ti
[Jan 12, 2022 9:05:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2201
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

No errors on my Windows 8.1 host, with a Nvidia GTX980 Strix GPU.
[Jan 12, 2022 10:28:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 977
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

Keith,

Out of interest, what happened to the wingmen on the failed tasks???

For what it's worth, I've not seen any issues here [yet, says he...] but then I've only got 1 GPU in each of two (Linux) machines, and only my 1660Ti is trying to do two OPNG at a time (no real benefit from trying two at a time with a 1050Ti!...)

The tasks over the last couple of days are no longer dominated by larger ligands with lots of branches - this means more jobs per task; since the beginning of 11th January I've not seen a single task with less than 60 jobs, and the average number of jobs per task has climbed from below 20 to over 80! I've noticed that my task completion monitor is finding it harder to get the information it needs from client_state.xml within the time limits I've set - this suggests a resource choke-point within local BOINC infrastructure (and that's on a 3700X which only has the 1660Ti and never runs more than 10 or 11 BOINC tasks at once!)

If you're still having issues, post up some task or workunit examples and maybe we can have a dig for issues...

Cheers - Al.
[Jan 12, 2022 11:58:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
sam6861
Advanced Cruncher
Joined: Mar 31, 2020
Post Count: 107
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

On this website, go up to "My Contribution", "Results". Look for error, click on work unit, does other computers on the same work unit got error as well?
Click on "Error" on your task, what does this show?

If other tasks on same work unit works fine, but your tasks on all your computers all errors, then this can possibly be wrong configuration, missing packages, missing libraries, bad driver install, or if copied a possibly corrupt file/data to all your computers.

OPNG: 84 Valid, 4 pending, 1 in progress. Works fine for me on both AMD and NVidia.
Ryzen 3900x, 32GB, Win10, AMD 5500 XT 8GB, NVidia GT 1030
Ryzen 2700x, 32GB, Win10, AMD RX 550 4GB
[Jan 13, 2022 12:05:47 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Freewill
Cruncher
United States
Joined: Mar 28, 2006
Post Count: 41
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

Keith, I was getting code 255 on my xps 15 laptop with old Intel Graphics 530 and GTX 960M for every GPU task. Both with almost no run time on the task. I did some updates to Ubuntu and rebooted and they went away. Silly question, but have you rebooted? ;)

Not sure if same problem you're having, but you can check this error here:
https://www.worldcommunitygrid.org/contribution/results/183526597/log
----------------------------------------
[Edit 1 times, last edit by Freewill at Jan 13, 2022 1:13:43 AM]
[Jan 13, 2022 1:12:28 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Keith Myers
Senior Cruncher
USA
Joined: Apr 6, 2021
Post Count: 193
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

Yes, almost all my errored tasks have at least one wingman who also errored the task.
All the host are updated daily for the security updates and have been rebooted at least once every day.

All my error tasks have this at the beginning of the stderr.txt output.
<core_client_version>7.17.0</core_client_version>
<![CDATA[
<message>
process exited with code 255 (0xff, -1)</message>
<stderr_txt>

I only run single tasks on any card.

https://www.worldcommunitygrid.org/contribution/results/186574616/log
https://www.worldcommunitygrid.org/contribution/results/186578178/log
https://www.worldcommunitygrid.org/contribution/results/186578130/log
https://www.worldcommunitygrid.org/contribution/results/186577541/log
https://www.worldcommunitygrid.org/contribution/results/186576514/log

I have no clue why my tasks are all failing. I have done nothing different to any of my hosts from the last time I got work and the tasks were validating. The only thing that has changed was the incremental kernel security updates.

I am also running gpugrid python, milkyway separation and einstein gamma-ray tasks on all the gpus and they all run fine. Only one gpu task per project on any card.
----------------------------------------

A proud member of the OFA (Old Farts Association)
[Jan 13, 2022 3:53:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Keith Myers
Senior Cruncher
USA
Joined: Apr 6, 2021
Post Count: 193
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All OPNG tasks erroring on all gpus-all hosts

Richard, what the heck is a -255 error? I looked at the old BOINC error list page and it is not one of the mentioned ones.
----------------------------------------

A proud member of the OFA (Old Farts Association)
[Jan 13, 2022 3:57:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 36   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread