Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 9
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1938 times and has 8 replies Next Thread
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1673
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Computation Error at start - Some WUs of batch OPN1_0029411_xxxxx

It seems that batch OPN1_0029411_xxxxx causes an error after about 35 sec run:
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
process exited with code 255 (0xff, -1)
</message>
<stderr_txt>
INFO:[13:33:31] Start AutoGrid...
autogrid4: Successful Completion.
INFO:[13:34:03] End AutoGrid...

INFO:[13:34:04] Start AutoDock for ZINC000067911426-ACR2.45_RX1--5re9_001--CYS145_wcgsplit2.dpf(Job #0)...
INFO: In AutoDock main_autodock()
Beginning AutoDock...

</stderr_txt>
]]>

At least, WUs 00098 and 09318 have a problem.
In the both cases, the wingmen - running Windows 10 - did experience the same issue.
Cheers,
Yves
---
PS: Up-to-date Ubuntu 14.04 x64, AMD Phenom II x6. The system is otherwise running failure free.
----------------------------------------
----------------------------------------
[Edit 2 times, last edit by KerSamson at Dec 31, 2020 1:36:37 PM]
[Dec 31, 2020 12:47:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2170
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error at start - Batch OPN1_0029411_xxxxx

It seems that batch OPN1_0029411_xxxxx causes an error after about 35 sec run:
[…]
At least, WUs 00098 and 09318 have a problem.

Yves,
My experience with OPN1_0029411_xxxxx comprises two tasks so far. Both finished successfully a few days ago:
$ wcglog -0 -e OPN1_0029411_
App CpuTime Elapsed Claimed Granted ModTime Exit Outc SentTime ReceivedTime Name
opn1 3.57 3.62 73.8 73.8 1609025476 0 1 2020-12-26T19:53:59 2020-12-26T23:31:12 OPN1_0029411_06423_0
opn1 2.53 2.60 80.2 80.2 1609075489 0 1 2020-12-26T18:17:37 2020-12-27T13:24:44 OPN1_0029411_02288_0
App CpuTime Elapsed Claimed Granted ModTime Exit Outc SentTime ReceivedTime Name
OPN1_0029411_ n=2 CpuHours=6.09 Hrs/n=3.047
EDIT: Added '-0' to affirm that there were no faulty OPN1_0029411 tasks here.
----------------------------------------
[Edit 1 times, last edit by adriverhoef at Dec 31, 2020 2:18:31 PM]
[Dec 31, 2020 1:18:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1673
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error at start - Some WUs of batch OPN1_0029411_xxxxx

Hi Adri,
thank you for your feedback. It is strange ...
I verified my WU collection and I had only the two mentioned WUs from batch 29411 and both failed (incl. for the corresponding wingmen); the rest of the OPN1 seems to be OK.
Cheers,
Yves
---
PS: Based on your feedback, I modified the thread title.
----------------------------------------
[Dec 31, 2020 1:36:11 PM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2170
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error at start - Some WUs of batch OPN1_0029411_xxxxx

At least it's something not to be worried about, Yves, since your wingmen also failed to complete their tasks successfully.
PS: Based on your feedback, I modified the thread title.
Good thinking! smile
[Dec 31, 2020 2:26:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1322
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error at start - Some WUs of batch OPN1_0029411_xxxxx

Also 1 failed task (OPN1_0029411_05208) from this batch, but not just after start but later in the job.

Me and 2 wingman errors after 0.38, 0.53 and 0.89 hours when starting this ...

Start AutoDock for ZINC000004544593-ACR2.45_RX1--5re9_001--CYS145.dpf(Job #1)...
Start AutoDock for ZINC000004544593-ACR2.45_RX1--5re9_001--CYS145.dpf(Job #1)...
Start AutoDock for ZINC000004544593-ACR2.45_RX1--5re9_001--CYS145.dpf(Job #1)...

... after having done 48 dockings of another target before.

https://www.worldcommunitygrid.org/ms/device/...s.do?workunitId=463288871
[Dec 31, 2020 2:38:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error at start - Some WUs of batch OPN1_0029411_xxxxx

There are a few things that could cause this. But I have two suspicions before even looking into it.

1. Something in the random seed is bonkers (unlikely especially on 2 in the same batch)
2. The dpf file that was provided to us was generated improperly, We have seen it a few times where the file provided did not have proper closing tag, or it was repeated. (This is more likely, especially since it's two in the same batch).

We will investigate when back from holiday as it is usually very infrequent. I've fixed maybe 5 the whole project for option #2 and option #1 should be cleared up with updates to the build scripts a few months ago.

Thanks,
-Uplinger
[Dec 31, 2020 4:55:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1673
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error at start - Some WUs of batch OPN1_0029411_xxxxx

Hi Keith,
happy new year to you and your colleagues smile
I did report the problem as soon as I did notice it, not because I worried about it, but because I suspected a bug in the batch generation.
Cheers,
Yves
----------------------------------------
[Jan 1, 2021 8:07:01 AM]   Link   Report threatening or abusive post: please login first  Go to top 
BobbyB
Veteran Cruncher
Canada
Joined: Apr 25, 2020
Post Count: 609
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error at start - Some WUs of batch OPN1_0029411_xxxxx

OPN1_ 0029411_ 08656_0 through _3 failed

<core_client_version>7.14.3</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -1 (0xffffffff)</message>
<stderr_txt>
INFO:[17:01:01] Start AutoGrid...

autogrid4: Successful Completion.
INFO:[17:01:32] End AutoGrid...
INFO:[17:01:32] Start AutoDock for ZINC000102190983_2-ACR2.42_RX1--5re9_001--CYS145_wcgsplit4.dpf(Job #0)...
INFO: In AutoDock main_autodock()
Beginning AutoDock...

</stderr_txt>
]]>
[Jan 1, 2021 4:57:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
FAHE
Advanced Cruncher
Australia
Joined: Apr 27, 2007
Post Count: 122
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error at start - Some WUs of batch OPN1_0029411_xxxxx

OPN1_0029411_00971.....

0, 1 and 2 failed, 3 trying valiantly......
----------------------------------------

[Jan 2, 2021 1:27:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread