Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 30
Posts: 30   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 170123 times and has 29 replies Next Thread
supdood
Senior Cruncher
USA
Joined: Aug 6, 2015
Post Count: 333
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

Result Log
Result Name: SCC1_ 0000000_ Bct-A_ 44112_ 1--
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>scc1_image07_7.08.tga</file_name>
<error_code>-120</error_code>
<error_message>signature verification failed</error_message>

</file_xfer_error>

</message>
]]>

Looks like the checksum for that file failed. See if you can re-download the scc1 files. The default location on Win machines is C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org

Someone with more experience with these errors may have a better suggestion or cautions.
----------------------------------------
Crunch with BOINC team USA
www.boincusa.com

[Jan 27, 2017 1:18:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
branjo
Master Cruncher
Slovakia
Joined: Jun 29, 2012
Post Count: 1892
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

Mumak wrote:
Almost all tasks I'm getting are resends.. There must be something wrong here.

EDIT: First error on my side:
SCC1_ 0000000_ Bct-A_ 29429_ 0--
(unknown error) - exit code 194 (0xc2)
From log:
Output file SCC1_0000000_Bct-A_29429_0_r1794313948_0 for task SCC1_0000000_Bct-A_29429_0 absent


From 247 WU's I have received so far, only 17 are resends. The rest are binaries.
----------------------------------------

Crunching@Home since January 13 2000. Shrubbing@Home since January 5 2006

----------------------------------------
[Edit 1 times, last edit by branjo at Jan 27, 2017 4:11:04 PM]
[Jan 27, 2017 4:10:12 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7697
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

Result Log
Result Name: SCC1_ 0000000_ Bct-A_ 44112_ 1--
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>scc1_image07_7.08.tga</file_name>
<error_code>-120</error_code>
<error_message>signature verification failed</error_message>

</file_xfer_error>

</message>
]]>

Looks like the checksum for that file failed. See if you can re-download the scc1 files. The default location on Win machines is C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org

Someone with more experience with these errors may have a better suggestion or cautions.

Well, I left the machine alone to see what would happen, and that file must have finally downloaded correctly because I now have 1 valid and 10 more pending validation. It was the first 10 which all went to error, at least now all appears well.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Jan 27, 2017 4:54:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
keithhenry
Ace Cruncher
Senile old farts of the world ....uh.....uh..... nevermind
Joined: Nov 18, 2004
Post Count: 18665
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

Result Log
Result Name: SCC1_ 0000000_ Bct-A_ 44112_ 1--
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>scc1_image07_7.08.tga</file_name>
<error_code>-120</error_code>
<error_message>signature verification failed</error_message>

</file_xfer_error>

</message>
]]>

Looks like the checksum for that file failed. See if you can re-download the scc1 files. The default location on Win machines is C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org

Someone with more experience with these errors may have a better suggestion or cautions.

Well, I left the machine alone to see what would happen, and that file must have finally downloaded correctly because I now have 1 valid and 10 more pending validation. It was the first 10 which all went to error, at least now all appears well.
Cheers


Joe, I have seen this before with both new projects and new versions of a project. Somehow, BOINC is trying to start wu's before all of the files have downloaded completely. It clears up fairly shortly just like you saw. One thought to prevent this is to suspend computation while downloading until all these one off files complete. They also tend to be the larger files to come down so it can depend on where they end up in the line of files downloading. Early on usually is best as they complete first before the wu files. It's the other way around that tends to see this problem.
----------------------------------------
Join/Website/IMODB



[Jan 27, 2017 6:29:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2172
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

Saw it happening that files were being downloaded but got stuck Downloading for a few minutes:

26-Jan-2017 21:06:11 [World Community Grid] Started download of wcgrid_scc1_vina_7.08_x86_64-pc-linux-gnu
26-Jan-2017 21:06:11 [World Community Grid] Started download of wcgrid_scc1_gfx_7.08_x86_64-pc-linux-gnu
26-Jan-2017 21:06:11 [World Community Grid] Started download of scc1_image01_7.08.tga
26-Jan-2017 21:06:11 [World Community Grid] Started download of scc1_image02_7.08.tga
26-Jan-2017 21:06:11 [World Community Grid] Started download of scc1_image03_7.08.tga
26-Jan-2017 21:06:11 [World Community Grid] Started download of scc1_image04_7.08.tga
26-Jan-2017 21:06:11 [World Community Grid] Started download of scc1_image05_7.08.tga
26-Jan-2017 21:06:11 [World Community Grid] Started download of scc1_image06_7.08.tga
26-Jan-2017 21:06:12 [World Community Grid] Finished download of scc1_image06_7.08.tga <-- 06 OK
26-Jan-2017 21:06:12 [World Community Grid] Started download of scc1_image07_7.08.tga
26-Jan-2017 21:06:13 [World Community Grid] Finished download of scc1_image05_7.08.tga <-- 05, 06 OK
26-Jan-2017 21:06:13 [World Community Grid] Started download of scc1_image08_7.08.tga
26-Jan-2017 21:06:14 [World Community Grid] Finished download of scc1_image01_7.08.tga <-- 01, 05-06 OK
26-Jan-2017 21:06:14 [World Community Grid] Finished download of scc1_image03_7.08.tga <-- 01, 03, 05-06 OK
26-Jan-2017 21:06:14 [World Community Grid] Started download of scc1_image09_7.08.tga
26-Jan-2017 21:06:14 [World Community Grid] Started download of scc1.Bct-A.pdbqt
26-Jan-2017 21:06:15 [World Community Grid] Finished download of scc1.Bct-A.pdbqt
26-Jan-2017 21:06:15 [World Community Grid] Started download of 4bff5b91d110aeb2610311c13c64d66c.job
26-Jan-2017 21:06:16 [World Community Grid] Finished download of scc1_image08_7.08.tga <-- 01, 03, 05-06, 08 OK
26-Jan-2017 21:06:16 [World Community Grid] Finished download of 4bff5b91d110aeb2610311c13c64d66c.job
26-Jan-2017 21:06:16 [World Community Grid] Started download of 6225a25b379521f4336eed3bfd2baf4b.zip
26-Jan-2017 21:06:16 [World Community Grid] Started download of 7b9ab1629c06472491f3bcbb7e30fb86.pdbqt
26-Jan-2017 21:06:17 [World Community Grid] Finished download of 6225a25b379521f4336eed3bfd2baf4b.zip
26-Jan-2017 21:06:17 [World Community Grid] Finished download of 7b9ab1629c06472491f3bcbb7e30fb86.pdbqt
26-Jan-2017 21:06:19 [World Community Grid] Finished download of scc1_image02_7.08.tga <-- 01, 02, 03, 05-06, 08 OK
26-Jan-2017 21:06:26 [World Community Grid] Finished download of scc1_image09_7.08.tga <-- 01-03, 05-06, 08, 09 OK
26-Jan-2017 21:06:34 [World Community Grid] Finished download of scc1_image04_7.08.tga <-- 01-03, 04, 05-06, 08-09 OK
26-Jan-2017 21:06:55 [World Community Grid] Finished download of wcgrid_scc1_gfx_7.08_x86_64-pc-linux-gnu
26-Jan-2017 21:06:56 [World Community Grid] Finished download of wcgrid_scc1_vina_7.08_x86_64-pc-linux-gnu
26-Jan-2017 21:16:40 [---] Project communication failed: attempting access to reference site
26-Jan-2017 21:16:40 [World Community Grid] Temporarily failed download of scc1_image07_7.08.tga: transient HTTP error
26-Jan-2017 21:16:41 [World Community Grid] Started download of scc1_image07_7.08.tga
26-Jan-2017 21:16:45 [---] Internet access OK - project servers may be temporarily down.
26-Jan-2017 21:17:15 [World Community Grid] Finished download of scc1_image07_7.08.tga <-- 01-06, 07, 08-09 OK (complete)
26-Jan-2017 21:18:18 [World Community Grid] Message from task: 0
26-Jan-2017 21:18:18 [World Community Grid] Computation for task FAHV_1001034_3j3y-bJ-P4_8723_0 finished
26-Jan-2017 21:18:18 [World Community Grid] Starting task SCC1_0000000_Bct-A_10408_1
26-Jan-2017 21:18:20 [World Community Grid] Started upload of FAHV_1001034_3j3y-bJ-P4_8723_0_r518328708_0
26-Jan-2017 21:18:23 [World Community Grid] Finished upload of FAHV_1001034_3j3y-bJ-P4_8723_0_r518328708_0
26-Jan-2017 21:18:53 [World Community Grid] Sending scheduler request: To fetch work.
26-Jan-2017 21:18:53 [World Community Grid] Reporting 3 completed tasks
26-Jan-2017 21:18:53 [World Community Grid] Requesting new tasks for CPU
26-Jan-2017 21:18:56 [World Community Grid] Scheduler request completed: got 1 new tasks
26-Jan-2017 21:18:58 [World Community Grid] Started download of 80f6cee796bf942d3316c7edc9cc1306.job
26-Jan-2017 21:18:58 [World Community Grid] Started download of 67a82fab766661164d6266263481f465.zip
26-Jan-2017 21:19:00 [World Community Grid] Finished download of 80f6cee796bf942d3316c7edc9cc1306.job
26-Jan-2017 21:19:00 [World Community Grid] Finished download of 67a82fab766661164d6266263481f465.zip
26-Jan-2017 21:19:10 [World Community Grid] Computation for task OET1_0004444_x3Q7Bp_rig_50203_0 finished
26-Jan-2017 21:19:10 [World Community Grid] Starting task SCC1_0000000_Bct-A_11387_1
etc.
----------------------------------------
[Edit 1 times, last edit by adriverhoef at Jan 28, 2017 12:19:48 PM]
[Jan 28, 2017 10:43:29 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2172
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

EDIT: First error on my side:
SCC1_ 0000000_ Bct-A_ 29429_ 0--
(unknown error) - exit code 194 (0xc2)
From log:
Output file SCC1_0000000_Bct-A_29429_0_r1794313948_0 for task SCC1_0000000_Bct-A_29429_0 absent

Second SCC1 job on my Android device also died - with an ExitStatus 7:

Result Log
Result Name: SCC1_ 0000000_ Bct-A_ 75236_ 1--

<core_client_version>7.4.41</core_client_version>
<![CDATA[
<message>
process got signal 7
</message>
<stderr_txt>

</stderr_txt>
]]>

My first two jobs were downloaded as a set at the same time where the first one (74875_2) succeeded and the second one (75236_1) failed with an error:
SCC1_0000000_Bct-A_74875_2-- Pending Validation 1/27/17 08:46:30 	1/28/17 10:50:45 	7.54 / 7.65 	56.0 / 0.0
SCC1_0000000_Bct-A_75236_1-- Error 1/27/17 08:46:30 1/28/17 10:50:45 0.00 / 0.00 59.1 / 0.0

From the log it appeared that some outputfile was absent and that the second job failed almost immediately:

Sat Jan 28 11:49:13 CET 2017|World Community Grid|Computation for task SCC1_0000000_Bct-A_74875_2 finished
Sat Jan 28 11:49:13 CET 2017|World Community Grid|Starting task SCC1_0000000_Bct-A_75236_1
Sat Jan 28 11:49:14 CET 2017|World Community Grid|Computation for task SCC1_0000000_Bct-A_75236_1 finished
Sat Jan 28 11:49:14 CET 2017|World Community Grid|Output file SCC1_0000000_Bct-A_75236_1_r1352087123_0 for task SCC1_0000000_Bct-A_75236_1 absent
Sat Jan 28 11:49:15 CET 2017|World Community Grid|Started upload of SCC1_0000000_Bct-A_74875_2_r1791366140_0
Sat Jan 28 11:49:19 CET 2017|World Community Grid|Finished upload of SCC1_0000000_Bct-A_74875_2_r1791366140_0
Sat Jan 28 11:50:41 CET 2017|World Community Grid|Sending scheduler request: To report completed tasks.
Sat Jan 28 11:50:41 CET 2017|World Community Grid|Reporting 2 completed tasks

[Jan 28, 2017 1:04:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
orangepeel13
Cruncher
USA
Joined: Jul 22, 2014
Post Count: 11
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

I have all of the same error on every SCC1 I have gotten:

Result Log
Result Name: SCC1_ 0000000_ Bct-A_ 44112_ 1--
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>scc1_image07_7.08.tga</file_name>
<error_code>-120</error_code>
<error_message>signature verification failed</error_message>
</file_xfer_error>

</message>
]]>
I am suspending this project until I can figure out why. This is on a machine which has run MCM flawlessly for months. It also ran quite a number of the beta jobs for this project with no problems. Any insight on this from anyone would be appreciated.
Thanks


I am getting the same kind of error on one of my Ubuntu VMs:

Result Log

Result Name: SCC1_ 0000001_ Bct-A_ 43708_ 0--
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>wcgrid_scc1_vina_7.08_x86_64-pc-linux-gnu</file_name>
<error_code>-120 (RSA key check failed for file)</error_code>
<error_message>signature verification failed</error_message>
</file_xfer_error>

</message>
]]>


Is this something with the files corrupted during download? I run other projects just fine, so it doesn't seem like it should be my machine that is the problem.
[Jan 28, 2017 5:21:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

If it's any consolation, there's a thread from 6 months ago about this, "Download fails -186 [encompassing -119 MD5 and -120 RSA Key fails]", but there was no specific solution other than maybe techs resetting a cache. The problem might be "caused" by a download from the CDN failing. You might like to try adding file_xfer_debug logging to see if it's still the same.
[Jan 28, 2017 6:01:12 PM]   Link   Report threatening or abusive post: please login first  Go to top 
NixChix
Veteran Cruncher
United States
Joined: Apr 29, 2007
Post Count: 1187
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

What is your queue size? Are you clients downloading work and immediately starting on them? I didn't have any errors for SCC, but since I use a queue size of 0.5 day, all downloaded jobs have been downloaded for around 12 hours before running. That should have been more than enough time to complete all the downloads.

Cheers coffee
----------------------------------------

[Jan 28, 2017 6:10:21 PM]   Link   Report threatening or abusive post: please login first  Go to top 
orangepeel13
Cruncher
USA
Joined: Jul 22, 2014
Post Count: 11
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: High proportion of repair jobs

What is your queue size? Are you clients downloading work and immediately starting on them? I didn't have any errors for SCC, but since I use a queue size of 0.5 day, all downloaded jobs have been downloaded for around 12 hours before running. That should have been more than enough time to complete all the downloads.

Cheers coffee


I have my queues set for about a day. I'm running FAAH along with this new project, and they are going fine. I have 4 Ubuntu VMs and checking on them I see that another of them has failed all the SCC jobs so far also. So none have worked so far.
----------------------------------------
[Edit 1 times, last edit by orangepeel13 at Jan 29, 2017 3:04:56 AM]
[Jan 29, 2017 3:03:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 30   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread