Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 16
Posts: 16   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1558 times and has 15 replies Next Thread
Jacob Klein
Cruncher
Joined: May 31, 2007
Post Count: 28
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: File size too big

Wow, thanks for the responses guys!

There are plenty of hints here, about increasing result file limits... hope World Community Grid finds the configuration problem and fixes it, so our resources aren't continuing to be wasted.
----------------------------------------
[Edit 1 times, last edit by Jacob Klein at Aug 12, 2014 11:12:43 AM]
[Aug 12, 2014 11:11:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: File size too big

The file size was always large can't find it now because we all have cookies but in the required computer for each project if I remember right CEP2 was listed as needing to up load between 20-200MB of data with 1GB extra ram on stand by.

I think you need a computer not running WCG to find it again, since all I can find now are log in problem pages.
[Aug 12, 2014 1:10:07 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: File size too big

Hi all.

The problem here is simply that there are a small number of jobs towards the end of what was first sent out that have a large number of electrons. This pretty much equates to having a large number of basis functions, which is what determines the size of the large binary file that BOINC is saying is too big.

In the old style of work unit, it was a calculation in a basically minimal basis set that was getting exposed to this limit - and these calculations have far fewer basis functions per atom than the one we have changed to.

For the vast majority of work units, this file falls under the size limit of BOINC and so was not causing a problem, and frankly I did not expect this to be a problem at all, based on the in-house testing I had done. However, this is research and the unexpected frequently occurs (it is what makes it interesting after all!). Rather than continue to send new jobs, we decided to profile the jobs which BOINC was refusing to send to see if we can allow larger files to be transferred to *minimise* the wasted time on the grid. Because of this, and due to the fact that failed jobs get sent multiple times, the failure of these jobs is more visible since no smaller jobs (for which this would not be a problem) are being sent until we change the settings (which makes sense for diagnosing errors).

Siedentopf: whilst that is a viable theory based upon what has been happening, that is not what the issue is here. As reported, the validation problem was due to two different 'styles' of work unit being on the grid at the same time (something which has not happened at all in the CEP2 until now) and was quickly and completely resolved. With regard to the calculations themselves, I gave a pretty complete summary of why jobs might fail here : https://secure.worldcommunitygrid.org/forums/wcg/viewpostinthread?post=465685 and you can add the current BOINC file size limit to that. It is the complete nature of our type of high throughput screening that some jobs will fail for various reasons, and there is pretty much nothing that can be done to completely remove failures - it is the nature of the quantum calculations. Rest assured we do our very best not to send things to the grid which are flawed or rushed.

Your Harvard CEP Team
[Aug 12, 2014 1:44:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: File size too big

Cleanenergy

Well thank you for that, so this isn't the black hole I had expected it to be. Also never said you would intend to rush flawed work, just that it went boom. So by Maritime Law the sinking ship is your problem, sorry. However, yes the details have bailed you out.

Since 1958 when I started to decode words within my older brothers Popular Science I was fascinated with the idea of solar cells when they were set to power Vanguard. I was told they made free energy and dad wouldn't be running around screaming flipping off the lights. None of us were ever as traumatized as I have already made it sound, but my need to see it, the need to make it real for everyone, has only gotten bigger. I'€™m always reading anything I can find dealing with current PV tech..

Try if you can to forgive me my zeal, my tendency to see conspiracy to avoid clean energy. Tesla tells me they will not be putting solar panels on the bodies of their cars any time soon. I toss up my hands and ask "why not"?

I'm the one person that would pack my dogs and I into a car and move to anyplace I could make this hands on. So if you ever do find anyone starting up World Grid has my e-mail address, not the first time I camped on a doorstep.

Indulge me, I assume the current work units are not within the data base, so how can we see the molecules getting crunched, or is that unavailable until they are done and placed in the data base? It may kill my Mr. Potato Head theory, but science is finding out what is right over plausible. Also I don'€™t know as much about basis sets as I would like, 4 kids and a wife packed my school books and presented me a hard hat, what form of basis sets are you computing with?

I hope you resolve the size problem; I for one don'€™t care how big they get or how long they run. Once had a FEM program run for 73 days, wasn't going to stop it so I picked up a new computer to surf on.

Thanks for your time.

Oh wait I did say rushed didn't I..... okay my bad sorry. That was a bad day in every respect.
----------------------------------------
[Edit 2 times, last edit by Former Member at Aug 13, 2014 3:30:20 AM]
[Aug 12, 2014 10:11:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
armstrdj
Former World Community Grid Tech
Joined: Oct 21, 2004
Post Count: 695
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: File size too big

We have increased the file size limit for CEP2 and when it starts again this issue should be solved.

Thanks,
armstrdj
[Aug 21, 2014 3:34:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
yoro42
Ace Cruncher
United States
Joined: Feb 19, 2011
Post Count: 8976
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: File size too big

First time I've seen/noticed this error.
Hope this helps.

upload failure: <file_xfer_error>
<file_name>E225101_581_S.322.C45H31N3.HUVQRXZSXJIVIL-UHFFFAOYSA-N.17_s1_14_1_1</file_name>
<error_code>-131 (file size too big)</error_code>
</file_xfer_error>


E225101_ 581_ S.322.C45H31N3.HUVQRXZSXJIVIL-UHFFFAOYSA-N.17_ s1_ 14_ 1-- Dexter Error 8/19/14 06:49:39 8/20/14 04:27:45 11.05 / 11.32 314.9 / 0.0


Result Log

Result Name: E225101_ 581_ S.322.C45H31N3.HUVQRXZSXJIVIL-UHFFFAOYSA-N.17_ s1_ 14_ 1--
<core_client_version>7.2.47</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[09:11:15] Number of jobs = 8
[09:11:15] Starting job 0,CPU time has been restored to 0.000000.
[14:55:18] Finished Job #0
[14:55:18] Starting job 1,CPU time has been restored to 20127.171020.
[15:22:14] Finished Job #1
[15:22:14] Starting job 2,CPU time has been restored to 21718.927223.
[15:42:58] Finished Job #2
[15:42:58] Starting job 3,CPU time has been restored to 22946.249490.
[16:07:52] Finished Job #3
[16:07:52] Starting job 4,CPU time has been restored to 24419.242133.
[16:29:15] Finished Job #4
[16:29:15] Starting job 5,CPU time has been restored to 25688.793871.
[16:49:40] Finished Job #5
[16:49:40] Starting job 6,CPU time has been restored to 26897.786021.
Application exited with RC = 0x1
[20:27:20] Finished Job #6
[20:27:20] Starting job 7,CPU time has been restored to 39792.033076.
[20:27:20] Skipping Job #7
20:27:29 (1492): called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>E225101_581_S.322.C45H31N3.HUVQRXZSXJIVIL-UHFFFAOYSA-N.17_s1_14_1_1</file_name>
<error_code>-131 (file size too big)</error_code>
</file_xfer_error>

</message>
]]>
----------------------------------------

[Aug 23, 2014 12:41:14 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 16   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread