Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 22
Posts: 22   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1563 times and has 21 replies Next Thread
depriens
Senior Cruncher
The Netherlands
Joined: Jul 29, 2005
Post Count: 350
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Something going on with WU ex379_2A ??


Workunit Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
ex379_2A In Progress 06/12/2006 08:14:09 06/19/2006 08:14:09 0.00 0 / 0
ex379_2A In Progress 06/11/2006 01:43:06 06/18/2006 01:43:06 0.00 0 / 0
ex379_2A Error 06/10/2006 23:34:47 06/11/2006 01:32:57 0.84 6 / 0
ex379_2A Error 06/08/2006 18:35:18 06/12/2006 08:13:17 11.56 105 / 0
ex379_2A Error 06/06/2006 16:42:08 06/08/2006 18:34:31 7.56 57 / 0
ex379_2A Error 06/06/2006 16:38:30 06/10/2006 23:32:58 2.21 23 / 0
ex379_2A In Progress 06/06/2006 16:37:08 06/13/2006 16:37:08 0.00 0 / 0


A lot of CPU time is wasted on this workunit. 11.56 hours from my side. crying Shouldn't there be a way that a workunit gets aborted when there are more than 3 errors? At this moment additional workunits are sent out to others and I guess there will probably be more errors.
How is the policy of handling these workunits? Will there be new workunits until there is a valid quorum?
----------------------------------------

[Jun 12, 2006 8:36:06 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

No, five errors and the work unit will be automatically rejected and reported to WCG for analysis.

Correction: four errors.
----------------------------------------
[Edit 1 times, last edit by Former Member at Jun 12, 2006 11:39:13 AM]
[Jun 12, 2006 8:55:53 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

Asked the very question in another thread about 4 weeks ago, no response.....now we know its 5 errors on 1 WU.

thx

@!DePriens.....had quite a few EX....., which crunched all flawlessly. A few took 7 to 9 hours, rest 2 to 4 hours, so must be a fluke..... It's hot in Holland is it not...MBM5 for readings, Speedfan for realtime reading and active automated fan control at specified temp! It's taken 5 minutes CPU time in the 10 days since last boot. biggrin
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Jun 12, 2006 9:49:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

I think you did get a response eventually - that's how I know now.
[Jun 12, 2006 10:10:00 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

Ah sorry , i did not see that added comment:
Added: I just reread your question. I think that I did not read it correctly. knreed has said that 4 Errors causes a work unit to be pulled for review but I do not know what the other limits are. Obviously, it will make a difference if every result is different, if 1 is different and another was not returned, etc. The simple title 'Inconclusive' does not give us enough information to determine just which case is happening. The general rule is that the valid solution is determined by a majority of results (minimum size of majority = 3). So 3 identical results plus 2 different results will result in validation.


Here you actually are saying, that pulling takes place at 4 errors, or were you meaning 4 inconclusives.......

In the DePriens Case, those 3 other crunches are unstoppable and who knows do they turn valid!

Enjoy the heat up in the northern zones....still only 18c down south.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Jun 12, 2006 10:24:53 AM]
[Jun 12, 2006 10:22:33 AM]   Link   Report threatening or abusive post: please login first  Go to top 
depriens
Senior Cruncher
The Netherlands
Joined: Jul 29, 2005
Post Count: 350
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

Okay, thanks for the info. So if there'll be another error this WU will be rejected.

The error occured just when the benchmarks started to run:
2006-06-11 21:23:39 [---] Suspending computation - running CPU benchmarks
2006-06-11 21:23:39 [World Community Grid] Pausing task ex379_2A_3 (removed from memory)
2006-06-11 21:23:39 [---] Suspending network activity - running CPU benchmarks
2006-06-11 21:23:40 [World Community Grid] Unrecoverable error for result ex379_2A_3 ( - exit code -1073741819 (0xc0000005))
2006-06-11 21:23:40 [World Community Grid] Deferring scheduler requests for 1 minutes and 0 seconds
2006-06-11 21:23:40 [---] Rescheduling CPU: application exited
2006-06-11 21:23:40 [World Community Grid] Computation for task ex379_2A_3 finished
2006-06-11 21:23:41 [---] Running CPU benchmarks
2006-06-11 21:24:40 [---] Benchmark results:
2006-06-11 21:24:40 [---] Number of CPUs: 1
2006-06-11 21:24:40 [---] 1411 floating point MIPS (Whetstone) per CPU
2006-06-11 21:24:40 [---] 2935 integer MIPS (Dhrystone) per CPU
2006-06-11 21:24:40 [---] Finished CPU benchmarks
2006-06-11 21:24:42 [---] Resuming computation
2006-06-11 21:24:42 [---] Rescheduling CPU: Resuming computation
2006-06-11 21:24:42 [---] Resuming network activity
2006-06-11 21:24:42 [World Community Grid] Starting task faah0585_d284cb471_x1hpv_00_1 using faah version 509
2006-06-11 21:24:43 [World Community Grid] Started upload of file ex379_2A_3_0
2006-06-11 21:25:15 [World Community Grid] Finished upload of file ex379_2A_3_0
2006-06-11 21:25:15 [World Community Grid] Throughput 44666 bytes/sec


I've had this once more:
2006-05-17 17:10:06 [---] Suspending computation - running CPU benchmarks
2006-05-17 17:10:06 [World Community Grid] Pausing task ew873_02_0 (removed from memory)
2006-05-17 17:10:06 [---] Suspending network activity - running CPU benchmarks
2006-05-17 17:10:07 [World Community Grid] Unrecoverable error for result ew873_02_0 ( - exit code -1073741819 (0xc0000005))
2006-05-17 17:10:07 [World Community Grid] Deferring scheduler requests for 1 minutes and 0 seconds
2006-05-17 17:10:08 [---] Rescheduling CPU: application exited
2006-05-17 17:10:08 [World Community Grid] Computation for task ew873_02_0 finished
2006-05-17 17:10:09 [---] Running CPU benchmarks
2006-05-17 17:11:07 [---] Benchmark results:
2006-05-17 17:11:07 [---] Number of CPUs: 1
2006-05-17 17:11:07 [---] 1399 floating point MIPS (Whetstone) per CPU
2006-05-17 17:11:07 [---] 2893 integer MIPS (Dhrystone) per CPU
2006-05-17 17:11:07 [---] Finished CPU benchmarks
2006-05-17 17:11:08 [---] Resuming computation
2006-05-17 17:11:08 [---] Rescheduling CPU: Resuming computation
2006-05-17 17:11:08 [---] Resuming network activity
2006-05-17 17:11:08 [World Community Grid] Starting task faah0510_d134cb465_x1hpv_02_2 using faah version 509
2006-05-17 17:11:09 [World Community Grid] Started upload of file ew873_02_0_0
2006-05-17 17:11:31 [World Community Grid] Finished upload of file ew873_02_0_0
2006-05-17 17:11:31 [World Community Grid] Throughput 43942 bytes/sec


Temperature is not a problem, yes it's over 30degC over here at the moment, but my processor runs only a few degrees hotter now. 53degC constantly. Normally i'm around 46-47 degC.
----------------------------------------

[Jun 12, 2006 12:49:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

I've had errors coming at me until i unticked the option to keep the suspended process in memory....it writes it to disc and you loose a few minutes as it returns to last checkpoint saved. Also, i changed the 'save to disc' from 60 seconds to 5 minutes a few days ago, which also improved thruput a few percentage points. Not booted in 10 days, so considered the hi reliability.

Interesting, your whetstone is 100 better, your dhrystone 150 less....always running truly flatline 60c on CPU and 42c on MB and 43c on HD! I'm told this is best for durability.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Jun 12, 2006 1:28:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

Hello depriens,
According to your first post, the 4th Error was returned at 06/12/2006 08:13:17. This should have marked that work unit for review. Another work unit was sent out at 06/12/2006 08:14:09. Apparently the replacement work unit is dispatched before the error count is checked against the limit of 4. smile

Well, that is something to add to the list of desired program fixes.
Lawrence
[Jun 12, 2006 4:37:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
depriens
Senior Cruncher
The Netherlands
Joined: Jul 29, 2005
Post Count: 350
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

Glad to hear that the error (and this post) was good for something in the end. Hopefully you can fix this minor flaw in a later stage. Thanks for all replies. biggrin
----------------------------------------

[Jun 12, 2006 6:48:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
depriens
Senior Cruncher
The Netherlands
Joined: Jul 29, 2005
Post Count: 350
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something going on with WU ex379_2A ??

Another one with the same error as the two errors mentioned above: ex621_02

And -again- with running the CPU Benchmarks. Only this time on another computer.
Strange that this ONLY happens to computers which run BOINC 5.4.9 (2 computers).
All my other computers (11) run the previous BOINC version (5.3?) and they never had any problem with running the benchmarks.

I'm starting to believe that it has something to do with the newest BOINC client. Have there been any changes to the Benchmark routine?
----------------------------------------

[Jun 13, 2006 8:56:07 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 22   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread