Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 148
Posts: 148   Pages: 15   [ Previous Page | 5 6 7 8 9 10 11 12 13 14 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 17301 times and has 147 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Got them running under Ubuntu 14.04 lts. Checkpointing appears to adhere to the setting of 120 seconds at least, one being made shortly after that interval. Good for booting.

Runtimes, the 0622, 0069, 0727 seem to be heading for 5 hours if percent progress is accurate. 0454 going for 1:40, 0300 for 1:18.

The downloads per task were pretty hefty, up to 10mb for the txt file, each task with it's own. Suppose this is because a random sampling was taken from 50 different batches. No two tasks are of the same batch.

Uploads observed of up to 1.44mb during previous test. Suspended networking to see what's with this test. Is there linearity in size versus runtime?

Graphics. When launching from terminal with sudo boincmgr, the ugm graphics window stays up, with the counting progress bar. Also tested graphics for the one cep2 running, whilst at it. Nothing at all, not even a brief blip of a window outline. The app was recently upgraded to 700. The agent is latest berkeley test build 7.4.22.
[Oct 12, 2014 8:38:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1323
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

The downloads per task were pretty hefty, up to 10mb for the txt file, each task with it's own. Suppose this is because a random sampling was taken from 50 different batches. No two tasks are of the same batch.

I got 8 tasks on 1 machine and received 13 beta19.*txt files for them.
This 13 files are used in the 8 tasks to compare sequences, where always one _a_.txt is compared with one _b_.txt

13 files:
beta19.ugm1_ugm1_00002_a_0009.txt
beta19.ugm1_ugm1_00002_a_0014.txt
beta19.ugm1_ugm1_00002_a_0020.txt
beta19.ugm1_ugm1_00002_a_0022.txt
beta19.ugm1_ugm1_00002_a_0023.txt
beta19.ugm1_ugm1_00002_b_0006.txt
beta19.ugm1_ugm1_00002_b_0013.txt
beta19.ugm1_ugm1_00002_b_0020.txt
beta19.ugm1_ugm1_00002_b_0027.txt
beta19.ugm1_ugm1_00004_a_0010.txt
beta19.ugm1_ugm1_00004_b_0027.txt
beta19.ugm1_ugm1_00033_a_0026.txt
beta19.ugm1_ugm1_00033_b_0020.txt

ugm1_ugm1_00002_a_0020.txt compared to ugm1_ugm1_00002_b_0020.txt
ugm1_ugm1_00002_a_0020.txt compared to ugm1_ugm1_00002_b_0006.txt
ugm1_ugm1_00002_a_0023.txt compared to ugm1_ugm1_00002_b_0020.txt
ugm1_ugm1_00002_a_0009.txt compared to ugm1_ugm1_00002_b_0027.txt
ugm1_ugm1_00002_a_0014.txt compared to ugm1_ugm1_00002_b_0013.txt
ugm1_ugm1_00002_a_0022.txt compared to ugm1_ugm1_00002_b_0006.txt
ugm1_ugm1_00004_a_0010.txt compared to ugm1_ugm1_00004_b_0027.txt
ugm1_ugm1_00033_a_0026.txt compared to ugm1_ugm1_00033_b_0020.txt
[Oct 12, 2014 9:16:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7697
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Got several of these on various flavors of Linux Mint. So far on this batch all have turned valid even though this message appears, sometimes more than once.
BETA_ ugm1_ ugm1_ 00032_ 0128_ 0--
...
26000 query sequences compared.
26500 query sequences compared.
22:28:39 (20644): No heartbeat from client for 30 sec - exiting
22:28:39 (20644): timer handler: client dead, exiting
Checkpoint restored: 26894
27000 query sequences compared.
22:29:44 (20958): No heartbeat from client for 30 sec - exiting
22:29:44 (20958): timer handler: client dead, exiting
Checkpoint restored: 27010
22:31:45 (20962): No heartbeat from client for 30 sec - exiting
22:31:45 (20962): timer handler: client dead, exiting
Checkpoint restored: 27125
27500 query sequences compared.
28000 query sequences compared.
...

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Oct 12, 2014 9:34:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Strange. I reported earlier that the first batch of beta jobs for this new project ran hot on my lappie, but since then I didn't notice any repeat until today's batch, which is also running hot. Maybe the new build has picked up more "efficient" libraries as a side effect?

If anyone is interested it's one of these:

Processors: 2, GenuineIntel, Intel(R) Core(TM)2 Duo CPU T9500 @ 2.60GHz [Family 6 Model 23 Stepping 6]
fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 nx lm vmx tm2 pbe
[Oct 12, 2014 10:31:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
ThreadRipper
Veteran Cruncher
Sweden
Joined: Apr 26, 2007
Post Count: 1322
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

All tasks received where BOINC-data is on a RAM-disk, crash directly after the start.
CEP2's are running fine from that folder.

ERROR: could not initialize graphics pointer in shared memory.

BETA_ betaugm1_ ugm1_ 00036_ 0303_ 0-- 3166874 Error 10/12/14 20:12:16 10/12/14 20:13:37 0.00 / 0.00 0.0 / 0.0
BETA_ betaugm1_ ugm1_ 00036_ 0291_ 0-- 3166874 Error 10/12/14 20:12:16 10/12/14 20:13:37 0.00 / 0.00 0.0 / 0.0
BETA_ betaugm1_ ugm1_ 00036_ 0219_ 0-- 3166874 Error 10/12/14 20:12:16 10/12/14 20:13:37 0.00 / 0.00 0.0 / 0.0
BETA_ betaugm1_ ugm1_ 00036_ 0147_ 1-- 3166874 Error 10/12/14 20:12:16 10/12/14 20:13:37 0.00 / 0.00 0.0 / 0.0
BETA_ betaugm1_ ugm1_ 00032_ 0601_ 1-- 3166874 Error 10/12/14 19:51:07 10/12/14 19:56:34 0.00 / 0.00 0.0 / 0.0


Interesting...I have finally received some beta WUs (6) on my main machine and I am also running RAM-disk. I shall see how they behave once they start...which they should do within the next few hours.
----------------------------------------

Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1

AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock
AMD 3800X @ PBO
AMD 2700X @ 4GHz
[Oct 12, 2014 10:35:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KWSN - A Shrubbery
Master Cruncher
Joined: Jan 8, 2006
Post Count: 1585
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

The only batch that appears to have unpredictable run times is 0025. At 4 hours, the fastest is completing, the slowest is at 20% all on the same machine.

All other batches appear to be consistent.
----------------------------------------

Distributed computing volunteer since September 27, 2000
[Oct 12, 2014 11:38:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
yoro42
Ace Cruncher
United States
Joined: Feb 19, 2011
Post Count: 8979
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

28 still running.

3 Pending Validation:
BETA_ betaugm1_ ugm1_ 00015_ 0773_ 1-- Coltrane Pending Validation 10/12/14 19:30:32 10/13/14 01:48:03 3.02 / 3.79 103.7 / 0.0
BETA_ betaugm1_ ugm1_ 00015_ 0557_ 1-- Coltrane Pending Validation 10/12/14 19:30:32 10/13/14 01:11:47 1.65 / 1.98 54.2 / 0.0
BETA_ betaugm1_ ugm1_ 00015_ 0737_ 0-- Coltrane Pending Validation 10/12/14 19:30:32 10/13/14 00:34:18 1.49 / 1.88 51.3 / 0.0

1 error:

BETA_ betaugm1_ ugm1_ 00015_ 0593_ 1-- Coltrane Error 10/12/14 19:30:32 10/13/14 01:09:02 1.67 / 2.05 56.2 / 0.0

Result Log

Result Name: BETA_ betaugm1_ ugm1_ 00015_ 0593_ 1--
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Unable to open checkpoint file starting from 0
500 query sequences compared.
1000 query sequences compared.
1500 query sequences compared.
2000 query sequences compared.
2500 query sequences compared.
3000 query sequences compared.
3500 query sequences compared.
4000 query sequences compared.
4500 query sequences compared.
5000 query sequences compared.
5500 query sequences compared.
6000 query sequences compared.
6500 query sequences compared.
7000 query sequences compared.
7500 query sequences compared.
8000 query sequences compared.
8500 query sequences compared.
9000 query sequences compared.
9500 query sequences compared.
10000 query sequences compared.
10500 query sequences compared.
11000 query sequences compared.
11500 query sequences compared.
12000 query sequences compared.
12500 query sequences compared.
13000 query sequences compared.
13500 query sequences compared.
14000 query sequences compared.
14500 query sequences compared.
15000 query sequences compared.
15500 query sequences compared.
16000 query sequences compared.
16500 query sequences compared.
17000 query sequences compared.
17500 query sequences compared.
Run complete, CPU time: 6009.642123
18:07:08 (24092): called boinc_finish

</stderr_txt>
<message>
finish file present too long
</message>
]]>
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by yoro42 at Oct 13, 2014 2:43:34 AM]
[Oct 13, 2014 2:17:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rbotterb
Senior Cruncher
United States
Joined: Jul 21, 2005
Post Count: 401
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

I actually got 4 WUs on this round (Finally after missing out on the last couple rounds!). I tried starting, stopping, restarting again all four - looks to be behaving OK. One WU already has completed - in PV mode waiting for wingman to complete. The other three are running on my 4 CPs on my laptop. They don't seem to be running too hot on my laptop (HP dv7 4 core, 6 GB memory), at least not any more so than the MCM1 WUs I'm now normally crunching at night and weekends.
[Oct 13, 2014 3:30:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1323
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

The next four tasks of batch 11 are behaving strange IMO.
No time left displayed anymore and no checkpoints made (Windows)

World Community Grid	BETA_betaugm1_ugm1_00011_0489_0	7.22 Beta Test	09:33:38 (09:33:31)	78,843	-	16 Oct 21:26:20	Running	100,0	[0] 09:33:31	51.78 MB	10.38 MB	VM1	
World Community Grid BETA_betaugm1_ugm1_00011_0748_0 7.22 Beta Test 09:33:13 (09:33:03) 78,820 - 16 Oct 21:26:20 Running 100,0 [0] 09:33:03 52.58 MB 12.01 MB VM1
World Community Grid BETA_betaugm1_ugm1_00011_0293_0 7.22 Beta Test 09:32:03 (09:32:00) 78,752 - 16 Oct 21:26:20 Running 100,0 [0] 09:32:00 51.78 MB 10.38 MB VM1
World Community Grid BETA_betaugm1_ugm1_00011_0503_0 7.22 Beta Test 09:30:38 (09:30:33) 78,670 - 16 Oct 21:26:20 Running 100,0 [0] 09:30:33 51.78 MB 11.67 MB VM1

Edit: The same for 1 task of batch 12 on a Linux machine:

World Community Grid BETA_betaugm1_ugm1_00012_0069_1 7.22 Beta Test 09:34:28 (09:34:29) 87,477 - 16 Oct 21:26:30 Running 100,0 [0] 09:34:29 36.00 MB 4.58 MB VM3
----------------------------------------
[Edit 1 times, last edit by Crystal Pellet at Oct 13, 2014 5:25:50 AM]
[Oct 13, 2014 5:22:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

8 caught before uploading, network was suspended:
BETA_betaugm1_ugm1_00019_0615_0_0 0.000 2204.40 K 00:00:00 0.00 Kbps Upload pending
BETA_betaugm1_ugm1_00024_0286_0_0 0.000 73.93 K 00:00:00 0.00 Kbps Upload pending
BETA_betaugm1_ugm1_00024_0027_1_0 0.000 167.33 K 00:00:00 0.00 Kbps Upload pending
BETA_betaugm1_ugm1_00008_0776_0_0 0.000 228.55 K 00:00:00 0.00 Kbps Upload pending
BETA_betaugm1_ugm1_00008_0720_1_0 0.000 356.94 K 00:00:00 0.00 Kbps Upload pending
BETA_betaugm1_ugm1_00008_0629_1_0 0.000 266.59 K 00:00:00 0.00 Kbps Upload pending
BETA_betaugm1_ugm1_00008_0671_1_0 0.000 304.73 K 00:00:00 0.00 Kbps Upload pending
BETA_betaugm1_ugm1_00008_0741_1_0 0.000 278.54 K 00:00:00 0.00 Kbps Upload pending

6 without time left, cpu time and progress percent still incrementing, same as previously reported, for these no more checkpoint debug entries in event log.
7.22 beta19 BETA_betaugm1_ugm1_00012_0377_0 10:07:40 (09:54:00) 97.75 77.811 02:53:17 12-Oct-2014 9:27:20 PM 03d,13:49:27 Running [0] 09:54:00 48.88 MB 8.64 MB
7.22 beta19 BETA_betaugm1_ugm1_00011_0272_1 10:07:40 (09:56:17) 98.13 77.811 02:53:17 12-Oct-2014 9:25:11 PM 03d,13:47:19 Running [0] 09:56:17 48.96 MB 8.33 MB
7.22 beta19 BETA_betaugm1_ugm1_00011_0125_0 10:07:40 (09:58:32) 98.50 77.811 02:53:17 12-Oct-2014 9:25:11 PM 03d,13:47:19 Running [0] 09:58:32 49.28 MB 7.92 MB
7.22 beta19 BETA_betaugm1_ugm1_00012_0727_0 10:09:55 (10:03:33) 98.96 87.193 - 12-Oct-2014 9:27:42 PM 03d,13:47:58 Running [0] 10:03:33 47.66 MB 4.10 MB
7.22 beta19 BETA_betaugm1_ugm1_00011_0622_0 10:10:21 (10:04:10) 98.99 87.212 - 12-Oct-2014 9:25:32 PM 03d,13:45:48 Running [0] 10:04:10 47.67 MB 3.64 MB
7.22 beta19 BETA_betaugm1_ugm1_00011_0069_1 10:10:08 (10:03:48) 98.96 87.202 - 12-Oct-2014 9:25:32 PM 03d,13:45:48 Running [0] 10:03:48 48.40 MB 4.41 MB

Do these actually complete when hitting 100 percent?

Suspended 1 of these with laim off, and let it sit a little. Then resumed it. No progress or time was lost, no regression to last checkpoint. Is it really doing anything?

13-Oct-2014 8:03:01 AM task BETA_betaugm1_ugm1_00011_0069_1 suspended by user
13-Oct-2014 8:03:01 AM Starting task MCM1_0008230_0143_1
13-Oct-2014 8:03:50 AM task MCM1_0008230_0753_0 resumed by user
13-Oct-2014 8:03:50 AM task BETA_betaugm1_ugm1_00011_0069_1 resumed by user

No heartbeat issues for ubuntu, but this one is as discussed twisted ethernet cable connected to a windows node, no wifi. The second of the 2 heartbeat related lines is only found to be reported at cpdn and the apparently now deceased qcn.

22:28:39 (20644): No heartbeat from client for 30 sec - exiting
22:28:39 (20644): timer handler: client dead, exiting

It's new these tasks do not die permanently, run to end and go valid. A enhanced piece of code in latest wrapper/api or agent? For cep2 it was fatal on linux in past.
[Oct 13, 2014 6:11:03 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 148   Pages: 15   [ Previous Page | 5 6 7 8 9 10 11 12 13 14 | Next Page ]
[ Jump to Last Post ]
Post new Thread