Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 148
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Got them running under Ubuntu 14.04 lts. Checkpointing appears to adhere to the setting of 120 seconds at least, one being made shortly after that interval. Good for booting.
Runtimes, the 0622, 0069, 0727 seem to be heading for 5 hours if percent progress is accurate. 0454 going for 1:40, 0300 for 1:18. The downloads per task were pretty hefty, up to 10mb for the txt file, each task with it's own. Suppose this is because a random sampling was taken from 50 different batches. No two tasks are of the same batch. Uploads observed of up to 1.44mb during previous test. Suspended networking to see what's with this test. Is there linearity in size versus runtime? Graphics. When launching from terminal with sudo boincmgr, the ugm graphics window stays up, with the counting progress bar. Also tested graphics for the one cep2 running, whilst at it. Nothing at all, not even a brief blip of a window outline. The app was recently upgraded to 700. The agent is latest berkeley test build 7.4.22. |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1323 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
The downloads per task were pretty hefty, up to 10mb for the txt file, each task with it's own. Suppose this is because a random sampling was taken from 50 different batches. No two tasks are of the same batch. I got 8 tasks on 1 machine and received 13 beta19.*txt files for them. This 13 files are used in the 8 tasks to compare sequences, where always one _a_.txt is compared with one _b_.txt 13 files: beta19.ugm1_ugm1_00002_a_0009.txt beta19.ugm1_ugm1_00002_a_0014.txt beta19.ugm1_ugm1_00002_a_0020.txt beta19.ugm1_ugm1_00002_a_0022.txt beta19.ugm1_ugm1_00002_a_0023.txt beta19.ugm1_ugm1_00002_b_0006.txt beta19.ugm1_ugm1_00002_b_0013.txt beta19.ugm1_ugm1_00002_b_0020.txt beta19.ugm1_ugm1_00002_b_0027.txt beta19.ugm1_ugm1_00004_a_0010.txt beta19.ugm1_ugm1_00004_b_0027.txt beta19.ugm1_ugm1_00033_a_0026.txt beta19.ugm1_ugm1_00033_b_0020.txt ugm1_ugm1_00002_a_0020.txt compared to ugm1_ugm1_00002_b_0020.txt ugm1_ugm1_00002_a_0020.txt compared to ugm1_ugm1_00002_b_0006.txt ugm1_ugm1_00002_a_0023.txt compared to ugm1_ugm1_00002_b_0020.txt ugm1_ugm1_00002_a_0009.txt compared to ugm1_ugm1_00002_b_0027.txt ugm1_ugm1_00002_a_0014.txt compared to ugm1_ugm1_00002_b_0013.txt ugm1_ugm1_00002_a_0022.txt compared to ugm1_ugm1_00002_b_0006.txt ugm1_ugm1_00004_a_0010.txt compared to ugm1_ugm1_00004_b_0027.txt ugm1_ugm1_00033_a_0026.txt compared to ugm1_ugm1_00033_b_0020.txt |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7697 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Got several of these on various flavors of Linux Mint. So far on this batch all have turned valid even though this message appears, sometimes more than once.
----------------------------------------BETA_ ugm1_ ugm1_ 00032_ 0128_ 0-- ... 26000 query sequences compared. 26500 query sequences compared. 22:28:39 (20644): No heartbeat from client for 30 sec - exiting 22:28:39 (20644): timer handler: client dead, exiting Checkpoint restored: 26894 27000 query sequences compared. 22:29:44 (20958): No heartbeat from client for 30 sec - exiting 22:29:44 (20958): timer handler: client dead, exiting Checkpoint restored: 27010 22:31:45 (20962): No heartbeat from client for 30 sec - exiting 22:31:45 (20962): timer handler: client dead, exiting Checkpoint restored: 27125 27500 query sequences compared. 28000 query sequences compared. ... Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Strange. I reported earlier that the first batch of beta jobs for this new project ran hot on my lappie, but since then I didn't notice any repeat until today's batch, which is also running hot. Maybe the new build has picked up more "efficient" libraries as a side effect?
If anyone is interested it's one of these: Processors: 2, GenuineIntel, Intel(R) Core(TM)2 Duo CPU T9500 @ 2.60GHz [Family 6 Model 23 Stepping 6] fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 nx lm vmx tm2 pbe |
||
|
ThreadRipper
Veteran Cruncher Sweden Joined: Apr 26, 2007 Post Count: 1322 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
All tasks received where BOINC-data is on a RAM-disk, crash directly after the start. CEP2's are running fine from that folder. ERROR: could not initialize graphics pointer in shared memory. BETA_ betaugm1_ ugm1_ 00036_ 0303_ 0-- 3166874 Error 10/12/14 20:12:16 10/12/14 20:13:37 0.00 / 0.00 0.0 / 0.0 BETA_ betaugm1_ ugm1_ 00036_ 0291_ 0-- 3166874 Error 10/12/14 20:12:16 10/12/14 20:13:37 0.00 / 0.00 0.0 / 0.0 BETA_ betaugm1_ ugm1_ 00036_ 0219_ 0-- 3166874 Error 10/12/14 20:12:16 10/12/14 20:13:37 0.00 / 0.00 0.0 / 0.0 BETA_ betaugm1_ ugm1_ 00036_ 0147_ 1-- 3166874 Error 10/12/14 20:12:16 10/12/14 20:13:37 0.00 / 0.00 0.0 / 0.0 BETA_ betaugm1_ ugm1_ 00032_ 0601_ 1-- 3166874 Error 10/12/14 19:51:07 10/12/14 19:56:34 0.00 / 0.00 0.0 / 0.0 Interesting...I have finally received some beta WUs (6) on my main machine and I am also running RAM-disk. I shall see how they behave once they start...which they should do within the next few hours. ![]() Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1 AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock AMD 3800X @ PBO AMD 2700X @ 4GHz |
||
|
KWSN - A Shrubbery
Master Cruncher Joined: Jan 8, 2006 Post Count: 1585 Status: Offline |
The only batch that appears to have unpredictable run times is 0025. At 4 hours, the fastest is completing, the slowest is at 20% all on the same machine.
----------------------------------------All other batches appear to be consistent. ![]() Distributed computing volunteer since September 27, 2000 |
||
|
yoro42
Ace Cruncher United States Joined: Feb 19, 2011 Post Count: 8979 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
28 still running.
----------------------------------------3 Pending Validation: BETA_ betaugm1_ ugm1_ 00015_ 0773_ 1-- Coltrane Pending Validation 10/12/14 19:30:32 10/13/14 01:48:03 3.02 / 3.79 103.7 / 0.0 BETA_ betaugm1_ ugm1_ 00015_ 0557_ 1-- Coltrane Pending Validation 10/12/14 19:30:32 10/13/14 01:11:47 1.65 / 1.98 54.2 / 0.0 BETA_ betaugm1_ ugm1_ 00015_ 0737_ 0-- Coltrane Pending Validation 10/12/14 19:30:32 10/13/14 00:34:18 1.49 / 1.88 51.3 / 0.0 1 error: BETA_ betaugm1_ ugm1_ 00015_ 0593_ 1-- Coltrane Error 10/12/14 19:30:32 10/13/14 01:09:02 1.67 / 2.05 56.2 / 0.0 Result Log Result Name: BETA_ betaugm1_ ugm1_ 00015_ 0593_ 1-- <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Unable to open checkpoint file starting from 0 500 query sequences compared. 1000 query sequences compared. 1500 query sequences compared. 2000 query sequences compared. 2500 query sequences compared. 3000 query sequences compared. 3500 query sequences compared. 4000 query sequences compared. 4500 query sequences compared. 5000 query sequences compared. 5500 query sequences compared. 6000 query sequences compared. 6500 query sequences compared. 7000 query sequences compared. 7500 query sequences compared. 8000 query sequences compared. 8500 query sequences compared. 9000 query sequences compared. 9500 query sequences compared. 10000 query sequences compared. 10500 query sequences compared. 11000 query sequences compared. 11500 query sequences compared. 12000 query sequences compared. 12500 query sequences compared. 13000 query sequences compared. 13500 query sequences compared. 14000 query sequences compared. 14500 query sequences compared. 15000 query sequences compared. 15500 query sequences compared. 16000 query sequences compared. 16500 query sequences compared. 17000 query sequences compared. 17500 query sequences compared. Run complete, CPU time: 6009.642123 18:07:08 (24092): called boinc_finish </stderr_txt> <message> finish file present too long </message> ]]> ![]() [Edit 1 times, last edit by yoro42 at Oct 13, 2014 2:43:34 AM] |
||
|
rbotterb
Senior Cruncher United States Joined: Jul 21, 2005 Post Count: 401 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I actually got 4 WUs on this round (Finally after missing out on the last couple rounds!). I tried starting, stopping, restarting again all four - looks to be behaving OK. One WU already has completed - in PV mode waiting for wingman to complete. The other three are running on my 4 CPs on my laptop. They don't seem to be running too hot on my laptop (HP dv7 4 core, 6 GB memory), at least not any more so than the MCM1 WUs I'm now normally crunching at night and weekends.
|
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1323 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
The next four tasks of batch 11 are behaving strange IMO.
----------------------------------------No time left displayed anymore and no checkpoints made (Windows) World Community Grid BETA_betaugm1_ugm1_00011_0489_0 7.22 Beta Test 09:33:38 (09:33:31) 78,843 - 16 Oct 21:26:20 Running 100,0 [0] 09:33:31 51.78 MB 10.38 MB VM1 Edit: The same for 1 task of batch 12 on a Linux machine: World Community Grid BETA_betaugm1_ugm1_00012_0069_1 7.22 Beta Test 09:34:28 (09:34:29) 87,477 - 16 Oct 21:26:30 Running 100,0 [0] 09:34:29 36.00 MB 4.58 MB VM3 [Edit 1 times, last edit by Crystal Pellet at Oct 13, 2014 5:25:50 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
8 caught before uploading, network was suspended:
BETA_betaugm1_ugm1_00019_0615_0_0 0.000 2204.40 K 00:00:00 0.00 Kbps Upload pending BETA_betaugm1_ugm1_00024_0286_0_0 0.000 73.93 K 00:00:00 0.00 Kbps Upload pending BETA_betaugm1_ugm1_00024_0027_1_0 0.000 167.33 K 00:00:00 0.00 Kbps Upload pending BETA_betaugm1_ugm1_00008_0776_0_0 0.000 228.55 K 00:00:00 0.00 Kbps Upload pending BETA_betaugm1_ugm1_00008_0720_1_0 0.000 356.94 K 00:00:00 0.00 Kbps Upload pending BETA_betaugm1_ugm1_00008_0629_1_0 0.000 266.59 K 00:00:00 0.00 Kbps Upload pending BETA_betaugm1_ugm1_00008_0671_1_0 0.000 304.73 K 00:00:00 0.00 Kbps Upload pending BETA_betaugm1_ugm1_00008_0741_1_0 0.000 278.54 K 00:00:00 0.00 Kbps Upload pending 6 without time left, cpu time and progress percent still incrementing, same as previously reported, for these no more checkpoint debug entries in event log. 7.22 beta19 BETA_betaugm1_ugm1_00012_0377_0 10:07:40 (09:54:00) 97.75 77.811 02:53:17 12-Oct-2014 9:27:20 PM 03d,13:49:27 Running [0] 09:54:00 48.88 MB 8.64 MB 7.22 beta19 BETA_betaugm1_ugm1_00011_0272_1 10:07:40 (09:56:17) 98.13 77.811 02:53:17 12-Oct-2014 9:25:11 PM 03d,13:47:19 Running [0] 09:56:17 48.96 MB 8.33 MB 7.22 beta19 BETA_betaugm1_ugm1_00011_0125_0 10:07:40 (09:58:32) 98.50 77.811 02:53:17 12-Oct-2014 9:25:11 PM 03d,13:47:19 Running [0] 09:58:32 49.28 MB 7.92 MB 7.22 beta19 BETA_betaugm1_ugm1_00012_0727_0 10:09:55 (10:03:33) 98.96 87.193 - 12-Oct-2014 9:27:42 PM 03d,13:47:58 Running [0] 10:03:33 47.66 MB 4.10 MB 7.22 beta19 BETA_betaugm1_ugm1_00011_0622_0 10:10:21 (10:04:10) 98.99 87.212 - 12-Oct-2014 9:25:32 PM 03d,13:45:48 Running [0] 10:04:10 47.67 MB 3.64 MB 7.22 beta19 BETA_betaugm1_ugm1_00011_0069_1 10:10:08 (10:03:48) 98.96 87.202 - 12-Oct-2014 9:25:32 PM 03d,13:45:48 Running [0] 10:03:48 48.40 MB 4.41 MB Do these actually complete when hitting 100 percent? Suspended 1 of these with laim off, and let it sit a little. Then resumed it. No progress or time was lost, no regression to last checkpoint. Is it really doing anything? 13-Oct-2014 8:03:01 AM task BETA_betaugm1_ugm1_00011_0069_1 suspended by user 13-Oct-2014 8:03:01 AM Starting task MCM1_0008230_0143_1 13-Oct-2014 8:03:50 AM task MCM1_0008230_0753_0 resumed by user 13-Oct-2014 8:03:50 AM task BETA_betaugm1_ugm1_00011_0069_1 resumed by user No heartbeat issues for ubuntu, but this one is as discussed twisted ethernet cable connected to a windows node, no wifi. The second of the 2 heartbeat related lines is only found to be reported at cpdn and the apparently now deceased qcn. 22:28:39 (20644): No heartbeat from client for 30 sec - exiting 22:28:39 (20644): timer handler: client dead, exiting It's new these tasks do not die permanently, run to end and go valid. A enhanced piece of code in latest wrapper/api or agent? For cep2 it was fatal on linux in past. |
||
|
|
![]() |