Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 70
|
![]() |
Author |
|
jay_Orlando
Senior Cruncher USA Joined: Jan 4, 2006 Post Count: 181 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
BTW, here is data from BOINC startup log (from syslog) about the CPU WITH errors:
----------------------------------------
![]() |
||
|
jay_Orlando
Senior Cruncher USA Joined: Jan 4, 2006 Post Count: 181 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
and here is info about CPU WITHOUT errors:
----------------------------------------
![]() |
||
|
jay_Orlando
Senior Cruncher USA Joined: Jan 4, 2006 Post Count: 181 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Making it more readable - I hope.
----------------------------------------Here is a side-by-side difference. I am still getting errors on the AMC chip and not on the intel chip.
![]() |
||
|
jay_Orlando
Senior Cruncher USA Joined: Jan 4, 2006 Post Count: 181 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
The differences in features are:
----------------------------------------perfc and perfctr_core tsc and no tsc ![]() |
||
|
jay_Orlando
Senior Cruncher USA Joined: Jan 4, 2006 Post Count: 181 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
2 and 3 June 2021 - Update.
----------------------------------------On the AMD machine with the failing MIP1 WU, I have installed many fortran libraries and slowed down all options in the BIOS. There has been a change. Instead of failing in less than 2 or 3 minutes, the WU can run for about an hour. None complete without failing. for example(from https://www.worldcommunitygrid.org/ms/viewBoi...atus=-1&projectId=123
The error results now look like:
![]() |
||
|
jay_Orlando
Senior Cruncher USA Joined: Jan 4, 2006 Post Count: 181 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Since MIP1 is ending, I'll call it quits - except if anyone has a firm lead.
----------------------------------------T H A N K S !! Jay ![]() |
||
|
xdarma
Cruncher Joined: Oct 4, 2014 Post Count: 5 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I confirm the failure of all WUs on linux with Amd CPU.
Maybe can be helpful, thus I report the instructions of the tested CPUs: Phenom II x4 960t (K10 core): fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt nodeid_msr cpb hw_pstate vmmcall npt lbrv svm_lock nrip_save pausefilter FX 8300 (Piledriver core): fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 popcnt aes xsave avx f16c lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs xop skinit wdt lwp fma4 tce nodeid_msr tbm topoext perfctr_core perfctr_nb cpb hw_pstate ssbd vmmcall bmi1 arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold Ryzen 7 3700x (Zen2 core): fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate sme ssbd mba sev ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif umip rdpid overflow_recov succor smca |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2159 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Just the other night my AMD machine received this executable:
-rwxr-xr-x. 1 boinc boinc 80977064 Jul 26 20:07 wcgrid_mip1_rosetta_7.16_i686-pc-linux-gnuIt's a new machine, since two weeks in service, running all WCG projects, so also MIP1. This setup has been running without problems, until today, when I started seeing Computation Errors. They all happen to be related to the i686 binary, because all the MIP1 tasks that have been running ever since with the x86_64 MIP1 binary are Valid. There isn't any valid i686 MIP1 task so far. |
||
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 2982 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Personally adriverhoef I wouldn't worry about it - as the MIP project is coming to an end within days.
----------------------------------------![]() |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2159 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Has anybody tried removing the i686 MIP1 binary from their Linux system by linking the x86_64 MIP1 binary to it, I mean by doing this:
if cd ~boinc/projects/www.worldcommunitygrid.org/; thenI did this at home before I went to work. When I came back, all 79 returned MIP1 results during my absence turned out to be Valid and some of them thought they had been run by the i686 MIP1 binary when in fact they have been run by the x86_64 MIP1 binary. ![]() |
||
|
|
![]() |