Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 79
|
![]() |
Author |
|
BarryAZ
Cruncher United States Joined: Mar 28, 2006 Post Count: 19 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Ah -- OK -- so we can expect things to return to live in a few hours then. Still in 'maintenance' after several hours.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just noticed I wasn't crunching any tasks, and I was wondering what was going on. This why forums are helpful, I am just glad something didn't get messed up in my settings.
|
||
|
cjslman
Master Cruncher Mexico Joined: Nov 23, 2004 Post Count: 2082 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Yeap... I'm seeing the same (tasks ready to report that are not moving off the BOINC manager task list and VUs that are in Pending Validation status).
----------------------------------------knreed: thnx for the post... hope things get straighten out soon. CJSL Crunching for a better world... |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
MStenholm, (in reply to a removed post)
Only having a part of day work buffer work for GPU's even when your setting is 4 days is, because WCG has set a ceiling of WU's allowed "in progress" for a device. The GPU limit is about 4000, the CPU used to be about 80 per processor... don't know what it is today... subject to change without notice. The CPU part I have 3 days worth on devices, but that's just 156 CPU tasks on my Octo for instance. |
||
|
MStenholm
Advanced Cruncher Denmark Joined: Jan 7, 2010 Post Count: 97 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Even folding@home is more reliable....and people with boots in both camps would know it says a lot. The nice feature of BOINC with a buffer simple doesn't work for more then a 3-4 hour stop. A four day setting (minimum buffer) last less then 4 hours for me (both 7.0.42 and the older 7 version). Set and leave is no longer possible with the GPU work.
----------------------------------------![]() |
||
|
branjo
Master Cruncher Slovakia Joined: Jun 29, 2012 Post Count: 1892 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
MStenholm, (in reply to a removed post) Only having a part of day work buffer work for GPU's even when your setting is 4 days is, because WCG has set a ceiling of WU's allowed "in progress" for a device. The GPU limit is about 4000, the CPU used to be about 80 per processor... don't know what it is today... subject to change without notice. The CPU part I have 3 days worth on devices, but that's just 156 CPU tasks on my Octo for instance. My (OK, not my, but for my rig ![]() Cheers ![]() ![]() Crunching@Home since January 13 2000. Shrubbing@Home since January 5 2006 ![]() [Edit 1 times, last edit by branjo at Jan 9, 2013 8:39:41 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
We have started things up again. Unfortunately we didn't get as much performance increase as we hoped. We have ordered additional memory for the servers which we expect to get installed in the next 2-3 weeks. -- knreed [Jan 9, 2013 8:45:26 PM] post I have reserved memory-hardware within my arm's reach as a contingency just in case my machines would need them. I have a dozen of them memory-sticks. The specs of that memory is amazing: 4-TBytes per stick, DDR-version-10, octo-channel, 1.2-Terahertz, 1-mWatt power-consumption. Just give me a call if you need some... ![]() Seriously now, what prospects are we looking at for ramping up things back to normal operational-speed system-wide -- 2 to 3 weeks? ![]() ; ; andzgridPost#799 ; |
||
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 1951 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Well, while all systems seem to be able again to report finished and receive new WUs, is it possible that someone still needs to kick the half-daily stats update cycle on the server(s) in the **** or is this going to happen by itself at a later time?
----------------------------------------![]() EDIT: Of course, as soon as I posted the above, the stats did in fact update, though apparently only with a fraction of the WUs reported after the outage... ![]() Ralf ![]() [Edit 2 times, last edit by TPCBF at Jan 10, 2013 1:30:29 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi.
I take it that this message has to do with the current problems, never got this one before. Thu 10 Jan 2013 16:15:38 EST | World Community Grid | Sending scheduler request: To report completed tasks. Thu 10 Jan 2013 16:15:38 EST | World Community Grid | Reporting 2 completed tasks, not requesting new tasks Thu 10 Jan 2013 16:15:42 EST | | Project communication failed: attempting access to reference site Thu 10 Jan 2013 16:15:42 EST | World Community Grid | Scheduler request failed: Unrecognized or bad HTTP Content or Transfer-Encoding Thu 10 Jan 2013 16:15:43 EST | | Internet access OK - project servers may be temporarily down. |
||
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
We have started things up again. Unfortunately we didn't get as much performance increase as we hoped. We have ordered additional memory for the servers which we expect to get installed in the next 2-3 weeks. -- knreed [Jan 9, 2013 8:45:26 PM] post I have reserved memory-hardware within my arm's reach as a contingency just in case my machines would need them. I have a dozen of them memory-sticks. The specs of that memory is amazing: 4-TBytes per stick, DDR-version-10, octo-channel, 1.2-Terahertz, 1-mWatt power-consumption. Just give me a call if you need some... ![]() Seriously now, what prospects are we looking at for ramping up things back to normal operational-speed system-wide -- 2 to 3 weeks? ![]() ; ; andzgridPost#799 ; I've tweaked a number of things and the backlog for hcc1 validation is now only 47,000 workunits and getting smaller. The processes that delete files on the server after archiving the good results and the processes that delete records from the database are ~1,000,000 records behind but also catching up after the changes just made. The database itself is about 158GB and it is running on 64GB of RAM. We will be increasing the database server to 128GB of RAM. The storage for the database server is on a shared SAN device with 15k RPM disks in a RAID 5 config. We will be looking things over in detail over the next few months to determine if we should get our own dedicated SAN device and put the databases on SDD drives in order to ensure high performance over the next several years. We will also be looking at replacing the db servers with more powerful servers that can have up to 256GB (or more) of RAM. However, these types of bigger hardware purchases take much more time to review, approve and install (i.e. many months). |
||
|
|
![]() |