Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 79
Posts: 79   Pages: 8   [ Previous Page | 1 2 3 4 5 6 7 8 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 5309 times and has 78 replies Next Thread
BarryAZ
Cruncher
United States
Joined: Mar 28, 2006
Post Count: 19
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server maintenance??

Ah -- OK -- so we can expect things to return to live in a few hours then. Still in 'maintenance' after several hours.
[Jan 9, 2013 7:25:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server maintenance??

Just noticed I wasn't crunching any tasks, and I was wondering what was going on. This why forums are helpful, I am just glad something didn't get messed up in my settings.
[Jan 9, 2013 7:42:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
cjslman
Master Cruncher
Mexico
Joined: Nov 23, 2004
Post Count: 2082
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server maintenance??

Yeap... I'm seeing the same (tasks ready to report that are not moving off the BOINC manager task list and VUs that are in Pending Validation status).

knreed: thnx for the post... hope things get straighten out soon.

CJSL

Crunching for a better world...
----------------------------------------
I follow the Gimli philosophy: "Keep breathing. That's the key. Breathe."
Join The Cahuamos Team


[Jan 9, 2013 7:51:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server maintenance??

MStenholm, (in reply to a removed post)

Only having a part of day work buffer work for GPU's even when your setting is 4 days is, because WCG has set a ceiling of WU's allowed "in progress" for a device. The GPU limit is about 4000, the CPU used to be about 80 per processor... don't know what it is today... subject to change without notice. The CPU part I have 3 days worth on devices, but that's just 156 CPU tasks on my Octo for instance.
[Jan 9, 2013 8:08:39 PM]   Link   Report threatening or abusive post: please login first  Go to top 
MStenholm
Advanced Cruncher
Denmark
Joined: Jan 7, 2010
Post Count: 97
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server maintenance??

Even folding@home is more reliable....and people with boots in both camps would know it says a lot. The nice feature of BOINC with a buffer simple doesn't work for more then a 3-4 hour stop. A four day setting (minimum buffer) last less then 4 hours for me (both 7.0.42 and the older 7 version). Set and leave is no longer possible with the GPU work.
----------------------------------------

[Jan 9, 2013 8:14:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
branjo
Master Cruncher
Slovakia
Joined: Jun 29, 2012
Post Count: 1892
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server maintenance??

MStenholm, (in reply to a removed post)

Only having a part of day work buffer work for GPU's even when your setting is 4 days is, because WCG has set a ceiling of WU's allowed "in progress" for a device. The GPU limit is about 4000, the CPU used to be about 80 per processor... don't know what it is today... subject to change without notice. The CPU part I have 3 days worth on devices, but that's just 156 CPU tasks on my Octo for instance.


My (OK, not my, but for my rig smile) GPU limit is 600 (what is app. 1 - 1.5 day(s)).

Cheers peace
----------------------------------------

Crunching@Home since January 13 2000. Shrubbing@Home since January 5 2006

----------------------------------------
[Edit 1 times, last edit by branjo at Jan 9, 2013 8:39:41 PM]
[Jan 9, 2013 8:39:07 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server maintenance??

We have started things up again. Unfortunately we didn't get as much performance increase as we hoped. We have ordered additional memory for the servers which we expect to get installed in the next 2-3 weeks. -- knreed [Jan 9, 2013 8:45:26 PM] post
I have reserved memory-hardware within my arm's reach as a contingency just in case my machines would need them. I have a dozen of them memory-sticks. The specs of that memory is amazing: 4-TBytes per stick, DDR-version-10, octo-channel, 1.2-Terahertz, 1-mWatt power-consumption. Just give me a call if you need some... laughing

Seriously now, what prospects are we looking at for ramping up things back to normal operational-speed system-wide -- 2 to 3 weeks? thinking
;
; andzgridPost#799
;
[Jan 9, 2013 9:57:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1951
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server maintenance??

Well, while all systems seem to be able again to report finished and receive new WUs, is it possible that someone still needs to kick the half-daily stats update cycle on the server(s) in the **** or is this going to happen by itself at a later time? confused

EDIT: Of course, as soon as I posted the above, the stats did in fact update, though apparently only with a fraction of the WUs reported after the outage... sad

Ralf
----------------------------------------

----------------------------------------
[Edit 2 times, last edit by TPCBF at Jan 10, 2013 1:30:29 AM]
[Jan 10, 2013 1:25:50 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server maintenance??

Hi.

I take it that this message has to do with the current problems, never got this one before.

Thu 10 Jan 2013 16:15:38 EST | World Community Grid | Sending scheduler request: To report completed tasks.
Thu 10 Jan 2013 16:15:38 EST | World Community Grid | Reporting 2 completed tasks, not requesting new tasks
Thu 10 Jan 2013 16:15:42 EST | | Project communication failed: attempting access to reference site
Thu 10 Jan 2013 16:15:42 EST | World Community Grid | Scheduler request failed: Unrecognized or bad HTTP Content or Transfer-Encoding
Thu 10 Jan 2013 16:15:43 EST | | Internet access OK - project servers may be temporarily down.
[Jan 10, 2013 5:20:39 AM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server maintenance??

We have started things up again. Unfortunately we didn't get as much performance increase as we hoped. We have ordered additional memory for the servers which we expect to get installed in the next 2-3 weeks. -- knreed [Jan 9, 2013 8:45:26 PM] post
I have reserved memory-hardware within my arm's reach as a contingency just in case my machines would need them. I have a dozen of them memory-sticks. The specs of that memory is amazing: 4-TBytes per stick, DDR-version-10, octo-channel, 1.2-Terahertz, 1-mWatt power-consumption. Just give me a call if you need some... laughing

Seriously now, what prospects are we looking at for ramping up things back to normal operational-speed system-wide -- 2 to 3 weeks? thinking
;
; andzgridPost#799
;


I've tweaked a number of things and the backlog for hcc1 validation is now only 47,000 workunits and getting smaller. The processes that delete files on the server after archiving the good results and the processes that delete records from the database are ~1,000,000 records behind but also catching up after the changes just made. The database itself is about 158GB and it is running on 64GB of RAM. We will be increasing the database server to 128GB of RAM.

The storage for the database server is on a shared SAN device with 15k RPM disks in a RAID 5 config. We will be looking things over in detail over the next few months to determine if we should get our own dedicated SAN device and put the databases on SDD drives in order to ensure high performance over the next several years. We will also be looking at replacing the db servers with more powerful servers that can have up to 256GB (or more) of RAM. However, these types of bigger hardware purchases take much more time to review, approve and install (i.e. many months).
[Jan 10, 2013 4:49:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 79   Pages: 8   [ Previous Page | 1 2 3 4 5 6 7 8 | Next Page ]
[ Jump to Last Post ]
Post new Thread