Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 3319
|
![]() |
Author |
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Updated stats:
----------------------------------------Average Generation: 80.6 Pace (average time to complete a generation): 4.25 days timestamp_first_indexed generation num_units completed_last_day I didn't like the idea of holding 2.5% of workunits before releasing the next leading generation just to keep the spread from getting too wide, so we are only holding 0.5% and will just let the spread be wider. Right now we are designating generations more than 7 behind the lead as stragglers (i.e. at the moment that is generations 78 and earlier). I will leave it here for the next week and see how things go. The next generation or two will arrive quickly but then it is should settle into a pace of about every 4.25 days. One item of interest - the generations 80-85 (the lead generation) completed 23.0% of the units that were in them as of 24 hours ago while the generations 78 and earlier completed 34.5% of the units that were in them as of 24 hours ago. This is another way of looking how sending the stragglers to reliable hosts is able to complete them faster. Generation 79 completed 28.1% of its results . [Edit 2 times, last edit by knreed at Aug 6, 2021 1:45:26 PM] |
||
|
erich56
Senior Cruncher Austria Joined: Feb 24, 2007 Post Count: 295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
any idea when new WUs will become available ?
|
||
|
Acibant
Advanced Cruncher USA Joined: Apr 15, 2020 Post Count: 126 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() |
any idea when new WUs will become available ? They received quite a few additional resources from participants after a recent news posting asking people to increase their commitment in terms of downloaded tasks. They have very little backlog, meaning that very soon after a unit of one generation is validated it goes right out again as a newer generation unit.It's possible that if anyone has too large of a cache on their computer they could adjust that down to allow some to go to others, but beyond that it looks like most all work units will be in an assigned state from here on out. They could be more aggressive and limit more work units to machines that return quickly to make more available to that particular group but then that runs against their stated goal of not having work units waiting for available resources. ![]() |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12436 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
ARP are readily available, but not necessarily immediately. Maybe in a few minutes.
Your machine is best restricted to half its threads on ARP and the rest on OPN & MCM, which are shorter duration projects. That way your machine will ask for more ARP whenever one of OPN/MCM units is uploaded. As an example, say you have an 8 thread machine, restrict ARP to 4 threads and OPN & MCM to 2 each using app_config.xml and the cache in your profile to 5, 3 & 3 respectively. That way you have a spare waiting on each. Just in case you run short of one of the projects, up one of OPN or MCM by 1 in app_config.xml. Scale those figures up or down according to the threads on your machine. Mike |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I found the wait to be as long as one hour in some instances. I would run a longer queue so that work will be available to the thread during the "drought". I have removed my machines from ARP1 due to the lack of work. Just like I predicted the lengthening distribution spread, I'm predicting that members will get tired of waiting for work and move to other projects ot sub-projects and then, in about 2 months, the work flow will drop under 15,000 WUs per day. Then they will be asking for more cores again. LIke they say, fool me once......
|
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12436 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
With the 8 thread machine I used as an example, with a 24 hour CPU time per unit and 4 active units, the average time between updates of ARP units would be 6 hours. One spare unit would easily cover the 1 hour wait that you describe.
Scale that up to a 24 thread machine with 12 ARP units running and 8 hours CPU time per unit, you would have an average time between updates of ARP units of 40 minutes. There you would need to have a cache of 14 to cover your 1 hour wait. So in my example for an 8 thread machine I suggested a cache of 5 ARP units, scaling that up for a 24 thread machine would give you a cache of 15 ARP units, so 3 spare which would take 2 hours to clear, double your maximum waiting time. Where is the downtime in that? Given that the remaining threads would be on OPN and/or MCM, with spares, you have double protection from downtime. The OPN/MCM units would also be creating frequent updates which would also call for ARP units, so you should have no problem. Mike |
||
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I found the wait to be as long as one hour in some instances. I would run a longer queue so that work will be available to the thread during the "drought". I have removed my machines from ARP1 due to the lack of work. Just like I predicted the lengthening distribution spread, I'm predicting that members will get tired of waiting for work and move to other projects ot sub-projects and then, in about 2 months, the work flow will drop under 15,000 WUs per day. Then they will be asking for more cores again. LIke they say, fool me once...... I'm very confused as to why you think we fooled you. One of the mistakes that I think volunteer computing has done is created this perception that work will continuously available to run and that there is some form of issue if work is intermittent. Most researchers don't operate this way. It is usually a much iterative process of run a bunch of work, analyze the data, have new ideas and run more work. However, many projects that were first attracted to volunteer computing needed much more computing power that they couldn't get elsewhere so they tended to have enormous needs for computing power and could run continuously for years. This is great and I'm happy that volunteer computing can meet that need. However, there is much larger class of research projects that have a need for a lot of computing - but not at a such a scale that they can provide continuous work. Right now in volunteer computing this tends to be viewed as a "problem" rather than the norm. WCG with our multiple sub-projects and BOINC as a whole is designed to support this intermittent model. You can set a preferences for multiple projects and if there currently isn't work for one, you will get it from another. That is ok and great. If you think that our African Rainfall Project, ClimatePrediction.net and Einstein@Home are the worthy projects you want to run. Great! You client will rotate through each each BOINC project asking for work and sometimes you will get a job from that project and sometimes not but overall your client will continuously do work for one of them. The same thing could be achieved just within WCG. You could sign up for African Rainfall Project, Help Stop TB and Smash Childhood Cancer (which are all intermittent project) and then check the box for "If there is no work available for the project(s) I have selected above, please send me work from another project.". If there is work for one of your preferred projects, then you will get it. If not, you will get work from Mapping Cancer Markers or OpenPandemics and your client will continuously process work. I do not see why you should unsign up from African Rainfall Project just because work is not continuously available for it. As far as the pace of the project. The biggest value to the researchers for the African Rainfall Project is in getting the full set of data for the project. The best way to measure how fast the project is running is by how fast the average generation is moving forward. Over the past two months with the work we have been doing on the backend to get the next generation back out as fast as possible as well as recruiting additional volunteers to help accelerate the project we have been able to reduce the time to move the average generation forward from 1 generation every 6 days down to 1 generation every 4.3 days. That is a huge acceleration in the project. If we had 20,000 jobs sitting around waiting for a client to request them like we had 2 months ago, that would mean that about 1/3rd of the units on the project were just waiting around and not making progress which slows the project. I'm recording the size of jobs available to send every 20 minutes and it generally shows only 100-150 jobs ready to send which is close to as low as we can get. That means that often a request for work for African Rainfall Project will get a job, but not always. This is the way it needs to be to complete the project quickest and get the full data into the hands of the researchers. This has caused the distribution of units between generations to spread out some, but this only matters at the very end where the time between the first unit to reach its final generation and the last unit to reach its final generation might be a little bit longer (maybe 3-4 weeks longer). However, if we have removed many months from the overall length of the project (which we have) then this is a huge gain for the project. Can you help me understand what it is that I don't understand about why you think we fooled you? [Edit 1 times, last edit by knreed at Aug 7, 2021 3:50:21 PM] |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7697 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Bravo Kevin. Well said and well explained. Sometimes volunteers can get quite myopic about their work and neither understand nor appreciate the work which goes on behind the scenes. Thank you for your hard work and explanations. I, for one, appreciate the information on the structure and operations of each project. Keep up the good work.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
pwhidden
Cruncher USA Joined: Nov 17, 2004 Post Count: 32 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
My first unit from generation 086... ARP1_0011268_086_3
----------------------------------------![]() ![]() [Edit 1 times, last edit by pwhidden at Aug 7, 2021 6:15:04 PM] |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12436 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thank you, Paul.
086 indicates we are at about 47.0%, but for this month I am again assuming 2 generations behind to allow for the stragglers, so 45.9%. The latest interval is just 1.48856 days and the 10-interval average is down to 3.01736 days. The end date forecast would have been May 2022, but, based on Kevin Reed's data on the stragglers, I expect it to be about October or November 2022. I would expect the next generation to start about 10 August. Mike |
||
|
|
![]() |