News on Project Outages

Message boards : Projects : News on Project Outages
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next

AuthorMessage
Profile Peter
Avatar

Send message
Joined: 7 Sep 09
Posts: 167
Canada
Message 38677 - Posted: 26 Jun 2011, 12:50:13 UTC - in response to Message 38674.  

Yup -- I did that early in the week, I've other CPU projects to pick up the slack (Einstein, Spinhenge, POEM plus some others)



Edit: I forgot to add that it might be an idea to suspend Aqua in the meanwhile so Boinc doesn't waste time trying to contact Aqua's servers.




If any WU's are in the Transfers tab, they will still keep trying regardless of settings. I had hoped Boinc would suspend the project entirely.
ID: 38677 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 38686 - Posted: 26 Jun 2011, 20:09:56 UTC - in response to Message 38677.  

Correct, while one has project specific control for suspend and no new work, there is no project specific control for network access. It is something which also has been problematic for other projects in the past when they have had extended offline issues with work trying to be either reported or uploaded. (Projects like SETI and Climate come to mind, along with the now apparently terminated Dnet project).




If any WU's are in the Transfers tab, they will still keep trying regardless of settings. I had hoped Boinc would suspend the project entirely.

ID: 38686 · Report as offensive
Profile Peter
Avatar

Send message
Joined: 7 Sep 09
Posts: 167
Canada
Message 38687 - Posted: 26 Jun 2011, 20:15:12 UTC - in response to Message 38686.  

That's too bad. I was just trying to save Boinc, and my computer of course, some work, but I guess it's not really a problem.
ID: 38687 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 38694 - Posted: 28 Jun 2011, 4:29:21 UTC

I was kind of hoping to see the Aqua home page pop up today - not expecting their database to show until later in the week. But no home page today -- guess they are deep into oh my goodness mode over there.
ID: 38694 · Report as offensive
Bernd

Send message
Joined: 24 Aug 09
Posts: 91
United States
Message 38695 - Posted: 28 Jun 2011, 5:41:08 UTC

Update on The Lattice Project. They had a validation problem over the weekend. It turned out to be a problem caused by the sever crash. The validation script has been rerun on all WU in the last batch and most have been validated. They have found a bug in one of the science apps with extra long runtimes on 64 bit windows systems with AMD processors with the 64 bit science app taking up to twice as long to run. They are looking into the code for a fix before the next batch is sent out.
ID: 38695 · Report as offensive
Odicin

Send message
Joined: 26 May 11
Posts: 5
Germany
Message 38700 - Posted: 28 Jun 2011, 21:12:21 UTC

Aqua is up now with good news:

AQUA server is recovering from a serious problem
Last week the AQUA server had to be shut down because many configuration files were accidentally deleted. We are working on gradually repairing the damage, and will start submitting new work as soon as we are reasonably confident that everything is fine. This could take a few days. We believe the database is intact and no one has lost any credit. We apologize for any inconvenience this may have caused our volunteers.


Regards Odi
ID: 38700 · Report as offensive
Profile Peter
Avatar

Send message
Joined: 7 Sep 09
Posts: 167
Canada
Message 38704 - Posted: 28 Jun 2011, 21:27:03 UTC - in response to Message 38700.  

Aqua is up now with good news:

AQUA server is recovering from a serious problem
Last week the AQUA server had to be shut down because many configuration files were accidentally deleted. We are working on gradually repairing the damage, and will start submitting new work as soon as we are reasonably confident that everything is fine. This could take a few days. We believe the database is intact and no one has lost any credit. We apologize for any inconvenience this may have caused our volunteers.


Regards Odi


Good news, thanks.
ID: 38704 · Report as offensive
Profile Peter
Avatar

Send message
Joined: 7 Sep 09
Posts: 167
Canada
Message 38706 - Posted: 28 Jun 2011, 23:56:51 UTC - in response to Message 38704.  

Orbit appears to be down, not that they've had any work for ages now as they are awaiting a government grant. Maybe this is good news...?
ID: 38706 · Report as offensive
Profile Peter
Avatar

Send message
Joined: 7 Sep 09
Posts: 167
Canada
Message 38709 - Posted: 29 Jun 2011, 14:29:16 UTC - in response to Message 38706.  

..Orbit is back up and it wasn't good news...still no work.
ID: 38709 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 38712 - Posted: 29 Jun 2011, 16:05:22 UTC

Still no new work from Aqua - not a surprised -- they noted that late this week would be about the earliest for new work.

New outage -- Spinhenge -- went offline as of 7:30AM PDT -- like a number of BOINC sites, air conditioning and local power has been a problem there in the past month and might have resurfaced, there are limits to baling wire, chewing gum and duct tape.
ID: 38712 · Report as offensive
Profile Byron Leigh Hatch @ team Carl ...
Avatar

Send message
Joined: 30 Aug 05
Posts: 505
Canada
Message 38715 - Posted: 29 Jun 2011, 20:34:38 UTC

ClimatePrediction.net News and Announcements

CPDN Main Project


After a number of false starts with faulty greenhouse gas (GHG) forcing parameters the second phase of the RAPIT project has now started in earnest.
HadCM3N tasks have names in the format hadcm3n_{umid}_{start year}_40_* (where 40 is the number of model years the task runs for).
{umid} is a 4 character universal model identity, with the first character being the main indicator of the type of model being run as follows:

  • 'o', 'p' and 'q' are control models with no GHG forcing. These should continue through the resubmission processes at 1940 and 1980 (and beyond if the scientists decide that's required) with no problems.

  • at the first resubmission (hadcm3n_{umid}_1940_40_*) each successful control should, in addition, spawn a new series with a range of GHG forcing parameters. This has been the problem area, with the {umid} first character set as follows:


'r' - a large batch of workunits which all fail at the end of the first model year before a trickle is generated. Most of these have been completed but if you have a task with the name format hadcm3n_rXXX_1940_40_ it should be aborted.

's' - another large batch, this time failing at the end of the 10th model year, just before the trickle and upload file are generated. If you have a task with the name format hadcm3n_sXXX_1940_40_ it should be aborted. The workunits for these tasks have been cancelled on the server to prevent reissues.

'b' to 'i' - small test batches which will fail at the end of the first or 10th model year. These should be aborted.

'j' - a small test batch with GHG forcing brought forward from 1950 to 1941. These tasks should complete as long as the climate doesn't go wild.

'k' - a small stress test batch with highly variable GHG forcing. These tasks should also complete as long as the climate doesn't go wild.

't' - this is the large batch of work currently being generated. Tasks with the name format hadcm3n_tXXX_1940_40_ should complete as long as the climate doesn't go wild.

A small number of hadcm3n_tXXX_1980_40_ workunits have been generated from 'o' series tasks which have already completed the second phase. This shouldn't have happened (the _tXXX_1980_ batch should be a continuation of the _tXXX_1940_ batch). These have been cancelled on the server to prevent reissues and should be aborted.

NOTE: tasks with the name format hadcm3n_tXXX_1980_40_ will start appearing again in 2 or 3 weeks and will be from genuine resubmission workunits. These should not be aborted.

Thyme Lawn
Forum moderator

ID: 38715 · Report as offensive
Profile Gundolf Jahn

Send message
Joined: 20 Dec 07
Posts: 1069
Germany
Message 38717 - Posted: 29 Jun 2011, 21:32:12 UTC - in response to Message 38715.  

Those are interesting informations on CPDN but have nothing to do with an outage.

There is already a thread about ClimatePrediction.Net (AKA CPDN) in this subforum. It's admittedly a bit old, but you could also start a new one.

Gruß,
Gundolf
ID: 38717 · Report as offensive
Profile Byron Leigh Hatch @ team Carl ...
Avatar

Send message
Joined: 30 Aug 05
Posts: 505
Canada
Message 38719 - Posted: 29 Jun 2011, 22:42:15 UTC - in response to Message 38717.  

Oops sorry about that.
you are right.
this thread should only be about News on Project Outages.
I didn't know or realize :(
thank you Gundolf Jahn for pointing me to that thread.
about ClimatePrediction.Net (AKA CPDN) in this sub forum
Best Wishes
Byron
ID: 38719 · Report as offensive
Odicin

Send message
Joined: 26 May 11
Posts: 5
Germany
Message 38722 - Posted: 30 Jun 2011, 4:56:54 UTC

A (new?) Dnetc Webserver is online, but only with standard starting page. This gives me hope, that the boinc server is online again in anytime soon.

This is the default web page for this server.


Regards Odi
ID: 38722 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 38724 - Posted: 30 Jun 2011, 5:05:53 UTC - in response to Message 38722.  

Frankly, *if* they come back up, I think I'll be in observer only mode for a fair while....

A (new?) Dnetc Webserver is online, but only with standard starting page. This gives me hope, that the boinc server is online again in anytime soon.




ID: 38724 · Report as offensive
Profile Byron Leigh Hatch @ team Carl ...
Avatar

Send message
Joined: 30 Aug 05
Posts: 505
Canada
Message 38726 - Posted: 30 Jun 2011, 10:07:18 UTC

There seems to be a project outage at Spinenge@home
Dose any one have any info ?
ID: 38726 · Report as offensive
Profile Peter
Avatar

Send message
Joined: 7 Sep 09
Posts: 167
Canada
Message 38731 - Posted: 30 Jun 2011, 13:11:44 UTC - in response to Message 38726.  

Spinhenge just came back up. ;-)
ID: 38731 · Report as offensive
Profile NullCoding*
Avatar

Send message
Joined: 10 Jan 11
Posts: 58
United States
Message 38735 - Posted: 30 Jun 2011, 15:51:26 UTC

AQUA's webpage is functional but there is no new work yet. I believe outstanding WUs can still be submitted but it doesn't appear the validator is up yet. Apparently there should be no point loss and credits will still be granted where they are and have been due. Good news indeed!

I believe Collatz was down earlier, for maintenance it said. I couldn't reach the site at all but it's all working fine now.

For those who like DNETC and/or want to participate in the RC5-72 effort through BOINC, try Moo! Wrapper. Same/similar credits and runtime for both CPU and GPU apps (~45 min on OC GTX460, not too bad). It is the DNETC Client app.
ID: 38735 · Report as offensive
Odicin

Send message
Joined: 26 May 11
Posts: 5
Germany
Message 38747 - Posted: 2 Jul 2011, 0:05:58 UTC

Dnetc seems to be online again...

Regards Odi
ID: 38747 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 38848 - Posted: 8 Jul 2011, 18:59:24 UTC
Last modified: 8 Jul 2011, 19:03:54 UTC

From the Aqua project admins:

"In preparation for a credit rollback, I've emptied the queues of all work. Please feel free to cancel any tasks that you may be running.

Please run tasks from other BOINC projects till we sort out this credit mess."

Since dealing with the credit mess involves not only some project work, but also some corrective BOINC server code, I'd expect this to take some time.

Basically, aside from the message boards, consider Aqua to be offline -- best case -- some time next week.
ID: 38848 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next

Message boards : Projects : News on Project Outages

Copyright © 2024 University of California. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.