Thread 'News on Project Outages'

Message boards : Projects : News on Project Outages
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · 13 · Next

AuthorMessage
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 36274 - Posted: 30 Dec 2010, 14:43:58 UTC - in response to Message 36257.  

MW is entering their third day of being unreachable -- I wonder if they will remain offline until early next week. If so, it would be their longest period of being totally offline in years. Not that I would characterize that as an achievement.
ID: 36274 · Report as offensive
ProfileGary Charpentier
Avatar

Send message
Joined: 23 Feb 08
Posts: 2486
United States
Message 36288 - Posted: 31 Dec 2010, 2:36:32 UTC - in response to Message 36274.  

MW is entering their third day of being unreachable -- I wonder if they will remain offline until early next week. If so, it would be their longest period of being totally offline in years. Not that I would characterize that as an achievement.

http://setiathome.berkeley.edu/forum_thread.php?id=62590
FYI the Milkyway Project is completely down. Admins are aware of it but due to the holiday, restoral of service is not expected due until Monday.

We apologize for any inconvenience and thank the Seti Mods for allowing this post.

Blurf
Milkyway Forum Moderator
Project Admin


ID: 36288 · Report as offensive
Bernd

Send message
Joined: 24 Aug 09
Posts: 91
United States
Message 36311 - Posted: 1 Jan 2011, 20:45:35 UTC

MW and CPDN seem to both be back online at the moment.
ID: 36311 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15540
Netherlands
Message 36331 - Posted: 3 Jan 2011, 20:52:09 UTC

Collatz is down. Just as I was trying to figure out what the maximum RAC is on my ATI HD4850. ;)
ID: 36331 · Report as offensive
Profileidahofisherman
Avatar

Send message
Joined: 11 Aug 06
Posts: 154
United States
Message 36332 - Posted: 3 Jan 2011, 21:25:38 UTC

Anyone know wha t is happening with CPLAN. Its been down for one week now and not communication.
ID: 36332 · Report as offensive
Bernd

Send message
Joined: 24 Aug 09
Posts: 91
United States
Message 36337 - Posted: 4 Jan 2011, 6:26:13 UTC - in response to Message 36331.  

Collatz is down. Just as I was trying to figure out what the maximum RAC is on my ATI HD4850. ;)

Collatz is back up. According to the front page someone (Project admin) bumped a power cord doing work with a new Server Rack installation. The server came back up with disk errors so maintenance was run as well as some upgrades.
ID: 36337 · Report as offensive
skilledbachelor

Send message
Joined: 4 Jan 11
Posts: 4
Canada
Message 36340 - Posted: 4 Jan 2011, 14:48:10 UTC

rosetta@home appears to be down as of approx. seven hours ago.
ID: 36340 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 36344 - Posted: 5 Jan 2011, 5:28:51 UTC

Collatz is down again -- and Dnetc is offline as well -- bummer.
ID: 36344 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 36346 - Posted: 5 Jan 2011, 14:55:25 UTC - in response to Message 36344.  

Collatz remains offline (about 12 hours so far), Dnetc is back and running though.
ID: 36346 · Report as offensive
Warped
Avatar

Send message
Joined: 25 Aug 08
Posts: 39
South Africa
Message 36347 - Posted: 5 Jan 2011, 17:32:54 UTC - in response to Message 36340.  

rosetta@home appears to be down as of approx. seven hours ago.


It's still down.

I'm getting the following:
On Boinc Manager "Scheduler request failed: Error 403"

On the Website "You don't have permission to access /rosetta/ on this server."

Anyone know what's going on?

ID: 36347 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 36348 - Posted: 5 Jan 2011, 21:29:21 UTC

Anyone have any information about Collatz -- been offline for a day and a half. That project has been very solid for the past year with only a few shorter outages.
ID: 36348 · Report as offensive
arkayn
Avatar

Send message
Joined: 21 Mar 09
Posts: 33
United States
Message 36350 - Posted: 6 Jan 2011, 5:08:41 UTC - in response to Message 36348.  

Anyone have any information about Collatz -- been offline for a day and a half. That project has been very solid for the past year with only a few shorter outages.


It just came back up, has the following news on the front page now.

Server Status - Good and Bad
The server is up but the work generator is not. That is on purpose. The server reports that the boot drive needs to be replaced which means reinstalling Linux, Apache, MySQL, etc. Unfortunately, that disk also contains the partician for the WUs waiting to be sent out. I won't have time to replace it until this weekend. So, until then, I'd rather be cautious and not generate any new work just in case Mr. Murphy is still hanging around. You should be able to return any completed results in the mean time so your BOINC client doesn't get overwhelmed with the number of workunits it tries to monitor.
ID: 36350 · Report as offensive
dcdc

Send message
Joined: 29 Aug 06
Posts: 82
United Kingdom
Message 36353 - Posted: 6 Jan 2011, 19:39:37 UTC - in response to Message 36347.  

rosetta@home appears to be down as of approx. seven hours ago.


It's still down.

I'm getting the following:
On Boinc Manager "Scheduler request failed: Error 403"

On the Website "You don't have permission to access /rosetta/ on this server."

Anyone know what's going on?



Still down... I'm nearly out of jobs on a few machines now. It'd be helpful if the forum was located elsewhere from the project servers so we could see what was going on...
ID: 36353 · Report as offensive
David Ball

Send message
Joined: 2 Dec 06
Posts: 69
United States
Message 36355 - Posted: 6 Jan 2011, 20:51:45 UTC - in response to Message 36353.  

The Rosetta web server now says

The project's fileserver has crashed. We're working to get things back online as soon as possible. Thanks for your patience. -KEL 01/06/2011
ID: 36355 · Report as offensive
mo.v
Avatar

Send message
Joined: 13 Aug 06
Posts: 778
United Kingdom
Message 36361 - Posted: 7 Jan 2011, 10:43:10 UTC

CPDN main project

uploader.oerc

This upload server has been out of action because its disk filled up. Copying such large quantities of data to another machine takes a long time. It is now running, so thank you, Milo!

There must be a very large number of files from EU regional and HadCM models waiting to upload. They cannot all upload simultaneously so there will still be upload delays, perhaps for another day.
ID: 36361 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 36364 - Posted: 7 Jan 2011, 15:24:33 UTC - in response to Message 36361.  

Collatz is running for now, but not generating new work. The boot partition/drive needs to be replaced -- Slicker plans to do that over the weekend. Here's hoping for success there.

Dnetc came back up earlier in the week, but as of about an hour ago (7AM PST) it is offline -- no access to the home page.
ID: 36364 · Report as offensive
Bernd

Send message
Joined: 24 Aug 09
Posts: 91
United States
Message 36373 - Posted: 8 Jan 2011, 10:27:16 UTC

Rosetta is online again sort of. The web page and forums are up but the rest is down. The message on the front page is as follows


Jan 7, 2010
Well, our luck ran out. The SAN controller that has been causing so much trouble in the last few months finally tipped over in a rather distructive fashion, corrupting the binary tree on which the filesystem is based. We're trying to rebuild the thing but the sheer number of files in the filesystem (> 10M files) makes this process very, very slow. We're bringing the project up from a recent backup (12/09/10) but the backup wasn't a perfect replica of the environment, so we're having to scramble to get all the parts working together again. We only need a few more weeks and then our new, next generation SAN will be ready to be put into place... I just thought the old one would last a few more week. I apologize for the hassle and appreciate your patience as we get things online again... KEL 01/07/11


Does not look good for a fast recovery.
ID: 36373 · Report as offensive
whynot

Send message
Joined: 8 May 10
Posts: 90
Ukraine
Message 36376 - Posted: 8 Jan 2011, 15:43:15 UTC - in response to Message 36136.  


The SZDGR (SZDG Research Facility project will be closed. We want to thank the help of the participants! The project will reopen later, we will let you know the details.


There's no clue when close and when would happen. Looks like it's up to local admins.


Everything back to normal. It was cleared here that SZDGR isn't SZTAKI itself. "Particapants" were special invited guests. SZTAKI itself is up and running. No SNAFU here to see, move alnog, move along.

I'm counting for science,
points just make me sick.
ID: 36376 · Report as offensive
Bernd

Send message
Joined: 24 Aug 09
Posts: 91
United States
Message 36382 - Posted: 9 Jan 2011, 21:19:26 UTC

Collatz seems to be offline right now. I thought the repairs where to be done yesterday(saturday)? Did they come back up after repairs?
ID: 36382 · Report as offensive
Bernd

Send message
Joined: 24 Aug 09
Posts: 91
United States
Message 36383 - Posted: 10 Jan 2011, 5:09:36 UTC

Collatz is back online. I was off by one day on the date of repair it was today (sunday).
ID: 36383 · Report as offensive
Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · 13 · Next

Message boards : Projects : News on Project Outages

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.