Thread 'News on Project Outages'

Message boards : Projects : News on Project Outages
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next

AuthorMessage
NaRoon

Send message
Joined: 17 Feb 15
Posts: 2
United States
Message 60347 - Posted: 17 Feb 2015, 2:36:31 UTC - in response to Message 60269.  

Constellation has been down for several hours


Now several days.


Anyone have a status on the constellation Project? Their website is down also.
ID: 60347 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60358 - Posted: 17 Feb 2015, 16:49:43 UTC

Collatz is offline -- this time the online run between going offline was about 9 days.

When it goes offline it takes between 1 day and 2 weeks before Slicker is able to restart it.

From his previous reports, it seems he's got an intermittant which leaves no trace of the failure mode. So it resurfaces and when he is able to restart the server, Collatz is back online.
ID: 60358 · Report as offensive
ProfileGary Charpentier
Avatar

Send message
Joined: 23 Feb 08
Posts: 2486
United States
Message 60491 - Posted: 23 Feb 2015, 17:51:15 UTC

Seti has gone down.
ID: 60491 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60502 - Posted: 24 Feb 2015, 0:08:05 UTC - in response to Message 60491.  
Last modified: 24 Feb 2015, 1:07:39 UTC

Seti appears back up

Collatz which recovered quickly from one of its periodic outages a few days ago, went offline again yesterday and remains offline.

One frustrating thing there is when Collatz goes back online, there is little explanation as to the outage -- and no advance notice of the next one. Seems just like a frustrating head scratcher there.


GPUGrid appears to be having some communications issues. It has been rather slow connection wise (and with it's large uploads and downloads that can be problematic), but today the problem appears more severe.

For GPUGrid -- seems it isn't their hardware that is amiss, but rather their internet connectivity. It isn't absolutely gone, just dial up modem slow -- which is REALLY bad for their uploads and downloads.

They are pretty much a daytime outfit -- so I suspect the outage won't get looked at until tomorrow.
ID: 60502 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 60506 - Posted: 24 Feb 2015, 14:26:34 UTC

Climate is downloading files but I can't upload completed work to them
ID: 60506 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5121
United Kingdom
Message 60507 - Posted: 24 Feb 2015, 14:42:58 UTC - in response to Message 60506.  

Climate is downloading files but I can't upload completed work to them

Yes, the current disk space at the British Atmospheric Data Centre (BADC) is full. CPDN project staff have been notified, but they will need to persuade BADC staff to do some housekeeping before the storage facility can accept more uploads.

Meanwhile, the CPDN regional models should upload results to their respective regional upload servers without problems.
ID: 60507 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 60529 - Posted: 24 Feb 2015, 20:37:36 UTC - in response to Message 60507.  


Yes, the current disk space at the British Atmospheric Data Centre (BADC) is full. CPDN project staff have been notified, but they will need to persuade BADC staff to do some housekeeping before the storage facility can accept more uploads.

Meanwhile, the CPDN regional models should upload results to their respective regional upload servers without problems.


Thx for the update, Richard.
ID: 60529 · Report as offensive
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1424
United States
Message 60560 - Posted: 27 Feb 2015, 6:04:32 UTC
Last modified: 27 Feb 2015, 6:56:55 UTC

WUProp@Home has been down for 30 to 60 minutes as of this posting.
Getting "Database not found" errors, unable to login or get past home page. Site has been a bit buggy last 24+ hours and not always updating reported computing hours.

------ EDIT ------
WUProp@Home is up and running again but there is currently about 1.94 hours backlog on the Transitioner.
ID: 60560 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60563 - Posted: 27 Feb 2015, 21:06:24 UTC

Collatz is back online at the moment.

GPUGrid seems to be online -- though it still appears to have communications problems -- very slow access and sometimes it times out.
ID: 60563 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60564 - Posted: 27 Feb 2015, 21:57:43 UTC - in response to Message 60563.  

The access problems at GPUGrid appear now to have it essentially non-accessible.

Those problems surfaced to a degree last week.

They were severe early this week, and got marginally better mid week.

They are now at the same severe status that was manifest early this week.

As it is the weekend back at their shop, I suspect it will be until next week that the problem even get's acknowledged, let alone identified and then resolved.




Collatz is back online at the moment.

GPUGrid seems to be online -- though it still appears to have communications problems -- very slow access and sometimes it times out.
ID: 60564 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60565 - Posted: 28 Feb 2015, 15:50:32 UTC

GPUGrid remains essentially unaccessible -- we can hope that their staff:

1) Acknowledges that there is a problem
2) Identifies the problem
3) Works toward resolving the problem

For GPUGrid, which has very large uploads and downloads, fast thruput is critical, I am sure they know that. Also, with an outage of days, the backlog is likely to place an even greater stress on thruput so the longer the outage, the longer the recovery.

Collatz is back to its periodic crash tricks as well -- started last night. For Collatz, it should return to online status within 1, 2 or 10 days -- with no determination as what causes it to break.
ID: 60565 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 60566 - Posted: 28 Feb 2015, 16:40:54 UTC

Wow...guess it's Outage Weekend:

1) Seti-can't get new work

2) Citizen's Science Grid-Can't get work

3) GPUrid-*Poof* Dead

4) Collatz-*Poof* Dead
ID: 60566 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5121
United Kingdom
Message 60567 - Posted: 28 Feb 2015, 17:12:17 UTC - in response to Message 60566.  

And this BOINC-dev site was practically inaccessible this morning. I wonder what the common factor is. Linux servers falling over? All running BOINC server software? External DDOS attack?
ID: 60567 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15542
Netherlands
Message 60568 - Posted: 28 Feb 2015, 19:09:31 UTC - in response to Message 60567.  
Last modified: 28 Feb 2015, 19:09:46 UTC

And this BOINC-dev site was practically inaccessible this morning.

Whenever that happens, use the HTTPS address instead of the HTTP address. The pounding seems to happen on the HTTP address only.
ID: 60568 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 60570 - Posted: 1 Mar 2015, 3:23:44 UTC - in response to Message 60566.  

Updates:

1) Seti-can't get new work -->BACK UP

2) Citizen's Science Grid-Can't get work -->BACK UP

3) GPUrid-*Poof* Dead -->BACK UP

4) Collatz-*Poof* Dead -->Site up but down for maintenance
ID: 60570 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 60592 - Posted: 1 Mar 2015, 12:54:02 UTC

Can't upload files back to Seti this morning
ID: 60592 · Report as offensive
Bill Walker

Send message
Joined: 13 Dec 07
Posts: 24
Canada
Message 60624 - Posted: 3 Mar 2015, 1:53:23 UTC - in response to Message 60347.  

Constellation has been down for several hours


Now several days.


Anyone have a status on the constellation Project? Their website is down also.


I get occasional connects to the site, but haven't completed a download in a couple of weeks. They all come up with various error messages, usually

"2015-03-02 3:03:22 PM | | Project communication failed: attempting access to reference site"

But it keeps trying to send me WUs.
ID: 60624 · Report as offensive
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1424
United States
Message 60681 - Posted: 5 Mar 2015, 12:12:17 UTC

Collatz is sorta up but appears to have ANOTHER Database Error:
Can not sign in to your account... User ID/Login failure with "No such ID/Database Error messages."
Can not report work... "Message from server: Invalid or missing account key. To fix, remove and add this project."
ID: 60681 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60687 - Posted: 5 Mar 2015, 17:56:25 UTC - in response to Message 60681.  

Indeed, the gremlin that has periodically been plaguing Collatz with something of a mechanics shoulder shrug response of a restart and a "I wonder what isn't working right", has not, absent aggressive problem solving, decided to force complete action.

Now the database will either have to be restored from existing backups (which might have the gremlin embedded there), or the project may need an absolute and complete rebuild and recode from scratch.

Personally, after all the glitches we've seen over the past several months, and recognizing that this has been a one persona labor of love, I am NOT sanguine of seeing a return to action for Collatz in the need future.

I lost about 10 hours work since it happened over night -- minor drama there -- I've shifted over to other projects and simply deleted the Collatz project fro my workstations.

Personally, over the past year, I had been shifting over to Moowrap, Milkyway, GPUGrid and PrimeGrid gradually, so it is more a one day bump to me.

I wish him luck on a project rebuild -- and suspect he's gotten more than a bit frustrated with the issues over the past several months -- but he too has a life to lead.



Collatz is sorta up but appears to have ANOTHER Database Error:
Can not sign in to your account... User ID/Login failure with "No such ID/Database Error messages."
Can not report work... "Message from server: Invalid or missing account key. To fix, remove and add this project."
ID: 60687 · Report as offensive
ProfileTigers Dave

Send message
Joined: 24 Dec 05
Posts: 52
United States
Message 60688 - Posted: 5 Mar 2015, 18:11:40 UTC - in response to Message 60687.  
Last modified: 5 Mar 2015, 18:11:52 UTC

Oddly enough, Collatz is still accepting completed tasks. So, I am going to keep crunching my Collatz work units until I run out of them. Hopefully, by that point Collatz will be back. In the meantime, I am going to look for alternatives. Not too many projects run on GPUs under Mac OS, particularly some of my older slower GPUs like NVIDIA GeForce 320M, NVIDIA GeForce 9400, and NVIDIA GeForce 8800 GT.
ID: 60688 · Report as offensive
Previous · 1 . . . 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next

Message boards : Projects : News on Project Outages

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.