Message boards : Projects : News on Project Outages
Message board moderation
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · Next
Author | Message |
---|---|
Send message Joined: 21 Nov 10 Posts: 3 |
flashawk: the electrical work was supposed to be complete by Monday...then the system re-build/repair work would start...don't forget Rule 1 of IT: everything takes longer than it takes !!! |
Send message Joined: 19 Sep 10 Posts: 24 |
CPDN was down last Friday even before they began the scheduled test of the electrical system at OERC. As you have pointed out they had a broken RAID last week that took several days to fix. They probably couldn’t even work on it over the weekend. I think that Weather@home works off the same servers so it too would be down. I too have WU’s ready to report. At least the zip files are uploading properly. |
Send message Joined: 4 Jul 08 Posts: 82 |
At least the zip files are uploading properly... They are? Lucky you... :-) I've been getting the dreaded "Internet access OK - project servers may be temporarily down" message for the last 48-hours plus. :-( |
Send message Joined: 18 Jul 11 Posts: 217 |
Seti forum appears to be down... |
Send message Joined: 5 Oct 06 Posts: 5121 |
Seti forum appears to be down... There appears to be some maintenance work scheduled on backup servers at the CoLo facility. http://systemstatus.berkeley.edu/ (CMR: 2278) |
Send message Joined: 30 May 12 Posts: 356 |
Thanks Richard ! :) I post this on SETI is down cafe |
Send message Joined: 23 Feb 08 Posts: 2486 |
Looks like Seti is back up. |
Send message Joined: 23 Feb 08 Posts: 2486 |
Looks like they crashed again. |
Send message Joined: 19 Sep 10 Posts: 24 |
At least the zip files are uploading properly... Yes, one of my hadcm3n WU’s finished overnight and the zip files uploaded just fine. Of course, the WU won’t be able to report until the outage is over. That makes 2 that I have sitting on my machines that are “ready to report”. Hopefully they will get it sorted out soon as I now have an vacant core that I am running MalariaControl on until I can get new work from CP. |
Send message Joined: 29 Aug 05 Posts: 15542 |
Andy Bowery, system admin CPDN wrote: Hi All, |
Send message Joined: 4 Jul 08 Posts: 82 |
Andy Bowery, system admin CPDN wrote: So you will see that we have the majority of services restored now! Unfortunately, I cannot see that... :-( I don't seem to be able to access any parts of the site that I used to be able to access: climateprediction.net, my account page, the forums, etc. The only response I get in the BOINC manager to update requests is: 8/8/2013 2:27:45 PM update requested by user 8/8/2013 2:27:46 PM Fetching scheduler list 8/8/2013 2:28:11 PM Project communication failed: attempting access to reference site 8/8/2013 2:28:13 PM Internet access OK - project servers may be temporarily down. This is the same on all hosts running on three different networks. Are these parts of the project that still aren't back? Did something else change that I might have missed? Or, could it be that there's so much traffic now no everybody is getting through? |
Send message Joined: 16 Apr 06 Posts: 386 |
... Unfortunately, I cannot see that... :-( I don't seem to be able to access any parts of the site that I used to be able to access: climateprediction.net, my account page, the forums, etc. ... Yes, doesn't look like it stayed up for very long. I'm not altogether surprised, they had to unexpectedly move everything onto a brand-new server because the original one was falling to pieces. The new configuration probably isn't quite right yet (CPDN has a lot of customisations on top of the standard version of Boinc). I have to confess, I will be pleased when everything is up again because I only have one model remaining. My money would be on tomorrow. |
Send message Joined: 4 Jul 08 Posts: 82 |
Yes, doesn't look like it stayed up for very long. I'm not altogether surprised, they had to unexpectedly move everything onto a brand-new server because the original one was falling to pieces. The new configuration probably isn't quite right yet (CPDN has a lot of customisations on top of the standard version of Boinc). I have to confess, I will be pleased when everything is up again because I only have one model remaining. My money would be on tomorrow. Thanks, Mike. I'm sure it hasn't been easy for them and hopefully some stability and reliability will be reached soon. I'll keep crunching my models and take another dose of patience pills... ;-) MarkR |
Send message Joined: 29 Aug 05 Posts: 15542 |
It could also be that it's a DNS problem, where we just have to wait until the DNS address is propagated again to all DNS servers out there, before we see the site or people can upload to it. |
Send message Joined: 19 Sep 10 Posts: 24 |
It is now early Friday morning in the U.K. If they don’t get the problems fixed by end of business today does that mean that they will be shut out of the server room at OERC until Monday morning? |
Send message Joined: 16 Apr 06 Posts: 386 |
Just had an update from the CPDN admins - climateprediction.net is up & accessible, but while climateapps2 (the key boinc server) is now up, it does not seem to be accessible outside their local network. They are trying to figure out the problem, but at least we know that there are signs of life!! |
Send message Joined: 16 Apr 06 Posts: 386 |
A further update - the firewall is now fixed, and should be letting through connections to climateapps2. But the DNS settings for the server need to propagate through the internet (allow up to a day for this to happen everywhere). |
Send message Joined: 4 Jul 08 Posts: 82 |
Thanks for the updates, Mike. I can get to climateprediction.net but am now getting "403 Forbidden" trying to get anywhere on climateapps2. Is that DNS related? (No sarcasm intended...I honestly don't know). |
Send message Joined: 16 Apr 06 Posts: 386 |
Nope, it means that you are getting through to the server (hence the DNS must be OK), but the server itself isn't accepting web connections. Jord has let the admins know. |
Send message Joined: 9 Nov 05 Posts: 123 |
Docking has had some failure, their HP looks like this: Warning: session_start() [function.session-start]: open(/var/lib/php/session/sess_mtr1qu1l7jbg8tlivr4f3lg6c4, O_RDWR) failed: Read-only file system (30) in /boinc/projects/docking/html_v2/project/project.inc on line 32 And my account page says this: Forbidden And BOINC says: So 11 Aug 2013 16:44:26 CEST | Docking | [fxd] starting upload, upload_offset -1 So 11 Aug 2013 16:44:26 CEST | Docking | Started upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_0 So 11 Aug 2013 16:44:26 CEST | Docking | [file_xfer] URL: http://docking.cis.udel.edu/docking_cgi/file_upload_handler So 11 Aug 2013 16:44:26 CEST | Docking | [fxd] starting upload, upload_offset -1 So 11 Aug 2013 16:44:26 CEST | Docking | Started upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_1 So 11 Aug 2013 16:44:26 CEST | Docking | [file_xfer] URL: http://docking.cis.udel.edu/docking_cgi/file_upload_handler So 11 Aug 2013 16:44:26 CEST | Docking | [fxd] starting upload, upload_offset 0 So 11 Aug 2013 16:44:26 CEST | Docking | Started upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_2 So 11 Aug 2013 16:44:26 CEST | Docking | [file_xfer] URL: http://docking.cis.udel.edu/docking_cgi/file_upload_handler So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] http op done; retval 0 (Success) So 11 Aug 2013 16:44:27 CEST | Docking | [error] Error reported by file upload server: can't open log file So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] parsing upload response: <data_server_reply> <status>1</status> <message>can't open log file</message></data_server_reply> So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] parsing status: -127 So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] http op done; retval 0 (Success) So 11 Aug 2013 16:44:27 CEST | Docking | [error] Error reported by file upload server: can't open log file So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] parsing upload response: <data_server_reply> <status>1</status> <message>can't open log file</message></data_server_reply> So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] parsing status: -127 So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] http op done; retval 0 (Success) So 11 Aug 2013 16:44:27 CEST | Docking | [error] Error reported by file upload server: can't open log file So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] parsing upload response: <data_server_reply> <status>1</status> <message>can't open log file</message></data_server_reply> So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] parsing status: -127 So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] file transfer status -127 (transient upload error) So 11 Aug 2013 16:44:27 CEST | Docking | Temporarily failed upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_0: transient upload error So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] project-wide xfer delay for 18296.980512 sec So 11 Aug 2013 16:44:27 CEST | Docking | Backing off 1 hr 6 min 47 sec on upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_0 So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] file transfer status -127 (transient upload error) So 11 Aug 2013 16:44:27 CEST | Docking | Temporarily failed upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_1: transient upload error So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] project-wide xfer delay for 14446.881880 sec So 11 Aug 2013 16:44:27 CEST | Docking | Backing off 2 hr 0 min 20 sec on upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_1 So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] file transfer status -127 (transient upload error) So 11 Aug 2013 16:44:27 CEST | Docking | Temporarily failed upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_2: transient upload error So 11 Aug 2013 16:44:27 CEST | Docking | [file_xfer] project-wide xfer delay for 14269.344979 sec So 11 Aug 2013 16:44:27 CEST | Docking | Backing off 1 hr 14 min 32 sec on upload of 1m0b1hsg_mod0014crossdockinghiv1_119842_387017_0_2 Gruesse vom Saenger For questions about Boinc look in the BOINC-Wiki |
Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.