Thread 'News on project outages.'

Message boards : Projects : News on project outages.
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 15 · Next

AuthorMessage
ZPM
Avatar

Send message
Joined: 14 Mar 09
Posts: 215
United States
Message 25034 - Posted: 25 May 2009, 1:22:05 UTC - in response to Message 25018.  

update: vtu for me is unreachable at home and work.....

I recommend Secunia PSI: http://secunia.com/vulnerability_scanning/personal/
ID: 25034 · Report as offensive
David Ball

Send message
Joined: 2 Dec 06
Posts: 69
United States
Message 25052 - Posted: 26 May 2009, 5:42:34 UTC

Spinhenge is back up!
ID: 25052 · Report as offensive
mo.v
Avatar

Send message
Joined: 13 Aug 06
Posts: 778
United Kingdom
Message 25186 - Posted: 3 Jun 2009, 15:50:38 UTC

CPDN main project

Milo said this morning (Wednesday):

'cpdn-upload1.comlab is shown as down at the moment. The server is running but it's shut down apache as the data partition is full. I've got nowhere else to put the data at the moment so this may well cause a problem for hadam3p uploads until I can obtain more hardware.'

and this afternoon:

'There's a tiny amount of space left (it's only 99% full) so I've turned apache back on for a short time in the hope that a few more such as this will clear. I'll have to shut it down before I go home, though.'

When Milo stops file uploads it will affect all model types. Attempts to upload files will produce BOINC messages like 'No space left on device/server/disk'. It is best to avoid multiple failures of file uploads. If you have any type of CPDN model near completion it would be a good idea to suspend the model and run other tasks instead until Milo creates more space on the file upload server. If you suspend models near completion you can allow BOINC network activity. (But the CPDN server will probably refuse to send you new models while you have models suspended.)

If you prefer not to suspend models near completion you should suspend BOINC network activity until Milo solves the problem.

If your computer is without work because you cannot complete models and upload files, you could attach to another project and fetch work from there (Tools > Attach to project and use the same email address as when you attached to CPDN). To fetch more work you may need Advanced > Preferences > increase additional work buffer > OK.
ID: 25186 · Report as offensive
mo.v
Avatar

Send message
Joined: 13 Aug 06
Posts: 778
United Kingdom
Message 25253 - Posted: 6 Jun 2009, 16:05:48 UTC

CPDN main project

The problem described in the above post continues.

Uploader1atm is currently disabled. It normally accepts trickle-ups. Because there are several upload servers, some (but not all) models are unable to upload trickles. If your model(s) cannot upload trickles you may prefer to suspend BOINC network activity all or most of the time.
ID: 25253 · Report as offensive
ZPM
Avatar

Send message
Joined: 14 Mar 09
Posts: 215
United States
Message 25260 - Posted: 6 Jun 2009, 21:27:02 UTC - in response to Message 25253.  

VTU@HOME Back in service and seems to be running much better....

I recommend Secunia PSI: http://secunia.com/vulnerability_scanning/personal/
ID: 25260 · Report as offensive
mo.v
Avatar

Send message
Joined: 13 Aug 06
Posts: 778
United Kingdom
Message 25280 - Posted: 7 Jun 2009, 21:25:44 UTC

CPDN main project

As Milo predicted, the disk of trickle upload server uploader.oerc has also filled up and is now disabled as can be seen on the server status page. More models (though not all) are now unable to upload their trickles. When the upload servers are activated again they will accept the delayed trickles with no harm to the models.

Because we cannot at the moment upload model files, the previous advice to suspend models before they complete remains valid.

ID: 25280 · Report as offensive
ProfileKSMarksPsych
Avatar

Send message
Joined: 30 Oct 05
Posts: 1239
United States
Message 25293 - Posted: 8 Jun 2009, 13:40:30 UTC

Einstein@Home is temporarily down for some database maintenance. Admin is predicting this should be about an hour or so.
ID: 25293 · Report as offensive
ZPM
Avatar

Send message
Joined: 14 Mar 09
Posts: 215
United States
Message 25346 - Posted: 11 Jun 2009, 14:14:48 UTC

vtu@home down... unknown.

I recommend Secunia PSI: http://secunia.com/vulnerability_scanning/personal/
ID: 25346 · Report as offensive
mo.v
Avatar

Send message
Joined: 13 Aug 06
Posts: 778
United Kingdom
Message 25396 - Posted: 12 Jun 2009, 23:18:54 UTC

CPDN main project

Update on server situation

The three problematic upload servers are

cpdn-upload1.comlab (disk space)
uploader1.atm (broken power supply)
uploader.oerc (disk space)

Milo is still moving data from cpdn-upload1.comlab. He has not yet been able to replace the power supply because of a delivery delay. He has moved over 3TB of data from uploader.oerc and this server is now up and running.

However, some CPDN models upload their zip files to more than one server. Each HadAM3P model uploads files to all those three servers. So if you have had BOINC network activity enabled, some of your HadAM3P zip files will have uploaded to uploader.oerc but the other files will still be unable to upload. Please do not try to make them upload by pressing the Retry Now button.

HadCM (80-year model) zip files upload to climateapps3 and uploader.oerc so these no longer present a problem.

We are still recommending that members should suspend HadAM3P models before they complete. If you do this you can allow BOINC network activity as much as you want.

If you have zip files in the Transfers tab that cannot upload it is a good idea to keep BOINC network activity suspended as much as possible.

Many thanks to ClimatePrediction members for your patience and good humour during this outage.
ID: 25396 · Report as offensive
mo.v
Avatar

Send message
Joined: 13 Aug 06
Posts: 778
United Kingdom
Message 25404 - Posted: 13 Jun 2009, 11:10:18 UTC

CPDN main project

cpdn-upload1.comlab is now up and running. Please see the above post for how this will affect HadAM3P zip file uploads.
ID: 25404 · Report as offensive
ZPM
Avatar

Send message
Joined: 14 Mar 09
Posts: 215
United States
Message 25415 - Posted: 13 Jun 2009, 15:32:09 UTC - in response to Message 25404.  

Aqua@home down....

I recommend Secunia PSI: http://secunia.com/vulnerability_scanning/personal/
ID: 25415 · Report as offensive
ZPM
Avatar

Send message
Joined: 14 Mar 09
Posts: 215
United States
Message 25421 - Posted: 14 Jun 2009, 4:07:48 UTC - in response to Message 25415.  

Aqua@home back up and running...

vtu@home is still down.. yet some files have uploaded... site page is down.

6/14/2009 12:07:25 AM vtu@home Scheduler request failed: Couldn't connect to server


I recommend Secunia PSI: http://secunia.com/vulnerability_scanning/personal/
ID: 25421 · Report as offensive
ZPM
Avatar

Send message
Joined: 14 Mar 09
Posts: 215
United States
Message 25455 - Posted: 15 Jun 2009, 15:40:00 UTC - in response to Message 25421.  

vtu@home, somewhat back up but running slower than a p3 with xp installed.

I recommend Secunia PSI: http://secunia.com/vulnerability_scanning/personal/
ID: 25455 · Report as offensive
ZPM
Avatar

Send message
Joined: 14 Mar 09
Posts: 215
United States
Message 25462 - Posted: 16 Jun 2009, 4:22:31 UTC - in response to Message 25455.  

vtu@home, no response to...... anything....

i got 100 fresh wu early yesterday, but now, can't upload them and stuck on the uploading portion of boinc....

I recommend Secunia PSI: http://secunia.com/vulnerability_scanning/personal/
ID: 25462 · Report as offensive
mo.v
Avatar

Send message
Joined: 13 Aug 06
Posts: 778
United Kingdom
Message 25467 - Posted: 16 Jun 2009, 14:12:22 UTC
Last modified: 16 Jun 2009, 14:12:46 UTC

CPDN main project

Milo said just before lunchtime 'Fortunately the power supply [for upload server uploader1.atm] did arrive today and has been fitted. Unfortunately the server has suffered disk failures in its RAID array. Fortunately it's only the OS disks not the data. Unfortunately I can't gain access to the room to do an OS install at the moment, but should be able to do so in the afternoon.'.

Zip files allocated to uploader1.atm should not time out until until Friday 19 June, two weeks after its PSU failed. Even if Milo doesn't get this server running today there's no need to panic. If you have zip files stuck in the Transfers tab, allow BOINC network activity now to upload the files allocated to the other two servers.

If you have HadAM3P models suspended before completion, please keep them suspended for a couple of days after Milo gets uploader1.atm up and running again. After activation this server will receive a big backlog of delayed zip file uploads and we should give priority to files that have been stuck in members' Transfers tab for many days.

Here is the CPDN server status page.
ID: 25467 · Report as offensive
ProfileByron Leigh Hatch @ team Carl ...
Avatar

Send message
Joined: 30 Aug 05
Posts: 505
Canada
Message 25495 - Posted: 17 Jun 2009, 13:14:52 UTC




as of this posting can any one else connect to SETI@home? I can't connect to SETI@home ... I can connect to Google, Einstein@Home etc. etc. ... as of this posting.
ID: 25495 · Report as offensive
ProfileGundolf Jahn

Send message
Joined: 20 Dec 07
Posts: 1069
Germany
Message 25497 - Posted: 17 Jun 2009, 13:26:05 UTC - in response to Message 25495.  
Last modified: 17 Jun 2009, 13:26:34 UTC

as of this posting can any one else connect to SETI@home? I can't connect to SETI@home ... I can connect to Google, Einstein@Home etc. etc. ... as of this posting.

Nope. And the cricket graphs have dropped from 100% to zero about an hour ago. So, I think we'll have to wait a couple of hours until workday begins in Berkeley.

Gruß,
Gundolf
Computer sind nicht alles im Leben. (Kleiner Scherz)
ID: 25497 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15542
Netherlands
Message 25502 - Posted: 17 Jun 2009, 14:36:51 UTC - in response to Message 25497.  

Prior to that, the scheduler contacts would already bounce off. Perhaps that they've taken everything down to go use one of the new servers. At least it went off without notice, one minute it was there, the next... poof.
ID: 25502 · Report as offensive
ProfileGundolf Jahn

Send message
Joined: 20 Dec 07
Posts: 1069
Germany
Message 25504 - Posted: 17 Jun 2009, 14:52:43 UTC - in response to Message 25502.  

...Perhaps that they've taken everything down to go use one of the new servers...

At 5:30 in the morning? ;-)
ID: 25504 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15542
Netherlands
Message 25506 - Posted: 17 Jun 2009, 15:03:12 UTC - in response to Message 25504.  

Sure, why not? Perhaps someone who couldn't sleep, or perhaps they thought it would give the least mayhem to do it at a ridiculously early hour. It's what ISPs do... ;-)
ID: 25506 · Report as offensive
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 15 · Next

Message boards : Projects : News on project outages.

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.