News on Project Outages

Message boards : Projects : News on Project Outages
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 33 · 34 · 35 · 36 · 37 · Next

AuthorMessage
Profile Keith Myers
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 17 Nov 16
Posts: 643
United States
Message 106197 - Posted: 27 Nov 2021, 3:33:00 UTC

why would they bunker tasks?

No idea. Not aware of any challenges for Universe.
But they did the same thing two weeks ago. Dumped 7X their average work today. Everybody should get a nice credit bump today from all their pendings finally clearing back to normalcy.
https://stats.free-dc.org/team/uni/3
ID: 106197 · Report as offensive     Reply Quote
Profile Jan Henrik
Avatar

Send message
Joined: 5 Jul 20
Posts: 23
Message 106207 - Posted: 27 Nov 2021, 21:33:22 UTC

CAMK is online again

universe@home website displays "shutdown for maintenance"

could upload but all stuck in "ready to report"

updates pushed back 1 hour so no new tasks

for the middle of a Thanksgiving/Black Friday weekend that's some progress
"less than a pixel"
ID: 106207 · Report as offensive     Reply Quote
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 4678
United Kingdom
Message 106217 - Posted: 28 Nov 2021, 9:17:23 UTC

GPUGrid have let their server https certificate expire. No uploads or downloads, no scheduler contact, website can't be accessed unless you bypass the warnings.
ID: 106217 · Report as offensive     Reply Quote
Profile Jan Henrik
Avatar

Send message
Joined: 5 Jul 20
Posts: 23
Message 106228 - Posted: 29 Nov 2021, 3:32:01 UTC - in response to Message 106197.  


. . . Everybody should get a nice credit bump today from all their pendings finally clearing back to normalcy.


to me it looks like rather everybody got a credit-average cut

could download tasks again, but still got a lot of pendings

besides that; universe is all "norminal" again . . .
"less than a pixel"
ID: 106228 · Report as offensive     Reply Quote
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 4678
United Kingdom
Message 106229 - Posted: 29 Nov 2021, 8:41:31 UTC

GPUGrid has been repaired, with a new certificate valid until 27 February 2022 (watch out for a repeat performance then). But the server will be very busy with recovery today.
ID: 106229 · Report as offensive     Reply Quote
Jimbocous
Avatar

Send message
Joined: 1 Oct 15
Posts: 388
United States
Message 106419 - Posted: 14 Dec 2021, 21:21:10 UTC

Milkyway seems to be down.
ID: 106419 · Report as offensive     Reply Quote
Harri Liljeroos

Send message
Joined: 25 Jul 18
Posts: 29
Finland
Message 106480 - Posted: 22 Dec 2021, 8:59:02 UTC

https://einsteinathome.org/ seems to be down.
ID: 106480 · Report as offensive     Reply Quote
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 4678
United Kingdom
Message 106481 - Posted: 22 Dec 2021, 10:14:01 UTC - in response to Message 106480.  
Last modified: 22 Dec 2021, 10:55:17 UTC

https://einsteinathome.org/ seems to be down.
Seems to be affecting the website and the scheduler only: completed tasks can be uploaded, but not reported.

I think the upload servers are in Europe, but the web servers are in America. It may be this afternoon (European time) before they can investigate.

Edit: I was wrong - they're back.
ID: 106481 · Report as offensive     Reply Quote
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 4678
United Kingdom
Message 106483 - Posted: 22 Dec 2021, 15:19:57 UTC

Bernd wrote:
A few hours ago the Einstein@Home servers experienced a hard shutdown. Apparently several safety measurements failed, for most of these it's still unclear why. We're trying to get the servers back up running and replace the ones with terminal failures and are working on restoring all services before Friday. Until then you may experience some limitations. In particular the O3AS workunit generator isn't running.
ID: 106483 · Report as offensive     Reply Quote
Profile mlviper
Avatar

Send message
Joined: 7 Sep 20
Posts: 33
Germany
Message 106640 - Posted: 2 Jan 2022, 21:48:58 UTC
Last modified: 2 Jan 2022, 21:49:30 UTC

Happy New Year! :D

universe@home seems down, website unreachable, and no new WUs received.
ID: 106640 · Report as offensive     Reply Quote
robsmith
Volunteer tester
Help desk expert

Send message
Joined: 25 May 09
Posts: 999
United Kingdom
Message 106655 - Posted: 3 Jan 2022, 19:58:59 UTC - in response to Message 106640.  

The log in page is back, but totally unresponsive....
ID: 106655 · Report as offensive     Reply Quote
Profile Keith Myers
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 17 Nov 16
Posts: 643
United States
Message 106672 - Posted: 4 Jan 2022, 19:27:47 UTC

The Universe website has been unreachable for over a day.
ID: 106672 · Report as offensive     Reply Quote
Michael Blucher

Send message
Joined: 3 Jan 20
Posts: 6
Message 106678 - Posted: 5 Jan 2022, 10:38:41 UTC

Anyone know what happened to Universe at Home: https://universeathome.pl/universe/home.php
The project site appears to be unreachable.

Regards

Michael Blucher
ID: 106678 · Report as offensive     Reply Quote
Les Bayliss
Help desk expert

Send message
Joined: 25 Nov 05
Posts: 1590
Australia
Message 106679 - Posted: 5 Jan 2022, 11:33:11 UTC

it's at a University, so, while the University site is up, it seems that the BOINC project isn't, and everyone will probably have to wait until the project owner is back on site.
ID: 106679 · Report as offensive     Reply Quote
Profile Jan Henrik
Avatar

Send message
Joined: 5 Jul 20
Posts: 23
Message 106681 - Posted: 5 Jan 2022, 13:50:18 UTC - in response to Message 106678.  

Anyone know what happened to Universe at Home: https://universeathome.pl/universe/home.php
The project site appears to be unreachable.

Regards

Michael Blucher


When the site was up (briefly and somewhat wobbly) I found this;

Krzysztof 'krzyszp' Piszcek [the administrator] posted in the "No tasks" - thread at 1 Jan 2022,16:46:36 UTC


"The recent server problem comes as with current load we have massive traffic between main and storage server and between main and database server.

I'm working in it but still can't recognize what exactly makes troubles in whole system."



Their servers where going down(and quietly restarted mostly on weekends) with increasing frequency.

IMHO they take their time now to look what's really going on there.
"less than a pixel"
ID: 106681 · Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 798
United States
Message 106682 - Posted: 5 Jan 2022, 14:18:18 UTC - in response to Message 106681.  

krzyszp posted this over at BOINCstats:
Index :: The Projects :: Universe@Home server crash

On January 1, the project server had a serious physical failure. After trying to restart, it started for several hours, but stopped responding shortly after.
As it turned out, at least one of his hard drives had physically crashed.
As the main server of the project is also the oldest of the machines used in the project, we currently do not have any disk to replace the one that has died, and the other two that are also already old and need to be replaced.
We will be ordering new disks soon, and before the end of January I will go to Warsaw to replace them and restart the project server, unfortunately I can't do it earlier.
All user data and computed tasks are completely safe on a separate machine (the database server is physically different machine), while all computed results are also stored on a another machine.
Regards,

Krzysztof 'krzyszp' Piszczek
Boinc@Poland team member.
ID: 106682 · Report as offensive     Reply Quote
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 14825
Netherlands
Message 106683 - Posted: 5 Jan 2022, 14:43:25 UTC

Meanwhile at BOINCstats:
Secure Connection Failed

An error occurred during a connection to boincstats.com.

- The page you are trying to view cannot be shown because the authenticity of the received data could not be verified.
- Please contact the website owners to inform them of this problem.
ID: 106683 · Report as offensive     Reply Quote
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 4678
United Kingdom
Message 106684 - Posted: 5 Jan 2022, 14:52:10 UTC - in response to Message 106683.  

The BOINCstats website (as seen from here) is using a Cloudflare certificate valid from 11 August 2021 to 10 August 2022.

I've sometimes seen problems establishing a secure connection to a busy and active server: I think https requires significantly more server resource to establish the connection than http. If server operators have succumbed to the pressure for 'https for everything', without upgrading their hardware, glitches can happen. They usually clear within a few minutes or hours.
ID: 106684 · Report as offensive     Reply Quote
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 14825
Netherlands
Message 106685 - Posted: 5 Jan 2022, 15:24:58 UTC - in response to Message 106684.  

At least Firefox says that an error occurred, Chrome is less obvious:
This site can’t be reached

The webpage at https://boincstats.com/ might be temporarily down or it may have moved permanently to a new web address.
ERR_HTTP2_PROTOCOL_ERROR
ID: 106685 · Report as offensive     Reply Quote
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 4678
United Kingdom
Message 106686 - Posted: 5 Jan 2022, 15:37:49 UTC - in response to Message 106685.  

It was Chrome (under Windows 7) that let me retrieve those certificate details.
ID: 106686 · Report as offensive     Reply Quote
Previous · 1 . . . 33 · 34 · 35 · 36 · 37 · Next

Message boards : Projects : News on Project Outages

Copyright © 2022 University of California. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.