Posts by Thyme Lawn

1) Message boards : Projects : News on Project Outages (Message 106176)
Posted 25 Nov 2021 by Thyme Lawn
Post:
All accesses to TN-Grid have been timing out since between 0520 and 0715 UTC today.

Edit: The website is back up, but uploads and scheduler requests are still failing.
2) Message boards : Projects : News on Project Outages (Message 103016)
Posted 16 Feb 2021 by Thyme Lawn
Post:
Asteroids@home is back up after a 12-week outage.
3) Message boards : Projects : News on Project Outages (Message 80149)
Posted 13 Aug 2017 by Thyme Lawn
Post:
WuProp is partially OFFLINE - Appears to be a DATABASE CRASH or other DATABASE ISSUES.
"8/12/2017 21:39:14 | WUProp@Home | Server can't open database"

WUProp@Home is back up with no obvious issues after its database connection problem.
4) Message boards : Projects : News on Project Outages (Message 79160)
Posted 21 Jun 2017 by Thyme Lawn
Post:
It looks like DENIS@Home has gone AWOL:

  • The project home page displays the message Your PHP installation appears to be missing the MySQL extension which is required by WordPress.
  • Other pages display the error Fatal error: Call to undefined function mysql_pconnect() in /home/boincadm/projects/denisathome/html/inc/db_conn.inc on line 46
  • Scheduler requests are failing with the message Server error: feeder not running

5) Message boards : Projects : News on Project Outages (Message 75724)
Posted 5 Feb 2017 by Thyme Lawn
Post:
DENIS@Home came back up at around 13:00 UTC
6) Message boards : Projects : News on Project Outages (Message 75704)
Posted 4 Feb 2017 by Thyme Lawn
Post:
DENIS@Home went down some time between 22:00 and 01:00 UTC (http://www.downforeveryoneorjustme.com/denis.usj.es).
7) Message boards : Projects : News on Project Outages (Message 65947)
Posted 10 Dec 2015 by Thyme Lawn
Post:
Announcement by the CPDN project team:

We will be taking the project offline tomorrow (Wednesday 9th December) from 10am (UK time) in order to take a snapshot of the database. This is part of the process of the re-configuration of a slave database machine. Once this snapshot process has completed we will bring the project back online again, we anticipate that this process will take a minimum of 24 hours to complete. We apologise in advance for any inconvenience.

The planned work has been completed and CPDN is now back online.
8) Message boards : Projects : News on Project Outages (Message 65879)
Posted 8 Dec 2015 by Thyme Lawn
Post:
Announcement by the CPDN project team:

We will be taking the project offline tomorrow (Wednesday 9th December) from 10am (UK time) in order to take a snapshot of the database. This is part of the process of the re-configuration of a slave database machine. Once this snapshot process has completed we will bring the project back online again, we anticipate that this process will take a minimum of 24 hours to complete. We apologise in advance for any inconvenience.
9) Message boards : Projects : Is Volpex dead? (Message 60145)
Posted 6 Feb 2015 by Thyme Lawn
Post:
The transitioner backlog has cleared and completed tasks for job number 9302 have just been successfully reported.
10) Message boards : Projects : Is Volpex dead? (Message 60143)
Posted 6 Feb 2015 by Thyme Lawn
Post:
Volpex seems to be failing to acknowledge reports of completed work for job number 9302 (workunits 5 and 6) but new tasks are still being sent for WU 6. I have a single main (WU 5) and 82 workers (WU 6) waiting to be reported, with run times ranging from 0:21 to 18:37. The scheduler request file for those 83 tasks weighs in at 4.3MB and with work fetch disabled it takes 3 minutes for the blank reply to arrive.

The website is extremely slow and according to the server status page there's currently an 8 hour transitioner backlog:

Tasks ready to send 0
Tasks in progress 6,548
Workunits waiting for validation 1
Workunits waiting for assimilation 2
Workunits waiting for file deletion 0
Tasks waiting for file deletion 0
Transitioner backlog (hours) 8
11) Message boards : Projects : Is Volpex dead? (Message 60020)
Posted 28 Jan 2015 by Thyme Lawn
Post:
The 7 week long networking problem has been resolved and new work is now being generated (the project's job log in the BOINC data directory shows that I've successfully completed tasks from 21 workunits in the past 2 days).
12) Message boards : Projects : ClimatePrediction.Net (AKA CPDN) NEWS (Message 59991)
Posted 27 Jan 2015 by Thyme Lawn
Post:
The project will have some scheduled downtime on Thursday and Friday.

The scheduled downtime has been pushed back a week and will now be on Thursday 5th and Friday 6th February.
13) Message boards : Projects : ClimatePrediction.Net (AKA CPDN) NEWS (Message 59987)
Posted 27 Jan 2015 by Thyme Lawn
Post:
CPDN will have some scheduled downtime on Thursday and Friday.

This is to allow the underlying hardware to be configured to accept a tape backup system as part of the 'near-line' storage.

Jonathan will be taking the opportunity to move the database backup to a different server, to give more resilience in case of hardware failures.

The downtime ought to be no more than a few hours on Thursday, but Jonathan does acknowledge that he's said that many times before!

All of the virtualised servers will be offline briefly:

  • climateprediction.net
  • trillionthtonne.net
  • climateapps2.oerc.ox.ac.uk
  • cpdn-upload2.oerc.ox.ac.uk
  • cpdn-results2.oerc.ox.ac.uk
  • database server

14) Message boards : Projects : QMC@home/cleanmobility.now (Message 59893)
Posted 21 Jan 2015 by Thyme Lawn
Post:
I had an email correspondence with Martin 6 months ago about a number of problems with the cleanmobility application:

  • checkpointing (failure to maintain the geometry optimization count, resulting in the additional calculations for the first cycle being repeated every time a task is restarted).
  • progress (jumps to 100% when it is starts its "FINAL ENERGY EVALUATION AT THE STATIONARY POINT" calculations). The task I monitored was at 100% for 180 hours (only 44.5 of them were necessary, the excess being due to a first cycle repeat following an enforced computer restart).
  • deadline (14 day limit for tasks which can run high priority for well over 5 weeks on a C2Q Q6600).
  • workunit settings (initial quorum=1, replication=2). Combining that with the deadline problems means that lots of tasks can end up being needlessly reissued, with only the first reported one being credited and the results from the others being thrown away. I reported one task after it had run high priority for 40 days only for it to be flagged as invalid because someone else had a successful completion 2 days earlier. I aborted another task after 45 days because the WU had been purged (doing valid science for other projects was more important than completing something which was going to end up in the bit bucket).

15) Message boards : Projects : News on Project Outages (Message 59891)
Posted 21 Jan 2015 by Thyme Lawn
Post:
WUProp@home is currently down.

The web site pages say that "The project's database server is down" and (unsurprisingly) scheduler requests are failing:
21/01/2015 12:39:25 | WUProp@Home | Reporting 1 completed tasks, requesting new tasks for CPU
21/01/2015 12:39:27 | WUProp@Home | Scheduler request completed: got 0 new tasks
21/01/2015 12:39:27 | WUProp@Home | Server error: feeder not running
16) Message boards : Projects : News on Project Outages (Message 52609)
Posted 17 Feb 2014 by Thyme Lawn
Post:
CPDN is currently down due to what appears to be a VM infrastructure failure. This is unlikely to be fixed until tomorrow.

CPDN is back up.
17) Message boards : Projects : News on Project Outages (Message 52592)
Posted 16 Feb 2014 by Thyme Lawn
Post:
CPDN is currently down due to what appears to be a VM infrastructure failure. This is unlikely to be fixed until tomorrow.
18) Message boards : Projects : News on Project Outages (Message 52131)
Posted 24 Jan 2014 by Thyme Lawn
Post:
malariacontrol.net went offline at around 0900 UTC on Friday 24 January and will be down for at least this weekend. All pages are currently generating the following message:

Site is temporary unavailable.

Sorry, we had a major failure! We're dealing with it and should be back next week, but the server will be down at least over this weekend.

We apologize for any inconvenience.
19) Message boards : Projects : QMC News - Project Moving, URL Change (Message 50965)
Posted 22 Oct 2013 by Thyme Lawn
Post:
All of the QMC server programs are running again. A new cleanmobility.now task was downloaded 30 minutes ago and immediately started running high priority (as all of the previous ones did).
20) Message boards : Projects : News on Project Outages (Message 50816)
Posted 9 Oct 2013 by Thyme Lawn
Post:
This looks like having been the state at QMC@home for at least the last 36 hours:

The website has no direct link to it, but the server status page does exist and it tallies with what my work requests are being told:

08-Oct-2013 22:24:40 [QMC@HOME] Sending scheduler request: To fetch work.
08-Oct-2013 22:24:40 [QMC@HOME] Requesting new tasks for CPU
08-Oct-2013 22:24:41 [QMC@HOME] Scheduler request completed: got 0 new tasks
08-Oct-2013 22:24:41 [QMC@HOME] Server error: feeder not running


Next 20

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.