Message boards :
Projects :
News on Project Outages
Message board moderation
Previous · 1 . . . 49 · 50 · 51 · 52 · 53 · 54 · 55 . . . 62 · Next
Author | Message |
---|---|
Send message Joined: 30 Mar 20 Posts: 372 |
Yeah, no support after business hours. Incredible. To the question "How long should I expect to wait for support?", on this page: https://helpwiki.sharcnet.ca/wiki/FAQ, The answer is: "Unfortunately Compute Canada/SHARCNET does not have adequate funding to provide support 24 hours a day, 7 days a week. User support and system monitoring is limited to regular business hours: there is no official support on weekends or holidays, or outside 9:00 - 17:00 EST . Please note that this includes monitoring of our systems and operations, so typically when there are problems overnight or on weekends/holidays system notices will not be posted until the next business day." So, no wonder then that everything, including the migration from IBM, takes such long time, compared to when WCG was run by IBM. That state of affairs is not going to work in the long run. If there's no support outside of business hours, WCG will slowly fade away. |
Send message Joined: 30 Mar 20 Posts: 372 |
WCG New update, 15 minutes ago: "Update #5: The storage server was revived yesterday late afternoon. Both database filesystems mounted as before, but the science filesystem did not. It needs a repair; erasing the old log first." |
Send message Joined: 3 Mar 23 Posts: 14 |
Just find other projects to use your computer time. No use complaining. Nothing is going to change. "Came, offended, left." (= Perhaps, before inflating further hysteria that "everything is lost", still wait for this story ends and only THEN draw any conclusions (especially with calls to abandon the project)? |
Send message Joined: 29 Aug 05 Posts: 71 |
as we have learned by now, SHARCNET (Shared Hierarchical Academic Research Computing Network),SHARCNET has free access to Compute Canada for academic research. https://youtu.be/hWkWAaNBILs?t=146 Free makes sense. I don't see a flow of cash to the project. Limited service makes sense from a free service. It's actually amazing to have any service at all for no charge! After all, somebody (Canadian taxpayer) is paying for replacement parts and labour and delivery etc... It also makes sense that this system is now overburdened by World Community Grid. It was not set up with the intention to host anything like a huge BOINC project. Good on these people for still trying to help us. They are relentless :) |
Send message Joined: 17 Nov 16 Posts: 863 |
Asteroids@home is back online. |
Send message Joined: 19 May 15 Posts: 123 |
[Dennis currently telling me it has no work available. (Wonders if anybody ever reads anything on the projects or just connect blindly...) DENIS is realizing work in large batches as they fine-tune their models. They just finished the last batch and posted the results to News. Ironically, it's one main researcher who is overseeing the project and he is a professor at the University (there seem to be a team in the background analyzing things though). He posts and communicates more than the whole Krembil team... I do wonder if the communications intern doesn't know what to post or they aren't letting her post. Someday I see in an interview: I was a communications intern at Krembil but they never wanted to let me post updates about failures occurring at the project... |
Send message Joined: 19 May 15 Posts: 123 |
Asteroids@home is back online. Asteroids@home periodically runs out of work. They just came back to activity rather recently after a hiatus of a few years after their old hardware bit the dust. It's one person who is running the project probably on a shoe-string budget. I'm sure he'd be ecstatic if he got the rounding error of the budget LHC has. ^_^ |
Send message Joined: 28 Jun 20 Posts: 68 |
Asteroids@home is back online. Maybe. But I Ihave tasks stuck in uploading, can't access my account, can't get to their message boards or the Home Page. S. Gaber Oldsmar, FL |
Send message Joined: 10 May 07 Posts: 1329 |
Asteroids@home is back online. I have not had any problems accessing the website from DFW Metro area in Texas nor server access to send/receive tasks since the new certificate was installed. Restart your web browser and/or empty the browser cache to clean out old information it might contain. Then try accessing the forums. For stuck tasks in BOINC go to the transfers tab in advanced view and select 4 to 6 Asteroids tasks and retry upload until you have successfully transferred all. |
Send message Joined: 28 Jun 20 Posts: 68 |
Asteroids@home is back online. Still getting downloads from Universe. But all 26 or my tasks in Transfer say "Upload pennding: project backoff." |
Send message Joined: 28 Jun 10 Posts: 2518 |
(Wonders if anybody ever reads anything on the projects or just connect blindly...)I did have a look at their forums but obviously not carefully enough! |
Send message Joined: 10 May 07 Posts: 1329 |
It has been nearly 2 weeks since WCG crashed & burned into the ether. Another Monday 1/2 gone and nothing but cricket's from Krembil about what if anything is happening with the RAID STORAGE failure at WCG. WCG Facebook page: https://facebook.com/197379135651/ |
Send message Joined: 30 Mar 20 Posts: 372 |
Yup, last update from WCG according to the timestamp of the tweet on Twitter, was March 10, at 19:19 UTC. Now, it's March 13, 18:44 UTC. |
Send message Joined: 30 Mar 20 Posts: 372 |
New WCG Update, 20 minutes ago: "The web pages and forums are back online, but the recovery process continues. As a result, performance is slower than usual, and not all functionality is there. Until we can restart the science database and BOINC, stats/contributions are not accurate. We will provide further updates as we progress. Thank you for your patience." Edit, added: Well to say that the website is back, was to go a bit too far. Not possible to log in. "System Error", or "503 Service Unavailable", is the response to any attempt to log in. |
Send message Joined: 3 Mar 23 Posts: 10 |
WCG website back and forums working (but, sadly, no official communication update yesterday) |
Send message Joined: 3 Mar 23 Posts: 10 |
WCG website back and forums working (but, sadly, no official communication update yesterday) It looks like I spoke too soon. Website and forums down once more with the "System Error" message again. This does not bode well for the overall recovery. |
Send message Joined: 25 Aug 08 Posts: 39 |
Our website is currently down and we are looking into the root cause and a method to fix it. We will post a follow-up when it has been resolved. The latest news on WCG from their Twitter feed. Seems to be going backwards! |
Send message Joined: 30 Mar 20 Posts: 372 |
New WCG Update, on FB and Twitter, 15 minutes ago: Our website is currently down and we are looking into the root cause and a method to fix it. We will post a follow-up when it has been resolved. |
Send message Joined: 30 Mar 20 Posts: 372 |
First the problem was with the RAID card. Then a borrowed card from the data centre was installed, and then they managed to successfully rebuild the RAID array. That didn't help, so now they said the problem was the PCI bus. (how could they successfully rebuild the RAID array with a broken PCI Bus?) So, another storage system (DSS 7000) was installed by the data center, and again rebuilt the RAID array. "The "new" system did recognize the data hardware RAIDs. All have been rebuilt, and the data center is attempting to repair the OS drives/RAID." Later on "The storage server was revived yesterday late afternoon. Both database filesystems mounted as before, but the science filesystem did not. It needs a repair; erasing the old log first." So, yesterday the website came back, but then took a dive again some hours later. BOINC is still MIA of course. I think they are chasing ghosts, and looking in the wrong direction. As said before: how could they successfully rebuild the RAID array, the first time, (after they first changed only the RAID card), with a broken PCI Bus. |
Send message Joined: 30 Mar 20 Posts: 372 |
New WCG Update, on FB and Twitter. 30 minutes ago: Update: The system error has been resolved and all users should regain access to the website. Thank you for your patience. I doubt the website will stay up, for long. Still no BOINC.... |
Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.