Thread 'manager not responding'

Message boards : BOINC Manager : manager not responding
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Ralph

Send message
Joined: 30 Sep 05
Posts: 50
Message 16155 - Posted: 31 Mar 2008, 13:09:41 UTC

Hi,

I've had what I can only describe as an unusual BOINC event.

My usual practice is to turn off BOINC networking, and then enable it, to watch as uploading and downloading occurs.

Today, the messages scrolled, the files appeared and disappeared, and then the manager display blanked, as though all my tasks had vanished.

I shut down BOINC (running as service), restarted it, and saw my tasks show up briefly, then vanish again with (not responding) up in the title bar.

I restarted my Vista machine. Same result.
I removed BOINC, restarted, re-installed BOINC. Same result.

I'd like to hear that someone else has seen this, and knows how to help.

Ralph
ID: 16155 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15573
Netherlands
Message 16156 - Posted: 31 Mar 2008, 13:15:00 UTC

Where have you installed BOINC to under Vista? The default C:\Program Files\BOINC directory, or one you made yourself? Which version of BOINC?
ID: 16156 · Report as offensive
Ralph

Send message
Joined: 30 Sep 05
Posts: 50
Message 16157 - Posted: 31 Mar 2008, 14:23:14 UTC

Hi,

I was running 5.10.45 in the default directory, under Vista Ultimate.
Since it won't run, it's currently uninstalled.

I did some tech support with Dell and HP, so if you'd like any other information, I'm sure I can scrounge it up.

None of my other applications seem to be affected, but just to be sure, I downloaded Superantispyware, scanned, and got nothing but cookies, which I normally delete before my AVG scan. Superantispyware has since been removed.
The system seems 'squeaky' clean.

The install was from my old download. (march 14) I suppose I could try to get another, if there are no other suggestions.
ID: 16157 · Report as offensive
Ralph

Send message
Joined: 30 Sep 05
Posts: 50
Message 16160 - Posted: 31 Mar 2008, 15:02:26 UTC

OK- installed to C/boinc.
The manager seems to be running.
Where do I get the WU data to move, so that the new location will find it?


ID: 16160 · Report as offensive
Ralph

Send message
Joined: 30 Sep 05
Posts: 50
Message 16161 - Posted: 31 Mar 2008, 15:14:34 UTC

I tried moving the projects folder to C/boinc. - didn't work.
The boinc manager only detects one project, with no tasks.
ID: 16161 · Report as offensive
Ralph

Send message
Joined: 30 Sep 05
Posts: 50
Message 16164 - Posted: 31 Mar 2008, 15:44:17 UTC

If I re-attach, won't I lose all the tasks that were running before the crash?

I thought I shouldn't, but I tried Einstein, and it promptly went and started downloading work like I was a new account.

I'll try again.
ID: 16164 · Report as offensive
Ralph

Send message
Joined: 30 Sep 05
Posts: 50
Message 16165 - Posted: 31 Mar 2008, 16:11:58 UTC

Hello,

In all the years I've run BOINC, and SETI for years before that, this is the first time it's been suggested that I lose the work units from all the projects I'm running.

I'm disappointed that an alternative wasn't found, choices given, or another help avenue suggested so that work wasn't lost.

BOINC has just gone downhill in my list of reliable software.

I have a simple policy:

If I get good service, I tell 10 friends.
If I get bad service, I tell 10 friends.

Sometimes, they even listen...
ID: 16165 · Report as offensive
Uffe F

Send message
Joined: 31 Mar 08
Posts: 2
Denmark
Message 16166 - Posted: 31 Mar 2008, 16:39:38 UTC
Last modified: 31 Mar 2008, 16:42:58 UTC

Today something similar happened to me at my xp machine. All my tasks vanished, but every time i restart the client, they are there for 2 seconds before they vanish again. Any surgestions? I'm using version 5.10.45 and windows xp service pack 2.

The manager comes up with a dialog after some time, saying: BOINC Manager can't connect til a BOINC klient. Do you wanna try to connect again?
Nothing happens when i press the reconnect button...

My projects are: LHC, ABC, Seti and Rosetta. I was currently working on a LHC workunit when it started.

I would hate to just reinstall, because i have around 30 hours of computation for ABC almost done. So if anyone have some surgestions I'm all open...
ID: 16166 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15573
Netherlands
Message 16168 - Posted: 31 Mar 2008, 17:02:05 UTC - in response to Message 16165.  

Hello,

In all the years I've run BOINC, and SETI for years before that, this is the first time it's been suggested that I lose the work units from all the projects I'm running.

I'm disappointed that an alternative wasn't found, choices given, or another help avenue suggested so that work wasn't lost.

There was a step Dagorath missed.

The correct way to do it is:

1. Uninstall BOINC from Add/remove Programs.
2. Move the BOINC directory with all sub directories from the default c:\program files\BOINC to c:\BOINC (saying Yes to overwrite if applicable)
3. Install BOINC to c:\BOINC

That way you won't lose any work. You shouldn't need to re-attach to the new projects either as you moved all the necessary information over.


ID: 16168 · Report as offensive
Ralph

Send message
Joined: 30 Sep 05
Posts: 50
Message 16173 - Posted: 31 Mar 2008, 18:00:15 UTC

Hi,

Back to square one -

I have a BOINC program that will look like it starts, and even shows the first screen sometimes, with projects or statistics.

Then it shows 'not responding' in the title bar, and all BOINC tabs are blank after that.

I re-downloaded, uninstalled, changed directories, and copied files OK, and the item I forgot and remembered "run as administrator" on the install.

Same again.

I'll see if there's any posts tomorrow. I'm getting slowly frustrated.
ID: 16173 · Report as offensive
Nicolas

Send message
Joined: 19 Jan 07
Posts: 1179
Argentina
Message 16175 - Posted: 31 Mar 2008, 18:14:22 UTC - in response to Message 16173.  

Post the last few lines of stdoutdae.txt on your BOINC directory. Also stderrdae.txt, if it has anything (but look at the Dump Timestamp, it may be very old logs).
ID: 16175 · Report as offensive
genes
Avatar

Send message
Joined: 14 Dec 06
Posts: 16
United States
Message 16190 - Posted: 31 Mar 2008, 21:01:54 UTC

I seem to have a similar problem, if not the same one. I just noticed it today, on one machine. I went to check the machine and found the BOINC manager disconnected. The status bar says "connecting to localhost". If I stop it, and restart, I can momentarily see all my projects, then the status bar says "disconnected", then "connecting to localhost" again. I initially saw it with version 5.10.42, I upgraded to 5.10.45, and while it was doing the benchmarks, it remained connected, but once those finished, it failed again.

Looking at stdoutdae.txt, the last thing it tries to do before disconnecting is to upload an lhc@home result:

31-Mar-2008 16:48:43 [---] Starting BOINC client version 5.10.45 for windows_intelx86
31-Mar-2008 16:48:43 [---] log flags: task, file_xfer, sched_ops
31-Mar-2008 16:48:43 [---] Libraries: libcurl/7.18.0 OpenSSL/0.9.8e zlib/1.2.3
31-Mar-2008 16:48:43 [---] Data directory: C:Program FilesBOINC
31-Mar-2008 16:48:44 [---] Processor: 2 GenuineIntel Intel(R) Pentium(R) D CPU 3.00GHz [x86 Family 15 Model 4 Stepping 4]
31-Mar-2008 16:48:44 [---] Processor features: fpu tsc pae nx sse sse2 mmx
31-Mar-2008 16:48:44 [---] OS: Microsoft Windows XP: Professional Edition, Service Pack 2, (05.01.2600.00)
31-Mar-2008 16:48:44 [---] Memory: 2.00 GB physical, 3.85 GB virtual
31-Mar-2008 16:48:44 [---] Disk: 37.27 GB total, 23.10 GB free
31-Mar-2008 16:48:44 [---] Local time is UTC -4 hours
31-Mar-2008 16:48:44 [rosetta@home] URL: http://boinc.bakerlab.org/rosetta/; Computer ID: 24538; location: work; project prefs: default
31-Mar-2008 16:48:44 [climateprediction.net] URL: http://climateprediction.net/; Computer ID: 846; location: work; project prefs: work
31-Mar-2008 16:48:44 [Einstein@Home] URL: http://einstein.phys.uwm.edu/; Computer ID: 1021; location: work; project prefs: work
31-Mar-2008 16:48:44 [lhcathome] URL: http://lhcathome.cern.ch/lhcathome/; Computer ID: 2366328; location: home; project prefs: default
31-Mar-2008 16:48:44 [QMC@HOME] URL: http://qah.uni-muenster.de/; Computer ID: 23361; location: (none); project prefs: default
31-Mar-2008 16:48:44 [SETI@home] URL: http://setiathome.berkeley.edu/; Computer ID: 1157146; location: work; project prefs: work
31-Mar-2008 16:48:44 [SETI@home Beta Test] URL: http://setiweb.ssl.berkeley.edu/beta/; Computer ID: 2496; location: home; project prefs: home
31-Mar-2008 16:48:44 [Spinhenge@home] URL: http://spin.fh-bielefeld.de/; Computer ID: 5054; location: home; project prefs: default
31-Mar-2008 16:48:44 [malariacontrol.net beta] URL: http://www.malariacontrol.net/; Computer ID: 81939; location: (none); project prefs: default
31-Mar-2008 16:48:44 [---] General prefs: from http://www.worldcommunitygrid.org/ (last modified 04-Dec-2007 22:41:45)
31-Mar-2008 16:48:44 [---] Host location: work
31-Mar-2008 16:48:44 [---] General prefs: using separate prefs for work
31-Mar-2008 16:48:44 [---] Reading preferences override file
31-Mar-2008 16:48:44 [---] Preferences limit memory usage when active to 1023.21MB
31-Mar-2008 16:48:44 [---] Preferences limit memory usage when idle to 1841.78MB
31-Mar-2008 16:48:44 [---] Preferences limit disk usage to 18.63GB
31-Mar-2008 16:48:44 [---] Running CPU benchmarks
31-Mar-2008 16:48:44 [lhcathome] Started upload of wm72A_m72allA__18__64.276_59.306__4_6__6__54_1_sixvf_boinc342911_1_0

This pattern repeats every time I shut down and restart BOINC. I've also tried restarting the machine, and then shutting it down and restarting.

Looking at stderrdae.txt, it looks like a Dr.Watson log, but from the BOINC Runtime Debugger (5.10.45). The event seems to be an access violation of some sort. I have that available as well, though it's bigger...

Currently the machine is not running BOINC. Maybe I will try backing up the whole thing, then temporarily relocating each of the "slots" folders to see if one of those is causing the problem. The first one I'll try is the LHC.

ID: 16190 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15573
Netherlands
Message 16194 - Posted: 31 Mar 2008, 21:08:05 UTC - in response to Message 16190.  

I seem to have a similar problem, if not the same one. I just noticed it today, on one machine.

Gene, do you have work for LHC that is finished and trying to upload?
If so, then that is most probably what is breaking BOINC at this time. if you're reading the Boinc_Dev/Boinc_Alpha email lists, see what JM7 posted about it today.
ID: 16194 · Report as offensive
ProfileGuy
Avatar

Send message
Joined: 9 Feb 08
Posts: 54
United Kingdom
Message 16198 - Posted: 31 Mar 2008, 21:20:31 UTC

I can not connect to 'localhost' when boinc manager starts either... Also, there seems to be a long, blanked out password in the Connect to Computer dialogue too.

My CPID is the same for the three projects I am involved with. Is there an error on My Account page at BOINC? It has a different CPID. They used to be the same - I made a note.

What is wrong at LHC??

Guy.

ID: 16198 · Report as offensive
glennpat

Send message
Joined: 31 Mar 08
Posts: 1
United States
Message 16201 - Posted: 31 Mar 2008, 21:25:40 UTC

My BOINC stopped connecting on 11 of my 12 boxes today. I think the only thing common was they were running LHC@Home. I have several Vista, XP, and 4 Linux boxes.
ID: 16201 · Report as offensive
Les Bayliss
Help desk expert

Send message
Joined: 25 Nov 05
Posts: 1654
Australia
Message 16203 - Posted: 31 Mar 2008, 21:34:33 UTC

My CPID is the same for the three projects I am involved with. Is there an error on My Account page at BOINC? It has a different CPID. They used to be the same - I made a note.


Guy

For the cpids to synchronise, contact must be be with each of the projects, using a single computer. After a variable period of time, all of the projects will know what cpid each of the others has, and will change to that of the last attached project. (NOT last project contacted for this syncing!)

As I posted elsewhere, (as well as Ageless), THIS forum is NOT a project.
So it CANNOT sync with any other.
And if your other projects have changed their cpids for your accounts on them, then your account on THIS forum will be out-of-sync. Permanently.

ID: 16203 · Report as offensive
genes
Avatar

Send message
Joined: 14 Dec 06
Posts: 16
United States
Message 16205 - Posted: 31 Mar 2008, 21:39:00 UTC - in response to Message 16194.  
Last modified: 31 Mar 2008, 21:44:28 UTC

I seem to have a similar problem, if not the same one. I just noticed it today, on one machine.

Gene, do you have work for LHC that is finished and trying to upload?
If so, then that is most probably what is breaking BOINC at this time. if you're reading the Boinc_Dev/Boinc_Alpha email lists, see what JM7 posted about it today.


Yes, it is LHC, and I see this is also going on in the Core Client area as well in this thread

(no, I am not on those mailing lists)

Just noticed that LHC site is down for maintenance, so no answers there. Perhaps it is exactly this that is causing the problem, not something wrong with the LHC result.
ID: 16205 · Report as offensive
ProfileGuy
Avatar

Send message
Joined: 9 Feb 08
Posts: 54
United Kingdom
Message 16209 - Posted: 31 Mar 2008, 21:47:47 UTC

Thanks Les.
Say, where and how do I read Boinc_Dev/Boinc_Alpha email lists mentioned in Message ID 16194?
Thanks,

ID: 16209 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15573
Netherlands
Message 16210 - Posted: 31 Mar 2008, 21:51:29 UTC - in response to Message 16205.  

Just noticed that LHC site is down for maintenance, so no answers there. Perhaps it is exactly this that is causing the problem, not something wrong with the LHC result.

It IS something with the contact to the scheduler at LHC.

The problem is that the scheduler at http://lhcathome.cern.ch/lhcathome_cgi/cgi points to http://lhcathome.cern.ch/ ... while http://lhcathome.cern.ch/lhcathome is the correct website. It also reverts back to the other link.

Is anyone using a debug build of BOINC that returns a good stack trace on this problem? If it's there, the trace can be found in stderrdae.txt
ID: 16210 · Report as offensive
genes
Avatar

Send message
Joined: 14 Dec 06
Posts: 16
United States
Message 16217 - Posted: 31 Mar 2008, 21:59:38 UTC

I'm using 5.10.45, not the debug version, but the stderrdae.txt log DOES have a callstack in it if that helps.
ID: 16217 · Report as offensive
1 · 2 · 3 · Next

Message boards : BOINC Manager : manager not responding

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.