Message boards : BOINC client : segmentation violation
Message board moderation
Author | Message |
---|---|
Send message Joined: 31 Mar 08 Posts: 7 |
when i came home today my boinc manager told me that it wasn't connected to client anymore. i then tried to restart the client manually, but it gave me the following error: [iLuvatar@localhost BOINC]$ ./run_client 31-Mar-2008 18:48:19 [---] Starting BOINC client version 5.10.28 for x86_64-pc-linux-gnu 31-Mar-2008 18:48:19 [---] log flags: task, file_xfer, sched_ops 31-Mar-2008 18:48:19 [---] Libraries: libcurl/7.17.1 OpenSSL/0.9.8g zlib/1.2.3 31-Mar-2008 18:48:19 [---] Data directory: /home/iLuvatar/BOINC 31-Mar-2008 18:48:19 [---] Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU @ 2.40GHz [Family 6 Model 15 Stepping 7] 31-Mar-2008 18:48:19 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm 31-Mar-2008 18:48:19 [---] OS: Linux: 2.6.24.3-50.fc8 31-Mar-2008 18:48:19 [---] Memory: 3.87 GB physical, 2.42 GB virtual 31-Mar-2008 18:48:19 [---] Disk: 17.03 GB total, 6.72 GB free 31-Mar-2008 18:48:19 [---] Local time is UTC +2 hours 31-Mar-2008 18:48:19 [SETI@home] URL: http://setiathome.berkeley.edu/; Computer ID: 4110986; location: home; project prefs: default 31-Mar-2008 18:48:19 [lhcathome] URL: http://lhcathome.cern.ch/lhcathome/; Computer ID: 9657577; location: home; project prefs: default 31-Mar-2008 18:48:19 [Spinhenge@home] URL: http://spin.fh-bielefeld.de/; Computer ID: 83787; location: home; project prefs: default 31-Mar-2008 18:48:19 [Einstein@Home] URL: http://einstein.phys.uwm.edu/; Computer ID: 1083282; location: (none); project prefs: default 31-Mar-2008 18:48:19 [QMC@HOME] URL: http://qah.uni-muenster.de/; Computer ID: 85924; location: (none); project prefs: default 31-Mar-2008 18:48:19 [---] General prefs: from http://szdg.lpds.sztaki.hu/szdg/ (last modified 31-Mar-2006 13:01:33) 31-Mar-2008 18:48:19 [---] Host location: none 31-Mar-2008 18:48:19 [---] General prefs: using your defaults 31-Mar-2008 18:48:19 [---] Reading preferences override file 31-Mar-2008 18:48:19 [---] Preferences limit memory usage when active to 1983.36MB 31-Mar-2008 18:48:19 [---] Preferences limit memory usage when idle to 3570.04MB 31-Mar-2008 18:48:19 [---] Preferences limit disk usage to 0.47GB 31-Mar-2008 18:48:19 [lhcathome] Started upload of wm72A_m72allA__7__64.275_59.305__10_12__6__54_1_sixvf_boinc339019_4_0 31-Mar-2008 18:48:20 [Einstein@Home] Restarting task h1_0929.50_S5R3__484_S5R3b_1 using einstein_S5R3 version 438 31-Mar-2008 18:48:20 [Spinhenge@home] Restarting task 5_Fe30_map_1398_326_0 using metropolis version 312 31-Mar-2008 18:48:20 [Einstein@Home] Restarting task h1_0929.50_S5R3__473_S5R3b_0 using einstein_S5R3 version 438 31-Mar-2008 18:48:20 [lhcathome] Sending scheduler request: To fetch work. Requesting 2681 seconds of work, reporting 0 completed tasks SIGSEGV: segmentation violation Stack trace (9 frames): ./boinc[0x4487b9] /lib64/libpthread.so.0[0x3bb500e540] ./boinc[0x459684] ./boinc[0x435fcc] ./boinc[0x436ae3] ./boinc[0x41267c] ./boinc[0x438df9] /lib64/libc.so.6(__libc_start_main+0xf4)[0x3bb441e074] ./boinc(__gxx_personality_v0+0x1b9)[0x4056f9] Exiting... [iLuvatar@localhost BOINC]$ any suggestions what the problem might be? just a bad work unit or something serious? thx |
Send message Joined: 29 Aug 05 Posts: 15542 |
SIGSEGV: segmentation violation If it does it constantly, it sounds like something serious. With that in mind, it can be a memory error, hard drive (read or write) error or a CPU error. |
Send message Joined: 6 Jun 06 Posts: 12 |
I have the same problem running under Kubuntu 7.10. I issue: /etc/init.d/boinc-client start and ps shows sixtrack is running. After 10 or 20 seconds, this is written to /var/lib/boinc-client/stderrdae.txt: UNRECOGNIZED: suspend_if_no_recent_input SIGSEGV: segmentation violation Stack trace (9 frames): /usr/bin/boinc_client[0x44a759] /lib/libpthread.so.0[0x2b9e865a3100] /usr/lib/libcurl.so.4(curl_multi_remove_handle+0x44)[0x2b9e85231c24] /usr/bin/boinc_client[0x43636c] /usr/bin/boinc_client[0x4376dc] /usr/bin/boinc_client[0x4127de] /usr/bin/boinc_client[0x4394cd] /lib/libc.so.6(__libc_start_main+0xf4)[0x2b9e86a4fb44] /usr/bin/boinc_client(__gxx_personality_v0+0x179)[0x4049b9] Exiting... stdoutdae.txt has: 2008-03-31 19:00:25 [---] Starting BOINC client version 5.10.8 for x86_64-pc-linux-gnu 2008-03-31 19:00:25 [---] log flags: task, file_xfer, sched_ops 2008-03-31 19:00:25 [---] Libraries: libcurl/7.16.4 OpenSSL/0.9.8e zlib/1.2.3.3 libidn/1.0 2008-03-31 19:00:25 [---] Data directory: /var/lib/boinc-client 2008-03-31 19:00:25 [---] Processor: 2 GenuineIntel Intel(R) Core(TM)2 CPU 6300 @ 1.86GHz [Family 6 Model 15 Stepping 2] 2008-03-31 19:00:25 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm constant_tsc pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm 2008-03-31 19:00:25 [---] Memory: 2.95 GB physical, 1.91 GB virtual 2008-03-31 19:00:25 [---] Disk: 45.84 GB total, 39.51 GB free 2008-03-31 19:00:25 [Einstein@Home] URL: http://einstein.phys.uwm.edu/; Computer ID: 962236; location: home; project prefs: default 2008-03-31 19:00:25 [climateprediction.net] URL: http://climateprediction.net/; Computer ID: 715024; location: home; project prefs: de fault 2008-03-31 19:00:25 [lhcathome] URL: http://lhcathome.cern.ch/lhcathome/; Computer ID: 9620369; location: (none); project prefs: defau lt 2008-03-31 19:00:25 [SETI@home] URL: http://setiathome.berkeley.edu/; Computer ID: 3444335; location: home; project prefs: default 2008-03-31 19:00:25 [QMC@HOME] URL: http://qah.uni-muenster.de/; Computer ID: 52563; location: (none); project prefs: default 2008-03-31 19:00:25 [---] General prefs: from Einstein@Home (last modified 2008-01-23 17:43:14) 2008-03-31 19:00:25 [---] Host location: home 2008-03-31 19:00:25 [---] General prefs: no separate prefs for home; using your defaults 2008-03-31 19:00:25 [---] Reading preferences override file 2008-03-31 19:00:25 [---] Preferences limit memory usage when active to 1814.45MB 2008-03-31 19:00:25 [---] Preferences limit memory usage when idle to 2721.67MB 2008-03-31 19:00:25 [---] Preferences limit disk usage to 13.75GB 2008-03-31 19:00:25 [lhcathome] [file_xfer] Started upload of file wm72A_m72allA__17__64.281_59.311__6_8__6__18_1_sixvf_boinc342721_2_ 0 2008-03-31 19:00:25 [lhcathome] Restarting task wm72A_m72allA__4__64.284_59.314__14_16__6__18_1_sixvf_boinc338257_3 using sixtrack ver sion 466 2008-03-31 19:00:25 [lhcathome] Restarting task wm72A_m72allA__19__64.283_59.313__4_6__6__72_1_sixvf_boinc343488_4 using sixtrack vers ion 466 2008-03-31 19:00:25 [lhcathome] Sending scheduler request: To fetch work 2008-03-31 19:00:25 [lhcathome] Requesting 313588 seconds of new work var/crash/_var_lib_boinc-client_projects_setiathome.berkeley.edu_setiathome-5.28.x86_64-pc-linux-gnu.112.crash was written this afternoon. The LHC@HOME Web site is currently down for maintenance. |
Send message Joined: 6 Jun 06 Posts: 12 |
5.10.8-1 is the version shown by Adept (APT front-end), and it's installed. I recall I had to do some manual work to get it going on this 64-bit system, though, so I'm confused. Off to do some research on the Web... |
Send message Joined: 6 Jun 06 Posts: 12 |
OK, so what I could do is uninstall the boinc-client (5.10.8-1) Ubuntu package and manually install the Linux X64 5.10.45 version. Will doing that delete my existing work units, though? In particular, I have a month or two invested in a climateprediction WU. :-( |
Send message Joined: 6 Jun 06 Posts: 12 |
Unfortunately, I still get a SIGSEGV with 5.10.45 when sixtrack restarts. |
Send message Joined: 31 Mar 08 Posts: 7 |
also tried 5.10.45, no success i also made a memory and cpu test recently, so they should be ok. i am though having some hd problems from time to time (kernel journal commit I/O error), but i don't know if that has anything to do with it. |
Send message Joined: 29 Aug 05 Posts: 15542 |
Hold on everyone. Is everyone who has this problem attached to, work for and possible trying to upload work to LHC? |
Send message Joined: 27 Jun 06 Posts: 305 |
Here we have a _very_ similar one, I don't know which CC version but it's sure 64bit. The log shows no LHC access attempts : 30-Mar-2008 23:34:47 [Poem@Home] [file_xfer] Throughput 17827 bytes/sec 30-Mar-2008 23:34:53 [Poem@Home] Sending scheduler request: To fetch work 30-Mar-2008 23:34:53 [Poem@Home] Requesting 476729 seconds of new work, and reporting 1 completed tasks 30-Mar-2008 23:35:10 [---] Project communication failed: attempting access to reference site 30-Mar-2008 23:35:10 [Poem@Home] Scheduler request failed: couldn't resolve host name 30-Mar-2008 23:35:10 [Poem@Home] Deferring communication for 1 min 0 sec 30-Mar-2008 23:35:10 [Poem@Home] Reason: scheduler request failed 30-Mar-2008 23:35:52 [Poem@Home] Task data_156_1206675990_815500093_0 exited with zero status but no 'finished' file 30-Mar-2008 23:35:52 [Poem@Home] If this happens repeatedly you may need to reset the project. 30-Mar-2008 23:35:52 [---] Access to reference site failed - check network connection or proxy configuration. SIGSEGV: segmentation violation Stack trace (10 frames): /opt/BOINC/boinc[0x44e299] /lib64/libc.so.6[0x3618e300b0] /lib64/libc.so.6(memset+0x60)[0x3618e77440] /opt/BOINC/boinc[0x413943] /opt/BOINC/boinc[0x413e17] /opt/BOINC/boinc[0x423147] /opt/BOINC/boinc[0x41aa59] /opt/BOINC/boinc[0x43d9d4] /lib64/libc.so.6(__libc_start_main+0xf4)[0x3618e1d8a4] /opt/BOINC/boinc(__gxx_personality_v0+0x1c9)[0x409409] Exiting... It seems to have crashed just when a network access occured. The callstack looks different but if it generated a signal that caused a dump, it can happen anywhere in a program. I pointed the "owner" of this sigsegv to this thread. |
Send message Joined: 31 Mar 08 Posts: 7 |
seems like i am uploading to LHC... you think this could be the problem? |
Send message Joined: 29 Aug 05 Posts: 15542 |
Yes, it would seem to be the problem. it was reported earlier on the development email lists that it would crash Windows versions of BOINC 5.10.45 So I am suspecting that it is the problem for your Linux version as well. I'm in contact with one of the developers at this moment. Can you check, if you're quick enough, what BOINC does if you disable its ability to connect to the network/internet? (Activity menu in BOINC Manager->Suspend network activity) |
Send message Joined: 31 Mar 08 Posts: 7 |
is it possible to change some preferences file, i don't think i manage to start the manager before the client crashes? |
Send message Joined: 14 Dec 06 Posts: 16 |
(Activity menu in BOINC Manager->Suspend network activity) I can't seem to get to the Activity settings before it crashes, and setting it after that is forgotten the next time it starts. |
Send message Joined: 6 Jun 06 Posts: 12 |
Can you check, if you're quick enough, what BOINC does if you disable its ability to connect to the network/internet? (Activity menu in BOINC Manager->Suspend network activity) I tried it a few times already, without success. This time, though, I shut down the network interface first with "ifdown eth0", then started the client and suspended its network activity, and finally restarted the interface with "ifup eth0". Now lhcathome is happily running, and one completed lhcathome WU is trying to upload. So, it seems the latter is the cause of the problem. |
Send message Joined: 29 Aug 05 Posts: 15542 |
is it possible to change some preferences file, i don't think i manage to start the manager before the client crashes? You can try to edit the client_state.xml file, scroll all the way down and edit the network option to show <user_network_request>3</user_network_request> That means you suspended network activity. |
Send message Joined: 14 Dec 06 Posts: 16 |
is it possible to change some preferences file, i don't think i manage to start the manager before the client crashes? That worked for me! It's now running. I have to leave now but I'll check it later in an hour or so. |
Send message Joined: 31 Mar 08 Posts: 7 |
yup, worked for me, too. so, what now? get rid of the wu? |
Send message Joined: 31 Mar 08 Posts: 59 |
Yeah, that worked for me too. |
Send message Joined: 29 Aug 05 Posts: 15542 |
iLuvatar wrote: yup, worked for me, too. You can only do that by editing the client_state.xml file. I'm offering people in the other thread to show how you can get (temporarily) rid of LHC from the client_state.xml file. This will mean it wipes out any work from LHC that is trying to upload or report. |
Send message Joined: 31 Mar 08 Posts: 59 |
I'm in contact with one of the developers at this moment... Well, that's good that you're talking to "developers". Perhaps you can tell 'em that there IS a general issue with the BOINC manager specific to the LHC client. It causes repeated DLL init errors on my machine. There is a thread on the LHC forum regard that issue that I initiated some time ago, and no resolution yet (despite having migrated BOINC to the most current one). It appears that LHC WU are successfully completing in any case. Its just that using my machine while LHC is running is virtually impossible. What's most annoying about this problem is that it causes my machine to hang for as long as two minutes. Even so the mouse cursor moves, it remains frozen with the "hand pointer" icon (nothing else is accessible on the desktop, although sometimes I can access the auto-hide taskbar). The time period between "freezes" is variable, as is the time period of "freezing". The system invariably resumes whatever it was doing when it "wakes up". Sometimes, but not always there'll be an "DLL init" error message in the BOINC messages pane. There is NEVER a stderr.txt file in the LHC slot folder (and stderrdae.txt is empty also). No other BOINC client gives me this issue. 3/31/2008 6:29:09 PM|rosetta@home|URL: http://boinc.bakerlab.org/rosetta/; Computer ID: 609131; location: home; project prefs: home 3/31/2008 6:29:09 PM|boincsimap|URL: http://boinc.bio.wzw.tum.de/boincsimap/; Computer ID: 83721; location: home; project prefs: default 3/31/2008 6:29:09 PM|Einstein@Home|URL: http://einstein.phys.uwm.edu/; Computer ID: 1011976; location: home; project prefs: home 3/31/2008 6:29:09 PM|lhcathome|URL: http://lhcathome.cern.ch/lhcathome/; Computer ID: 9636853; location: home; project prefs: home 3/31/2008 6:29:09 PM|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 3829768; location: home; project prefs: home 3/31/2008 6:29:09 PM|Spinhenge@home|URL: http://spin.fh-bielefeld.de/; Computer ID: 100366; location: home; project prefs: default 3/31/2008 6:29:09 PM|uFluids|URL: http://www.ufluids.net/; Computer ID: 57580; location: home; project prefs: default 3/31/2008 6:29:09 PM|The Lattice Project|URL: http://boinc.umiacs.umd.edu/; Computer ID: 10196; location: home; project prefs: default 3/31/2008 6:29:09 PM|Leiden Classical|URL: http://boinc.gorlaeus.net/; Computer ID: 39374; location: home; project prefs: default So dunno what to tell you, and its a completely different issue than what started the thread, but now there's this new problem. |
Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.