Message boards :
BOINC client :
segmentation violation
Message board moderation
Author | Message |
---|---|
Send message Joined: 31 Mar 08 Posts: 7 ![]() |
when i came home today my boinc manager told me that it wasn't connected to client anymore. i then tried to restart the client manually, but it gave me the following error: [iLuvatar@localhost BOINC]$ ./run_client 31-Mar-2008 18:48:19 [---] Starting BOINC client version 5.10.28 for x86_64-pc-linux-gnu 31-Mar-2008 18:48:19 [---] log flags: task, file_xfer, sched_ops 31-Mar-2008 18:48:19 [---] Libraries: libcurl/7.17.1 OpenSSL/0.9.8g zlib/1.2.3 31-Mar-2008 18:48:19 [---] Data directory: /home/iLuvatar/BOINC 31-Mar-2008 18:48:19 [---] Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU @ 2.40GHz [Family 6 Model 15 Stepping 7] 31-Mar-2008 18:48:19 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm 31-Mar-2008 18:48:19 [---] OS: Linux: 2.6.24.3-50.fc8 31-Mar-2008 18:48:19 [---] Memory: 3.87 GB physical, 2.42 GB virtual 31-Mar-2008 18:48:19 [---] Disk: 17.03 GB total, 6.72 GB free 31-Mar-2008 18:48:19 [---] Local time is UTC +2 hours 31-Mar-2008 18:48:19 [SETI@home] URL: http://setiathome.berkeley.edu/; Computer ID: 4110986; location: home; project prefs: default 31-Mar-2008 18:48:19 [lhcathome] URL: http://lhcathome.cern.ch/lhcathome/; Computer ID: 9657577; location: home; project prefs: default 31-Mar-2008 18:48:19 [Spinhenge@home] URL: http://spin.fh-bielefeld.de/; Computer ID: 83787; location: home; project prefs: default 31-Mar-2008 18:48:19 [Einstein@Home] URL: http://einstein.phys.uwm.edu/; Computer ID: 1083282; location: (none); project prefs: default 31-Mar-2008 18:48:19 [QMC@HOME] URL: http://qah.uni-muenster.de/; Computer ID: 85924; location: (none); project prefs: default 31-Mar-2008 18:48:19 [---] General prefs: from http://szdg.lpds.sztaki.hu/szdg/ (last modified 31-Mar-2006 13:01:33) 31-Mar-2008 18:48:19 [---] Host location: none 31-Mar-2008 18:48:19 [---] General prefs: using your defaults 31-Mar-2008 18:48:19 [---] Reading preferences override file 31-Mar-2008 18:48:19 [---] Preferences limit memory usage when active to 1983.36MB 31-Mar-2008 18:48:19 [---] Preferences limit memory usage when idle to 3570.04MB 31-Mar-2008 18:48:19 [---] Preferences limit disk usage to 0.47GB 31-Mar-2008 18:48:19 [lhcathome] Started upload of wm72A_m72allA__7__64.275_59.305__10_12__6__54_1_sixvf_boinc339019_4_0 31-Mar-2008 18:48:20 [Einstein@Home] Restarting task h1_0929.50_S5R3__484_S5R3b_1 using einstein_S5R3 version 438 31-Mar-2008 18:48:20 [Spinhenge@home] Restarting task 5_Fe30_map_1398_326_0 using metropolis version 312 31-Mar-2008 18:48:20 [Einstein@Home] Restarting task h1_0929.50_S5R3__473_S5R3b_0 using einstein_S5R3 version 438 31-Mar-2008 18:48:20 [lhcathome] Sending scheduler request: To fetch work. Requesting 2681 seconds of work, reporting 0 completed tasks SIGSEGV: segmentation violation Stack trace (9 frames): ./boinc[0x4487b9] /lib64/libpthread.so.0[0x3bb500e540] ./boinc[0x459684] ./boinc[0x435fcc] ./boinc[0x436ae3] ./boinc[0x41267c] ./boinc[0x438df9] /lib64/libc.so.6(__libc_start_main+0xf4)[0x3bb441e074] ./boinc(__gxx_personality_v0+0x1b9)[0x4056f9] Exiting... [iLuvatar@localhost BOINC]$ any suggestions what the problem might be? just a bad work unit or something serious? thx |
![]() Send message Joined: 29 Aug 05 Posts: 15002 ![]() |
SIGSEGV: segmentation violation If it does it constantly, it sounds like something serious. With that in mind, it can be a memory error, hard drive (read or write) error or a CPU error. |
Send message Joined: 25 Aug 06 Posts: 1596 |
Twice reported at WCG and twice upgrading to the latest stable version resolved it. The last one was a 64bit version. Coelum Non Animum Mutant, Qui Trans Mare Currunt ![]() |
Send message Joined: 6 Jun 06 Posts: 12 ![]() |
I have the same problem running under Kubuntu 7.10. I issue: /etc/init.d/boinc-client start and ps shows sixtrack is running. After 10 or 20 seconds, this is written to /var/lib/boinc-client/stderrdae.txt: UNRECOGNIZED: suspend_if_no_recent_input SIGSEGV: segmentation violation Stack trace (9 frames): /usr/bin/boinc_client[0x44a759] /lib/libpthread.so.0[0x2b9e865a3100] /usr/lib/libcurl.so.4(curl_multi_remove_handle+0x44)[0x2b9e85231c24] /usr/bin/boinc_client[0x43636c] /usr/bin/boinc_client[0x4376dc] /usr/bin/boinc_client[0x4127de] /usr/bin/boinc_client[0x4394cd] /lib/libc.so.6(__libc_start_main+0xf4)[0x2b9e86a4fb44] /usr/bin/boinc_client(__gxx_personality_v0+0x179)[0x4049b9] Exiting... stdoutdae.txt has: 2008-03-31 19:00:25 [---] Starting BOINC client version 5.10.8 for x86_64-pc-linux-gnu 2008-03-31 19:00:25 [---] log flags: task, file_xfer, sched_ops 2008-03-31 19:00:25 [---] Libraries: libcurl/7.16.4 OpenSSL/0.9.8e zlib/1.2.3.3 libidn/1.0 2008-03-31 19:00:25 [---] Data directory: /var/lib/boinc-client 2008-03-31 19:00:25 [---] Processor: 2 GenuineIntel Intel(R) Core(TM)2 CPU 6300 @ 1.86GHz [Family 6 Model 15 Stepping 2] 2008-03-31 19:00:25 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm constant_tsc pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm 2008-03-31 19:00:25 [---] Memory: 2.95 GB physical, 1.91 GB virtual 2008-03-31 19:00:25 [---] Disk: 45.84 GB total, 39.51 GB free 2008-03-31 19:00:25 [Einstein@Home] URL: http://einstein.phys.uwm.edu/; Computer ID: 962236; location: home; project prefs: default 2008-03-31 19:00:25 [climateprediction.net] URL: http://climateprediction.net/; Computer ID: 715024; location: home; project prefs: de fault 2008-03-31 19:00:25 [lhcathome] URL: http://lhcathome.cern.ch/lhcathome/; Computer ID: 9620369; location: (none); project prefs: defau lt 2008-03-31 19:00:25 [SETI@home] URL: http://setiathome.berkeley.edu/; Computer ID: 3444335; location: home; project prefs: default 2008-03-31 19:00:25 [QMC@HOME] URL: http://qah.uni-muenster.de/; Computer ID: 52563; location: (none); project prefs: default 2008-03-31 19:00:25 [---] General prefs: from Einstein@Home (last modified 2008-01-23 17:43:14) 2008-03-31 19:00:25 [---] Host location: home 2008-03-31 19:00:25 [---] General prefs: no separate prefs for home; using your defaults 2008-03-31 19:00:25 [---] Reading preferences override file 2008-03-31 19:00:25 [---] Preferences limit memory usage when active to 1814.45MB 2008-03-31 19:00:25 [---] Preferences limit memory usage when idle to 2721.67MB 2008-03-31 19:00:25 [---] Preferences limit disk usage to 13.75GB 2008-03-31 19:00:25 [lhcathome] [file_xfer] Started upload of file wm72A_m72allA__17__64.281_59.311__6_8__6__18_1_sixvf_boinc342721_2_ 0 2008-03-31 19:00:25 [lhcathome] Restarting task wm72A_m72allA__4__64.284_59.314__14_16__6__18_1_sixvf_boinc338257_3 using sixtrack ver sion 466 2008-03-31 19:00:25 [lhcathome] Restarting task wm72A_m72allA__19__64.283_59.313__4_6__6__72_1_sixvf_boinc343488_4 using sixtrack vers ion 466 2008-03-31 19:00:25 [lhcathome] Sending scheduler request: To fetch work 2008-03-31 19:00:25 [lhcathome] Requesting 313588 seconds of new work var/crash/_var_lib_boinc-client_projects_setiathome.berkeley.edu_setiathome-5.28.x86_64-pc-linux-gnu.112.crash was written this afternoon. The LHC@HOME Web site is currently down for maintenance. |
Send message Joined: 25 Aug 06 Posts: 1596 |
This 2008-03-31 19:00:25 [---] Starting BOINC client version 5.10.8 for x86_64-pc-linux-gnu See my previous post. This was one of the versions someone reported the problem for and upgrade resolved it. Coelum Non Animum Mutant, Qui Trans Mare Currunt ![]() |
Send message Joined: 6 Jun 06 Posts: 12 ![]() |
5.10.8-1 is the version shown by Adept (APT front-end), and it's installed. I recall I had to do some manual work to get it going on this 64-bit system, though, so I'm confused. Off to do some research on the Web... |
Send message Joined: 6 Jun 06 Posts: 12 ![]() |
OK, so what I could do is uninstall the boinc-client (5.10.8-1) Ubuntu package and manually install the Linux X64 5.10.45 version. Will doing that delete my existing work units, though? In particular, I have a month or two invested in a climateprediction WU. :-( |
Send message Joined: 6 Jun 06 Posts: 12 ![]() |
Unfortunately, I still get a SIGSEGV with 5.10.45 when sixtrack restarts. |
Send message Joined: 31 Mar 08 Posts: 7 ![]() |
also tried 5.10.45, no success i also made a memory and cpu test recently, so they should be ok. i am though having some hd problems from time to time (kernel journal commit I/O error), but i don't know if that has anything to do with it. |
![]() Send message Joined: 29 Aug 05 Posts: 15002 ![]() |
Hold on everyone. Is everyone who has this problem attached to, work for and possible trying to upload work to LHC? |
![]() Send message Joined: 27 Jun 06 Posts: 305 ![]() |
Here we have a _very_ similar one, I don't know which CC version but it's sure 64bit. The log shows no LHC access attempts : 30-Mar-2008 23:34:47 [Poem@Home] [file_xfer] Throughput 17827 bytes/sec 30-Mar-2008 23:34:53 [Poem@Home] Sending scheduler request: To fetch work 30-Mar-2008 23:34:53 [Poem@Home] Requesting 476729 seconds of new work, and reporting 1 completed tasks 30-Mar-2008 23:35:10 [---] Project communication failed: attempting access to reference site 30-Mar-2008 23:35:10 [Poem@Home] Scheduler request failed: couldn't resolve host name 30-Mar-2008 23:35:10 [Poem@Home] Deferring communication for 1 min 0 sec 30-Mar-2008 23:35:10 [Poem@Home] Reason: scheduler request failed 30-Mar-2008 23:35:52 [Poem@Home] Task data_156_1206675990_815500093_0 exited with zero status but no 'finished' file 30-Mar-2008 23:35:52 [Poem@Home] If this happens repeatedly you may need to reset the project. 30-Mar-2008 23:35:52 [---] Access to reference site failed - check network connection or proxy configuration. SIGSEGV: segmentation violation Stack trace (10 frames): /opt/BOINC/boinc[0x44e299] /lib64/libc.so.6[0x3618e300b0] /lib64/libc.so.6(memset+0x60)[0x3618e77440] /opt/BOINC/boinc[0x413943] /opt/BOINC/boinc[0x413e17] /opt/BOINC/boinc[0x423147] /opt/BOINC/boinc[0x41aa59] /opt/BOINC/boinc[0x43d9d4] /lib64/libc.so.6(__libc_start_main+0xf4)[0x3618e1d8a4] /opt/BOINC/boinc(__gxx_personality_v0+0x1c9)[0x409409] Exiting... It seems to have crashed just when a network access occured. The callstack looks different but if it generated a signal that caused a dump, it can happen anywhere in a program. I pointed the "owner" of this sigsegv to this thread. |
Send message Joined: 31 Mar 08 Posts: 7 ![]() |
seems like i am uploading to LHC... you think this could be the problem? |
![]() Send message Joined: 29 Aug 05 Posts: 15002 ![]() |
Yes, it would seem to be the problem. it was reported earlier on the development email lists that it would crash Windows versions of BOINC 5.10.45 So I am suspecting that it is the problem for your Linux version as well. I'm in contact with one of the developers at this moment. Can you check, if you're quick enough, what BOINC does if you disable its ability to connect to the network/internet? (Activity menu in BOINC Manager->Suspend network activity) |
Send message Joined: 31 Mar 08 Posts: 7 ![]() |
is it possible to change some preferences file, i don't think i manage to start the manager before the client crashes? |
![]() Send message Joined: 14 Dec 06 Posts: 16 ![]() |
(Activity menu in BOINC Manager->Suspend network activity) I can't seem to get to the Activity settings before it crashes, and setting it after that is forgotten the next time it starts. |
Send message Joined: 6 Jun 06 Posts: 12 ![]() |
Can you check, if you're quick enough, what BOINC does if you disable its ability to connect to the network/internet? (Activity menu in BOINC Manager->Suspend network activity) I tried it a few times already, without success. This time, though, I shut down the network interface first with "ifdown eth0", then started the client and suspended its network activity, and finally restarted the interface with "ifup eth0". Now lhcathome is happily running, and one completed lhcathome WU is trying to upload. So, it seems the latter is the cause of the problem. |
![]() Send message Joined: 29 Aug 05 Posts: 15002 ![]() |
is it possible to change some preferences file, i don't think i manage to start the manager before the client crashes? You can try to edit the client_state.xml file, scroll all the way down and edit the network option to show <user_network_request>3</user_network_request> That means you suspended network activity. |
![]() Send message Joined: 14 Dec 06 Posts: 16 ![]() |
is it possible to change some preferences file, i don't think i manage to start the manager before the client crashes? That worked for me! It's now running. I have to leave now but I'll check it later in an hour or so. |
Send message Joined: 31 Mar 08 Posts: 7 ![]() |
yup, worked for me, too. so, what now? get rid of the wu? |
Send message Joined: 31 Mar 08 Posts: 59 ![]() |
Yeah, that worked for me too. |
Copyright © 2022 University of California. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.