Posts by Radjin

1) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92920)
Posted 23 Sep 2019 by Radjin
Post:
And out of the blue, it starts working. I did nothing but fly out for a couple of days and return to the Server madly processing work units.

Thanks for all the suggestions and info.

Radjin~
2) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92894)
Posted 21 Sep 2019 by Radjin
Post:
Radjin

If you're going to try models from cpdn, you need to be aware that they are 32 bit, and sometimes needed libraries aren't installed by default.
This is the usual culprit:
libstdc++.so.6

If it's not there, the models will crash at about 6 seconds.


Thanks for that piece and of information. I added CPDN just to test the vsyscall on something other than seti@home; I haven’t downloaded any work units as of yet; hearing there was some bug with creating them for Linux and there may be some ready this coming week. Nothing just works...
3) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92891)
Posted 21 Sep 2019 by Radjin
Post:
What does
cat /usr/src/linux-headers-$(uname -r)/.config | grep VSYSCALL
tell me? I have seen this suggested in a number of posts where they received a reply of
cat /usr/src/linux-headers-$(uname -r)/.config | grep VSYSCALL 
CONFIG_GENERIC_TIME_VSYSCALL=y
CONFIG_X86_VSYSCALL_EMULATION=y
CONFIG_LEGACY_VSYSCALL_EMULATE=y
# CONFIG_LEGACY_VSYSCALL_NONE is not set
If you installed the kernel headers for your currently running kernel - $(uname -r) is the version string - this shows you how the kernel is configured regarding VSYSCALL. This is purely informational. In this example the vsyscall emulation is built in and enabled by default (VSYSCALL_EMULATE=y). For your kernel, it is built in and disabled by default (VSYSCALL_NONE=y). I suspect that's what the Seti application can't cope with, so you override it with the vsyscall=emulate boot parameter and then it's time for a test to see if we're on the right path.


Thanks. A prior post suggested there may be no way to check if the option was activated. When I run:
cat /usr/src/linux-headers-$(uname -r)/.config | grep
I get:
cat: /usr/src/linux-headers-4.19.0-6-amd64/.config: No such file or directory
even though I should have activated it in grub with:
GRUB_CMDLINE_LINUX_DEFAULT="vsyscall=emulate"
and:
sudo update-grub
Generating grub configuration file ...
Found linux image: /boot/vmlinuz-4.19.0-6-amd64
Found initrd image: /boot/initrd.img-4.19.0-6-amd64
Found linux image: /boot/vmlinuz-4.19.0-5-amd64
Found initrd image: /boot/initrd.img-4.19.0-5-amd64
Found memtest86+ image: /boot/memtest86+.bin
Found memtest86+ multiboot image: /boot/memtest86+_multiboot.bin
done
sudo reboot

This is likely a moot point as some pretty knowledgeable people have told me the issue I am trying to resolve is likely not with the vsyscall at all. I am just trying all options in order of complexity given I am pretty much a noob learning as I go along. I can’t be certain I have activated the option if I don’t get the expected output when I use the cat command.

I have always run Debian Linux via command line, always used apt, and never had to step into the realm of compiling or updating outside of apt. So every time I get a suggestion beyond that realm I spend hours reading what I am doing and what has happened to others who did it. I’m quite thankful there are others out there who can help even if we noobs irritate them.
4) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92888)
Posted 21 Sep 2019 by Radjin
Post:
OTHER SETTINGS
Run Linux app versions built with LIBC 2.15:
YESNO
This ensures compatibility with new Linux systems that have virtual syscalls disabled, but breaks compatibility with older systems with (G)LIBC prior to 2.15


This is a specific version of BOINC, or some sort of library I don’t already have? If I add/switch to this library, will it break the apt update process?
5) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92887)
Posted 21 Sep 2019 by Radjin
Post:
I added to grub: GRUB_CMDLINE_LINUX_DEFAULT="VSYSCALL=EMULATE"
Better make that "vsyscall=emulate". I wouldn't be surprised to see that the upper case version doesn't work.

Replaced with lower case.

The grub update seemed to go ok.
So it seems. Of course you rebooted after that?

Yes

For some reason I don’t get the confirmation of the emulation mode.
I'm not aware of a way to query the current mode. You could look at /proc/cmdline. And of course the best confirmation would be if your application didn't segfault any longer.

What does
cat /usr/src/linux-headers-$(uname -r)/.config | grep VSYSCALL
tell me? I have seen this suggested in a number of posts where they received a reply of
cat /usr/src/linux-headers-$(uname -r)/.config | grep VSYSCALL 
CONFIG_GENERIC_TIME_VSYSCALL=y
CONFIG_X86_VSYSCALL_EMULATION=y
CONFIG_LEGACY_VSYSCALL_EMULATE=y
# CONFIG_LEGACY_VSYSCALL_NONE is not set

Be aware that vsyscall is just a run time parameter, it overrides the kernel default but doesn't change it permanently.

Yes, thank you. I understand it is a temporary thing.
6) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92881)
Posted 21 Sep 2019 by Radjin
Post:
With CPDN, it stopped I think after a newer version of the particular model type so not a lot you can do at the user end. I can't comment on the Seti@home because I have never seen it with them.


I haven’t seen any work being downloaded but I will wait a week and see what happens.

Aside from trying different projects to see what happens, how can I test the possibility hardware issues? I see a memtest86+ but it comes with mixed reviews.
7) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92880)
Posted 21 Sep 2019 by Radjin
Post:
What I am asking is since this is a known issue, how does one diagnose the cause of the error, memory, OS, BOINC? How did others resolve the issue? I can find dozens of references to the issue, all with a BOINC project, but only two resolutions, where vsyscall was put into emulate mode.
So you have identified a possible (and IMO very likely) cause, and you know a workaround. But you don't mention that you have tried it, or an outcome. What about that?


This is another interesting conundrum.

I added to grub: GRUB_CMDLINE_LINUX_DEFAULT="VSYSCALL=EMULATE"
sudo update-grub

Generating grub configuration file ...
Found linux image: /boot/vmlinuz-4.19.0-6-amd64
Found initrd image: /boot/initrd.img-4.19.0-6-amd64
Found linux image: /boot/vmlinuz-4.19.0-5-amd64
Found initrd image: /boot/initrd.img-4.19.0-5-amd64
Found memtest86+ image: /boot/memtest86+.bin
Found memtest86+ multiboot image: /boot/memtest86+_multiboot.bin
done
The grub update seemed to go ok.

cat /usr/src/linux-headers-$(uname -r)/.config | grep 

cat: /usr/src/linux-headers-4.19.0-6-amd64/.config: No such file or directory
For some reason I don’t get the confirmation of the emulation mode.
8) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92874)
Posted 21 Sep 2019 by Radjin
Post:
Thanks for the suggestions, I will add the project tonight to see what happens.
9) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92871)
Posted 20 Sep 2019 by Radjin
Post:
I am not disputing that it is a memory issue either corruption or fixed memory location that is out of range; everything I have read says that, however it only happens with seti/BOINC. I have also read that with the disabled vsyscall in later kernels this error is a known issue with BOINC in posts as late as 2018.

What I am asking is since this is a known issue, how does one diagnose the cause of the error, memory, OS, BOINC? How did others resolve the issue? I can find dozens of references to the issue, all with a BOINC project, but only two resolutions, where vsyscall was put into emulate mode.

This is not a bash post, it’s a call to the experts to help explain and resolve a problem that affects a number of people. At the moment all I have gotten is the equivalent of dump your computer and build a new one, dump your OS and do a clean install of this OS, don’t use the stable repository to install BOINC as recommended by the BOINC literature, install this custom package instead(which gave me the same error). I’m open to try new things except run unstable software.

At this point the only time I get this error is with BOINC. Is the reason that everyone goes silent on solving this issue because it’s unsolvable?
10) Message boards : Questions and problems : Can’t connect to localhost (Message 92865)
Posted 20 Sep 2019 by Radjin
Post:
This thread was superseded by another post and can be considered resolved.
11) Message boards : Questions and problems : Compute error - SIGSEGV: segmentation violation (Message 92864)
Posted 20 Sep 2019 by Radjin
Post:
I am sure others are having this issue or have, but a search of the forum didn’t pop up any recent posts or resolutions.

I decided to add my web server into my computers as it sits mostly idle, it’s a Debian 10(Buster) Linux box<details below>. First I tried the repository install with apt. Then talking to people on the Seti@Home forums they suggested using a Berkeley version they claimed would be more efficient and include everything needed. Once I got that running I was back to the same errors. A pretty knowledgeable friend had me look at a number of things and convinced me to return to the repository version which I did and am currently running.

Any help to get past or at least understand this would be appreciated.

My skill level on Linux is just enough to be dangerous so please be a bit more -verbose in explanations or how to do something.

Radjin~
======
A typical error as listed on my accounts/computers/tasks page:

Task 8058757637
Name blc11_2bit_guppi_58692_04223_HIP79568_0125.25756.0.21.44.68.vlar_0
Workunit 3657386096
Created 18 Sep 2019, 11:11:31 UTC
Sent 18 Sep 2019, 16:48:51 UTC
Report deadline 21 Nov 2019, 3:42:30 UTC
Received 18 Sep 2019, 16:56:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 11 (0x0000000B) Unknown error code
Computer ID 8816958
Run time
CPU time
Validate state Invalid
Credit 0.00
Device peak FLOPS 4.14 GFLOPS
Application version SETI@home v8 v8.00
x86_64-pc-linux-gnu
Stderr output
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<message>
process got signal 11</message>
<stderr_txt>
SIGSEGV: segmentation violation

</stderr_txt>
]]>
======
My computer:

CPU type GenuineIntel
Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7]
Number of processors 8
Coprocessors ---
Virtualization None
Operating System Linux Debian
Debian GNU/Linux 10 (buster) [4.19.0-6-amd64|libc 2.28 (Debian GLIBC 2.28-10)]
BOINC version 7.14.2
Memory 31.3 GB
Cache 8192 KB
Swap space 15.89 GB
Total disk space 884.49 GB
Free Disk Space 751.27 GB
Measured floating point speed 4.14 billion ops/sec
Measured integer speed 63.45 billion ops/sec
Average upload rate 147 KB/sec
Average download rate 2478.34 KB/sec
Average turnaround time 0 days
Application details Show
Tasks 307
Number of times client has contacted server 37
Last time contacted server 20 Sep 2019, 11:08:20 UTC
Fraction of time BOINC is running 98.97%
While BOINC is running, fraction of time computing is allowed 100.00%
While is BOINC running, fraction of time GPU computing is allowed 100.00%
Task duration correction factor 1
12) Message boards : Questions and problems : Can’t connect to localhost (Message 92822)
Posted 17 Sep 2019 by Radjin
Post:
I installed Boinc on my Debian Buster Linux box and had trouble getting it to work. Asking around the seti@home forums, as that is the project I run on other computers, I received a good suggestion to install a packaged setup optimized for my GPU and is totally within the /home directory. I apt removed, purged and autoremoved then installed the aforementioned package.

Currently my biggest problem is the one in the title. When I run “./boinccmd --project_attach http://setiathome.berkeley.edu <weak key>” I get the “cannot connect to localhost”. In fact, when I run ./boinccmd with any option I get that response. If I run ./boinccmd alone I get the help list meaning boinccmd is responding. If I run ./boinc I get:
/BOINC$ ./boinc
17-Sep-2019 05:36:31 [---] Starting BOINC client version 7.14.2 for x86_64-pc-linux-gnu
17-Sep-2019 05:36:31 [---] log flags: file_xfer, sched_ops, task, sched_op_debug
17-Sep-2019 05:36:31 [---] Libraries: libcurl/7.64.0 GnuTLS/3.6.7 zlib/1.2.11 libidn2/2.0.5 libpsl/0.20.2 (+libidn2/2.0.5) libssh2/1.8.0 nghttp2/1.36.0 librtmp/2.3
17-Sep-2019 05:36:31 [---] Data directory: /home/radjin/BOINC
17-Sep-2019 05:36:31 [---] No usable GPUs found
17-Sep-2019 05:36:31 [---] [libc detection] gathered: 2.28, Debian GLIBC 2.28-10
17-Sep-2019 05:36:31 [---] Host name: webserver
17-Sep-2019 05:36:31 [---] Processor: 8 GenuineIntel Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7]
17-Sep-2019 05:36:31 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid sse4_1 sse4_2 popcnt tsc_deadline_timer aes xsave avx lahf_lm epb pti tpr_shadow vnmi flexpriority ept vpid xsaveopt dtherm ida arat pln pts
17-Sep-2019 05:36:31 [---] OS: Linux Debian: Debian GNU/Linux 10 (buster) [4.19.0-6-amd64|libc 2.28 (Debian GLIBC 2.28-10)]
17-Sep-2019 05:36:31 [---] Memory: 31.30 GB physical, 15.89 GB virtual
17-Sep-2019 05:36:31 [---] Disk: 884.49 GB total, 750.54 GB free
17-Sep-2019 05:36:31 [---] Local time is UTC -7 hours
17-Sep-2019 05:36:31 [---] Config: use all coprocessors
17-Sep-2019 05:36:31 [---] Last benchmark was 18156 days 12:36:31 ago
17-Sep-2019 05:36:31 [---] No general preferences found - using defaults
17-Sep-2019 05:36:31 [---] Preferences:
17-Sep-2019 05:36:31 [---] max memory usage when active: 16024.55 MB
17-Sep-2019 05:36:31 [---] max memory usage when idle: 28844.18 MB
17-Sep-2019 05:36:31 [---] max disk usage: 751.17 GB
17-Sep-2019 05:36:31 [---] don't use GPU while active
17-Sep-2019 05:36:31 [---] suspend work if non-BOINC CPU load exceeds 25%
17-Sep-2019 05:36:31 [---] (to change preferences, visit a project web site or select Preferences in the Manager)
17-Sep-2019 05:36:31 [---] Setting up project and slot directories
17-Sep-2019 05:36:31 [---] Checking active tasks
17-Sep-2019 05:36:31 [---] Setting up GUI RPC socket
17-Sep-2019 05:36:31 [---] Checking presence of 0 project files
17-Sep-2019 05:36:31 [---] This computer is not attached to any projects
17-Sep-2019 05:36:31 Initialization completed

It continues to run until I ctrl-c.

ldd shows all dependencies are installed

Any suggestions?

Thanks in advance for your help.

Radjin~




Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.