Message boards : Questions and problems : GPU detection failed. error code 139
Message board moderation
Author | Message |
---|---|
Send message Joined: 23 Jul 11 Posts: 14 |
I get this every time I start or restart Boinc. On a reboot or starting PC the same thing happens. I'm at a loss as to what to do or where to look next. Done the following: Checked logs reinstalled boinc and drivers checked driver files On a computer restart/boot Ubuntu throws up a message that boinc has crashed with seg fault. I suspect that the part of boinc that detects the gpu flips its lid here and bombs. Pertinent info Boinc version: 7.6.31 (x64) from Ubuntu reps OS: Ubuntu 16.04.2 LTS GPU: Radeon RX480 x 2 GPU driver: AMD amdgpu-pro 16.60 kernel: 4.4.0-85-generic x64 Boinc Log: Sun 16 Jul 2017 12:34:06 BST | | Starting BOINC client version 7.6.31 for x86_64-pc-linux-gnu Sun 16 Jul 2017 12:34:06 BST | | log flags: file_xfer, sched_ops, task Sun 16 Jul 2017 12:34:06 BST | | Libraries: libcurl/7.47.0 OpenSSL/1.0.2g zlib/1.2.8 libidn/1.32 librtmp/2.3 Sun 16 Jul 2017 12:34:06 BST | | Data directory: /var/lib/boinc-client Sun 16 Jul 2017 12:34:06 BST | | GPU detection failed. error code 139 Sun 16 Jul 2017 12:34:06 BST | | No usable GPUs found Sun 16 Jul 2017 12:34:06 BST | | app version refers to missing GPU type ATI Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Application uses missing ATI GPU Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_828.0_0_0.0_6633930_0 Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_836.0_0_0.0_426700_0 Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_828.0_0_0.0_6628910_0 Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_836.0_0_0.0_425445_0 Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_828.0_0_0.0_6632675_1 . . . . . Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_884.0_0_0.0_7689385_1 Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_892.0_0_0.0_1143305_0 Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_892.0_0_0.0_1153345_1 Sun 16 Jul 2017 12:34:06 BST | | Host name: orac002 Sun 16 Jul 2017 12:34:06 BST | | Processor: 8 AuthenticAMD AMD FX(tm)-8320 Eight-Core Processor [Family 21 Model 2 Stepping 0] Sun 16 Jul 2017 12:34:06 BST | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 popcnt aes xsave avx f16c lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs xop skinit wdt lwp fma4 tce nodeid_msr tbm topoext perfctr_core perfctr_nb cpb hw_pstate vmmcall bmi1 arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold Sun 16 Jul 2017 12:34:06 BST | | OS: Linux: 4.4.0-85-generic Sun 16 Jul 2017 12:34:06 BST | | Memory: 31.34 GB physical, 119.21 GB virtual Sun 16 Jul 2017 12:34:06 BST | | Disk: 143.48 GB total, 133.17 GB free Sun 16 Jul 2017 12:34:06 BST | | Local time is UTC +1 hours Sun 16 Jul 2017 12:34:06 BST | | Config: GUI RPCs allowed from: Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | URL http://einstein.phys.uwm.edu/; Computer ID 12492607; resource share 100 Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Your current settings do not allow tasks from this project. To fix this, you can change Project Preferences on the project's web site. Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | General prefs: from Einstein@Home (last modified 02-Jul-2017 11:13:33) Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | Host location: none Sun 16 Jul 2017 12:34:06 BST | Einstein@Home | General prefs: using your defaults Sun 16 Jul 2017 12:34:06 BST | | Reading preferences override file |
Send message Joined: 2 Jul 14 Posts: 186 |
https://forums.linuxmint.com/viewtopic.php?f=59&t=229229&start=20#p1271515 Update kernel to 4.8xxxx Latest AMDGPU-PRO driver is 17.10 http://support.amd.com/en-us/kb-articles/Pages/AMDGPU-PRO-Driver-for-Linux-Release-Notes.aspx |
Send message Joined: 23 Jul 11 Posts: 14 |
Thank you Richie. I'll try that. The one thing I forgot to mention was that everything was working fine then after an update everything stopped. What broke would be useful to find out!!!!!!! Was using 17.10 when the problem started. I deliberately downgraded to see if that would help and sadly it did not. |
Send message Joined: 30 May 15 Posts: 265 |
boinc need to be part of the video group, /etc/group should show a line like video:x:44:boinc,agentb code 139 is congruent to the segfault. a) # clinfo may be revealing along with coproc_debug event log flag and b) there are newer drivers at AMD (as mentioned by Ritchie) Good luck Edit: I am having no problem with 4.4 |
Send message Joined: 23 Jul 11 Posts: 14 |
Greetings Agentb. Thank you for the info. I have done the following:- jlneill@orac002:~$ more /etc/group | grep boinc video:x:44:jlneill,boinc boinc:x:1002: Looks like no problem there. however setting the coproc_debug event log flag and running sudo systemctl restart boinc-client.service produced the following log entries: Sun 16 Jul 2017 17:11:45 BST | | Starting BOINC client version 7.6.31 for x86_64-pc-linux-gnu Sun 16 Jul 2017 17:11:45 BST | | log flags: file_xfer, sched_ops, task, coproc_debug Sun 16 Jul 2017 17:11:45 BST | | Libraries: libcurl/7.47.0 OpenSSL/1.0.2g zlib/1.2.8 libidn/1.32 librtmp/2.3 Sun 16 Jul 2017 17:11:45 BST | | Data directory: /var/lib/boinc-client Sun 16 Jul 2017 17:11:45 BST | | [coproc] launching child process at /usr/bin/boinc Sun 16 Jul 2017 17:11:45 BST | | [coproc] relative to directory / Sun 16 Jul 2017 17:11:45 BST | | [coproc] with data directory /var/lib/boinc-client Sun 16 Jul 2017 17:11:45 BST | | GPU detection failed. error code 139 Sun 16 Jul 2017 17:11:45 BST | | [coproc] NVIDIA: libnvidia-fatbinaryloader.so.375.66: cannot open shared object file: No such file or directory Sun 16 Jul 2017 17:11:45 BST | | [coproc] ATI: libaticalrt.so: cannot open shared object file: No such file or directory Sun 16 Jul 2017 17:11:45 BST | | [coproc] Caught SIGSEGV in OpenCL detection Sun 16 Jul 2017 17:11:45 BST | | No usable GPUs found Sun 16 Jul 2017 17:11:45 BST | | app version refers to missing GPU type ATI Sun 16 Jul 2017 17:11:45 BST | Einstein@Home | Application uses missing ATI GPU Sun 16 Jul 2017 17:11:45 BST | Einstein@Home | Missing coprocessor for task LATeah0035L_828.0_0_0.0_6633930_0 Then running sudo find / -name libaticalrt.so find: ‘/run/user/1000/gvfs’: Permission denied jlneill@orac002:~$ Where is it????????????? Big oops! In the meantime I am going to upgrade the driver and repeat looking for the offending file! |
Send message Joined: 23 Jul 11 Posts: 14 |
Have done the driver upgrade and the file is still missing. sudo find / -name libaticalrt.so find: ‘/run/user/1000/gvfs’: Permission denied jlneill@orac002:~$ sudo find / -name libaticalrt* find: ‘/run/user/1000/gvfs’: Permission denied jlneill@orac002:~$ Am now "googling" libcaticalart.so |
Send message Joined: 30 May 15 Posts: 265 |
Have done the driver upgrade and the file is still missing.Don't do that. The messages indicate it can not find libraries for CAL and nVidia, and i don't think you have (or need) them. You should see something like Sun 16 Jul 2017 18:34:11 BST | | Starting BOINC client version 7.6.33 for x86_64-pc-linux-gnu Sun 16 Jul 2017 18:34:11 BST | | log flags: file_xfer, sched_ops, task, coproc_debug Sun 16 Jul 2017 18:34:11 BST | | Libraries: libcurl/7.47.0 OpenSSL/1.0.2g zlib/1.2.8 libidn/1.32 librtmp/2.3 Sun 16 Jul 2017 18:34:11 BST | | Data directory: /var/lib/boinc-client Sun 16 Jul 2017 18:34:11 BST | | [coproc] launching child process at /usr/bin/boinc Sun 16 Jul 2017 18:34:11 BST | | [coproc] relative to directory / Sun 16 Jul 2017 18:34:11 BST | | [coproc] with data directory /var/lib/boinc-client Sun 16 Jul 2017 18:34:11 BST | | OpenCL: AMD/ATI GPU 0: AMD Radeon (TM) RX 480 Graphics (driver version 2348.3, device version OpenCL 1.2 AMD-APP (2348.3), 5819MB, 5819MB available, 5935 GFLOPS peak) Sun 16 Jul 2017 18:34:11 BST | | [coproc] NVIDIA: libcuda.so: cannot open shared object file: No such file or directory Sun 16 Jul 2017 18:34:11 BST | | [coproc] ATI: libaticalrt.so: cannot open shared object file: No such file or directory Sun 16 Jul 2017 18:34:11 BST | | [coproc] clGetDeviceInfo failed to get CL_DEVICE_SIMD_PER_COMPUTE_UNIT_AMD for device 0 I'll ask again, could you run clinfo and post the output. |
Send message Joined: 23 Jul 11 Posts: 14 |
Part 1 Oops Sorry Agentb here is the info from clinfo Number of platforms 1 Platform Name AMD Accelerated Parallel Processing Platform Vendor Advanced Micro Devices, Inc. Platform Version OpenCL 2.0 AMD-APP (2348.3) Platform Profile FULL_PROFILE Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices Platform Extensions function suffix AMD Platform Name AMD Accelerated Parallel Processing Number of devices 3 Device Name Ellesmere Device Vendor Advanced Micro Devices, Inc. Device Vendor ID 0x1002 Device Version OpenCL 1.2 AMD-APP (2348.3) Driver Version 2348.3 Device OpenCL C Version OpenCL C 1.2 Device Type GPU Device Profile FULL_PROFILE Device Board Name (AMD) AMD Radeon (TM) RX 480 Graphics Device Topology (AMD) PCI-E, 01:00.0 Max compute units 36 SIMD per compute unit (AMD) 4 SIMD width (AMD) 16 SIMD instruction width (AMD) 1 Max clock frequency 1310MHz Graphics IP (AMD) 8.0 Device Partition (core) Max number of sub-devices 36 Supported partition types none specified Max work item dimensions 3 Max work item sizes 256x256x256 Max work group size 256 Preferred work group size multiple 64 Wavefront width (AMD) 64 Preferred / native vector sizes char 4 / 4 short 2 / 2 int 1 / 1 long 1 / 1 half 1 / 1 (cl_khr_fp16) float 1 / 1 double 1 / 1 (cl_khr_fp64) Half-precision Floating-point support (cl_khr_fp16) Denormals No Infinity and NANs No Round to nearest No Round to zero No Round to infinity No IEEE754-2008 fused multiply-add No Support is emulated in software No Correctly-rounded divide and sqrt operations No Single-precision Floating-point support (core) Denormals No Infinity and NANs Yes Round to nearest Yes Round to zero Yes Round to infinity Yes IEEE754-2008 fused multiply-add Yes Support is emulated in software No Correctly-rounded divide and sqrt operations Yes Double-precision Floating-point support (cl_khr_fp64) Denormals Yes Infinity and NANs Yes Round to nearest Yes Round to zero Yes Round to infinity Yes IEEE754-2008 fused multiply-add Yes Support is emulated in software No Correctly-rounded divide and sqrt operations No Address bits 64, Little-Endian Global memory size 8059658240 (7.506GiB) Global free memory (AMD) 7841832 (7.479GiB) Global memory channels (AMD) 8 Global memory banks per channel (AMD) 16 Global memory bank width (AMD) 256 bytes Error Correction support No Max memory allocation 4244635648 (3.953GiB) Unified memory for Host and Device No Minimum alignment for any data type 128 bytes Alignment of base address 2048 bits (256 bytes) Global Memory cache type Read/Write Global Memory cache size 16384 Global Memory cache line 64 bytes Image support Yes Max number of samplers per kernel 16 Max size for 1D images from buffer 134217728 pixels Max 1D or 2D image array size 2048 images Base address alignment for 2D image buffers 256 bytes Pitch alignment for 2D image buffers 256 bytes Max 2D image size 16384x16384 pixels Max 3D image size 2048x2048x2048 pixels Max number of read image args 128 Max number of write image args 8 Local memory type Local Local memory size 32768 (32KiB) Local memory syze per CU (AMD) 65536 (64KiB) Local memory banks (AMD) 32 Max constant buffer size 4244635648 (3.953GiB) Max number of constant args 8 Max size of kernel argument 1024 Queue properties Out-of-order execution No Profiling Yes Prefer user sync for interop Yes Profiling timer resolution 1ns Profiling timer offset since Epoch (AMD) 1500222931398969873ns (Sun Jul 16 17:35:31 2017) Execution capabilities Run OpenCL kernels Yes Run native kernels No Thread trace supported (AMD) Yes SPIR versions 1.2 printf() buffer size 1048576 (1024KiB) Built-in kernels Device Available Yes Compiler Available Yes Linker Available Yes Device Extensions cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event Device Name Ellesmere Device Vendor Advanced Micro Devices, Inc. Device Vendor ID 0x1002 Device Version OpenCL 1.2 AMD-APP (2348.3) Driver Version 2348.3 Device OpenCL C Version OpenCL C 1.2 Device Type GPU Device Profile FULL_PROFILE Device Board Name (AMD) AMD Radeon (TM) RX 480 Graphics Device Topology (AMD) PCI-E, 06:00.0 Max compute units 36 SIMD per compute unit (AMD) 4 SIMD width (AMD) 16 SIMD instruction width (AMD) 1 Max clock frequency 1310MHz Graphics IP (AMD) 8.0 Device Partition (core) Max number of sub-devices 36 Supported partition types none specified Max work item dimensions 3 Max work item sizes 256x256x256 Max work group size 256 Preferred work group size multiple 64 Wavefront width (AMD) 64 Preferred / native vector sizes char 4 / 4 short 2 / 2 int 1 / 1 long 1 / 1 half 1 / 1 (cl_khr_fp16) float 1 / 1 double 1 / 1 (cl_khr_fp64) Half-precision Floating-point support (cl_khr_fp16) Denormals No Infinity and NANs No Round to nearest No Round to zero No Round to infinity No IEEE754-2008 fused multiply-add No Support is emulated in software No Correctly-rounded divide and sqrt operations No Single-precision Floating-point support (core) Denormals No Infinity and NANs Yes Round to nearest Yes Round to zero Yes Round to infinity Yes IEEE754-2008 fused multiply-add Yes Support is emulated in software No Correctly-rounded divide and sqrt operations Yes Double-precision Floating-point support (cl_khr_fp64) Denormals Yes Infinity and NANs Yes Round to nearest Yes Round to zero Yes Round to infinity Yes IEEE754-2008 fused multiply-add Yes Support is emulated in software No Correctly-rounded divide and sqrt operations No Address bits 64, Little-Endian Global memory size 8533544960 (7.947GiB) Global free memory (AMD) 8313956 (7.929GiB) Global memory channels (AMD) 8 Global memory banks per channel (AMD) 16 Global memory bank width (AMD) 256 bytes Error Correction support No Max memory allocation 4244635648 (3.953GiB) Unified memory for Host and Device No Minimum alignment for any data type 128 bytes Alignment of base address 2048 bits (256 bytes) Global Memory cache type Read/Write Global Memory cache size 16384 Global Memory cache line 64 bytes Image support Yes Max number of samplers per kernel 16 Max size for 1D images from buffer 134217728 pixels Max 1D or 2D image array size 2048 images Base address alignment for 2D image buffers 256 bytes Pitch alignment for 2D image buffers 256 bytes Max 2D image size 16384x16384 pixels Max 3D image size 2048x2048x2048 pixels Max number of read image args 128 Max number of write image args 8 Local memory type Local Local memory size 32768 (32KiB) Local memory syze per CU (AMD) 65536 (64KiB) Local memory banks (AMD) 32 Max constant buffer size 4244635648 (3.953GiB) Max number of constant args 8 Max size of kernel argument 1024 Queue properties Out-of-order execution No Profiling Yes Prefer user sync for interop Yes Profiling timer resolution 1ns Profiling timer offset since Epoch (AMD) 1500222931398969873ns (Sun Jul 16 17:35:31 2017) Execution capabilities Run OpenCL kernels Yes Run native kernels No Thread trace supported (AMD) Yes SPIR versions 1.2 printf() buffer size 1048576 (1024KiB) Built-in kernels Device Available Yes Compiler Available Yes Linker Available Yes Device Extensions cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event Device Name AMD FX(tm)-8320 Eight-Core Processor Device Vendor AuthenticAMD Device Vendor ID 0x1002 Device Version OpenCL 1.2 AMD-APP (2348.3) Driver Version 2348.3 (sse2,avx,fma4) Device OpenCL C Version OpenCL C 1.2 Device Type CPU Device Profile FULL_PROFILE Device Board Name (AMD) Device Topology (AMD) (n/a) Max compute units 8 Max clock frequency 3950MHz Device Partition (core, cl_ext_device_fission) Max number of sub-devices 8 Supported partition types equally, by counts, by affinity domain Supported affinity domains L3 cache, L2 cache, L1 cache, next partitionable Supported partition types (ext) equally, by counts, by affinity domain Supported affinity domains (ext) L3 cache, L2 cache, L1 cache, next fissionable Max work item dimensions 3 Max work item sizes 1024x1024x1024 Max work group size 1024 Preferred work group size multiple 1 Preferred / native vector sizes char 16 / 16 short 8 / 8 int 4 / 4 long 2 / 2 half 4 / 4 (n/a) float 8 / 8 double 4 / 4 (cl_khr_fp64) Half-precision Floating-point support (n/a) Single-precision Floating-point support (core) Denormals Yes Infinity and NANs Yes Round to nearest Yes Round to zero Yes Round to infinity Yes IEEE754-2008 fused multiply-add Yes Support is emulated in software No Correctly-rounded divide and sqrt operations Yes Double-precision Floating-point support (cl_khr_fp64) Denormals Yes Infinity and NANs Yes Round to nearest Yes Round to zero Yes Round to infinity Yes IEEE754-2008 fused multiply-add Yes Support is emulated in software No Correctly-rounded divide and sqrt operations No Address bits 64, Little-Endian Global memory size 33647460352 (31.34GiB) Error Correction support No Max memory allocation 8411865088 (7.834GiB) Unified memory for Host and Device Yes Minimum alignment for any data type 128 bytes Alignment of base address 1024 bits (128 bytes) Global Memory cache type Read/Write Global Memory cache size 16384 Global Memory cache line 64 bytes Image support Yes Max number of samplers per kernel 16 Max size for 1D images from buffer 65536 pixels Max 1D or 2D image array size 2048 images Max 2D image size 8192x8192 pixels Max 3D image size 2048x2048x2048 pixels Max number of read image args 128 Max number of write image args 64 Local memory type Global Local memory size 32768 (32KiB) Max constant buffer size 65536 (64KiB) Max number of constant args 8 Max size of kernel argument 4096 (4KiB) Queue properties Out-of-order execution No Profiling Yes Prefer user sync for interop Yes Profiling timer resolution 1ns Profiling timer offset since Epoch (AMD) 1500222931398969873ns (Sun Jul 16 17:35:31 2017) Execution capabilities Run OpenCL kernels Yes Run native kernels Yes SPIR versions 1.2 printf() buffer size 65536 (64KiB) Built-in kernels Device Available Yes Compiler Available Yes Linker Available Yes Device Extensions cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_spir cl_khr_gl_event NULL platform behavior clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) No platform clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) No platform clCreateContext(NULL, ...) [default] No platform clCreateContext(NULL, ...) [other] Success [AMD] clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No platform |
Send message Joined: 23 Jul 11 Posts: 14 |
Part 2: From the boinc log: Sun 16 Jul 2017 18:48:32 BST | | Starting BOINC client version 7.6.31 for x86_64-pc-linux-gnu Sun 16 Jul 2017 18:48:32 BST | | log flags: file_xfer, sched_ops, task, coproc_debug Sun 16 Jul 2017 18:48:32 BST | | Libraries: libcurl/7.47.0 OpenSSL/1.0.2g zlib/1.2.8 libidn/1.32 librtmp/2.3 Sun 16 Jul 2017 18:48:32 BST | | Data directory: /var/lib/boinc-client Sun 16 Jul 2017 18:48:32 BST | | [coproc] launching child process at /usr/bin/boinc Sun 16 Jul 2017 18:48:32 BST | | [coproc] relative to directory / Sun 16 Jul 2017 18:48:32 BST | | [coproc] with data directory /var/lib/boinc-client Sun 16 Jul 2017 18:48:33 BST | | GPU detection failed. error code 139 Sun 16 Jul 2017 18:48:33 BST | | [coproc] NVIDIA: libnvidia-fatbinaryloader.so.375.66: cannot open shared object file: No such file or directory Sun 16 Jul 2017 18:48:33 BST | | [coproc] ATI: libaticalrt.so: cannot open shared object file: No such file or directory Sun 16 Jul 2017 18:48:33 BST | | [coproc] Caught SIGSEGV in OpenCL detection Sun 16 Jul 2017 18:48:33 BST | | No usable GPUs found Sun 16 Jul 2017 18:48:33 BST | | app version refers to missing GPU type ATI Sun 16 Jul 2017 18:48:33 BST | Einstein@Home | Application uses missing ATI GPU |
Send message Joined: 30 May 15 Posts: 265 |
How exactly did you install boinc? sudo apt-get install .... for some reason you have specific nVidia icds / libraries reporting check the icds are correct # cd /etc/OpenCL/vendors you should see only two files amdocl32.icd amdocl64.icd and the content of each file will be a library name eg libamdocl64.so in the second. check existence of these libraries in the library cache - you should see # ldconfig -p | grep amdocl libamdocl64.so (libc6,x86-64) => /opt/amdgpu-pro/lib/x86_64-linux-gnu/libamdocl64.so libamdocl32.so (libc6) => /opt/amdgpu-pro/lib/i386-linux-gnu/libamdocl32.so libamdocl12cl64.so (libc6,x86-64) => /opt/amdgpu-pro/lib/x86_64-linux-gnu/libamdocl12cl64.so libamdocl12cl32.so (libc6) => /opt/amdgpu-pro/lib/i386-linux-gnu/libamdocl12cl32.so |
Send message Joined: 23 Jul 11 Posts: 14 |
How exactly did you install boinc? I used the synaptic package manager check the icds are correct jlneill@orac002:/etc/OpenCL/vendors$ ls -ial total 8 2499241 drwxr-xr-x 1 root root 48 Jul 16 17:34 . 2499240 drwxr-xr-x 1 root root 14 Jun 9 19:43 .. 2924436 -rw-r--r-- 1 root root 15 Mar 15 20:53 amdocl32.icd 2924250 -rw-r--r-- 1 root root 15 Mar 15 20:53 amdocl64.icd and the content of each file will be a library name eg libamdocl64.so in the second. Yes, all present. And jlneill@orac002:/etc/OpenCL/vendors$ ldconfig -p | grep amdocl libamdocl64.so (libc6,x86-64) => /opt/amdgpu-pro/lib/x86_64-linux-gnu/libamdocl64.so libamdocl32.so (libc6) => /opt/amdgpu-pro/lib/i386-linux-gnu/libamdocl32.so libamdocl12cl64.so (libc6,x86-64) => /opt/amdgpu-pro/lib/x86_64-linux-gnu/libamdocl12cl64.so libamdocl12cl32.so (libc6) => /opt/amdgpu-pro/lib/i386-linux-gnu/libamdocl12cl32.so Would it be an idea to dig around for extraneous nvidia packages? |
Send message Joined: 30 May 15 Posts: 265 |
How exactly did you install boinc? OK that is what you used to do the installing, but exactly what did you install with synaptic? There are several boinc (and other stuff) things you could have installed with synaptic. I think you can view the logs through the synaptic gui. Would it be an idea to dig around for extraneous nvidia packages?yes - but this # ldconfig -p | grep -i nvidia should show nothing. If it shows a lot of libraries then you have installed an nVidia package and that needs removing, especially if it shows a libOpenCL.so echo $LD_LIBRARY_PATH should also show nothing. |
Send message Joined: 23 Jul 11 Posts: 14 |
I had a brainwave and removed VERY carefully all the extraneous NVIDIA packages and this was the result: Sun 16 Jul 2017 19:47:18 BST | | Starting BOINC client version 7.6.31 for x86_64-pc-linux-gnu Sun 16 Jul 2017 19:47:18 BST | | log flags: file_xfer, sched_ops, task, coproc_debug Sun 16 Jul 2017 19:47:18 BST | | Libraries: libcurl/7.47.0 OpenSSL/1.0.2g zlib/1.2.8 libidn/1.32 librtmp/2.3 Sun 16 Jul 2017 19:47:18 BST | | Data directory: /var/lib/boinc-client Sun 16 Jul 2017 19:47:18 BST | | [coproc] launching child process at /usr/bin/boinc Sun 16 Jul 2017 19:47:18 BST | | [coproc] relative to directory / Sun 16 Jul 2017 19:47:18 BST | | [coproc] with data directory /var/lib/boinc-client Sun 16 Jul 2017 19:47:18 BST | | OpenCL: AMD/ATI GPU 0: AMD Radeon (TM) RX 480 Graphics (driver version 2348.3, device version OpenCL 1.2 AMD-APP (2348.3), 7650MB, 7650MB available, 6036 GFLOPS peak) Sun 16 Jul 2017 19:47:18 BST | | OpenCL: AMD/ATI GPU 1: AMD Radeon (TM) RX 480 Graphics (driver version 2348.3, device version OpenCL 1.2 AMD-APP (2348.3), 8138MB, 8138MB available, 6036 GFLOPS peak) Sun 16 Jul 2017 19:47:18 BST | | [coproc] NVIDIA: libcuda.so: cannot open shared object file: No such file or directory Sun 16 Jul 2017 19:47:18 BST | | [coproc] ATI: libaticalrt.so: cannot open shared object file: No such file or directory Sun 16 Jul 2017 19:47:18 BST | | [coproc] clGetDeviceInfo failed to get CL_DEVICE_SIMD_PER_COMPUTE_UNIT_AMD for device 0 Sun 16 Jul 2017 19:47:18 BST | | Host name: orac002 Sun 16 Jul 2017 19:47:18 BST | | Processor: 8 AuthenticAMD AMD FX(tm)-8320 Eight-Core Processor [Family 21 Model 2 Stepping 0] Sun 16 Jul 2017 19:47:18 BST | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 popcnt aes xsave avx f16c lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs xop skinit wdt lwp fma4 tce nodeid_msr tbm topoext perfctr_core perfctr_nb cpb hw_pstate vmmcall bmi1 arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold Sun 16 Jul 2017 19:47:18 BST | | OS: Linux: 4.4.0-85-generic Sun 16 Jul 2017 19:47:18 BST | | Memory: 31.34 GB physical, 119.21 GB virtual Sun 16 Jul 2017 19:47:18 BST | | Disk: 143.48 GB total, 133.37 GB free Sun 16 Jul 2017 19:47:18 BST | | Local time is UTC +1 hours Sun 16 Jul 2017 19:47:18 BST | | Config: GUI RPCs allowed from: Sun 16 Jul 2017 19:47:18 BST | Einstein@Home | URL http://einstein.phys.uwm.edu/; Computer ID 12492607; resource share 100 Sun 16 Jul 2017 19:47:18 BST | Einstein@Home | General prefs: from Einstein@Home (last modified 02-Jul-2017 11:13:33) Sun 16 Jul 2017 19:47:18 BST | Einstein@Home | Host location: none Sun 16 Jul 2017 19:47:18 BST | Einstein@Home | General prefs: using your defaults Sun 16 Jul 2017 19:47:18 BST | | Reading preferences override file Sun 16 Jul 2017 19:47:18 BST | | Preferences: Sun 16 Jul 2017 19:47:18 BST | | max memory usage when active: 32088.72MB Sun 16 Jul 2017 19:47:18 BST | | max memory usage when idle: 32088.72MB Sun 16 Jul 2017 19:47:18 BST | | max disk usage: 5.00GB Sun 16 Jul 2017 19:47:18 BST | | max download rate: 2200000 bytes/sec Sun 16 Jul 2017 19:47:18 BST | | max upload rate: 2200000 bytes/sec Sun 16 Jul 2017 19:47:18 BST | | (to change preferences, visit a project web site or select Preferences in the Manager) Sun 16 Jul 2017 19:47:18 BST | | gui_rpc_auth.cfg is empty - no GUI RPC password protection The problem me thinks is solved! |
Send message Joined: 23 Jul 11 Posts: 14 |
Agentb and Richie thank you very much for all your help! James |
Send message Joined: 30 May 15 Posts: 265 |
I had a brainwave and removed VERY carefully all the extraneous NVIDIA packages and this was the result: OK that's good news, but i'm curious why the nvidia packages were installed. |
Send message Joined: 23 Jul 11 Posts: 14 |
I would look like a red traffic light. The long and the short of it was that on July 4 I was trying to be clever; selected an nvidia package in a rush which pulled in some dependencies and the rest was history. The only problem being was that when I corrected the primary mistake the other packages were not removed and did not think enough. The offending package was boinc-client-nvidia-cuda. Moral of the story must be: Don't try being a smartass. Or like speaking engage brain before opening mouth!!!!!! |
Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.