Posts by Nikolay Polyarniy

InfoMessage
1) Message boards : GPUs : clinfo.exe wrong values for CL_DEVICE_ADDRESS_BITS
Message 75170
Posted 10 Jan 2017 by Nikolay Polyarniy
Ok, thanks!
2) Message boards : GPUs : clinfo.exe wrong values for CL_DEVICE_ADDRESS_BITS
Message 75162
Posted 9 Jan 2017 by Nikolay Polyarniy
I am sure, that driver for Intel HD 4600 is actual (I installed it today from my notebook vendor site).
Moreover - the problem exists for CPU too.
3) Message boards : GPUs : clinfo.exe wrong values for CL_DEVICE_ADDRESS_BITS
Message 75159
Posted 9 Jan 2017 by Nikolay Polyarniy
On 64-bit Windows clinfo.exe outputs strange values for "Address bits" (CL_DEVICE_ADDRES_BITS):

14757395255531667520 and 14757395255531667488

While expected: 64 and 32.

I notice, that:
14757395255531667520 = 0b1100110011001100110011001100110000000000000000000000000001000000
14757395255531667488 = 0b1100110011001100110011001100110000000000000000000000000000100000

Also google returns a lot of results with outputs with the same values.

So upper bits are seems to be uninitialized. May be your clinfo reads CL_DEVICE_ADDRES_BITS into size_t value? (While should read into cl_uint, according to documentation - https://www.khronos.org/registry/OpenCL/sdk/1.0/docs/man/xhtml/clGetDeviceInfo.html)

Full output:
Number of platforms:                             2
  Platform Profile:                              FULL_PROFILE
  Platform Version:                              OpenCL 1.1 CUDA 6.0.1
  Platform Name:                                 NVIDIA CUDA
  Platform Vendor:                               NVIDIA Corporation
  Platform Extensions:                           cl_khr_byte_addressable_store c
l_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_
sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query
cl_nv_pragma_unroll
  Platform Profile:                              FULL_PROFILE
  Platform Version:                              OpenCL 1.2
  Platform Name:                                 Intel(R) OpenCL
  Platform Vendor:                               Intel(R) Corporation
  Platform Extensions:                           cl_intel_dx9_media_sharing cl_k
hr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d11_sharing cl_khr_dep
th_images cl_khr_dx9_media_sharing cl_khr_gl_sharing cl_khr_global_int32_base_at
omics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_local_int32_base_at
omics cl_khr_local_int32_extended_atomics cl_khr_spir


  Platform Name:                                 NVIDIA CUDA
Number of devices:                               1
  Device Type:                                   CL_DEVICE_TYPE_GPU
  Device ID:                                     4318
  Max compute units:                             2
  Max work items dimensions:                     3
    Max work items[0]:                           1024
    Max work items[1]:                           1024
    Max work items[2]:                           64
  Max work group size:                           1024
  Preferred vector width char:                   1
  Preferred vector width short:                  1
  Preferred vector width int:                    1
  Preferred vector width long:                   1
  Preferred vector width float:                  1
  Preferred vector width double:                 1
  Max clock frequency:                           705Mhz
  Address bits:                                  14757395255531667488
  Max memory allocation:                         536870912
  Image support:                                 Yes
  Max number of images read arguments:           256
  Max number of images write arguments:          16
  Max image 2D width:                            32768
  Max image 2D height:                           32768
  Max image 3D width:                            4096
  Max image 3D height:                           4096
  Max image 3D depth:                            4096
  Max samplers within kernel:                    32
  Max size of kernel argument:                   4352
  Alignment (bits) of base address:              4096
  Minimum alignment (bytes) for any datatype:    128
  Single precision floating point capability
    Denorms:                                     Yes
    Quiet NaNs:                                  Yes
    Round to nearest even:                       Yes
    Round to zero:                               Yes
    Round to +ve and infinity:                   Yes
    IEEE754-2008 fused multiply-add:             Yes
  Cache type:                                    Read/Write
  Cache line size:                               128
  Cache size:                                    32768
  Global memory size:                            2147483648
  Constant buffer size:                          65536
  Max number of constant args:                   9
  Local memory type:                             Scratchpad
  Local memory size:                             49152
  Error correction support:                      0
  Profiling timer resolution:                    1000
  Device endianess:                              Little
  Available:                                     Yes
  Compiler available:                            Yes
  Execution capabilities:
    Execute OpenCL kernels:                      Yes
    Execute native function:                     No
  Queue properties:
    Out-of-Order:                                Yes
    Profiling :                                  Yes
  Platform ID:                                   00811B38
  Name:                                          Quadro K1100M
  Vendor:                                        NVIDIA Corporation
  Driver version:                                331.65
  Profile:                                       FULL_PROFILE
  Version:                                       OpenCL 1.1 CUDA
  Extensions:                                    cl_khr_byte_addressable_store c
l_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_
sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query
cl_nv_pragma_unroll  cl_khr_global_int32_base_atomics cl_khr_global_int32_extend
ed_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics c
l_khr_fp64


  Platform Name:                                 Intel(R) OpenCL
Number of devices:                               2
  Device Type:                                   CL_DEVICE_TYPE_CPU
  Device ID:                                     32902
  Max compute units:                             8
  Max work items dimensions:                     3
    Max work items[0]:                           8192
    Max work items[1]:                           8192
    Max work items[2]:                           8192
  Max work group size:                           8192
  Preferred vector width char:                   1
  Preferred vector width short:                  1
  Preferred vector width int:                    1
  Preferred vector width long:                   1
  Preferred vector width float:                  1
  Preferred vector width double:                 1
  Max clock frequency:                           2200Mhz
  Address bits:                                  14757395255531667488
  Max memory allocation:                         536838144
  Image support:                                 Yes
  Max number of images read arguments:           480
  Max number of images write arguments:          480
  Max image 2D width:                            16384
  Max image 2D height:                           16384
  Max image 3D width:                            2048
  Max image 3D height:                           2048
  Max image 3D depth:                            2048
  Max samplers within kernel:                    480
  Max size of kernel argument:                   3840
  Alignment (bits) of base address:              1024
  Minimum alignment (bytes) for any datatype:    128
  Single precision floating point capability
    Denorms:                                     Yes
    Quiet NaNs:                                  Yes
    Round to nearest even:                       Yes
    Round to zero:                               No
    Round to +ve and infinity:                   No
    IEEE754-2008 fused multiply-add:             No
  Cache type:                                    Read/Write
  Cache line size:                               64
  Cache size:                                    262144
  Global memory size:                            2147352576
  Constant buffer size:                          131072
  Max number of constant args:                   480
  Local memory type:                             Global
  Local memory size:                             32768
  Error correction support:                      0
  Profiling timer resolution:                    466
  Device endianess:                              Little
  Available:                                     Yes
  Compiler available:                            Yes
  Execution capabilities:
    Execute OpenCL kernels:                      Yes
    Execute native function:                     Yes
  Queue properties:
    Out-of-Order:                                Yes
    Profiling :                                  Yes
  Platform ID:                                   00702CA8
  Name:                                          Intel(R) Core(TM) i7-4702HQ CPU
 @ 2.20GHz
  Vendor:                                        Intel(R) Corporation
  Driver version:                                4.2.0.148
  Profile:                                       FULL_PROFILE
  Version:                                       OpenCL 1.2 (Build 148)
  Extensions:                                    cl_khr_icd cl_khr_global_int32_
base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomic
s cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir
cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_
fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl
_khr_gl_sharing


  Device Type:                                   CL_DEVICE_TYPE_GPU
  Device ID:                                     32902
  Max compute units:                             20
  Max work items dimensions:                     3
    Max work items[0]:                           512
    Max work items[1]:                           512
    Max work items[2]:                           512
  Max work group size:                           512
  Preferred vector width char:                   1
  Preferred vector width short:                  1
  Preferred vector width int:                    1
  Preferred vector width long:                   1
  Preferred vector width float:                  1
  Preferred vector width double:                 0
  Max clock frequency:                           600Mhz
  Address bits:                                  14757395255531667520
  Max memory allocation:                         373712486
  Image support:                                 Yes
  Max number of images read arguments:           128
  Max number of images write arguments:          8
  Max image 2D width:                            16384
  Max image 2D height:                           16384
  Max image 3D width:                            2048
  Max image 3D height:                           2048
  Max image 3D depth:                            2048
  Max samplers within kernel:                    16
  Max size of kernel argument:                   1024
  Alignment (bits) of base address:              1024
  Minimum alignment (bytes) for any datatype:    128
  Single precision floating point capability
    Denorms:                                     No
    Quiet NaNs:                                  Yes
    Round to nearest even:                       Yes
    Round to zero:                               Yes
    Round to +ve and infinity:                   Yes
    IEEE754-2008 fused multiply-add:             No
  Cache type:                                    Read/Write
  Cache line size:                               64
  Cache size:                                    2097152
  Global memory size:                            1494849946
  Constant buffer size:                          65536
  Max number of constant args:                   8
  Local memory type:                             Scratchpad
  Local memory size:                             65536
  Error correction support:                      0
  Profiling timer resolution:                    80
  Device endianess:                              Little
  Available:                                     Yes
  Compiler available:                            Yes
  Execution capabilities:
    Execute OpenCL kernels:                      Yes
    Execute native function:                     No
  Queue properties:
    Out-of-Order:                                No
    Profiling :                                  Yes
  Platform ID:                                   00702CA8
  Name:                                          Intel(R) HD Graphics 4600
  Vendor:                                        Intel(R) Corporation
  Driver version:                                10.18.14.4112
  Profile:                                       FULL_PROFILE
  Version:                                       OpenCL 1.2
  Extensions:                                    cl_intel_accelerator cl_intel_a
dvanced_motion_estimation cl_intel_ctz cl_intel_d3d11_nv12_media_sharing cl_inte
l_dx9_media_sharing cl_intel_motion_estimation cl_intel_simultaneous_sharing cl_
intel_subgroups cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d1
0_sharing cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_k
hr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_khr_gl_sharing cl_k
hr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_
khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_exten
ded_atomics cl_khr_spir


P.S. I didn't found source code for clinfo on github - https://github.com/BOINC, may be you will upload it there?


Copyright © 2025 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.