DHEP computation errors - Debian amd64

Message boards : Questions and problems : DHEP computation errors - Debian amd64
Message board moderation

To post messages, you must log in.

AuthorMessage
hgnoel1980

Send message
Joined: 17 Feb 19
Posts: 3
United Kingdom
Message 90156 - Posted: 17 Feb 2019, 12:20:30 UTC

I have noticed all jobs are failing with Computation Error on my Debian Unstable amd64 machine.
These were running fine until recently and work fine under Win10.
I have reset the project however this does not appear to have resolved the probem.
The slots directory appears to be missing the jre - this appears in the Win10 boinc dir slots however not under the Linux ones.

Win10 slots contain - jre directory, boinc_lockfile, in, init_data.xml, ITClient.jar, java.policy, job.xml, srterr.ext, wrapper_dhe-custom-2R_windows-x86_64.exe
Linux slots contain - init_data.xml, job.xml, wrapper_dhe-custom-2_x86_64-pc-linux-gnu

The project folder for dhep on both Win10 and Linux contain the same files.

The only error other than a generic computation error to the WU is from the log which advise the below
"Distributed Hardware Evolution Project | [error] merge_info(): failed to change permissions of wrapper_dhe-custom-2_x86_64-pc-linux-gnu"
wrapper_dhe-custom-2_x86_64-pc-linux-gnu is rwx for owner, group and other.

Any help would be greatly appreciated.

Harry
ID: 90156 · Report as offensive
Juha
Volunteer developer
Volunteer tester
Help desk expert

Send message
Joined: 20 Nov 12
Posts: 801
Finland
Message 90164 - Posted: 17 Feb 2019, 21:01:25 UTC - in response to Message 90156.  
Last modified: 17 Feb 2019, 21:01:58 UTC

Could you enable <slot_debug> logging flag? At least we'll see what file BOINC is having problems with.

As for failing to change permissions of the wrapper. What permissions and owner project directory has?

And lastly. In the DHEP thread you wondered if any recent system update could have made the app fail. Well, has there been system updates since the last time the app worked?
ID: 90164 · Report as offensive
hgnoel1980

Send message
Joined: 17 Feb 19
Posts: 3
United Kingdom
Message 90227 - Posted: 23 Feb 2019, 12:41:44 UTC - in response to Message 90164.  

Apologies - I have been on annual leave for a couple of days away from my computer.

I have enabled <slot_debug> below is the output I receive.

Sat 23 Feb 2019 12:28:54 GMT | Distributed Hardware Evolution Project | Requesting new tasks for CPU
Sat 23 Feb 2019 12:29:06 GMT | Distributed Hardware Evolution Project | Scheduler request completed: got 2 new tasks
Sat 23 Feb 2019 12:29:06 GMT | Distributed Hardware Evolution Project | [error] merge_info(): failed to change permissions of wrapper_dhe-custom-2_x86_64-pc-linux-gnu
Sat 23 Feb 2019 12:29:06 GMT | | [slot] removed file projects/dhep.ga_boinc/wPskrnARlJrHX1Tot_0_r1113671262_0
Sat 23 Feb 2019 12:29:06 GMT | | [slot] removed file projects/dhep.ga_boinc/wPskrnARlJrHX1Tot_0_r1113671262_0.gz
Sat 23 Feb 2019 12:29:06 GMT | | [slot] removed file projects/dhep.ga_boinc/wPskrnARlJrHX1Tot_0_r1113671262_0.gzt
Sat 23 Feb 2019 12:29:06 GMT | | [slot] removed file projects/dhep.ga_boinc/wYAbrmaHZbd8UE1f4_0_r741950838_0
Sat 23 Feb 2019 12:29:06 GMT | | [slot] removed file projects/dhep.ga_boinc/wYAbrmaHZbd8UE1f4_0_r741950838_0.gz
Sat 23 Feb 2019 12:29:06 GMT | | [slot] removed file projects/dhep.ga_boinc/wYAbrmaHZbd8UE1f4_0_r741950838_0.gzt
Sat 23 Feb 2019 12:29:08 GMT | | [slot] cleaning out slots/10: get_free_slot()
Sat 23 Feb 2019 12:29:08 GMT | Distributed Hardware Evolution Project | [slot] assigning slot 10 to w5lnE8qmsq3Wewacy_0
Sat 23 Feb 2019 12:29:08 GMT | | [slot] cleaning out slots/11: get_free_slot()
Sat 23 Feb 2019 12:29:08 GMT | | [slot] removed file slots/11/init_data.xml
Sat 23 Feb 2019 12:29:08 GMT | | [slot] removed file slots/11/job.xml
Sat 23 Feb 2019 12:29:08 GMT | | [slot] removed file slots/11/wrapper_dhe-custom-2_x86_64-pc-linux-gnu
Sat 23 Feb 2019 12:29:08 GMT | Distributed Hardware Evolution Project | [slot] assigning slot 11 to wqlGOiB3NYlaAIfrD_0
Sat 23 Feb 2019 12:29:08 GMT | | [slot] removed file slots/10/init_data.xml
Sat 23 Feb 2019 12:29:08 GMT | Distributed Hardware Evolution Project | setup_file: projects/dhep.ga_boinc/wrapper_dhe-custom-2_x86_64-pc-linux-gnu (input)
Sat 23 Feb 2019 12:29:08 GMT | Distributed Hardware Evolution Project | [slot] linked ../../projects/dhep.ga_boinc/wrapper_dhe-custom-2_x86_64-pc-linux-gnu to slots/10/wrapper_dhe-custom-2_x86_64-pc-linux-gnu
Sat 23 Feb 2019 12:29:08 GMT | Distributed Hardware Evolution Project | setup_file: projects/dhep.ga_boinc/jreitc_job_2.08.xml (input)
Sat 23 Feb 2019 12:29:08 GMT | | [slot] removed file slots/11/init_data.xml
Sat 23 Feb 2019 12:29:08 GMT | Distributed Hardware Evolution Project | setup_file: projects/dhep.ga_boinc/wrapper_dhe-custom-2_x86_64-pc-linux-gnu (input)
Sat 23 Feb 2019 12:29:08 GMT | Distributed Hardware Evolution Project | [slot] linked ../../projects/dhep.ga_boinc/wrapper_dhe-custom-2_x86_64-pc-linux-gnu to slots/11/wrapper_dhe-custom-2_x86_64-pc-linux-gnu
Sat 23 Feb 2019 12:29:08 GMT | Distributed Hardware Evolution Project | setup_file: projects/dhep.ga_boinc/jreitc_job_2.08.xml (input)
Sat 23 Feb 2019 12:29:09 GMT | Distributed Hardware Evolution Project | Computation for task w5lnE8qmsq3Wewacy_0 finished
Sat 23 Feb 2019 12:29:09 GMT | Distributed Hardware Evolution Project | Computation for task wqlGOiB3NYlaAIfrD_0 finished
Sat 23 Feb 2019 12:29:09 GMT | | [slot] cleaning out slots/10: get_free_slot()
Sat 23 Feb 2019 12:29:09 GMT | | [slot] removed file slots/10/init_data.xml
Sat 23 Feb 2019 12:29:09 GMT | | [slot] removed file slots/10/job.xml
Sat 23 Feb 2019 12:29:09 GMT | | [slot] removed file slots/10/wrapper_dhe-custom-2_x86_64-pc-linux-gnu
Sat 23 Feb 2019 12:29:13 GMT | | [slot] removed file slots/4/init_data.xml
Sat 23 Feb 2019 12:29:13 GMT | | [slot] removed file slots/4/boinc_temporary_exit
Sat 23 Feb 2019 12:29:13 GMT | | [slot] removed file slots/10/init_data.xml
Sat 23 Feb 2019 12:29:13 GMT | | [slot] removed file slots/10/boinc_temporary_exit

Project directories and all BOINC directories are held on an NTFS HDD shared between Debian and Win10 - both OS have an individual BOINC directory.
All BOINC directories are rwx including the DHEP project directory, they are mounted with owner = ME group = USERS. My user and the BOINC user are both member of the USERS system group

I'm running unstable so there have been numerous updates and I tend to install when they become available - however I'm not sure exactly when I started getting errors for DHEP so am struggling to narrow it down to a selection of packages. Is there something specific that is a likely culprit? I am having no issues with any of the other projects I am running it only appears to be affecting DHEP.

Thanks for your help

Harry
ID: 90227 · Report as offensive
hgnoel1980

Send message
Joined: 17 Feb 19
Posts: 3
United Kingdom
Message 90229 - Posted: 23 Feb 2019, 15:50:17 UTC

I've had a play with the mounting options for the NTFS partition - this appears to have resolved the issue - not sure what had changed or why it was only affecting DHEP - however I now appear to have running WU's again.

I will let you know if I get any further issues.

KR

Harry
ID: 90229 · Report as offensive
Juha
Volunteer developer
Volunteer tester
Help desk expert

Send message
Joined: 20 Nov 12
Posts: 801
Finland
Message 90231 - Posted: 23 Feb 2019, 18:54:32 UTC - in response to Message 90229.  

Okay, NTFS probably explains the errors.

[error] merge_info(): failed to change permissions of wrapper_dhe-custom-2_x86_64-pc-linux-gnu


BOINC wanted this file to have S_IRUSR | S_IWUSR | S_IXUSR | S_IRGRP | S_IXGRP | S_IROTH | S_IXOTH permissions but the chmod() call failed. Even though the log message says 'error' it's really ignored.

setup_file: projects/dhep.ga_boinc/jreitc_job_2.08.xml (input)


I'm not entirely sure but setting up this file probably fails because chmod() fails. In this case the failure is not logged but it's a hard error.

You didn't say but I'm guessing you either told NTFS-3g to be silent on chmod() failures or to use Unix access control.
ID: 90231 · Report as offensive

Message boards : Questions and problems : DHEP computation errors - Debian amd64

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.