Posts by sprzyswa

1) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107730)
Posted 5 Apr 2022 by sprzyswa
Post:
Nice conversation here on (water) cooling but completely off topic in this thread. Please start a thread outside this one on the subject and I'll happily move your earlier posts over. But for the topic, let's go back to sprzyswa and his problem(s).


Thanks, but for me Boinc is not the cause of my problem, so for me this discussion is closed.

Sam.
2) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107712)
Posted 4 Apr 2022 by sprzyswa
Post:
BOINC (the program) is highly unlikely to stress "4 CPUs are at more than 90% load", except for 60 seconds or less at startup during benchmarking. That level of CPU activity is more likely to be attributable to one or more science projects, running under the direction of BOINC. We haven't discussed projects yet in this thread.


I'm on einstein@home and LHC@home and often the 4 CPUs are at more than 90% load, I had to put a liquid cooler to avoid raising the temperature of the processor too much.

Sam.
3) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107708)
Posted 4 Apr 2022 by sprzyswa
Post:
An additional idea that came in my mind.

One of the first tests the BOINC client does is to launch a subprocess that checks the GPU capabilities.
If the system uses a wrong or somehow broken GPU driver this may be a possible source for the trouble.

Could be tested starting BOINC without GPU support.
See options "<ignore_ati_dev>N</ignore_ati_dev>" ... "ignore_nvidia_dev>N</ignore_nvidia_dev>":
https://boinc.berkeley.edu/wiki/Client_configuration


I am trying to recompile boinc-client and I had the same problem when compiling on 3 CPUs which were working at more than 90% the machine rebooted so boinc is not the cause but I think a hardware problem either the motherboard or the processor. Besides, I also had this problem during memory tests using the 4 CPUs, which occurs with Boinc when the 4 CPUs are at more than 90% load. So sorry to have increminated Boinc who is there I think for nothing.

Sam.
4) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107704)
Posted 4 Apr 2022 by sprzyswa
Post:
A (very long) while ago I had a power outage due to a heavy thunderstorm.
After that all my machines rebooted and seemed to work fine for a couple of weeks.
Then, as of a sudden, 1 machine crashed (rebooted) every time BOINC started a GPU task.

I finally could trace the error down to a corrupted filesystem (multiple sector allocation on the harddisk).
The solution was to
- backup all data
- reformat the disk
- restore all data
- force a reinstall of the OS, drivers and all applications

Since then the machine runs fine again.


I checked the RAID-1 disks with smartctl and e2fsck and they are clean, checked the RAM which is clean too, I also reinstalled the whole system (Ubuntu 20.04) I think a hardware problem, motherboard or CPU, because even with the application on a bootable USB key I have the same problem.

With VirtualBox on my machine à got the same problem...

Sam.
5) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107695)
Posted 3 Apr 2022 by sprzyswa
Post:
You know you can run BOINC without VirtualBox. Its only needed if the project in question only have that type of work unit. I'm still crunching for Rosetta although they have a "rosetta python projects" which uses vbox. I'm running their normal CPU work units only. VirtualBox allows you to run virtual machines. In the case of the various BOINC projects they supply work units which are a VM image.

The Linux versions of BOINC don't keep an old version of stdoutdae around. They use the standard Linux way of doing things. You can view the log by doing a "sudo journalctl --unit=boinc-client" command in a terminal or ssh session. The journal entries can go back quite some time (I have some from Sept 2021 on one machine).

As for directory naming on Raspberry Pi OS, they are following the Debian standard. Debian used to use /var/lib/boinc-client and other Linux distros use /var/lib/boinc. The later BOINC releases have both folders and a symlink from /var/lib/boinc to /var/lib/boinc-client so they are basically the same directory.


I even reinstalled my machine to have a clean system but after installing boinc-client I still have the same thing, a clean REBOOT and that broke my RAID-1 I have a disk in resynchronization...

Sam.
6) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107692)
Posted 3 Apr 2022 by sprzyswa
Post:
You know you can run BOINC without VirtualBox. Its only needed if the project in question only have that type of work unit. I'm still crunching for Rosetta although they have a "rosetta python projects" which uses vbox. I'm running their normal CPU work units only. VirtualBox allows you to run virtual machines. In the case of the various BOINC projects they supply work units which are a VM image.

The Linux versions of BOINC don't keep an old version of stdoutdae around. They use the standard Linux way of doing things. You can view the log by doing a "sudo journalctl --unit=boinc-client" command in a terminal or ssh session. The journal entries can go back quite some time (I have some from Sept 2021 on one machine).

As for directory naming on Raspberry Pi OS, they are following the Debian standard. Debian used to use /var/lib/boinc-client and other Linux distros use /var/lib/boinc. The later BOINC releases have both folders and a symlink from /var/lib/boinc to /var/lib/boinc-client so they are basically the same directory.


Yes I know all that, I installed boinc-client on Debian 9.13 without problems but my machine is installed in Ubuntu 20.04 and even without installing VirtualBox the machine reboots after launching boinc-client...

Sam.
7) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107689)
Posted 3 Apr 2022 by sprzyswa
Post:
Hi,

There seems to be a problem running boinc-client on Ubuntu 20.04 I think I will abandon this project...

Sam.
8) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107679)
Posted 1 Apr 2022 by sprzyswa
Post:
Sorry but the problem came back and seems to come from VirtualBox...

Sam.
9) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107661)
Posted 30 Mar 2022 by sprzyswa
Post:
Hi,

The problem is solved, my home directory was corrupted, I put all my directory back with my username and group, restarted the machine and everything is OK.

Thanks for your help.

Sam.
10) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107659)
Posted 30 Mar 2022 by sprzyswa
Post:
Hi,

I'm going to have to abandon the Boinc project because as soon as I launch boinc-client the machine reboot, no crash but a clean reboot...

Sam.
11) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107637)
Posted 29 Mar 2022 by sprzyswa
Post:
The log history for Linux systemd installations is kept in the system journal.

Ok, thanks but nothing wrong on syslog just this:

Mar 30 00:00:48 jupiter boinc[11481]: dir_open: Could not open directory 'locale' from '/var/lib/boinc-client'.

Sam.
12) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107634)
Posted 29 Mar 2022 by sprzyswa
Post:
You should have the Log backup file stdoutdae.old in your /var/lib/boinc-client directory

No I have not stdoutdae.old file in my /var/lib/boinc-client directory !?

Sam.
13) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107619)
Posted 28 Mar 2022 by sprzyswa
Post:
I also uninstalled and deleted the /var/lib/boinc-client directory and reinstalled boinc-client but it's still the same
14) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107618)
Posted 28 Mar 2022 by sprzyswa
Post:
Hi,

I use liquid cooling, CPU temperature is +/- 50° Celsius and CPU temperature is permanently displayed on the desktop.

I've checked memory, disks and swap status, I'm at about 50% memory usage (32Gb) I've tested running ONLY boinc-client to be sure it doesn't did not come from another application.

Sam.
15) Message boards : Questions and problems : boinc-client crash and reboot my machine (Message 107616)
Posted 28 Mar 2022 by sprzyswa
Post:
Hi,
For a few days my boinc-client application installed on Ubuntu 20.04 (5.4.0-105-generic) after a few minutes crash and reboot my machine although it had been working perfectly for a year, does anyone have an explanation ?
Sam.
16) Message boards : Questions and problems : Memory usage by Boinc (Message 105177)
Posted 18 Aug 2021 by sprzyswa
Post:
You shouldn't need to do anything like that, as with each change of project applications are swapped out of memory and new ones start. At least at the default setting of "Leave applications in memory" set to No.

Even applications within the same project, when tasks end and new ones start, they do so with their own science application, leaving memory and starting anew.


I did that and added 16GB of memory, everything works fine now and with more comfort when working on the machine.

Thanks again !

Sam.
17) Message boards : Questions and problems : Memory usage by Boinc (Message 105133)
Posted 14 Aug 2021 by sprzyswa
Post:
You shouldn't need to do anything like that, as with each change of project applications are swapped out of memory and new ones start. At least at the default setting of "Leave applications in memory" set to No.

Even applications within the same project, when tasks end and new ones start, they do so with their own science application, leaving memory and starting anew.


Apparently this seems to solve the memory usage problem, after several task and project changes.

Thanks a lot !

Sam.
18) Message boards : Questions and problems : Memory usage by Boinc (Message 105127)
Posted 14 Aug 2021 by sprzyswa
Post:
You shouldn't need to do anything like that, as with each change of project applications are swapped out of memory and new ones start. At least at the default setting of "Leave applications in memory" set to No.

Even applications within the same project, when tasks end and new ones start, they do so with their own science application, leaving memory and starting anew.


Ok, I modified this parameter in the preferences of each project I restart Boinc to see. I am at 80% memory usage for the Boinc projects, I ordered 16GB of additional memory.

Thanks again for your help.

Sam.
19) Message boards : Questions and problems : Memory usage by Boinc (Message 105119)
Posted 13 Aug 2021 by sprzyswa
Post:
Because that exits the tasks out of memory and restarts them, possibly from the beginning.


Apparently they start again where they were stopped...

The problem is that it must be done automatically at each change of project, and that I do not know how to do...

Sam.
20) Message boards : Questions and problems : Memory usage by Boinc (Message 105117)
Posted 13 Aug 2021 by sprzyswa
Post:
What is your setup for Atlas tasks? How many CPU cores are you using per task? This can be setup on LHC web site or by an app_config.xml. Are you using VirtualBox or are you running Atlas tasks as native linux tasks? On LHC website project preferences you can also limit how many tasks you have on your computer at one time. On Boinc preferences (locally with Boinc manager or on project website) you can limit how many CPU cores you allow Boinc to use at any time.


The problem occurs whatever the project, I have the same thing with Einstein@home the only way to correct the problem is to run "boinc-client restart" when it happens ...

Sam.


Next 20

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.