New motherboard fried, not sure if BOINC is to blame

Message boards : Questions and problems : New motherboard fried, not sure if BOINC is to blame
Message board moderation

To post messages, you must log in.

AuthorMessage
tetamusha

Send message
Joined: 28 Jan 18
Posts: 1
Brazil
Message 84572 - Posted: 28 Jan 2018, 6:14:56 UTC

Hi guys. I've been a volunteer for a few years. Recently, I bought a Dell XPS 8920 with 16 GiB RAM, an i7-7700K CPU @ 4.20GHz and a GeForce GTX 1070. I decided to assign this machine to some GPU tasks, mostly from Collatz Conjecture and GPU Grid. I kept it on while sleeping and when at work, always crunching those tasks, leaving 100% of my CPUs and GPU available for use.

Less than a month after buying the computer, I came home one day to find it turned off and it would not turn back on. Since its warranty was still valid, a Dell technician came, diagnosed the problem as motherboard failure and replaced it.

Since the most hardware intensive software running on the machine at the time was BOINC, I started questioning whether this type of thing may happen because BOINC was running when the computer stopped. Maybe I configured BOINC wrong or something. I know that, in case the CPU overheats, the computer turns off before any possibility of hardware damage, but I never had a computer with a decent GPU and I don't know if the behavior is similar in this case. I also don't know if number crunching a whole day may cause my motherboard to fry.

I was wondering if someone could clarify whether it is possible for BOINC to cause this type of hardware failure or if I should blame a faulty motherboard and jsut install BOINC again. Thanks in advance.
ID: 84572 · Report as offensive
Profile Ageless
Volunteer moderator
Project administrator
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 12718
Netherlands
Message 84574 - Posted: 28 Jan 2018, 7:33:04 UTC - in response to Message 84572.  

Welcome to the boards. Sorry to read about your motherboard, but I can say with great certainty that BOINC was not to blame. BOINC by itself is just a managing program, it in itself does not add any stress to your system. You can easily test that by installing BOINC and not adding any projects, then you can leave BOINC on the system ad nauseam and it will only fail again if the next motherboard is also bad.

In all my years of computing I only ever had one motherboard go bad (and when it went with a bang, it took my RAM and CPU along with it to the great beyond). This was due to bad capacitors, a thing that was a problem some years back. These days they're all military grade, and gold capped instead of cardboard.
Usually when a motherboard goes it came with micro-fractures already that increased over time due to heavy use (read: heat & cooling cycles).

The project applications will put a heavy stress onto the CPU & GPU, although that does depend on the project and whether they optimised their applications, and if so what kind of optimisation was used (integer mathematics, floating point maths, using simd, using just CAL/CUDA or OpenCL.)
Then it depends on how much cooling your system has, and the amount of maintenance you spend on it. A case with 40 fans will grind to a halt just as fast as a case with 4 fans if you never clean them in a dusty (lived in, animal rich) environment.

So go ahead and reinstall BOINC.
But before you restart your projects, check your maintenance and decide if a little less intense usage is just as well.
Jord
Please do not private message me for tech support. Use the forums for that. Tech PMs will be ignored.

quote: "I'm an apostrophe, I'm just a symbol to remind you that there's more to see"
ID: 84574 · Report as offensive
noderaser
Avatar

Send message
Joined: 2 Jan 14
Posts: 276
United States
Message 84607 - Posted: 31 Jan 2018, 4:32:54 UTC
Last modified: 31 Jan 2018, 4:33:14 UTC

My DVR has been crunching BOINC in the living room for about 8 years now, 24/7 and the only issue I've had with it is a hard drive and power supply. If you decide to come back to BOINC, I would monitor your CPU temperatures and make sure you're not sustaining high temperatures. You can usually get the maximum allowed temperature from the manufacturer's data (Intel Ark shows this as Tjunction or Tcase) and then I use a program like TThrottle to adjust BOINC on the fly to keep a set temperature. I usually set it 10-15 C below the max allowed temperature. You can also use that program to control unwanted fan noise.

Since you're not crunching 24/7, I would definitely make sure your CPU and case cooling is adequate, but a motherboard failure that soon in a computer's life is more likely just a manufacturing defect. Nice that you got it covered under warranty.
My Detailed BOINC Stats
ID: 84607 · Report as offensive
Bob Harder

Send message
Joined: 11 Oct 10
Posts: 12
United States
Message 84642 - Posted: 3 Feb 2018, 16:05:30 UTC
Last modified: 3 Feb 2018, 16:06:52 UTC

You could monitor your temperatures to make sure your system is "overloaded".

SpeedFan is good for the CPU and other system components. GPU-Z is good for the GPU.

Both programs can create a log file,
ID: 84642 · Report as offensive
whynot

Send message
Joined: 8 May 10
Posts: 82
Ukraine
Message 84645 - Posted: 3 Feb 2018, 18:10:10 UTC

Let me participate in this BOINC releated horor story sharing festival. Once I've rubbed through crappy DIMM. Indeed, it took 5.5 years, but, because I'm running 24x7, it is clearly fault of BOINC :D
I'm counting for science,
points just make me sick.
ID: 84645 · Report as offensive
Profile Gary Charpentier
Avatar

Send message
Joined: 23 Feb 08
Posts: 1626
United States
Message 84660 - Posted: 4 Feb 2018, 16:52:49 UTC
Last modified: 4 Feb 2018, 16:53:25 UTC

Boinc did terrible things to my system. Because it made it warm it forced me to open the box and blow out the dust on a regular basis. Then it dried out the faulty electrolytic capacitors on the motherboard. Took about 4 years too. After that it was toast.

Oh, on the other system, the brand new one, it so stressed the CPU it failed. That 10 days out of box. I know it was responsible for that. Never mind Intel making a bad CPU. PS the replacement CPU has been going 5 years now.
ID: 84660 · Report as offensive
Profile BeemerBiker
Avatar

Send message
Joined: 27 Jun 08
Posts: 257
United States
Message 84810 - Posted: 16 Feb 2018, 16:14:49 UTC - in response to Message 84572.  

Hi guys. I've been a volunteer for a few years. Recently, I bought a Dell XPS 8920 with 16 GiB RAM, an i7-7700K CPU @ 4.20GHz and a GeForce GTX 1070.


Dell makes good products and has good tech support and forums both "Owner forums" and Dell community.

That being said, I discovered that my "Sensor Control" program does not regulate and only reports temps on my 1070 gpus. It does a fine job with the cpu sensor, disk sensor, motherboard sensor etc. It automatically scales up or down all the fans speed BUT GPU NEEDS PRECISON X OR AFTERBURNER to set its fan speed. Within hours of posting on the AlienOwners forum I got responses advising to use MSI's Afterburner. Dell uses MSI as OEM for graphic cards supposidly.

Easiest thing to do for tech is test the power supply and if it is good, then simply replace the motherboard. You get to reinstall windows which probably fixes the original problem. They run a diagnostic on the motherboard and then sell it as refurbished.
ID: 84810 · Report as offensive

Message boards : Questions and problems : New motherboard fried, not sure if BOINC is to blame

Copyright © 2018 University of California. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.