Stalled downloads

Message boards : Questions and problems : Stalled downloads
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Mad_Max

Send message
Joined: 29 Apr 19
Posts: 19
Russia
Message 95952 - Posted: 19 Feb 2020, 12:13:53 UTC - in response to Message 95860.  

Unfortunately, the log doesn't show you cancelling the download - which is probably the log's fault, not yours.
..............

Will do. But restarting the machine (or presumably just the Boinc client) always clears the problem. My problem is it doesn't clear without restarting.

Same problem here.
Some downloads keeps stalling (= stop flow of work for R@H as BOINC stops getting new work from R@H and switch to backup project - WCG in my case) every few fays (~ one file stuck of few hundreds downloads clearly ).
Some of the cases, retry transfer does not help (is probable a R@H server bug), aborting stuck transfer usually work, but it some cases it does not help either so i need to restart BOINC completely to resume normal work flow.

There are also no any entries in BOINC event log about aborting transfer:
19-Feb-2020 13:32:36 [Einstein@Home] Requesting new tasks for AMD/ATI GPU
19-Feb-2020 13:32:43 [Einstein@Home] Scheduler request completed: got 1 new tasks
-------------here i have aborted stalled download, but it is not logged -----------
19-Feb-2020 13:33:24 [Rosetta@home] update requested by user
19-Feb-2020 13:33:27 [Rosetta@home] Sending scheduler request: Requested by user.
19-Feb-2020 13:33:27 [Rosetta@home] Reporting 1 completed tasks    - note: this task has stalled download and BOINC recognize it as aborted after stale download canceled, but refuse ask for more work   
19-Feb-2020 13:33:27 [Rosetta@home] Not requesting tasks: some download is stalled
19-Feb-2020 13:33:29 [Rosetta@home] Scheduler request completed
-------------------------------- i restart BOINC and getting new work resume  -------------------- 
19-Feb-2020 13:33:51 [---] Exiting
19-Feb-2020 13:34:44 [---] Starting BOINC client version 7.14.2 for windows_x86_64
19-Feb-2020 13:34:44 [---] log flags: file_xfer, sched_ops, task
19-Feb-2020 13:34:44 [---] Libraries: libcurl/7.47.1 OpenSSL/1.0.2g zlib/1.2.8
-------------- cut --------------------
19-Feb-2020 13:35:06 [Rosetta@home] Sending scheduler request: To fetch work.
19-Feb-2020 13:35:06 [Rosetta@home] Requesting new tasks for CPU
19-Feb-2020 13:35:09 [Rosetta@home] Scheduler request completed: got 7 new tasks
19-Feb-2020 13:35:11 [Rosetta@home] Started download of flags_rb_02_12_15913_15773__t000__2_C1_robetta
19-Feb-2020 13:35:11 [Rosetta@home] Started download of input_rb_02_12_15913_15773__t000__2_C1_robetta.zip
19-Feb-2020 13:35:11 [Rosetta@home] Started download of flags_rb_02_13_14003_15805__t000__2_C1_robetta
19-Feb-2020 13:35:13 [Rosetta@home] Finished download of flags_rb_02_12_15913_15773__t000__2_C1_robetta
19-Feb-2020 13:35:13 [Rosetta@home] Finished download of flags_rb_02_13_14003_15805__t000__2_C1_robetta
ID: 95952 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5080
United Kingdom
Message 95954 - Posted: 19 Feb 2020, 12:36:55 UTC - in response to Message 95952.  

We won't be able to solve this unless somebody catches one in the act and can display some diagnostic test results. Then we'll at least know which bit of the system is broken - though even that doesn't guarantee we can fix it.
ID: 95954 · Report as offensive
Mad_Max

Send message
Joined: 29 Apr 19
Posts: 19
Russia
Message 95955 - Posted: 19 Feb 2020, 12:50:59 UTC
Last modified: 19 Feb 2020, 12:51:13 UTC

I can dig some data next time it happen.
I already added <http_debug>1</http_debug> in config.
But it will just show why downloads from R@H server is stalling, but it will not show why BOINC refuse to get new fork even after such stale download is already aborted. isn't it?
ID: 95955 · Report as offensive
Mad_Max

Send message
Joined: 29 Apr 19
Posts: 19
Russia
Message 96009 - Posted: 23 Feb 2020, 0:42:28 UTC

And i can not catch it. It does not repeats each day and looks like running BOINC with <http_debug>1</http_debug> cause problems to BOINC itself.
After some time it loses international connection (between BOINC and BOINC Manager - red dot on tray icon) and hangs.
Looks like it caused by too many messages in event log due to very high verbose level of <http_debug> and very high BOINC network activity on machine.
I checked stdoutdae.txt and it generate ~ 50 000 log lines per day.

Mat be will try later on less active machine.
ID: 96009 · Report as offensive
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15480
Netherlands
Message 96011 - Posted: 23 Feb 2020, 0:48:20 UTC - in response to Message 96009.  

Looks like it caused by too many messages in event log due to very high verbose level of <http_debug> and very high BOINC network activity on machine.
It's not the total amount of lines that causes this but generating over 1,000 lines per second as the polling between the client and the manager happens once every second, which is 1,000 milliseconds, which is those 1,000 lines. Don't set multiple debug flags at the same time (like http_debug and work_fetch_debug for instance) as then you run into this problem.
ID: 96011 · Report as offensive
Mad_Max

Send message
Joined: 29 Apr 19
Posts: 19
Russia
Message 96013 - Posted: 23 Feb 2020, 1:13:41 UTC

Good to know. Thanks.
Although i highly doubt i actually ever hit 1000 lines per second.
I have run only one debug option for this time (http_debug only).
Machine is quite active (16 thread CPU + 2 GPU each running 2 tasks so 20 tasks is running in parallel) but i skim trough stdoutdae.txt and looks like 200-300 lines per second was max logging speed.

May be same can happen if something delay polling between the client and the manager?
So not > 1000 lines per second but > 1000 lines since last polling?

P.S.
Have found a workaround - looks like BOINC could apply http_debug flag without restart (via "read config files" after cc_config edit). So i will try this method next time if i see more stuck downloads from Rosetta @ Home.
ID: 96013 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5080
United Kingdom
Message 96017 - Posted: 23 Feb 2020, 8:11:37 UTC

Another way of cutting down the log and making it more readable is to set

<dont_contact_ref_site>1</dont_contact_ref_site>

in cc_config.xml - that removes the debug clutter of the check-call to google.com after a failed BOINC connection.
ID: 96017 · Report as offensive
Profile vifov79255

Send message
Joined: 23 Feb 20
Posts: 1
Romania
Message 96020 - Posted: 23 Feb 2020, 17:11:59 UTC
Last modified: 23 Feb 2020, 17:12:10 UTC

journalctl --boot=-1 --unit=boinc-client
works for me
ID: 96020 · Report as offensive
Les Bayliss
Help desk expert

Send message
Joined: 25 Nov 05
Posts: 1654
Australia
Message 96105 - Posted: 25 Feb 2020, 23:44:19 UTC

Corona virus is close to being declared a pandemic.
But don't worry about it. it's only fatal to some people.
ID: 96105 · Report as offensive
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15480
Netherlands
Message 96107 - Posted: 26 Feb 2020, 0:07:27 UTC - in response to Message 96105.  
Last modified: 26 Feb 2020, 0:08:01 UTC

Sorry for the off-topic post, but there's no other place to answer this.

But don't worry about it. it's only fatal to some people.
The flu is deadlier. If at least the death numbers are given correctly and governments aren't putting out fake numbers to make it look less problematic.

So far, the new coronavirus, dubbed COVID-19, has led to more than 75,000 illnesses and 2,000 deaths, primarily in mainland China. But that's nothing compared with the flu, also called influenza. In the U.S. alone, the flu has already caused an estimated 26 million illnesses, 250,000 hospitalizations and 14,000 deaths this season, according to the Centers for Disease Control and Prevention (CDC).

...

Typical flu symptoms include fever, cough, sore throat, muscle aches, headaches, runny or stuffy nose, fatigue and, sometimes, vomiting and diarrhea, according to the CDC. Flu symptoms often come on suddenly. Most people who get the flu will recover in less than two weeks. But in some people, the flu causes complications, including pneumonia. So far this flu season, about 1% of people in the United States have developed symptoms severe enough to be hospitalized, which is similar to the rate last season, according to data from the CDC.

With COVID-19, doctors are still trying to understand the full picture of disease symptoms and severity. In a small study of about 100 people with the virus, published Jan. 30 in the journal The Lancet, the most common symptoms were fever, cough and shortness of breath. Only about 5% of patients in that study reported sore throat and runny nose, and only 1-2% reported diarrhea, nausea and vomiting.
https://www.livescience.com/new-coronavirus-compare-with-flu.html
ID: 96107 · Report as offensive
Profile Dave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 2531
United Kingdom
Message 96127 - Posted: 26 Feb 2020, 6:51:49 UTC

How long before some smart *** names a computer virus corona or covid19?
ID: 96127 · Report as offensive
Profile Dave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 2531
United Kingdom
Message 96134 - Posted: 26 Feb 2020, 11:09:09 UTC - in response to Message 96133.  

And virus writers aren't what I would categorise as smart. I know the guy who wrote the raid virus. Sure, he can program, but he has no common sense, no sense of decency, etc.


I don't know the etiquette. Are they named by the people who write them or those who find them in the wild?
ID: 96134 · Report as offensive
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15480
Netherlands
Message 96136 - Posted: 26 Feb 2020, 11:59:56 UTC - in response to Message 96133.  

You can say ass. That's a donkey. Just don't say arse :-)
But you can only say it once in this thread. Both.
Our rules give more leeway than those at project forums, even though the ones on the left are the same all over. I'm not enforcing them so strictly, unless you're a spammer. I can't speak for my moderators though, leave it up to them and their judgment to decide what to do.
Like someone guiding us back to the topic at hand, but maybe that the topic changed. :)
ID: 96136 · Report as offensive
Previous · 1 · 2 · 3 · Next

Message boards : Questions and problems : Stalled downloads

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.