Problem with "max concurrent" in app config.

Message boards : Questions and problems : Problem with "max concurrent" in app config.
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
robsmith
Volunteer tester
Help desk expert

Send message
Joined: 25 May 09
Posts: 1283
United Kingdom
Message 103427 - Posted: 6 Mar 2021, 15:23:35 UTC

I assume you are saying all other projects are complying with the restrictions you have set, that is x-concurrent tasks and a buffer of no more than 3 hours of work.
That being the case this would appear to be a project that has it's end of the process wrongly configured and is just sending out large amounts of data when only a small amount is requested. A quick look at the project's own forum indicates that there are other people suffering similar (not obviously identical) problems to the one you are seeing, indeed I see you have been quite active in one of the threads, so it may well be better to report it over there.
If this is affecting more projects than just kryptos@home then it could a BOINC problem.
ID: 103427 · Report as offensive
robsmith
Volunteer tester
Help desk expert

Send message
Joined: 25 May 09
Posts: 1283
United Kingdom
Message 103429 - Posted: 6 Mar 2021, 17:08:26 UTC - in response to Message 103428.  

Rather than trying to describe what is going wrong post your logs - Richard may be along soon to ask you to set some of the log debug flags so we can see what is being asked for.
One thing (or is it two things?), how long does a kryptos task take to run (really run) and what is the initial estimate of run time for tasks? This may help people understand where and what is going wrong. (Registration for new users is currently down, so it is hard to get any idea of these things.) In the past (and even today) there are projects for which the runtime guess was a gross underestimate, thus the servers would just keep pushing up excess work based on invalid runtime estimates.
ID: 103429 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5077
United Kingdom
Message 103432 - Posted: 6 Mar 2021, 17:40:33 UTC

Sorry, I was out for a walk, fetching the newspaper.

The key 'starter for ten' is <sched_op_debug> - that's quiet enough to leave running all the time. The key thing to post here is the number of seconds of work requested, and the number of (estimated) seconds returned.

Depending on the answer, we might start asking questions about DCF, and even for a single (PLEASE - only one) cycle of <work_fetch_debug>. DON'T leave that one running!

I don't know the project, so I can only go on what you report.
ID: 103432 · Report as offensive
robsmith
Volunteer tester
Help desk expert

Send message
Joined: 25 May 09
Posts: 1283
United Kingdom
Message 103434 - Posted: 6 Mar 2021, 17:57:42 UTC - in response to Message 103433.  

You can get newspapers on t'internet nowadays lad!

There are some things one might find rather uncomfortable or unpleasant to do with an i-pad or the like......Wrapping up one's fish and chips being but one of them ;-)
ID: 103434 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5077
United Kingdom
Message 103435 - Posted: 6 Mar 2021, 18:01:14 UTC - in response to Message 103434.  

You can get newspapers on t'internet nowadays lad!
There are some things one might find rather uncomfortable or unpleasant to do with an i-pad or the like......Wrapping up one's fish and chips being but one of them ;-)
And going for a walk on the internet is a bit wobbly.
ID: 103435 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5077
United Kingdom
Message 103436 - Posted: 6 Mar 2021, 18:05:11 UTC - in response to Message 103433.  

Well, on those figures, Kryptos is always going to be the highest priority for work fetch - it'll always be under-represented compared to its resource share (and that won't take any notice of cc_config.xml, either).

Tweak your resource shares to more closely match what you actually want to run.
ID: 103436 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5077
United Kingdom
Message 103440 - Posted: 6 Mar 2021, 20:05:24 UTC - in response to Message 103439.  

I can only assume the server is saying "er.... no, you have loads to do, you ain't getting any more".
The actual wording in the log would be "This computer has reached a limit of tasks in progress", after an attempted work fetch.

If the project has set such a thing.
ID: 103440 · Report as offensive
robsmith
Volunteer tester
Help desk expert

Send message
Joined: 25 May 09
Posts: 1283
United Kingdom
Message 103444 - Posted: 7 Mar 2021, 17:36:42 UTC - in response to Message 103428.  

Sorry, I forgot to say - this one looks to be a similar problem to yours (from the title)
https://www.kryptosathome.com/forum_thread.php?id=18
ID: 103444 · Report as offensive
robsmith
Volunteer tester
Help desk expert

Send message
Joined: 25 May 09
Posts: 1283
United Kingdom
Message 103446 - Posted: 7 Mar 2021, 18:11:41 UTC - in response to Message 103445.  

As I said - I just scanned thread titles - win some, loose some.
I did have a look in a couple of the other threads and someone a few months back said something about this project not having done a good job with the actual application development. I think your comments about not check-pointing correctly is a symptom of such :-(

I can't help thinking that the issues you are seeing with the excessive delivery of tasks may be related to having tried to do "something clever" with the server side of BOINC and haven't got everything right. I do know that other projects are no exhibiting the same over-supply issue when using the max-concurrent tag, but seeing the exact messages around a work-call and work-not-call would certainly help Richard understand what is going on and thus either report a bug correctly or point you in the direction of a solution.
ID: 103446 · Report as offensive
1 · 2 · Next

Message boards : Questions and problems : Problem with "max concurrent" in app config.

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.