Bug? - Shouldn't my climate prediction task be high priority?

Message boards : BOINC client : Bug? - Shouldn't my climate prediction task be high priority?
Message board moderation

To post messages, you must log in.

AuthorMessage
Cadeus

Send message
Joined: 2 Oct 12
Posts: 1
United States
Message 49913 - Posted: 20 Jul 2013, 4:23:23 UTC


I have what I think is a bug. I have a climate prediction task that estimates it will complete 2 hours before he deadline. I have boinc set so that no new tasks will be downloaded from any of my connected projects.

There are many SZTAKI tasks to be done (about 15) and 2 climate prediction tasks.

the laptop has a quad core. I have to suspend all the SZTAKI tasks except two in order to let the climate prediction tasks run. 1 climate prediction task has 12 days left and about 4 weeks until dead line. This one is running at high priority. The 2nd climate prediction task will finish 2 hours before the dead line at the current pace and it is not running at high priority. This 2nd climate prediction task does report a dead line warning.

If I have more than 3 SZTAKI tasks running, the 2nd cllimate prediction task goes to "waiting to run" state.
I expect the 2nd climate prediction should be running at high priority.
Why isn't it?

Here is the beginning of my output. I just rebooted so it is small. ugh and poorly formatted. This was copied from BoincView.


Host Project Date Message
bigLaptop --- 7/19/2013 8:55:21 PM No config file found - using defaults
bigLaptop --- 7/19/2013 8:55:21 PM Starting BOINC client version 7.0.64 for windows_x86_64
bigLaptop --- 7/19/2013 8:55:21 PM log flags: file_xfer, sched_ops, task
bigLaptop --- 7/19/2013 8:55:21 PM Libraries: libcurl/7.25.0 OpenSSL/1.0.1 zlib/1.2.6
bigLaptop --- 7/19/2013 8:55:21 PM Running as a daemon
bigLaptop --- 7/19/2013 8:55:21 PM Data directory: C:\ProgramData\BOINC
bigLaptop --- 7/19/2013 8:55:21 PM Running under account boinc_master
bigLaptop --- 7/19/2013 8:55:21 PM Processor: 4 GenuineIntel Intel(R) Core(TM) i3 CPU M 330 @ 2.13GHz [Family 6 Model 37 Stepping 2]
bigLaptop --- 7/19/2013 8:55:21 PM Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt syscall lm vmx tm2 pbe
bigLaptop --- 7/19/2013 8:55:21 PM OS: Microsoft Windows 7: Home Premium x64 Edition, Service Pack 1, (06.01.7601.00)
bigLaptop --- 7/19/2013 8:55:21 PM Memory: 3.68 GB physical, 7.36 GB virtual
bigLaptop --- 7/19/2013 8:55:21 PM Disk: 453.66 GB total, 401.76 GB free
bigLaptop --- 7/19/2013 8:55:21 PM Local time is UTC -7 hours
bigLaptop --- 7/19/2013 8:55:21 PM No usable GPUs found
bigLaptop --- 7/19/2013 8:55:21 PM Config: GUI RPCs allowed from:
bigLaptop --- 7/19/2013 8:55:21 PM princess
bigLaptop rosetta@home 7/19/2013 8:55:21 PM URL http://boinc.bakerlab.org/rosetta/; Computer ID 1489611; resource share 5
bigLaptop boincsimap 7/19/2013 8:55:21 PM URL http://boincsimap.org/boincsimap/; Computer ID 210517; resource share 5
bigLaptop climateprediction.net 7/19/2013 8:55:21 PM URL http://climateprediction.net/; Computer ID 1130955; resource share 75
bigLaptop Einstein@Home 7/19/2013 8:55:21 PM URL http://einstein.phys.uwm.edu/; Computer ID 3892017; resource share 75
bigLaptop LHC@home 1.0 7/19/2013 8:55:21 PM URL http://lhcathomeclassic.cern.ch/sixtrack/; Computer ID 9938902; resource share 5
bigLaptop Milkyway@Home 7/19/2013 8:55:21 PM URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 430394; resource share 5
bigLaptop SETI@home 7/19/2013 8:55:21 PM URL http://setiathome.berkeley.edu/; Computer ID 5804106; resource share 75
bigLaptop SZTAKI Desktop Grid 7/19/2013 8:55:21 PM URL http://szdg.lpds.sztaki.hu/szdg/; Computer ID 337479; resource share 750
bigLaptop SETI@home 7/19/2013 8:55:21 PM General prefs: from SETI@home (last modified 07-Jul-2013 20:30:15)
bigLaptop SETI@home 7/19/2013 8:55:21 PM Computer location: work
bigLaptop SETI@home 7/19/2013 8:55:21 PM General prefs: no separate prefs for work; using your defaults
bigLaptop --- 7/19/2013 8:55:21 PM Preferences:
bigLaptop --- 7/19/2013 8:55:21 PM max memory usage when active: 1883.39MB
bigLaptop --- 7/19/2013 8:55:21 PM max memory usage when idle: 3390.10MB
bigLaptop --- 7/19/2013 8:55:21 PM max disk usage: 100.00GB
bigLaptop --- 7/19/2013 8:55:21 PM don't compute while active
bigLaptop --- 7/19/2013 8:55:21 PM don't use GPU while active
bigLaptop --- 7/19/2013 8:55:21 PM suspend work if non-BOINC CPU load exceeds 26 %
bigLaptop --- 7/19/2013 8:55:21 PM (to change preferences, visit a project web site or select Preferences in the Manager)
bigLaptop --- 7/19/2013 8:55:21 PM Not using a proxy
bigLaptop --- 7/19/2013 8:55:22 PM Running CPU benchmarks
bigLaptop --- 7/19/2013 8:55:22 PM Suspending computation - CPU benchmarks in progress
bigLaptop --- 7/19/2013 8:55:54 PM Benchmark results:
bigLaptop --- 7/19/2013 8:55:54 PM Number of CPUs: 4
bigLaptop --- 7/19/2013 8:55:54 PM 1924 floating point MIPS (Whetstone) per CPU
bigLaptop --- 7/19/2013 8:55:54 PM 6543 integer MIPS (Dhrystone) per CPU
bigLaptop climateprediction.net 7/19/2013 8:55:55 PM Restarting task hadcm3n_o3je_1940_40_008382822_0 using hadcm3n version 607 in slot 4
bigLaptop climateprediction.net 7/19/2013 8:55:55 PM Restarting task hadcm3n_o3nx_1980_40_008384805_0 using hadcm3n version 607 in slot 1
bigLaptop SZTAKI Desktop Grid 7/19/2013 8:55:55 PM Restarting task aaaa0b60-d41c-41ea-8e5f-d1777c9dfd53_64a779d0-07b0-4c17-b6bb-5a6c08258378_359454_1 using search version 206 in slot 0
bigLaptop SZTAKI Desktop Grid 7/19/2013 8:55:55 PM Restarting task aaaa0b60-d41c-41ea-8e5f-d1777c9dfd53_4bc442ff-b5a7-4331-bef4-0fa2911216c3_359477_1 using search version 206 in slot 2


ID: 49913 · Report as offensive
Jim1348

Send message
Joined: 8 Nov 10
Posts: 310
United States
Message 49917 - Posted: 20 Jul 2013, 9:01:06 UTC - in response to Message 49913.  

I have no answer, but have one CPDN running along with two CEP2 jobs on three cores of an i5-3550. I had thought that the jobs ran in the order in which they were downloaded (except in case a deadline has to be met), but in fact if I don't limit the CEP2 jobs to only two running at a time with an app_config file, then three of them will run and the CPDN will sit idle, even though it is by far the oldest, having been downloaded 9 days ago.

I have never really tried figuring out BOINC's scheduling (now 7.2.5 x64 under Win7 64-bit for me, but probably 7.0.64 when the CPDN was downloaded), but good luck in finding an answer, if there is one.
ID: 49917 · Report as offensive
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15480
Netherlands
Message 49921 - Posted: 20 Jul 2013, 17:55:14 UTC - in response to Message 49913.  

To see if it can run all work by deadline, make a cc_config.xml file in Notepad, save it into your BOINC Data directory (C:\ProgramData\BOINC) and add into it:

<cc_config>
   <log_flags>
       <cpu_sched_debug>1</cpu_sched_debug>
       <rr_simulation>1</rr_simulation>
       <sched_op_debug>1</sched_op_debug>
    </log_flags>
  <options>
  </options>
</cc_config> 


When saving it, make sure to make it an 'all files' option, ANSI format. After saving, double check that the file extension is only .xml, not also .txt. If it is also .txt, rename it so the extension is only .xml

Next open BOINC Manager, Advanced, Read config file(s).
The above debug flags will now add a lot more information into your Event Log.
Run for two or three iterations, then suspend BOINC, and change the lines in cc_config.xml to:

<cc_config>
   <log_flags>
       <cpu_sched_debug>0</cpu_sched_debug>
       <rr_simulation>0</rr_simulation>
       <sched_op_debug>0</sched_op_debug>
    </log_flags>
  <options>
  </options>
</cc_config> 

Save changes to the file.
Next open BOINC Manager, Advanced, Read config file(s).
You can now resume BOINC.

Then open the Event Log (CTRL+SHIFT+E), copy the two or three iterations of the debug flags from the log and post that here.
ID: 49921 · Report as offensive
SekeRob2

Send message
Joined: 6 Jul 10
Posts: 585
Italy
Message 49925 - Posted: 20 Jul 2013, 20:31:46 UTC - in response to Message 49917.  
Last modified: 20 Jul 2013, 20:34:58 UTC

I have no answer, but have one CPDN running along with two CEP2 jobs on three cores of an i5-3550. I had thought that the jobs ran in the order in which they were downloaded (except in case a deadline has to be met), but in fact if I don't limit the CEP2 jobs to only two running at a time with an app_config file, then three of them will run and the CPDN will sit idle, even though it is by far the oldest, having been downloaded 9 days ago.

I have never really tried figuring out BOINC's scheduling (now 7.2.5 x64 under Win7 64-bit for me, but probably 7.0.64 when the CPDN was downloaded), but good luck in finding an answer, if there is one.

You forget about timeshare which is tracked. In principle tasks run FIFO -per project- and EDF if any task is under deadline threat. Since the CPDN is not under deadline threat, I'm sure, normal alternation between projects applies, which is default I think 60 minutes [the switch application you can set to influence how long a task runs once started... not quite but it helps to complete tasks in least interruption]. Of course with CEP2, you have LAIM set, else if EDF kicks in, interrupted tasks get unloaded before the checkpoint is written, which is the normal switch application moment when time's up. With the large checkpoint intervals, particular at #2 and #11 (3rd and 12th really), you could be loosing hours progress. Of course that is then predicated on having enough memory, but normally if tasks are held in memory, the OS will eventually move them whole or part into virtual memory, if that's active and allowed to be used by BOINC [recommended].
Coelum Non Animum Mutant, Qui Trans Mare Currunt
ID: 49925 · Report as offensive
Jim1348

Send message
Joined: 8 Nov 10
Posts: 310
United States
Message 49926 - Posted: 20 Jul 2013, 21:06:52 UTC - in response to Message 49925.  

You forget about timeshare which is tracked. In principle tasks run FIFO -per project- and EDF if any task is under deadline threat. Since the CPDN is not under deadline threat, I'm sure, normal alternation between projects applies, which is default I think 60 minutes [the switch application you can set to influence how long a task runs once started... not quite but it helps to complete tasks in least interruption]. Of course with CEP2, you have LAIM set, else if EDF kicks in, interrupted tasks get unloaded before the checkpoint is written, which is the normal switch application moment when time's up. With the large checkpoint intervals, particular at #2 and #11 (3rd and 12th really), you could be loosing hours progress. Of course that is then predicated on having enough memory, but normally if tasks are held in memory, the OS will eventually move them whole or part into virtual memory, if that's active and allowed to be used by BOINC [recommended].

All good points, except that it seems to be that CEP2 always takes precedence, and I don't ever recall seeing the 60-minute switch. Yes, I always have LAIM set, but even more I run the BOINC folder out of Ramdisk (12 GB, with usually half of it free), so whatever happens should be going fast.

But fate has intervened; CPDN is out of work, and so I am going to CEP2 exclusively on that machine; problem solved.
ID: 49926 · Report as offensive
Jim1348

Send message
Joined: 8 Nov 10
Posts: 310
United States
Message 49943 - Posted: 22 Jul 2013, 15:43:12 UTC

In reading through this explanation by Claggy, it appears that the behavior I observe is probably due to the Recent Estimated Credit (REC):
http://boinc.berkeley.edu/dev/forum_thread.php?id=8513&postid=49939#49939

That is, since I had been running CPDN for several days on all three cores without any CEP2 jobs at all, it had built up a large REC for CPDN. Then, when I added the CEP2 work, it preferred that to the CPDN work. Eventually, it would have equalized out at the resource share I had chosen (equal for both projects in my case).

That is all very interesting, but beyond human comprehension in most cases. I think it would be better if BOINC just asked which projects you want to run on which cores (with options for secondary projects if the primary ones are out of work), and ditch the whole REC scheme, and whatever else they have. Fortunately, they did not ask my advice, since it may overlook something important. Or it may just be that they are trying to over-engineer the whole thing.

ID: 49943 · Report as offensive
SekeRob2

Send message
Joined: 6 Jul 10
Posts: 585
Italy
Message 49946 - Posted: 22 Jul 2013, 16:33:23 UTC - in response to Message 49943.  

Or you could stop watching the BOINC [which it hates] and let it run ;>)
Coelum Non Animum Mutant, Qui Trans Mare Currunt
ID: 49946 · Report as offensive
Jim1348

Send message
Joined: 8 Nov 10
Posts: 310
United States
Message 49953 - Posted: 22 Jul 2013, 17:52:18 UTC - in response to Message 49946.  
Last modified: 22 Jul 2013, 17:55:02 UTC

Or you could stop watching the BOINC [which it hates] and let it run ;>)

That has occurred to me. But then, what is the fun in it?

Perhaps more to the point, you don't know that your selections are working properly, since you get no feedback until days, if not weeks later. And then you may discover that something was misconfigured all along. I think it is just another attempt to "make things easy for you" that in fact introduces unnecessary complications. If this project isn't for techies, then who is it for? The average user won't bother with so many projects anyway (or at least shouldn't).
ID: 49953 · Report as offensive

Message boards : BOINC client : Bug? - Shouldn't my climate prediction task be high priority?

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.