Thread '5.8.8 - completion/upload of units'

Message boards : BOINC client : 5.8.8 - completion/upload of units
Message board moderation

To post messages, you must log in.

AuthorMessage
W-K ID 666

Send message
Joined: 30 Dec 05
Posts: 473
United Kingdom
Message 7957 - Posted: 3 Feb 2007, 20:55:33 UTC

Since I installed BOINC 5.8.8 there have been two ocassions, that I have seen, that Einstein units have reached 100% and the "to Completion" column is ---, but the Status is still "waiting to run".
It should have uploaded and the status column should be "waiting to Report".

Has anybody else noticed this. So far I have only noticed it on Einstein.
Part of the problem maybe that on this computer Einstein is on a very low resourse share, 4.71% (one hour/day) and usually when the Einstein unit has completed it switches to another project.

Andy
ID: 7957 · Report as offensive
Keck_Komputers
Avatar

Send message
Joined: 29 Aug 05
Posts: 304
United States
Message 7964 - Posted: 4 Feb 2007, 11:26:53 UTC

They checkpoint after processing is basically finished but before the clean up operations are done. Since the 5.8.8 version switches at checkpoints this can leave the app sitting at 100% but not finished. It is annoying but not really a bug. This was always possible, but switching at checkpoints makes it more likely.
BOINC WIKI

BOINCing since 2002/12/8
ID: 7964 · Report as offensive
Pepo
Avatar

Send message
Joined: 3 Apr 06
Posts: 547
Slovakia
Message 7999 - Posted: 5 Feb 2007, 21:06:26 UTC - in response to Message 7957.  

Since I installed BOINC 5.8.8 there have been two ocassions, that I have seen, that Einstein units have reached 100% and the "to Completion" column is ---, but the Status is still "waiting to run".
It should have uploaded and the status column should be "waiting to Report".

Has anybody else noticed this. So far I have only noticed it on Einstein.

Yes, Andy, it happens. In last few months I've observed and reported this for Rosetta (the devs fixed it for Ralph 5.41), WCG FAAH and recently Kathryn Marks (5.8.3) and me (5.8.4) reported it on Boinc alpha list for Windows einstein_S5RI version 424 (it was probably introduced in this version, it checkpoints 2-3 x in last few seconds at 100%).

Part of the problem maybe that on this computer Einstein is on a very low resourse share, 4.71% (one hour/day) and usually when the Einstein unit has completed it switches to another project.

This was also my problem. The mentioned apps (when with low share) build here large negative STD during crunching and when they are preempted at 100%, they often stay in memory for the next day or so.

Peter
ID: 7999 · Report as offensive
W-K ID 666

Send message
Joined: 30 Dec 05
Posts: 473
United Kingdom
Message 8021 - Posted: 6 Feb 2007, 10:08:01 UTC - in response to Message 7999.  
Last modified: 6 Feb 2007, 10:08:56 UTC

Since I installed BOINC 5.8.8 there have been two ocassions, that I have seen, that Einstein units have reached 100% and the "to Completion" column is ---, but the Status is still "waiting to run".
It should have uploaded and the status column should be "waiting to Report".

Has anybody else noticed this. So far I have only noticed it on Einstein.

Yes, Andy, it happens. In last few months I've observed and reported this for Rosetta (the devs fixed it for Ralph 5.41), WCG FAAH and recently Kathryn Marks (5.8.3) and me (5.8.4) reported it on Boinc alpha list for Windows einstein_S5RI version 424 (it was probably introduced in this version, it checkpoints 2-3 x in last few seconds at 100%).

Part of the problem maybe that on this computer Einstein is on a very low resourse share, 4.71% (one hour/day) and usually when the Einstein unit has completed it switches to another project.

This was also my problem. The mentioned apps (when with low share) build here large negative STD during crunching and when they are preempted at 100%, they often stay in memory for the next day or so.

Peter


I think your last paragraph is the significant point, about staying in memory. The fact it hasn't uploaded is not a problem, just an observation on my part, that I think should be fixed.
I did post same msg on Einstein so maybe someone will see it and act accordingly.

Andy
edit] or maybe 5.8.9 will fix, I see it is now available and I have downloaded. [/edit
ID: 8021 · Report as offensive
Pepo
Avatar

Send message
Joined: 3 Apr 06
Posts: 547
Slovakia
Message 8034 - Posted: 6 Feb 2007, 15:23:29 UTC - in response to Message 8021.  
Last modified: 6 Feb 2007, 15:40:18 UTC

The fact it hasn't uploaded is not a problem, just an observation on my part,

It checkpoints multiple times last (tens of) seconds prior to being finished. In between the last checkpoints, it also compresses some (I assume, it is at least reported by the Linux version) ouptput file, so it possibly finally issues some 1-2 redundant checkpoints in last few seconds of runtime. And then it gets catched by the cogent core client.

[edit]
John Keck seems to say approximately the same:
"The latest version of the Einstein app checkpoints after processing is basically finished but before the clean up operations begin. The 5.8.8 client sees a checkpoint event as an chance to change which app is running. So you will see Einstein tasks stopped at 100% but not finished more often with the new client."
[/edit]

that I think should be fixed. [...] or maybe 5.8.9 will fix, I see it is now available and I have downloaded.

After hearing the symptoms, Boinc devs always agreed on it being an app issue.

Peter
ID: 8034 · Report as offensive
W-K ID 666

Send message
Joined: 30 Dec 05
Posts: 473
United Kingdom
Message 8044 - Posted: 7 Feb 2007, 0:31:06 UTC - in response to Message 8034.  

[edit]
John Keck seems to say approximately the same:
"The latest version of the Einstein app checkpoints after processing is basically finished but before the clean up operations begin. The 5.8.8 client sees a checkpoint event as an chance to change which app is running. So you will see Einstein tasks stopped at 100% but not finished more often with the new client."
[/edit]

that I think should be fixed. [...] or maybe 5.8.9 will fix, I see it is now available and I have downloaded.

After hearing the symptoms, Boinc devs always agreed on it being an app issue.

Peter

Saw JK's post, I started that thread also.

And ref BOINC/Project. I get that all the time at work except there it is, software say Hardware problem see them. Go see hardware, Not us man, its those software geeks, go see them. and on and on it goes.

Andy
ID: 8044 · Report as offensive
Pepo
Avatar

Send message
Joined: 3 Apr 06
Posts: 547
Slovakia
Message 8049 - Posted: 7 Feb 2007, 17:49:41 UTC - in response to Message 8044.  


After hearing the symptoms, Boinc devs always agreed on it being an app issue.

And ref BOINC/Project. I get that all the time at work except there it is, software say Hardware problem see them. Go see hardware, Not us man, its those software geeks, go see them. and on and on it goes.

Sounds familiar :-)

(BTW the same happened when I reported it on WCG forum. Sekerob? ;-) no flame intended

Peter
ID: 8049 · Report as offensive
Pepo
Avatar

Send message
Joined: 3 Apr 06
Posts: 547
Slovakia
Message 8069 - Posted: 8 Feb 2007, 10:06:32 UTC
Last modified: 8 Feb 2007, 10:31:09 UTC

One spicery - yesterday my Linux host started to crunch Predictor's dTasser_test 0.02 unit. It checkpointed for the first time after using 3:06:50 CPU time and was immediately preempted. Now it's stuck at 99.875% and 0:00:14 ETC :-)

(As the project was out of work for some months, I've set it's ressource share low, now the debts are accordingly deep in hell and it'll take some time.....)

Peter
ID: 8069 · Report as offensive

Message boards : BOINC client : 5.8.8 - completion/upload of units

Copyright © 2025 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.