Rioja Science

Message boards : Projects : Rioja Science
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Félix

Send message
Joined: 6 May 12
Posts: 8
Spain
Message 44157 - Posted: 15 May 2012, 18:25:20 UTC - in response to Message 44113.  
Last modified: 15 May 2012, 18:26:53 UTC

There seems to be about an 8% inconclusive results returned on the tasks I've run so far. Is that normal and will they remain that way or be declared invalid?


Last week we had problems with the server, in particular with MySQL, that´s the reason why you got invalid results, we´re sorry.

We have already solved it.
ID: 44157 · Report as offensive
Félix

Send message
Joined: 6 May 12
Posts: 8
Spain
Message 44158 - Posted: 15 May 2012, 18:42:30 UTC - in response to Message 44146.  
Last modified: 15 May 2012, 18:43:53 UTC

I´ll talk to the Web Page´s Administrator to try to release the forum...

Yes, please! It would be very helpful if the project forum was active. Thanks!


We´re working on it.


Oh...And please state that it's okay to import and publish project statistics. Some stats site admins won't do that without express permission. :-)

Cheers,

MarkR



The statistics are already available, we hope that in the coming days the boinc stats sites publish them
ID: 44158 · Report as offensive
Profile ritterm
Avatar

Send message
Joined: 4 Jul 08
Posts: 82
United States
Message 44159 - Posted: 15 May 2012, 18:55:43 UTC

@Felix... Thanks for being so responsive. :-)
ID: 44159 · Report as offensive
Profile ChertseyAl
Avatar

Send message
Joined: 17 Jul 09
Posts: 107
United Kingdom
Message 44163 - Posted: 18 May 2012, 17:46:01 UTC

FWIW, there seems to be a problem validating linux versus windows machines. All of the 100+ pending WUs I've checked so far have been linux against windows. Not an uncommon problem.

Also, stats have not been updated since 16th May, but nonetheless I'm grateful that you are at least trying to export them :)

Oh, and the limit on WUs in progress is proving a real hindrance to getting any work done. Suggest a minimum of 2 WUs per core, but ideally a chance to get an hour or mores worth of work cached would help :) If the WUs took a bit (lot!) longer it would make the project easier to crunch, but I don't know how practical that is for you to fix. 100 times more work per WU would be nice.

Sadly still no message boards :(

Cheers,

Al.

p.s. Sorry for cluttering up the BOINC dev boards :)

ID: 44163 · Report as offensive
Profile ritterm
Avatar

Send message
Joined: 4 Jul 08
Posts: 82
United States
Message 44275 - Posted: 23 May 2012, 19:42:00 UTC - in response to Message 44163.  

Sadly still no message boards :(
...
p.s. Sorry for cluttering up the BOINC dev boards :)

My apologies for continuing to post here, but I feel I have no choice... The latest batch of work that I've got on my hosts seems to be never-ending. Tasks have been running for between 40-90 minutes and most show 100% progress, although one (a WinXP machine) shows 0% progress. All triqct processes are running and using a full core.
ID: 44275 · Report as offensive
Profile ritterm
Avatar

Send message
Joined: 4 Jul 08
Posts: 82
United States
Message 44276 - Posted: 23 May 2012, 20:51:02 UTC - in response to Message 44275.  
Last modified: 23 May 2012, 20:53:09 UTC

The latest batch of work that I've got on my hosts seems to be never-ending...

Okay, so maybe I should have been more patient. Four of these have since finished -- in about 2.5 hours on my faster hosts. Quite a change from the 60 second tasks in earlier batches. "Real" work perhaps? ;-)
ID: 44276 · Report as offensive
Profile ChertseyAl
Avatar

Send message
Joined: 17 Jul 09
Posts: 107
United Kingdom
Message 44277 - Posted: 23 May 2012, 21:45:55 UTC

Perhaps this thread should be locked.

The project's admins really must open their message boards.

There are so many issues that could be resolved with helpful crunchers if only there was a forum (sic) to communicate.

Looks like the WU length has increased, as suggested, by 100x. Looks like all of the win v linux WU validation problem WUS have been wiped (well, gee, thanks for that). Looks like there's still a limit on concurrent WUs.

Félix, get those message boards open. Now. Before we all lose interest in what is potentially an interesting project.

Cheers,

Al.

p.s. Someone please lock this thread!


ID: 44277 · Report as offensive
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15480
Netherlands
Message 44278 - Posted: 23 May 2012, 23:51:29 UTC - in response to Message 44277.  

Perhaps this thread should be locked.

I disagree. Not until there are forums at the project. What if Félix is subscribed to this thread, then locking it would only be counterproductive.
ID: 44278 · Report as offensive
Profile ChertseyAl
Avatar

Send message
Joined: 17 Jul 09
Posts: 107
United Kingdom
Message 44282 - Posted: 24 May 2012, 17:51:46 UTC - in response to Message 44278.  

Perhaps this thread should be locked.

I disagree.


Fair enough.

Meanwhile, a few problems have come to light:

"Completed, validation inconclusive" - This is still happening with linux v Windows, but now more worryingly seems to happen between Win XP32 and Win7 64. Also, a 'tie-break' WU is not being sent, so the tasks presumably never get resolved.

The tasks don't relinquish control once running. Once they start, they run to completion, blocking other projects.

Checkpointing doesn't work, or isn't present (probably the same issue as above, it simply isn't implemented). Prior to a power failure I had some WUs running solidly for 5 hours. On restart they started from zero again.

For some reason, the IR is set to 3 although the MQ is 2. I thought this was generally frowned upon as a waste of resourses. However, on the bright side, only 2 replications are actually sent out, the third remains as 'unsent' (see validation issue above, probably related).

I'd suggest it's pretty important to sort out the validation issues as by my estimation about 10% of work is being wasted (seems to tie up with the 90/10 win/linux split that someone quoted someplace). Secondly, as the WUs are a decent length, checkpointing would be nice :)

Cheers,

Al.
ID: 44282 · Report as offensive
Félix

Send message
Joined: 6 May 12
Posts: 8
Spain
Message 44310 - Posted: 27 May 2012, 16:06:45 UTC

Hi all,

Due to external issues, we are not able to open the website forum so far. It will be available within one month.

Like Ritterm said, now are having real work from the chemistry department, the workunits last between two and three hours.

We are doing our best to fix problems such as the progress bar, Windows 7 64bits bugs, checkpoint and so on.

Last week we released the MacOs-Intel platform and the next week the application over Nvidia-Cuda.

We have a lot of work but because of the lack of human resources availability, we can´t be as efficient as we would like.

Please, be patient, don´t close the forum thread.

Thanks, the obtained results are so valuable for us.
Best Regards

ID: 44310 · Report as offensive
Profile ritterm
Avatar

Send message
Joined: 4 Jul 08
Posts: 82
United States
Message 44312 - Posted: 27 May 2012, 18:21:12 UTC - in response to Message 44310.  

Hi all,

Due to external issues, we are not able to open the website forum so far. It will be available within one month.

Like Ritterm said, now are having real work from the chemistry department, the workunits last between two and three hours.

We are doing our best to fix problems such as the progress bar, Windows 7 64bits bugs, checkpoint and so on.

Last week we released the MacOs-Intel platform and the next week the application over Nvidia-Cuda.

We have a lot of work but because of the lack of human resources availability, we can´t be as efficient as we would like.

Please, be patient, don´t close the forum thread.

Thanks, the obtained results are so valuable for us.
Best Regards


Thanks for the feedback, Felix. I don't speak for all of crunchers, of course, but I'm sure that many of us understand the problems that can arise from limited resources, both human and monetary.

Communication with volunteers is one of the most important things a project admin can do, so please keep us updated as best you can when there's news, changes, problems, etc.

Regards,

MarkR
ID: 44312 · Report as offensive
Profile ritterm
Avatar

Send message
Joined: 4 Jul 08
Posts: 82
United States
Message 44408 - Posted: 5 Jun 2012, 17:42:31 UTC

@Felix:

Can you give us an update on the status of the project? I've been getting this for the past several days...

6/5/2012 10:44:03 AM Rioja Science Message from server: Project is temporarily shut down for maintenance

Regards,

MarkR
ID: 44408 · Report as offensive
Profile skivelitis
Avatar

Send message
Joined: 9 Apr 09
Posts: 35
United States
Message 44437 - Posted: 9 Jun 2012, 8:00:49 UTC

Nice to see you fully back online. However, all credits that were pending at the start of the downtime have been marked as "completed can't validate".
ID: 44437 · Report as offensive
Profile ChertseyAl
Avatar

Send message
Joined: 17 Jul 09
Posts: 107
United Kingdom
Message 44439 - Posted: 9 Jun 2012, 17:44:04 UTC

The feeder hasn't been running for a while now.

I have work to report, there is work available to get, but someone needs to give the feeder a kick please :)

Cheers,

Al.
ID: 44439 · Report as offensive
Chris Granger
Avatar

Send message
Joined: 20 Nov 10
Posts: 33
Canada
Message 44543 - Posted: 18 Jun 2012, 23:26:30 UTC

Tasks are running in high priority mode on my Win32 machine even though they have plenty of time to finish well before their deadline. 40-minute tasks with a week deadline shouldn't do this, obviously.
ID: 44543 · Report as offensive
Profile ritterm
Avatar

Send message
Joined: 4 Jul 08
Posts: 82
United States
Message 44545 - Posted: 19 Jun 2012, 1:02:55 UTC - in response to Message 44543.  

Tasks are running in high priority mode on my Win32 machine even though they have plenty of time to finish well before their deadline. 40-minute tasks with a week deadline shouldn't do this, obviously.

Do you have outstanding work from other projects in your queue that could cause the BOINC manager to think everything won't finish on time?
ID: 44545 · Report as offensive
Chris Granger
Avatar

Send message
Joined: 20 Nov 10
Posts: 33
Canada
Message 44556 - Posted: 19 Jun 2012, 19:19:49 UTC - in response to Message 44545.  

Nope, just some WCG workunits that have plenty of time to finish as well. I cache only 0.25 days of work, and these also have a week deadline.
ID: 44556 · Report as offensive
Profile ritterm
Avatar

Send message
Joined: 4 Jul 08
Posts: 82
United States
Message 44567 - Posted: 20 Jun 2012, 13:43:43 UTC

There seems to be plenty of work available (at the time of this post ~182K tasks ready to send and decreasing), but I keep getting the following messages from the project:

6/20/2012 8:00:12 AM Rioja Science Message from server: No tasks sent
6/20/2012 8:00:12 AM Rioja Science Message from server: Tasks are committed to other platforms

This happens intermittently between periods of time that my hosts (all Windows) get work without any problems.
ID: 44567 · Report as offensive
BobCat13

Send message
Joined: 6 Dec 06
Posts: 118
United States
Message 44568 - Posted: 20 Jun 2012, 15:09:07 UTC - in response to Message 44567.  
Last modified: 20 Jun 2012, 15:10:19 UTC

There seems to be plenty of work available (at the time of this post ~182K tasks ready to send and decreasing), but I keep getting the following messages from the project:

6/20/2012 8:00:12 AM Rioja Science Message from server: No tasks sent
6/20/2012 8:00:12 AM Rioja Science Message from server: Tasks are committed to other platforms

This happens intermittently between periods of time that my hosts (all Windows) get work without any problems.

Same thing here on both Windows and Linux. Project is probably using coarse HR, which matches Windows, Linux, and Mac with the same OS. Looks like the queue is full of tasks waiting to be assigned to Mac, so we will have to wait for those to clear before getting work on Windows & Linux.
ID: 44568 · Report as offensive
Profile ChertseyAl
Avatar

Send message
Joined: 17 Jul 09
Posts: 107
United Kingdom
Message 44570 - Posted: 20 Jun 2012, 17:40:49 UTC - in response to Message 44568.  

Project is probably using coarse HR, which matches Windows, Linux, and Mac with the same OS.


I think that's true as I can't find any recent work where my XP machines have been paired with linux. Which is nice :) Still getting a few validation problems, but maybe only 1 WU per day, so not that worried.

HOWEVER ...

It would be nice to lift the limit on WUs in progress per host/CPU/whatever to a sensible level. My machines hit the limit, keep asking for work, back off for hours, and as a result machines are often dry :(

Cheers,

Al.

ID: 44570 · Report as offensive
Previous · 1 · 2 · 3 · Next

Message boards : Projects : Rioja Science

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.