Posts by mearvk

1) Message boards : Server programs : Maximum CPU time exceeded (Message 9076)
Posted 24 Mar 2007 by mearvk
Post:
I get this error on all my downloaded workunits from my own Boinc server which is running the "Uppercase" application:

3/24/2007 4:50:45 PM|Uppercase|[file_xfer] Finished download of file in__1174768406_5
3/24/2007 4:50:45 PM|Uppercase|[file_xfer] Throughput 20779 bytes/sec
3/24/2007 4:50:45 PM|Uppercase|[file_xfer] Finished download of file in__1174768406_6
3/24/2007 4:50:45 PM|Uppercase|[file_xfer] Throughput 24242 bytes/sec
3/24/2007 4:50:45 PM|Uppercase|[file_xfer] Started download of file in__1174768406_7
3/24/2007 4:50:45 PM|Uppercase|[file_xfer] Started download of file in__1174768406_8
3/24/2007 4:50:47 PM|Uppercase|[file_xfer] Finished download of file in__1174768406_7
3/24/2007 4:50:47 PM|Uppercase|[file_xfer] Throughput 18390 bytes/sec
3/24/2007 4:50:47 PM|Uppercase|[file_xfer] Finished download of file in__1174768406_8
3/24/2007 4:50:47 PM|Uppercase|[file_xfer] Throughput 21333 bytes/sec
3/24/2007 4:50:47 PM|Uppercase|[file_xfer] Started download of file in__1174768406_9
3/24/2007 4:50:47 PM|Uppercase|[file_xfer] Started download of file in__1174768406_10
3/24/2007 4:50:47 PM|Uppercase|Starting wu_1174768406_5_0
3/24/2007 4:50:47 PM|Uppercase|Starting task wu_1174768406_5_0 using ucase version 500
3/24/2007 4:50:48 PM|Uppercase|[file_xfer] Finished download of file in__1174768406_9
3/24/2007 4:50:48 PM|Uppercase|[file_xfer] Throughput 23529 bytes/sec
3/24/2007 4:50:48 PM|Uppercase|[file_xfer] Finished download of file in__1174768406_10
3/24/2007 4:50:48 PM|Uppercase|[file_xfer] Throughput 16494 bytes/sec
3/24/2007 4:50:53 PM|Uppercase|Aborting task wu_1174768406_5_0: exceeded CPU time limit 4.098307
3/24/2007 4:50:53 PM|Uppercase|Deferring communication for 1 min 0 sec
3/24/2007 4:50:53 PM|Uppercase|Reason: Unrecoverable error for result wu_1174768406_5_0 (Maximum CPU time exceeded)
3/24/2007 4:50:58 PM|Uppercase|Computation for task wu_1174768406_5_0 finished
3/24/2007 4:50:58 PM|Uppercase|Starting wu_1174768406_6_0
3/24/2007 4:50:58 PM|Uppercase|Starting task wu_1174768406_6_0 using ucase version 500
3/24/2007 4:51:00 PM|Uppercase|[file_xfer] Started upload of file wu_1174768406_5_0_0
3/24/2007 4:51:02 PM|Uppercase|[file_xfer] Finished upload of file wu_1174768406_5_0_0
3/24/2007 4:51:02 PM|Uppercase|[file_xfer] Throughput 29097 bytes/sec
3/24/2007 4:51:05 PM|Uppercase|Aborting task wu_1174768406_6_0: exceeded CPU time limit 4.098307
3/24/2007 4:51:05 PM|Uppercase|Deferring communication for 1 min 0 sec
3/24/2007 4:51:05 PM|Uppercase|Reason: Unrecoverable error for result wu_1174768406_6_0 (Maximum CPU time exceeded)
3/24/2007 4:51:10 PM|Uppercase|Computation for task wu_1174768406_6_0 finished
3/24/2007 4:51:11 PM|Uppercase|Starting wu_1174768406_7_0
3/24/2007 4:51:11 PM|Uppercase|Starting task wu_1174768406_7_0 using ucase version 500
3/24/2007 4:51:13 PM|Uppercase|[file_xfer] Started upload of file wu_1174768406_6_0_0
3/24/2007 4:51:15 PM|Uppercase|[file_xfer] Finished upload of file wu_1174768406_6_0_0
3/24/2007 4:51:15 PM|Uppercase|[file_xfer] Throughput 26219 bytes/sec
3/24/2007 4:51:18 PM|Uppercase|Aborting task wu_1174768406_7_0: exceeded CPU time limit 4.098307
3/24/2007 4:51:18 PM|Uppercase|Deferring communication for 1 min 0 sec
3/24/2007 4:51:18 PM|Uppercase|Reason: Unrecoverable error for result wu_1174768406_7_0 (Maximum CPU time exceeded)
3/24/2007 4:51:23 PM|Uppercase|Computation for task wu_1174768406_7_0 finished
3/24/2007 4:51:23 PM|Uppercase|Starting wu_1174768406_8_0
3/24/2007 4:51:23 PM|Uppercase|Starting task wu_1174768406_8_0 using ucase version 500


The correct results are uploaded to /upload but the database "results" table says they failed with exit code: -177 (0xffffffffffffff4f) ERR_RSC_LIMIT_EXCEEDED.

Keep in my this is a trivial application which only reads in text and converts it to all upper case. So, I am a bit confused as to how my CPU can be "too slow" to run it.

Server is running on RedHat 9 and client is running on Vista.

On an OpenSuse client I get this sort error (but the correct result is uploaded):

Sat 24 Mar 2007 05:00:50 PM EDT|Uppercase|Deferring communication for 1 min 0 sec
Sat 24 Mar 2007 05:00:50 PM EDT|Uppercase|Reason: Unrecoverable error for result wu_1174768406_11_1 (process exited with code 22 (0x16))
Sat 24 Mar 2007 05:00:50 PM EDT|Uppercase|Computation for task wu_1174768406_11_1 finished
Sat 24 Mar 2007 05:00:50 PM EDT|Uppercase|Output file wu_1174768406_11_1_0 for task wu_1174768406_11_1 absent
Sat 24 Mar 2007 05:00:50 PM EDT|Uppercase|Starting wu_1174768406_12_1
Sat 24 Mar 2007 05:00:50 PM EDT|Uppercase|Starting task wu_1174768406_12_1 using ucase version 500
Sat 24 Mar 2007 05:00:51 PM EDT|Uppercase|Deferring communication for 1 min 0 sec
Sat 24 Mar 2007 05:00:51 PM EDT|Uppercase|Reason: Unrecoverable error for result wu_1174768406_12_1 (process exited with code 22 (0x16))


Thanks and help, as always, is appreciated.

Max
2) Message boards : Server programs : Workunits do not download (Message 8617)
Posted 8 Mar 2007 by mearvk
Post:
When I attach a new client to my project I get:

[file_xfer] Started download of (executable)
[file_xfer] Finished download of (executable)
[file_xfer] Throughput ....

After which I get on every client some message like:

Requesting 8640 seconds of new work
Scheduler RPC succeeded [server version 508]
Message from server: No work sent
Reason: requested by project

There are workunits and results in the database and boincadm and wwwrun (apache) both have access to the database. This is incredibly annoying. I have been banging at Boinc for days and I can't get past this error. If you don't know how to fix it or debug it quickly list some of the possible things that could be causing it and ways to check them.


I have already subscribed to the boinc_dev and boinc_projects mailing lists.

Thank you,

Max
3) Message boards : Server programs : Project encountered internal error: shared memory (Message 8424)
Posted 25 Feb 2007 by mearvk
Post:
Still trying to get my home brewed boinc server to work. I get the following at my Windows box while trying to connect over a LAN to my Linux (OpenSuse 10.2) box.


2/25/2007 12:55:35 AM|Upper Case Convertor|Sending scheduler request: Requested by user
2/25/2007 12:55:35 AM|Upper Case Convertor|Requesting 8640 seconds of new work
2/25/2007 12:55:40 AM|Upper Case Convertor|Scheduler RPC succeeded
2/25/2007 12:55:40 AM|Upper Case Convertor|Message from server: Project encountered internal error: shared memory
2/25/2007 12:55:40 AM|Upper Case Convertor|Deferring communication for 1 hr 0 min 0 sec
2/25/2007 12:55:40 AM|Upper Case Convertor|Reason: project is down

I've googled around for a solution but can't find anything substanital, mostly clients complaining and nothing on server admin stuff.

Help!
4) Message boards : Server programs : Server Side Issues (Message 8421)
Posted 25 Feb 2007 by mearvk
Post:
So I found one problem with my input in config.xml:

sample_dummy_validator -d 3 -app uppercase

SHOULD HAVE BEEN

sample_dummy_assimilator -d 3 -app uppercase

but I still can't get "transitioner -d 3" to run...

And the database isn't getting workunits inserted still.
5) Message boards : Server programs : Translation.inc file has an error in it (Message 8420)
Posted 25 Feb 2007 by mearvk
Post:
Line 329 should have the ".$file." syntax instead of " $file " ...

It's on the stable download so you guys might want to fix it.

6) Message boards : Server programs : Server Side Issues (Message 8418)
Posted 25 Feb 2007 by mearvk
Post:
bin/start yields:

DAEMON pid status lockfile disabled commandline
1 5159 running locked no feeder -d 3
2 5161 NOT RUNNING UNLOCKED no transitioner -d 3
3 5164 running locked no file_deleter -d 3
4 5166 running locked no make_work -wu_name easywu -cusion 5
5 5170 running locked no sample_trivial_validator -d 3 -app uppercase
6 5173 NOT RUNNING UNLOCKED no sample_dummy_validator -d 3 -app uppercase

So, somehow a couple of these daemons aren't starting. Which may bring us to my next problem: the workunits aren't being created in the database via the cron daemon. I have a single workunit in the DB but it simply isn't adding new ones.

Finally, what should the $PROJDIR/apps/ subdirectory structure look like? I've seen a couple suggestions:

/boinc/projects/uppercase/apps/uppercase/uppercase_5.9_i686-pc-linux-gnu/uppercase_5.9_i686-pc-linux-gnu
/boinc/projects/uppercase/apps/appname_version_platform

Should the executable be directly under apps or have its own subdirectory structure like the first line?

Thanks,

Mearvk


Config.xml looks like:

<?xml version="1.0" ?>
<boinc>
<config>
<upload_dir>
/boinc/projects/uppercase/upload
</upload_dir>
<long_name>
Upper Case Convertor
</long_name>
<cgi_url>
http://192.168.1.3/uppercase_cgi/
</cgi_url>
<sched_debug_level>
3
</sched_debug_level>
<disable_account_creation>
0
</disable_account_creation>
<download_url>
http://192.168.1.3/uppercase/download
</download_url>
<db_user>
root
</db_user>
<app_dir>
/boinc/projects/uppercase/apps
</app_dir>
<download_dir>
/boinc/projects/uppercase/download
</download_dir>
<fuh_debug_level>
3
</fuh_debug_level>
<master_url>
http://192.168.1.3/uppercase/
</master_url>
<host>
linux-mnuu
</host>
<db_name>
uppercase
</db_name>
<shmem_key>
0x11111375
</shmem_key>
<show_results>
1
</show_results>
<key_dir>
/boinc/projects/uppercase/keys/
</key_dir>
<upload_url>
http://192.168.1.3/uppercase_cgi/file_upload_handler
</upload_url>
<db_passwd>




</db_passwd>
<min_sendwork_interval>
6
</min_sendwork_interval>
<db_host>




</db_host>
<daily_result_quota>
500
</daily_result_quota>
<one_result_per_user_per_wu>
1
</one_result_per_user_per_wu>
<max_wus_to_send>
50
</max_wus_to_send>
</config>
<tasks/>
<daemons>
<daemon>
<cmd>
feeder -d 3
</cmd>
</daemon>
<daemon>
<cmd>
transitioner -d 3
</cmd>
</daemon>
<daemon>
<cmd>
file_deleter -d 3
</cmd>
</daemon>
<daemon>
<cmd>
make_work -wu_name easywu -cusion 5
</cmd>
</daemon>
<daemon>
<cmd>
sample_trivial_validator -d 3 -app uppercase
</cmd>
</daemon>
<daemon>
<cmd>
sample_dummy_validator -d 3 -app uppercase
</cmd>
</daemon>
</daemons>
</boinc>



Project.xml looks like:

<boinc>
<platform>
<name>windows_intelx86</name>
<user_friendly_name>Microsoft Windows (98 or later) running on an Intel x86-compatible CPU</user_friendly_name>
</platform>
<platform>
<name>i686-pc-linux-gnu</name>
<user_friendly_name>Linux running on an Intel x86-compatible CPU</user_friendly_name>
</platform>
<platform>
<name>x86_64-pc-linux-gnu</name>
<user_friendly_name>Linux running on an AMD x86_64 or Intel EM64T CPU</user_friendly_name>
</platform>
<platform>
<name>powerpc-apple-darwin</name>
<user_friendly_name>Mac OS X 10.3 or later running on Motorola PowerPC</user_friendly_name>
</platform>
<platform>
<name>i686-apple-darwin</name>
<user_friendly_name>Mac OS 10.4 or later running on Intel</user_friendly_name>
</platform>
<platform>
<name>sparc-sun-solaris2.7</name>
<user_friendly_name>Solaris 2.7 running on a SPARC-compatible CPU</user_friendly_name>
</platform>
<platform>
<name>sparc-sun-solaris</name>
<user_friendly_name>Solaris 2.8 or later running on a SPARC-compatible CPU</user_friendly_name>
</platform>
<platform>
<name>sparc64-sun-solaris</name>
<user_friendly_name>Solaris 2.8 or later running on a SPARC 64-bit CPU</user_friendly_name>
</platform>
<platform>
<name>anonymous</name>
<user_friendly_name>anonymous</user_friendly_name>
</platform>
<app>
<name>uppercase</name>
<user_friendly_name>Upper Case Convertor</user_friendly_name>
</app>
</boinc>





Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.