Machine Learning Model Training A.I. Projects, How?

Message boards : Questions and problems : Machine Learning Model Training A.I. Projects, How?
Message board moderation

To post messages, you must log in.

AuthorMessage
Immortality

Send message
Joined: 3 Feb 24
Posts: 3
Message 113538 - Posted: 3 Feb 2024, 13:19:02 UTC
Last modified: 3 Feb 2024, 13:19:20 UTC

Hi, how would machine learning projects A.I. model training be capable of utilizing BOINC?

Does BOINC want to integrate or is a third party cluster management software used.

For instance, currently...

1. Use a utility with BOINC to manage the cluster, map and grade the network.
2. Setup a webpage to have model training jobs submitted to a task scheduler utility.
3. Alter the code to add the current cluster info, for instance tf.cluster on tensor and check for tf.distribute.
4. Execute the script on the master node utilizing the cluster directly.

Check all nodes on the same version of tensor and python and working.

This largely bypasses BOINC except for the cluster management and recruitment.

Does BOINC need to integrate the above or be upgraded in some way, or is the above the current situation?

Thanks in advance.
ID: 113538 · Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1354
United States
Message 113551 - Posted: 4 Feb 2024, 21:48:38 UTC - in response to Message 113538.  

See my reply in link below to your question posted today.
https://boinc.berkeley.edu/forum_thread.php?id=15199
ID: 113551 · Report as offensive     Reply Quote
Immortality

Send message
Joined: 3 Feb 24
Posts: 3
Message 113552 - Posted: 5 Feb 2024, 13:15:24 UTC - in response to Message 113551.  

I've emailed them.
From what I can gather. I do not think they trained the models in a distributed way, and I suspect BOINC needs some additional development to distribute model training.
ID: 113552 · Report as offensive     Reply Quote

Message boards : Questions and problems : Machine Learning Model Training A.I. Projects, How?

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.