Ticket #911 (closed Defect: fixed)

Opened 6 months ago

Last modified 3 months ago

BOINC core hangs when there is active CUDA tasks in queue but CUDA anabled GPU removed

Reported by: Raistmer Assigned to: davea
Priority: Blocker Milestone: Undetermined
Component: Client - Daemon Version: 6.6.20
Keywords: Cc:

Description (Last modified by Ageless)

Please, see this thread for full diagnostic of this issue.

In short, BOINC core hangs badly when there are active CUDA tasks (anonymous platform used) but no CUDA GPU. It does this after changing CUDA task status from "suspended" to "waiting to run". BOINC manager then restarts core from prev saved state with CUDA task disabled.

Note, thet BOINC correctly determined that there is no CUDA GPUs in system. Moreover, using <no_gpu> tag leads to exactly the same behavior.

There are some crash dumps in logs files.

Attachments

BOINC_hung_after_GPU_replacement_logs.rar (12.1 kB) - added by Raistmer on 06/03/09 14:30:28.
BOINC log files, renamed and RARed
BOINC_hung_after_GPU_replacement_logs.zip (14.9 kB) - added by Nicolas on 06/03/09 14:59:02.
Reuploaded logs in free format
BOINC_core_hang_bug_additional_logging_enabled.rar (42.0 kB) - added by Raistmer on 06/03/09 15:05:02.
Additional logging enabled by cc_config as recommended in related thread

Change History

06/03/09 14:30:28 changed by Raistmer

  • attachment BOINC_hung_after_GPU_replacement_logs.rar added.

BOINC log files, renamed and RARed

06/03/09 14:45:47 changed by Ageless

  • description changed.

Reformatted link.

06/03/09 14:46:13 changed by Ageless

  • description changed.

Added space.

06/03/09 14:59:02 changed by Nicolas

  • attachment BOINC_hung_after_GPU_replacement_logs.zip added.

Reuploaded logs in free format

06/03/09 15:05:02 changed by Raistmer

  • attachment BOINC_core_hang_bug_additional_logging_enabled.rar added.

Additional logging enabled by cc_config as recommended in related thread

06/03/09 15:29:55 changed by davea

  • status changed from new to closed.
  • resolution set to fixed.

(In [18284]) - client: when parsing app_info.xml, make sure coprocessors are present.

fixes #911

06/04/09 10:44:15 changed by romw

(In [18300]) - client: when parsing app_info.xml, make sure coprocessors are present.

fixes #911

client/

cs_statefile.cpp


If this page is incomplete or incorrect, please edit it or add it to the wiki to-do list. To do this, you must be logged in; click Login or Register above.