Postponed: VM job unmanageable, restarting later.

Message boards : Questions and problems : Postponed: VM job unmanageable, restarting later.
Message board moderation

To post messages, you must log in.

AuthorMessage
milesrf

Send message
Joined: 7 Aug 21
Posts: 2
United States
Message 104986 - Posted: 8 Aug 2021, 0:06:54 UTC

I'm often seeing this error on the only VM task I am running:

Postponed: VM job unmanageable, restarting later.

This task has been running for about two months, and looks likely to run for another six months before it finishes. A few dozen other tasks for the same workunit have already failed for other users. However, a similar workunit not using VirtualBox completed years ago. This project is known to have tasks run for months or years after their original deadline, and still get credit.

I've seen no sign that it restarts without manual intervention. Shutting down BOINC and then restarting it helps, but not much.

Shutting down BOINC and then restarting or rebooting Windows, then resarting BOINC, helps more - it's then typically about a day before I see this problem again.

Is there a way to make VirtualBox produce a log file showing where the last few commands it received came from?

Part of the event log for BOINC startup:

8/7/2021 6:24:38 PM | | Starting BOINC client version 7.16.11 for windows_x86_64
8/7/2021 6:24:38 PM | | log flags: file_xfer, sched_ops, task
8/7/2021 6:24:38 PM | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2s zlib/1.2.8
8/7/2021 6:24:38 PM | | Data directory: C:\ProgramData\BOINC
8/7/2021 6:24:38 PM | | Running under account *****
8/7/2021 6:24:39 PM | | CUDA: NVIDIA GPU 0: NVIDIA GeForce RTX 2070 (driver version 471.41, CUDA version 11.4, compute capability 7.5, 4096MB, 3468MB available, 7880 GFLOPS peak)
8/7/2021 6:24:39 PM | | OpenCL: NVIDIA GPU 0: NVIDIA GeForce RTX 2070 (driver version 471.41, device version OpenCL 3.0 CUDA, 8192MB, 3468MB available, 7880 GFLOPS peak)
8/7/2021 6:24:39 PM | | Windows processor group 0: 16 processors
8/7/2021 6:24:39 PM | | Host name: Nathan-PC
8/7/2021 6:24:39 PM | | Processor: 16 GenuineIntel Intel(R) Core(TM) i7-5960X CPU @ 3.00GHz [Family 6 Model 63 Stepping 2]
8/7/2021 6:24:39 PM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx tm2 dca pbe fsgsbase bmi1 smep bmi2
8/7/2021 6:24:39 PM | | OS: Microsoft Windows 10: Professional x64 Edition, (10.00.19043.00)
8/7/2021 6:24:39 PM | | Memory: 31.90 GB physical, 36.65 GB virtual
8/7/2021 6:24:39 PM | | Disk: 463.23 GB total, 261.45 GB free
8/7/2021 6:24:39 PM | | Local time is UTC -5 hours
8/7/2021 6:24:39 PM | | No WSL found.
8/7/2021 6:24:39 PM | | VirtualBox version: 6.1.12
8/7/2021 6:24:39 PM | | Config: don't use GPUs while FahCore_21.exe is running
8/7/2021 6:24:39 PM | | Config: don't use GPUs while FahCore_22.exe is running
8/7/2021 6:24:38 PM | | Starting BOINC client version 7.16.11 for windows_x86_64
8/7/2021 6:24:38 PM | | log flags: file_xfer, sched_ops, task
8/7/2021 6:24:38 PM | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2s zlib/1.2.8
8/7/2021 6:24:38 PM | | Data directory: C:\ProgramData\BOINC
8/7/2021 6:24:38 PM | | Running under account rober
8/7/2021 6:24:39 PM | | CUDA: NVIDIA GPU 0: NVIDIA GeForce RTX 2070 (driver version 471.41, CUDA version 11.4, compute capability 7.5, 4096MB, 3468MB available, 7880 GFLOPS peak)
8/7/2021 6:24:39 PM | | OpenCL: NVIDIA GPU 0: NVIDIA GeForce RTX 2070 (driver version 471.41, device version OpenCL 3.0 CUDA, 8192MB, 3468MB available, 7880 GFLOPS peak)
8/7/2021 6:24:39 PM | | Windows processor group 0: 16 processors
8/7/2021 6:24:39 PM | | Host name: Nathan-PC
8/7/2021 6:24:39 PM | | Processor: 16 GenuineIntel Intel(R) Core(TM) i7-5960X CPU @ 3.00GHz [Family 6 Model 63 Stepping 2]
8/7/2021 6:24:39 PM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx tm2 dca pbe fsgsbase bmi1 smep bmi2
8/7/2021 6:24:39 PM | | OS: Microsoft Windows 10: Professional x64 Edition, (10.00.19043.00)
8/7/2021 6:24:39 PM | | Memory: 31.90 GB physical, 36.65 GB virtual
8/7/2021 6:24:39 PM | | Disk: 463.23 GB total, 261.45 GB free
8/7/2021 6:24:39 PM | | Local time is UTC -5 hours
8/7/2021 6:24:39 PM | | No WSL found.
8/7/2021 6:24:39 PM | | VirtualBox version: 6.1.12
8/7/2021 6:24:39 PM | | Config: don't use GPUs while FahCore_21.exe is running
8/7/2021 6:24:39 PM | | Config: don't use GPUs while FahCore_22.exe is running
8/7/2021 6:24:39 PM | World Community Grid | General prefs: from World Community Grid (last modified 22-Feb-2021 14:46:44)
8/7/2021 6:24:39 PM | World Community Grid | Computer location: work
8/7/2021 6:24:39 PM | | General prefs: using separate prefs for work
8/7/2021 6:24:39 PM | | Reading preferences override file
8/7/2021 6:24:39 PM | | Preferences:
8/7/2021 6:24:39 PM | | max memory usage when active: 26131.03 MB
8/7/2021 6:24:39 PM | | max memory usage when idle: 26131.03 MB
8/7/2021 6:24:42 PM | | max disk usage: 46.32 GB
8/7/2021 6:24:42 PM | | max CPUs used: 14
8/7/2021 6:24:42 PM | | (to change preferences, visit a project web site or select Preferences in the Manager)
8/7/2021 6:24:42 PM | | Setting up project and slot directories
8/7/2021 6:24:42 PM | | Checking active tasks
8/7/2021 6:24:42 PM | RNA World | Task cmsvm_GA-p[e20-30MB_Lin64f]_1_Oryzias-latipes-(Japanese-medaka)_DG000021.lin.EMBL_RF00028_Intron_gpI_1349111823_80976_80 is 40.41 days overdue; you may not get credit for it. Consider aborting it.
8/7/2021 6:24:38 PM | | Starting BOINC client version 7.16.11 for windows_x86_64
8/7/2021 6:24:38 PM | | log flags: file_xfer, sched_ops, task
8/7/2021 6:24:38 PM | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2s zlib/1.2.8
8/7/2021 6:24:38 PM | | Data directory: C:\ProgramData\BOINC
8/7/2021 6:24:38 PM | | Running under account *****
8/7/2021 6:24:39 PM | | CUDA: NVIDIA GPU 0: NVIDIA GeForce RTX 2070 (driver version 471.41, CUDA version 11.4, compute capability 7.5, 4096MB, 3468MB available, 7880 GFLOPS peak)
8/7/2021 6:24:39 PM | | OpenCL: NVIDIA GPU 0: NVIDIA GeForce RTX 2070 (driver version 471.41, device version OpenCL 3.0 CUDA, 8192MB, 3468MB available, 7880 GFLOPS peak)
8/7/2021 6:24:39 PM | | Windows processor group 0: 16 processors
8/7/2021 6:24:39 PM | | Host name: Nathan-PC
8/7/2021 6:24:39 PM | | Processor: 16 GenuineIntel Intel(R) Core(TM) i7-5960X CPU @ 3.00GHz [Family 6 Model 63 Stepping 2]
8/7/2021 6:24:39 PM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx tm2 dca pbe fsgsbase bmi1 smep bmi2
8/7/2021 6:24:39 PM | | OS: Microsoft Windows 10: Professional x64 Edition, (10.00.19043.00)
8/7/2021 6:24:39 PM | | Memory: 31.90 GB physical, 36.65 GB virtual
8/7/2021 6:24:39 PM | | Disk: 463.23 GB total, 261.45 GB free
8/7/2021 6:24:39 PM | | Local time is UTC -5 hours
8/7/2021 6:24:39 PM | | No WSL found.
8/7/2021 6:24:39 PM | | VirtualBox version: 6.1.12
8/7/2021 6:24:39 PM | | Config: don't use GPUs while FahCore_21.exe is running
8/7/2021 6:24:39 PM | | Config: don't use GPUs while FahCore_22.exe is running
8/7/2021 6:24:39 PM | World Community Grid | General prefs: from World Community Grid (last modified 22-Feb-2021 14:46:44)
8/7/2021 6:24:39 PM | World Community Grid | Computer location: work
8/7/2021 6:24:39 PM | | General prefs: using separate prefs for work
8/7/2021 6:24:39 PM | | Reading preferences override file
8/7/2021 6:24:39 PM | | Preferences:
8/7/2021 6:24:39 PM | | max memory usage when active: 26131.03 MB
8/7/2021 6:24:39 PM | | max memory usage when idle: 26131.03 MB
8/7/2021 6:24:42 PM | | max disk usage: 46.32 GB
8/7/2021 6:24:42 PM | | max CPUs used: 14
8/7/2021 6:24:42 PM | | (to change preferences, visit a project web site or select Preferences in the Manager)
8/7/2021 6:24:42 PM | | Setting up project and slot directories
8/7/2021 6:24:42 PM | | Checking active tasks
8/7/2021 6:24:42 PM | RNA World | Task cmsvm_GA-p[e20-30MB_Lin64f]_1_Oryzias-latipes-(Japanese-medaka)_DG000021.lin.EMBL_RF00028_Intron_gpI_1349111823_80976_80 is 40.41 days overdue; you may not get credit for it. Consider aborting it.

*****

8/7/2021 6:24:42 PM | | Setting up GUI RPC socket
8/7/2021 6:24:42 PM | | Checking presence of 976 project files
8/7/2021 6:24:42 PM | | Suspending computation - user request
8/7/2021 6:24:43 PM | RNA World | Sending scheduler request: Requested by project.
8/7/2021 6:24:43 PM | RNA World | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: not highest priority project)
8/7/2021 6:24:45 PM | RNA World | Scheduler request completed
8/7/2021 6:24:45 PM | RNA World | Project requested delay of 3636 seconds
8/7/2021 6:24:42 PM | | Setting up GUI RPC socket
8/7/2021 6:24:42 PM | | Checking presence of 976 project files
8/7/2021 6:24:42 PM | | Suspending computation - user request
8/7/2021 6:24:43 PM | RNA World | Sending scheduler request: Requested by project.
8/7/2021 6:24:43 PM | RNA World | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: not highest priority project)
8/7/2021 6:24:45 PM | RNA World | Scheduler request completed
8/7/2021 6:24:45 PM | RNA World | Project requested delay of 3636 seconds
8/7/2021 6:24:42 PM | RNA World | Task cmsvm_GA-p[e20-30MB_Lin64f]_1_Oryzias-latipes-(Japanese-medaka)_DG000021.lin.EMBL_RF00028_Intron_gpI_1349111823_80976_80 is 40.41 days overdue; you may not get credit for it. Consider aborting it.

*****

8/7/2021 6:24:42 PM | | Setting up GUI RPC socket
8/7/2021 6:24:42 PM | | Checking presence of 976 project files
8/7/2021 6:24:42 PM | | Suspending computation - user request
8/7/2021 6:24:43 PM | RNA World | Sending scheduler request: Requested by project.
8/7/2021 6:24:43 PM | RNA World | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: not highest priority project)
8/7/2021 6:24:45 PM | RNA World | Scheduler request completed
8/7/2021 6:24:45 PM | RNA World | Project requested delay of 3636 seconds
ID: 104986 · Report as offensive

Message boards : Questions and problems : Postponed: VM job unmanageable, restarting later.

Copyright © 2021 University of California. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.