Rosetta@home computation error

Message boards : Questions and problems : Rosetta@home computation error
Message board moderation

To post messages, you must log in.

AuthorMessage
WzR

Send message
Joined: 24 Mar 20
Posts: 5
Finland
Message 97028 - Posted: 24 Mar 2020, 16:03:20 UTC

Hi,

Client: 7.14.2, Win10

started using the client yesterday. Also running folding@home in parallel (for GPU only if Rosetta is active). CPU is Ryzen, no OC. Got first batch of WUs which were processed and submitted, currently it is crunching other WUs, however, one failed with computation error at 100.000%, logs show:

24.3.2020 5:37:10 PM | Rosetta@home | Computation for task 8ft7da9j_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0 finished
24.3.2020 5:37:10 PM | Rosetta@home | Output file 8ft7da9j_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0_r1505128294_0 for task 8ft7da9j_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0 absent

so it did finish successfully but failed to write the output file and submit or...?

Thank you!
ID: 97028 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5077
United Kingdom
Message 97029 - Posted: 24 Mar 2020, 16:08:22 UTC - in response to Message 97028.  

Usually, it's the other way round. If an application crashes, or otherwise ends abnormally, then it doesn't reach the code that writes the output file.

'Output file absent' is a sign that the app finished, but didn't finish normally - most often, a crash.
ID: 97029 · Report as offensive
WzR

Send message
Joined: 24 Mar 20
Posts: 5
Finland
Message 97030 - Posted: 24 Mar 2020, 16:14:47 UTC - in response to Message 97029.  
Last modified: 24 Mar 2020, 16:21:22 UTC

em, no, it didn't crash imho, I'm checking it periodically, would have noticed if client has disappeared. Also, would logs indicate a crash?
UPDATE: Event Viewer -> Windows Logs -> Application - no entries for that period of time
ID: 97030 · Report as offensive
WzR

Send message
Joined: 24 Mar 20
Posts: 5
Finland
Message 97031 - Posted: 24 Mar 2020, 16:28:55 UTC

oh, I think you meant the separate process that was responsible for that specific task crashed. Okay, how do I resolve it then? Tried suspending and resuming that task from the client - nothing happens. It has 6:04 hours of elapsed time, sad it that's wasted.
ID: 97031 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5077
United Kingdom
Message 97032 - Posted: 24 Mar 2020, 16:56:03 UTC

The first line of the event log that you posted contains a task name, Rosetta has the 'task name search' feature enabled, which led me to Rosetta task 1131821072. That has the finish reason

Exit status	-1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
- which is actually a Windows code, not Rosetta or BOINC.

There is a brief report from the 'Windows Runtime Debugger', but it's much shorter than most I've seen - the fuller ones have a 'stack trace' towards the end, which is often the most useful part. Yours is missing.

I think you may have to write this one off to experience. You can look around the Rosetta forums to see if anyone else has reported this kind of problem recently, You can also ask there for advice - quote that task ID number above. Finally, keep an eye on Rosetta workunit 1019438559 - a duplicate copy of your task has been sent out to another volunteer. It sometimes happens that a data error causes a task to crash, and if that happens, all the other copies fail as well. At least that would give you reassurance that it wasn't your - or your machine's - fault.
ID: 97032 · Report as offensive
WzR

Send message
Joined: 24 Mar 20
Posts: 5
Finland
Message 97033 - Posted: 24 Mar 2020, 17:11:04 UTC - in response to Message 97032.  

Just looked at the client and couldn't find it, that task is nowhere to be seen, I think it disappeared:

24.3.2020 5:37:10 PM | Rosetta@home | Computation for task 8ft7da9j_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0 finished
24.3.2020 5:37:10 PM | Rosetta@home | Output file 8ft7da9j_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0_r1505128294_0 for task 8ft7da9j_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0 absent
24.3.2020 5:37:13 PM | Rosetta@home | Starting task 9fe6ot7b_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0
24.3.2020 6:00:21 PM | Rosetta@home | task 8ft7da9j_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0 suspended by user
24.3.2020 6:00:22 PM | Rosetta@home | task 8ft7da9j_3h3_design3_COVID-19_SAVE_ALL_OUT_902898_3_0 resumed by user
vanished here?
24.3.2020 6:37:48 PM | Rosetta@home | Sending scheduler request: To report completed tasks.
24.3.2020 6:37:48 PM | Rosetta@home | Reporting 1 completed tasks
24.3.2020 6:37:48 PM | Rosetta@home | Requesting new tasks for CPU
...
...
---
well, anyway, it's gone now, hopefully won't happen again.

Thank you!
ID: 97033 · Report as offensive
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15477
Netherlands
Message 97034 - Posted: 24 Mar 2020, 17:34:33 UTC - in response to Message 97033.  

When a task finishes in the normal manner, it will upload the result of the calculations to the Rosetta project server and BOINC will report the task as being finished and finished correctly.
In your case the Rosetta science application crashed before it could finalize the result of the calculations, thus it had nothing to upload (output file absent), and BOINC then only reported that the task had finished in a faulty manner.

The task in question is at the moment of being reported no longer on your system, it will get deleted.
ID: 97034 · Report as offensive
WzR

Send message
Joined: 24 Mar 20
Posts: 5
Finland
Message 97036 - Posted: 24 Mar 2020, 17:46:24 UTC - in response to Message 97034.  

Thank you for the clarification! I'm new to this, jumped into running those tasks without much of background research :)
ID: 97036 · Report as offensive
Did

Send message
Joined: 9 Apr 20
Posts: 1
Message 97396 - Posted: 9 Apr 2020, 9:54:00 UTC - in response to Message 97036.  

Hello I have the same problem with the same message.

Since yesterday I installed the client on two windows 10 PC for Rosetta@home. One is doing the job, the other is not completing anything. I have computation errors with problem of files. I found in a very old message that it maybe because of Visual C++ redistributable missing, so I installed it, but it was not that.

Did you solve your problem?
ID: 97396 · Report as offensive
reboot81

Send message
Joined: 15 Apr 20
Posts: 3
Sweden
Message 97696 - Posted: 15 Apr 2020, 14:32:55 UTC - in response to Message 97396.  

Received the same error on a docker image. Tracked down the issue to boinc not having permissions to write to disk.
Could this be a lead? "absent" meaning "I cant save to disk or send it anywhere, something is stopping me!" (Like a Firewall or insufficient disk space, permission errors?)
ID: 97696 · Report as offensive

Message boards : Questions and problems : Rosetta@home computation error

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.