Ticket #366 (new Defect)

Opened 9 months ago

Last modified 9 months ago

Client needs to cancel transfer after server abort

Reported by: Richard Haselgrove Assigned to: davea
Priority: Major Milestone: 5.10
Component: Client - Work Fetch Policy Version:
Keywords: Cc:

Description

SETI@home has implemented 'Redundant result - Cancelled by server' technology - if a quorum is filled and work validated, messages are sent to host(s) with the same WU 'abort if not started'.

During outage recovery on a busy/short WU project like SETI, it is possible that a WU can be validated and assimilated while the third quorum member (me, in this case!) is still trying to download the datafile through congestion/backoffs.

After assimilation, the datafile is quite properly deleted from the server download fanout, and the host's transfer is doomed to fail forever with 'file not found' retrys.

Once the host gets into this state, it seems to inhibit all work fetch for that project, until the offending transfer is aborted by user intervention. It will rapidly run dry.

When the server abort is issued and acted upon, presumably there are two actions:

1) Remove from task list after report 2) Delete datafile from disk

There needs to be a third:

3) Cancel any pending transfer for the datafile

Change History

08/24/07 11:59:53 changed by jason_gee


This happened to one of my hosts. Project aborted result, leaving it in the download queue. Endless server retries resulted, and refusal to get more work until transfer manually aborted.
jason_gee



If this page is incomplete or incorrect, please edit it or add it to the wiki to-do list. To do this, you must be logged in; click Login or Register above.