ws-tx message

Subject: Re: [ws-tx] Issue 052 - WS-AT: Replay message generates protocolerrors
From: Mark Little <mark.little@jboss.com>
To: Ram Jeyaraman <Ram.Jeyaraman@microsoft.com>
Date: Fri, 26 May 2006 09:30:26 +0100
My (further) 2 cents worth: I am slightly less tied to Replay from a 
technical perspective: I think an argument for its utility can be made. 
However, from an implementation perspective, I am completely tied to it. 
There have been several discussions centred around the impact of changes 
on existing implementations. I have not checked to see if that aspect 
has swayed votes, but I do believe it is important to take into 
consideration when errors are not at the heart of the discussion. 
Unfortunately there is one big error in this current issue that almost 
makes me want to drop Replay: the fact that the state table does not 
reflect the text and clearly states that transactions MUST roll back if 
Replay is used. I think that is wrong (to put it mildly!) and we need to 
fix the state table so that it reflects the text, which is correct IMO.

If we fix the state table as I indicated above, then I'm happy to keep 
Replay. Otherwise, let's drop Replay and update all of our 
implementations (and interoperability scenario documents).

Mark.


Ram Jeyaraman wrote:
>
> Alastair,
>
> My intent is to show that the replay message is quite valid.
>
> Case 1: A recovered participant in “Prepared Success” state may send a 
> “Replay” message to a coordinator that is in “Preparing” or “None” state.
>
> A participant could have successfully prepared (commit decision had 
> been logged), and yet may not be able to successfully send the 
> “Prepared” message, because of network failure of some sort. Such a 
> participant would likely, after multiple unsuccessful attempts to send 
> the “Prepared” message to the coordinator, give up and abort the 
> transaction; that is, it may send out a “Rollback” message to all its 
> resources. The participant could fail at this time, and upon recovery, 
> will notice that it had indeed successfully prepared, and will send 
> out a “Replay” message to the coordinator to find out the outcome. The 
> coordinator, which may be in “Preparing” or “None” state (if the 
> coordinator had aborted) would send a “Rollback” message back, since 
> the coordinator never entered the “Prepared Success” state.
>
> Case 2: A recovered participant in “Prepared Success” state may send a 
> “Replay” message to a coordinator that in “Prepared Success”, 
> “Committing”, or “Aborting” state.
>
> A participant that has successfully prepared (commit decision had been 
> logged), successfully sends out a “Prepared” message to its 
> coordinator, and then crashes. The coordinator upon receipt of the 
> “Prepared” message moves to the “Prepared Success” state. The 
> participant upon recovery will notice that it had indeed successfully 
> prepared, and will send out a “Replay” message to the coordinator to 
> find out the outcome. The coordinator, if in the “Prepared Success” 
> state, would ignore this message; and the participant would resend the 
> replay message. The coordinator when it makes a “Commit or Rollback 
> decision” would move to a “Committing” or “Aborting” state. When the 
> replay message arrives at the coordinator that is in a “Committing” or 
> “Aborting” state, it would send a “Commit” or “Rollback” message to 
> the recovered participant, depending on the final outcome.
>
> Analysis:
>
> It is only appropriate for a recovered participant to send a “Replay” 
> message to find out the final outcome; resending a “Prepared” message 
> may result in an erroneous situation. For example, in case 1 above, 
> the recovered participant should not send a “Prepared” message to the 
> coordinator, since it may have already aborted the transaction 
> locally. Hence, the “Replay” message has a role and a place in the 
> protocol.
>
> ------------------------------------------------------------------------
>
> *From:* Alastair Green [mailto:alastair.green@choreology.com]
> *Sent:* Wednesday, May 24, 2006 4:24 AM
> *To:* Ram Jeyaraman
> *Cc:* Mark Little; Peter Furniss; ws-tx@lists.oasis-open.org
> *Subject:* Re: [ws-tx] Issue 052 - WS-AT: Replay message generates 
> protocol errors
>
> Ram,
>
> Thanks for your comments. I think they help to clarify why Replay 
> should go.
>
> The "resend" definition of Replay is perfectly acceptable in 
> principle: it would be better worded as "interpret the message as a 
> resend of the last protocol message received by the Coordinator View". 
> It means: Replay is an algebraic quantity, to be valued by the 
> Coordinator View, given its state, as equivalent to the last message 
> received from the Participant.
>
> If the Coordinator View is Active, then Replay can be interpreted as 
> "the last received message was Register, I must replay my Register 
> Response".
>
> [In practice, using Replay to mean Register is going to cause an 
> almighty tangle. Because of the decision to use request-response for 
> WS-C messages, and to use [source endpoint] for notification messages 
> like Replay, it is not possible to make Replay equal Register. There 
> is a related issue as to which EPR a Replay-as-Register should be sent 
> to: is it the Coordinator View EPR, or the Registration Service EPR?. 
> I think we should stop this happening altogether.]
>
> If the Coordinator View is Preparing (awaiting a Prepared), then 
> Replay can safely be interpreted as "the last received message was 
> Register; I must replay my Prepare message".
>
> /In neither of these cases is there any reason to abort the 
> transaction/. The conversation can simply be resumed. That is issue 052.
>
> If the Coordinator View is Prepared Success, then Replay can safely be 
> interpreted as "the last message received was Prepared: I must send 
> the outcome", irrespective of the state of CV.
>
> Now let us consider how these cases can arise.
>
> There are three circumstances where some kind of replay or resend may 
> be needed or appropriate:
>
> 1. Known communication failures (comms time out)
> 2. Unknown failures of the interlocutor (it has crashed, its last 
> message has not been delivered, it is so slow as to appear to have failed)
> 3. Known failures of oneself (crash recovery from a persistent record).
>
> The correct reaction to known-to-sender comms failures is to permit 
> retry (according to any retry strategy that the implementer finds 
> useful). There is no need to say more: but we do need to make that 
> statement, and make it completely general, for all messages. Not 
> mandate, but permit.
>
> Unknown failures manifest themselves as "slow progress" -- e.g. CV 
> expected an answer to Prepare but hasn't received one.
>
> The sender is again free to retry at will. The only thing we know for 
> certain is that an implementation that /never/ retries cannot 
> guarantee to successfully complete the protocol: it depends on retries 
> to achieve guaranteed delivery. There are well-known adaptive retry 
> strategies to avoid network swamping, and we could do with a message 
> (at least in BA) that warns that slow progress is to be expected, to 
> help refine such strategies. But the strategy chosen is an 
> implementation issue.
>
> Again, we need the same general statement: that any message may be 
> resent at any time for any reason, with the qualification that 
> complete failure to attempt retries will produce a broken 
> implementation. I have already raised an issue that we should make 
> such a statement, and remove the comms time out line in the tables as 
> redundant/overly narrow.
>
> The state tables are designed to cope with such duplications, and will 
> also cope with transport-induced duplications.
>
> Known self-failure (crash and recover) will only occur when the party 
> concerned has logged its state. It is an unspoken assumption (perhaps 
> it needs, once again, to be spoken) that an AT Participant will /only/ 
> log when the state tables say it /must/ log, i.e. when it goes 
> prepared. If we follow that assumption, then we don't need to worry 
> about Replay arising when the Coordinator is Active: it can only occur 
> when the Participant has Prepared Success (written its log), and that 
> can't happen till the CV has moved to Preparing (has sent Prepare to P).
>
> [This would argue for making Replay in Active N/A, incidentally. I 
> think at present there is nothing to stop a P logging early (e.g. 
> before or after registering): if it did that then we are wandering 
> into the world of replaying Registers (which I think we should debate 
> and clarify).]
>
> If the Participant has moved to Aborting or Committing (volatile 
> state) then recovery will lead it to revert to the logged state 
> Prepared Success. If the Participant has moved to None then it will 
> not recover (or at least, if it does, then it has retained a durable 
> memory of the outcome, and has no need to Replay). Therefore, we can 
> concentrate solely on the case of recovery of P in Prepared Success.
>
> *Recovering in the Prepared Success state, P sends Replay, and 
> /always/ means by that: "Prepared".
> *
> The CV, on receiving that message, is either Preparing (i.e. awaiting 
> the decision of the hidden overall coordinator C that you don't want 
> us to define), or is Prepared Success (the hidden overall coordinator 
> has decided to log its decision, but has not yet written the log), or 
> is Committing/Aborting (has already sent the Commit or Rollback 
> message), or is None.
>
> The CV row Prepared defines the appropriate reaction of the CV in all 
> of these cases (plus or minus separate, orthogonal arguments about how 
> to react in None).
>
> Therefore:
>
> 1) There is no reason for the CV to react differently than it would if 
> it received Prepared, in any of these states. On reception Replay 
> equals the semantic "prepared", is exactly like the message Prepared.
>
> 2) There is no reason for comms time out to induce sending Prepared, 
> but crash recovery to induce sending Replay.
>
> 3) P can always replay Prepared (instead of sending Replay). Replay is 
> a redundant mechanism for communicating the semantic "prepared".
>
> 4) If you want to replay Register, the registering service can replay 
> Register.
>
> /5) There is no reason to have Replay/: it is simply a synonym for 
> Prepared (and possibly, maybe, if you really want to do this, for 
> Register). Issue 053.
>
> Redefining the unnecessary seems ... unnecessary.
>
> Alastair
>
>
> Ram Jeyaraman wrote:
>
> Alastair,
>
> Sorry for the delayed response.
>
> Part of the problem here is the current definition of Replay: “resend 
> the last appropriate protocol notification”. The replay message does 
> not necessarily indicate that the participant has successfully 
> prepared; it is possible that the participant is still in a preparing 
> state. Hence, the CV state table states that the coordinator should 
> send a rollback and move to Aborting state; that is, the coordinator 
> does not really know the possible conditions in which a participant 
> may send such a message.
>
> If the replay message were to be redefined, so it offers the guarantee 
> that the participant had indeed successfully prepared, the coordinator 
> may be able to treat it differently; that is, send the transaction 
> outcome.
>
> ------------------------------------------------------------------------
>
> *From:* Alastair Green [mailto:alastair.green@choreology.com]
> *Sent:* Friday, May 19, 2006 3:09 AM
> *To:* Ram Jeyaraman
> *Cc:* Mark Little; Peter Furniss; ws-tx@lists.oasis-open.org 
> <mailto:ws-tx@lists.oasis-open.org>
> *Subject:* Re: [ws-tx] Issue 052 - WS-AT: Replay message generates 
> protocol errors
>
> Ram,
>
> I very much appreciate your responsiveness, but I'm afraid that it 
> seems you are arguing simply by assertion (that which is, shall be).
>
> 1) Why should the coordinator abort in some circumstances purely 
> because of participant crash and recovery, despite the fact that the 
> participant has recovered in an identical state? That is the subject 
> of this issue 052. No-one has attempted to justify this behaviour thus 
> far. *Please can someone explain why this should be so?*
>
> 2) Anything that leaves the participant in doubt as to the outcome 
> requires it to communicate (again) with the C. If this occurs as a 
> result of detecting comms failure this leads to resend of Prepared. In 
> undefined circumstances, we are supposed to send Replay. The spec is 
> entirely silent on which circumstances, incidentally: it merely says 
> that if it is received then it is after "after recoverable failure". 
> If we resolve that aggressive abort is wrong (unjustified) then there 
> will be no reason for having two messages. *Can someone please provide 
> an /argument/ for having two messages, if they carry identical 
> effective semantics?
>
> *3) There is nothing in the spec to prevent replaying of Prepared. 
> Where is this prohibited, defined, circumscribed etc? Why should it 
> be? Is it dangerous or slow?
>
> Alastair
>
> Ram Jeyaraman wrote:
>
> Alastair,
>
> As we discussed below, the replay message is typically sent by a 
> participant that is in an in-doubt situation. It should not be used 
> for replaying a previous protocol message as the specification 
> currently states.
>
> The definition of Replay message should read along these lines:
>
> “Upon receipt of this notification, the coordinator may assume the 
> participant has suffered a recoverable failure. It should resend the 
> transaction outcome (commit or rollback protocol notification) to the 
> in-doubt participant.”
>
> ------------------------------------------------------------------------
>
> *From:* Alastair Green [mailto:alastair.green@choreology.com]
> *Sent:* Thursday, May 18, 2006 4:14 AM
> *To:* Ram Jeyaraman
> *Cc:* Mark Little; Peter Furniss; ws-tx@lists.oasis-open.org 
> <mailto:ws-tx@lists.oasis-open.org>
> *Subject:* Re: [ws-tx] Issue 052 - WS-AT: Replay message generates 
> protocol errors
>
> Ram,
>
> Absolutely. So if Replay is pure synonym for a resent Prepared (i.e. 
> carries no additional semantic relevant to the outcome, e.g. does not 
> imply that P is aborting) then there is no reason for C to abort the 
> transaction on receiving the message. Put another way, there is no 
> reason for Replay to induce a different behaviour from a resent 
> Prepared (which would not cause abortion).
>
> If Replay/Preparing is corrected to become identical to 
> Prepared/Preparing, then the unnecessary abort problem goes away (this 
> issue).
>
> If the two rows become identical, then there is no need to have a 
> separate Replay message (it is redundant) -- the related issue.
>
> Alastair
>
> Ram Jeyaraman wrote:
>
> Alastair,
>  
> A participant that has successfully prepared when it sends out a replay
> (after a crash) genuinely wants to know what the outcome is, so it can
> complete the in-doubt transaction during its recovery.
>  
> -----Original Message-----
> From: Alastair Green [mailto:alastair.green@choreology.com] 
> Sent: Wednesday, May 10, 2006 4:15 AM
> To: Mark Little
> Cc: Peter Furniss; Ram Jeyaraman; ws-tx@lists.oasis-open.org <mailto:ws-tx@lists.oasis-open.org>
> Subject: Re: [ws-tx] Issue 052 - WS-AT: Replay message generates
> protocol errors
>  
> Ram, Mark --
>  
> I've got a quite a few thoughts on this, but I want to check with the TC
>  
> on a couple of premises, in case I am misunderstanding some unwritten 
> piece of design intent.
>  
> 1. The text at l.221 of the spec defines the Replay message thus:
>  
> "Upon receipt of this notification, the coordinator may assume the 
> participant has suffered a recoverable failure. It should resend the 
> last appropriate protocol notification."
>  
> Does a Replay message for a Participant that crashed in the Prepared 
> Success and then recovered, carry the semantic:
>  
> a) "Have recovered, am in good state to proceed, i.e. am still
> prepared", or
> b) "Have recovered, was prepared, but am now aborting", or
> c) "Have recovered, and may be prepared successfully, or may be 
> aborting", or
> d) some other semantic, that I haven't thought of?
>  
> 2. Is a Participant which crashed in the Prepared Success state, has 
> recovered from a failure and is still prepared (i.e. is in the same 
> state as it was prior to crash recovery) allowed to re-send Prepared? Or
>  
> better, can its decision to do so damage the consistency of the 
> transaction outcome, or slow down arriving at the outcome decision?
>  
> Alastair
>  
>  
> Mark Little wrote:
>   
>>  
>> Peter Furniss wrote:
>>     
>>> I think it is likely the state table is being misinterpreted. I'm not
>>> sure by who :-)
>>>  
>>> If you treat the state as referring to just one participant, you
>>>       
> either
>   
>>> get some very convoluted definitions of the internal events (c.f.
>>>       
> issue
>   
>>> 048 - but more convoluted that the ones proposed there) or you
>>>       
> violate
>   
>>> atomicity.
>>>   
>>>       
>> We already agreed prior to the last f2f (in telecons) and at the last 
>> f2f (during the meeting) that the state table is not referring to just
>>     
>  
>   
>> one participant.
>>  
>>     
>>> Receiving a 'Prepared' message doesn't move the state to
>>>       
> PreparedSuccess
>   
>>> - that's done by "Commit Decision", and until then 'Replay' would
>>>       
> cause
>   
>>> an abort. You could define "Commit Decision" as meaning "receipt of
>>>       
> ok
>   
>>> vote for just this one participant", and take the state for this
>>> participant to PreparedSuccess. But the only way to leave
>>> PreparedSuccess is from "WriteDone" or "WriteFailed". Since a
>>>       
> 'Aborted'
>   
>>> from another participant should certainly cause this participant to
>>>       
> be
>   
>>> rolled back, that 'Aborted' will have to trigger "WriteFailed", which
>>>       
> is
>   
>>> not an obvious interpretation.
>>>  
>>>  
>>> But I think this issue, with 053 (eliminate Replay) is more about
>>> whether Replay need ever force an abort. We may be looking at a
>>> carry-over from connection-centric protocols, where it made sense to
>>> force an abort if the connection broke before commit-time. In those
>>> worlds (more or less all transaction protocols that weren't using xml
>>> and/or web-services, I think), receipt of a recovery message before
>>>       
> the
>   
>>> connection was observed to break could only mean the connection break
>>> was about to happen. But with WS-AT (especially because we have said
>>>       
> all
>   
>>> messages go on the underlying request) there is no connection to be
>>> monitored anyway. The coordinator hasn't noticed that participant was
>>> out of communication for a while, and now the participant says it is
>>> ready for the commit. Why *require* the coordinator to abort ?
>>>   
>>>       
>> Agreed.
>>  
>>     
>>> Of course that's not to say the coordinator cannot *choose* to abort
>>>       
> by
>   
>>> implementation option if replay is received (or any other
>>>       
> circumstance
>   
>>> that leads the coordinator to suspect a failure somewhere). It can
>>> always do that if it hasn't progressed too far - it would appear in
>>>       
> the
>   
>>> tables as a User Rollback or Write Failed.   
>>>       
>> Yes, I'd like to see this as an implementation specific choice.
>>  
>> Mark.
>>  
>>     
>>> Peter
>>>  
>>> -----Original Message-----
>>> From: Ram Jeyaraman [mailto:Ram.Jeyaraman@microsoft.com] Sent: 06 May
>>>       
>  
>   
>>> 2006 02:09
>>> To: ws-tx@lists.oasis-open.org <mailto:ws-tx@lists.oasis-open.org>
>>> Subject: RE: [ws-tx] Issue 052 - WS-AT: Replay message generates
>>> protocol errors
>>> Section 10 (AT specification) states "These tables present the view
>>>       
> of a
>   
>>> coordinator or participant with respect to a single partner".  Thus,
>>>       
> the
>   
>>> coordinator states correspond to interactions with a single
>>>       
> participant.
>   
>>> The receipt of a participant vote "PreparedSuccess" triggers the
>>> coordinator state to "PreparedSuccess" with respect to that
>>>       
> particular
>   
>>> participant, even though the coordinator may not have completed the
>>> prepare phase for the rest of the participants.
>>>  
>>> Is it possible that the state table is likely being misinterpreted?
>>>  
>>> -----Original Message-----
>>> From: Ram Jeyaraman [mailto:Ram.Jeyaraman@microsoft.com]
>>> Sent: Thursday, April 06, 2006 10:50 AM
>>> To: ws-tx@lists.oasis-open.org <mailto:ws-tx@lists.oasis-open.org>
>>> Subject: [ws-tx] Issue 052 - WS-AT: Replay message generates protocol
>>> errors
>>> This is identified as WS-TX issue 052.
>>>  
>>> Please ensure follow-ups have a subject line starting "Issue 052 -
>>> WS-AT: Replay message generates protocol errors ".
>>>  
>>> -----Original Message-----
>>> From: Alastair Green [mailto:alastair.green@choreology.com]
>>> Sent: Wednesday, April 05, 2006 5:07 PM
>>> To: ws-tx@lists.oasis-open.org <mailto:ws-tx@lists.oasis-open.org>
>>> Subject: [ws-tx] New Issue: WS-AT: Replay message generates protocol
>>> errors
>>> Issue name -- WS-AT: Replay message generates protocol errors
>>>  
>>> PLEASE DO NOT REPLY TO THIS EMAIL OR START A DISCUSSISON THREAD UNTIL
>>> THE ISSUE IS ASSIGNED A NUMBER.
>>>  
>>> The issues coordinators will notify the list when that has occurred.
>>>  
>>> Target document and draft:
>>>  
>>> Protocol:  WS-AT
>>>  
>>> Artifact:  spec
>>>  
>>> Draft:
>>>  
>>> WS-AT CD 0.1 uploaded
>>>  
>>> Link to the document referenced:
>>>  
>>>  
>>>       
> http://www.oasis-open.org/apps/org/workgroup/ws-tx/download.php/17325/ws
>   
>>> tx-wsat-1.1-spec-cd-01.pdf
>>>  
>>> Section and PDF line number:
>>>  
>>> Coordinator View State Table, after l. 503
>>>  
>>>  
>>> Issue type:
>>>  
>>> Design
>>>  
>>>  
>>> Related issues:
>>>  
>>> New issue: WS-AT: Eliminate Replay message. New issue: WS-AT: Is 
>>> logging mandatory?
>>>  
>>>  
>>> Issue Description:
>>>  
>>> Replay reactions defined in current CV state table will cause
>>> unnecessary transaction aborts.
>>>  
>>>  
>>> Issue Details:
>>>  
>>> The cells in row (Inbound Messages) Replay, columns (States) Active 
>>> and Preparing read:
>>>  
>>> Active: Send Rollback --> Aborting
>>> Preparing: Send Rollback --> Aborting
>>>  
>>> Replay message means: "play it again Sam", not "demolish the piano".
>>>  
>>> Case A. If the last thing they sent was Prepared, and it got through 
>>> (we're Preparing and we've recorded their vote), and they've 
>>> recovered, and they're waiting for a Commit or a Rollback, then we 
>>> need to Ignore the Replay (just like if they send it when we've done 
>>> our own housekeeping, and moved to Prepared Success).
>>>  
>>> Case B. If the message didn't get through, and we've processed User 
>>> Commit then we could be in the Preparing state, but have no record of
>>>       
>  
>   
>>> their vote. In that case we'd have to replay Prepare to indicate to 
>>> them, send us your vote again.
>>>  
>>> Case C. If the last thing we received was Register, and we haven't 
>>> processed User Commit, then we're still Active and they won't have 
>>> logged. Replay won't happen on crash recovery (no log record to 
>>> recover off), but it could be used to say to the coordinator "Are you
>>>       
>  
>   
>>> still there? Should I crap out?" (i.e., because of impatience). We 
>>> can't stop them using Replay in that fashion. Our only sensible 
>>> response would have
>>>  
>>> to be: silence (we don't have a blank ack to a ping), i.e. to Ignore.
>>>       
>  
>   
>>> There is no harm in them doing this, even though it is pointless. You
>>>       
>  
>   
>>> could argue that this should be a N/A but that seems heavy-handed.
>>>  
>>>  
>>> Proposed Resolution:
>>>  
>>> As the state tables do not differentiate between Preparing/no vote 
>>> recorded and Preparing/vote recorded, it seems easiest to always 
>>> resend Prepare in the Preparing state. Therefore:
>>>  
>>> Replace the current text in the cells in row (Inbound Messages) 
>>> Replay, columns (States) Active and Preparing with:
>>>  
>>> Active: Ignore --> Active
>>> Preparing: Resend Prepare --> Preparing
>>>  
>>>  
>>>   
>>>       
>  
>
References:
- RE: [ws-tx] Issue 052 - WS-AT: Replay message generates protocol errors
  - From: "Ram Jeyaraman" <Ram.Jeyaraman@microsoft.com>