cover the allowed call sequences on Subscriber in a spec rule #202

rkuhn · 2015-01-16T07:51:21Z

Currently it is only implicit that legal sequences start with onError or onSubscribe and end with onComplete or onError (or not at all), and it is not regulated within the spec rules that onComplete is illegal before onSubscribe.

I see two ways to approach this:

we leave the protocol sequence definition outside of the spec and add the missing rule about onComplete
we make the sequence definition explicit within the rules

The second approach would lead to duplication, since legal sequence grammars are almost perfectly constrained already, so I am leaning towards adding a new rule that orders onComplete and onSubscribe.

The text was updated successfully, but these errors were encountered:

drewhk · 2015-01-16T08:12:46Z

onComplete is currently legal before onSubscribe. I don't know how much you would gain by ordering them, given that onError can come before onSubscribe anyway. In my experience the only real help to implementors would be if onSubscribe is always the first message, even if it is immediately followed by onComplete or onError. Less state variations to maintain. I am not sure we want that though.

rkuhn · 2015-01-16T08:25:17Z

According to the docs the legal status of “onComplete first” is at least dubious, and #199 introduces a test that fails in this case, so we should definitely fix this either way.

ktoso · 2015-01-16T09:08:38Z

The spec (the rules) does not mandate if onComplete can come first or not currently, as mentioned in #199 and for more details see this comment

It is a bit accidental that current behaviour is "onComplete can not come first", as such is the diagram in section https://github.com/reactive-streams/reactive-streams#api-components which isn't really a spec rule.

drewhk · 2015-01-16T09:51:53Z

In the Akka impl we definitely treated onComplete as valid response instead of onSubscribe, in fact we have testkit methods like: expectCompletedOrSubscriptionFollowedByComplete()

drewhk · 2015-01-16T09:56:24Z

So the current language accepted by Akka Subscribers is:

(onSubscribe ~ (onNext)*)? ~ (onError | onComplete)

(Kleene star here allows infinite words)

viktorklang · 2015-01-16T10:53:33Z

@drewhk onError definitely needs to be able to be sent before onSubscribe if the Publisher is unable to, for some reason, create a Subscription.

drewhk · 2015-01-16T10:55:12Z

Well, you can always send a dummy subscription, but in general I agree. I don't see though then why cannot we send an onComplete instead of an onSubscribe. Since we need to handle the onError case the onComplete case is not that much to add (from an implementors perspective).

viktorklang · 2015-01-16T10:55:13Z

@rkuhn I agree with "so I am leaning towards adding a new rule that orders onComplete and onSubscribe.".

The question is if the symmetry of being able to send either onError or onComplete before onSubscribe has merit and should be instated (I'd think not, since it is ultimately racy for hot publishers)

drewhk · 2015-01-16T10:56:55Z

@viktorklang so you propose the following language?

(onSubscribe ~ (onNext)* ~ (onError | onComplete)) | onError

viktorklang · 2015-01-16T10:57:27Z

@drewhk Since it is verboten by the spec to call any methods on the Subscription from within onError or onComplete there should not be any technical issues with allowing both onError and onComplete without a preceeding onSubscribe.

Current semantics as present in the README.md: onError | (onSubscribe onNext* (onError | onComplete)?)

viktorklang · 2015-01-16T10:58:35Z

@reactive-streams/contributors Thoughts?

drewhk · 2015-01-16T11:00:39Z

My point (experienced in Akka Streams) is that if you have to handle onError without onSubscribe then it is not that hard to add onComplete handling there, too. I personally prefer what we use now:

(onSubscribe ~ (onNext)*)? ~ (onError | onComplete)

Not because I don't want to rewrite it, but because it makes sense. The simplest implementation wise of course would be:

onSubscribe ~ (onNext)* ~ (onError | onComplete)

but that would mean that the Publisher must send dummy Subscriptions even when it knows that it is already empty or in error state.

Edited

viktorklang · 2015-01-16T11:03:28Z

@drewhk Given 2.3 (Subscriber.onComplete() and Subscriber.onError(Throwable t) MUST NOT call any methods on the Subscription or the Publisher) I think changing to (onSubscribe ~ (onNext)*)? ~ (onError | onComplete) would be fine, but it IS a last-minute, non-trivial spec change for what we know right now so I definitely think we need to make sure we get a majority vote if we want/need to change it.

drewhk · 2015-01-16T11:08:52Z

What rule does exclude now sending an onComplete instead of a Subscription? Because if there is such rule I am pretty sure we have some impls that violate it. i.e. I am not sure if that rule is tested then by the TCK, or we had classes that has been not verified.

viktorklang · 2015-01-16T11:22:00Z

@drewhk This Issue is about making that clearer in the spec, and then of course make sure that the TCK properly verifies it, then we need to make sure that all Akka impls are properly TCKd :)

drewhk · 2015-01-16T11:25:50Z

Maybe it helps, I created a regex like (non-complete) spec of the language that can be seen by a man-in-the-middle observer that orders concurrent events arbitrarily:

// ** means infinte long string allowed
// * means normal Kleene star

subscribe ~ earlyTermination | activeSubscription
earlyTermination := (onComplete | onError) // or onError only?
activeSubscription := (onSubscribe ~ conversation ~ (cancellation | termination))

conversation := (request* ~ onNext*)** 
cancellation := cancel ~ producerRunOver
termination := (onComplete | onError) ~ consumerRunOver

// due to concurrency some stray messages are allowed
producerRunOver := onNext* ~ (onComplete | onError)?
consumerRunOver := request*

// unfolded:
subscribe ~ (onComplete | onError) | (onSubscribe ~ (request* ~ onNext*)** ~ ((cancel ~ onNext* ~ (onComplete | onError)?) | ((onComplete | onError) ~ request*)))

(Maybe it would be more digestable using a drawing. Beware of bugs.)

viktorklang · 2015-01-28T00:29:52Z

Status here?

viktorklang · 2015-02-04T14:17:54Z

Ping @rkuhn,
if we want to address this before 1.0.0.final we need to get it into the next RC

benjchristensen · 2015-02-04T17:40:24Z

I think I'm okay with onComplete being called immediately. If the data source is empty is there a reason to require this path -> onSubscribe/request/onComplete as opposed to just called onComplete directly the same way onError can be?

In other words, a Publisher should be able to call onSubscribe, onError or onComplete, but can only call onNext after onSubscribe/request.

viktorklang · 2015-02-05T15:21:12Z

@benjchristensen I think that is right. Given 2.3:

Subscriber.onComplete() and Subscriber.onError(Throwable t) MUST NOT call any methods on the Subscription or the Publisher.

So it doesn't matter if onSubscribe has been called before onError or onComplete since they shouldn't mess around with the Subscription anyway.

The question though is how we make it clear in the spec that it is:

(onSubscribe ~ onNext*)? ~ (onError | onComplete)

@rkuhn & @drewhk @DougLea Wdyt?

DougLea · 2015-02-05T16:18:21Z

My initial reading of this (and how I implemented) is that a subscription cannot be "complete" if it never began, so I force onSubscribe before onComplete when publisher is already closed. I still think this is a good policy, but I now don't see any wording forcing this.

viktorklang · 2015-02-05T16:37:36Z

@DougLea Your intuition is right. The existing intent is to only allow onComplete after an onSubscribe.

So there are 2 questions here:

If we keep the existing intent, how do we change the spec to make that clear
If we should change so that onComplete is symmetric to onError in that it can be sent without an onSubscribe, how do we change the spec to make this clear

I'm all ears on solutions to this :)

drewhk · 2015-02-05T16:45:59Z

I prefer onComplete and onError to be symmetric, they can either come without onSubscribe, or they must come after onSubscribe. (onError can be made to always come after onSubscribe, since it is always possible to send a dummy subscription)

benjchristensen · 2015-02-06T07:14:40Z

I prefer onComplete and onError to be symmetric,

I like this, and prefer not having to send dummy subscriptions.

viktorklang · 2015-02-06T10:12:32Z

@benjchristensen Great, so you're in the (onSubscribe ~ onNext*)? ~ (onError | onComplete)-camp with me then :)

rkuhn · 2015-02-06T10:53:33Z

Count me in that camp as well—with the small pedantic fix of adding a final ? because otherwise never-ending Publishers would strictly speaking not match the grammar that is expressed (not that it makes a huge difference).

viktorklang · 2015-02-06T10:59:01Z

@rkuhn Never-ending Publishers would still send an onSubscribe, though, because otherwise there is no association happening?

viktorklang · 2015-02-06T13:42:32Z

@DougLea I can understand this point (always requiring onSubscribe) and the cost of propagating a "dummy" subscription is small indeed. Perhaps this is a case where simplicity should win over performance.

rkuhn · 2015-02-06T13:45:46Z

If we go down the route of the simplest possible grammar (as Doug proposes) then we should also include a DummySubscription in the reactive streams artifact because that will then be needed in many cases:

final public class DummySubscription implements Subscription {
  public static final Subscription instance = new DummySubscription;

  @Override public void request(long n) {
    if (n <= 0) throw new IllegalArgumentException("...");
  }
  @Override public void cancel() {}
}

viktorklang · 2015-02-06T13:47:12Z

if (n <= 0) throw new IllegalArgumentException("...");

is not legal though.

DougLea · 2015-02-06T13:47:36Z

@viktorklang I'd be surprised if there is even a performance advantage -- in the simpler version, most clients need fewer special-case checks that would only rarely trigger.

drewhk · 2015-02-06T13:50:50Z

I don't think I can add anything more to the discussion, so I summarize my opinion (a.k.a vote) and leave it to the others to decide:

I strongly prefer onError and onComplete to be symmetric
I slightly prefer onSubscribe required to be the first, but no strong preference here

rkuhn · 2015-02-06T13:52:03Z

d’oh, of course; signaling that exception correctly would mean allocating a DummySubscription for each such subscription :-(

drewhk · 2015-02-06T13:55:09Z

You don't need to signal anything, the relation is already "terminated" it is just being in a race with the faulty request, but you can always arbitrarily define the order and say that the onComplete or onError that was already scheduled "won" (by definition, not by reality).

rkuhn · 2015-02-06T13:59:28Z

good point, thanks; so the corrected code is

final public class AlreadyCompletedSubscription implements Subscription {
  public static final Subscription instance = new AlreadyCompletedSubscription;

  @Override public void request(long n) {}
  @Override public void cancel() {}
}

benjchristensen · 2015-02-06T18:22:00Z

we should also include a DummySubscription in the reactive streams artifact

I don't like adding things like this. I strongly prefer keeping it as just interface definitions.

prefer onError and onComplete to be symmetric

I also prefer this.

(onSubscribe ~ onNext*)? ~ (onError | onComplete)?

The simplicity of this for me is that onError/onComplete terminal events can be sent whenever, but onNext must always be preceded by onSubscribe since onNext must obey the request behavior of the Subscription.

benjchristensen · 2015-02-06T18:24:27Z

Is there a link that describes the protocol syntax/grammar so we can link to it like we do to https://www.ietf.org/rfc/rfc2119.txt in the README?

viktorklang · 2015-02-06T18:31:30Z

@benjchristensen

I, too, would like the simplicity along the lines of: (onSubscribe ~ onNext*)? ~ (onError | onComplete)?
To me the open question RE that definition is valid:

class P[T] extends Publisher[T] { override def subscribe(s: Subscriber[_ >: T]) = () }

(I'd guess we'd have to have a clause in the rules that'd prevent it from being legal)
And an open question is if it matters?

So, I guess my stance right now is:

I don't like the status quo, I think it should be symmetric w.r.t onError and onComplete. I'm OK with requiring to pass in a Subscription that is already cancelled into onSubscribe but it seems like a code smell to me so I tend to lean a bit towards making onComplete be signallable before onSubscribe.

I think I need to experiment with the example implementations to see what makes implementations more or less ugly.

benjchristensen · 2015-02-06T18:37:34Z

I don't think that should be prevented. In fact, RxJava legitimately has a never() factory method that creates an Observable that never does anything. It actually does have some usecases, such as doing nothing between user event sequences in a switchLatest that switches between streams.

Most of the time a stream that never does anything is undesirable, but it's not illegal. And practically what's the difference between a stream that never emits and one that will emit in 4 hours if the consumer wanted it in 100ms? Async consumers needs to choose to protect themselves by stating their assumptions with timeouts and/or consumption limits (like take(n)) if they do not know what the source stream can provide to them.

viktorklang · 2015-02-06T18:45:19Z

@benjchristensen Now that is a compelling argument!

drewhk · 2015-02-06T18:58:40Z

Hm, I have to disagree here. I agree that a stream that never does anything is sometimes useful, but that has nothing to do with onSubscribe being required or not per se. As an analogy, a TCP connection is also completely fine doing nothing, but the three-way handshake is still required at the beginning. I don't say that sending onError/onComplete any time has no merit (this is what we implemented anyway, so it is even less work), all I want to say that you can have streams that do nothing while still establishing a proper subscribe-onSubscribe handshake. There is something satisfying about that a "never" like operation is not implicit but explicit by having a clear handshake that proves the linkage between the "never" element and its downstream. I don't have a stong opinion though.

viktorklang · 2015-02-06T19:08:19Z

Alright, so I think we all agree that the current asymmetric definition should be changed.

I'll try to find time to experiment with the impact of either of the suggestions (onSubscribe always && onComplete without preceding onSubscribe) on the example Publisher, I think that would probably convince myself what direction I'll vote.

viktorklang · 2015-02-06T19:12:44Z

So the choice is between:

(onSubscribe ~ onNext*)? ~ (onError | onComplete)?

and

onSubscribe ~ onNext* ~ (onError | onComplete)?

benjchristensen · 2015-02-06T19:50:48Z

all I want to say that you can have streams that do nothing while still establishing a proper subscribe-onSubscribe handshake

I'm okay with that. We just shouldn't declare it illegal to never emit onNext/onError/onComplete. So we would say that one MUST either emit onSubscribe or onError/onComplete.

rkuhn · 2015-02-06T20:11:42Z

Agreed on allowing “silent” Publishers; the remaining question then is whether onSubscribe should be mandatory or whether “one of onSubscribe/onError/onComplete” should be mandatory. While implementing the spec as well as while writing our own tests we encountered extra effort due to the uncertainty of what the first invocation will be, requiring an initial onSubscribe would make the logic more regular. Just as Endre I have a preference for the second choice presented by Viktor an hour ago (read: Endre’s argument convinced me).

benjchristensen · 2015-02-06T20:50:48Z

I like “one of onSubscribe/onError/onComplete should be mandatory” but both can work.

rkuhn · 2015-02-06T21:05:40Z

@tmontgomery are there other considerations that we have not yet included?

smaldini · 2015-02-08T14:41:37Z

Agree with Roland, implementations are slightly confused by this. Mandatory OnSubscribe is slightly more verbose but we can deal more efficiently with this by providing wrappers in Publisher factories.
+1 on mandatory onSubscribe, but I'm biased having experimenting a few issues with this.

Sent from my iPhone

On 6 Feb 2015, at 9:05 pm, Roland Kuhn notifications@github.com wrote:

@tmontgomery are there other considerations that we have not yet included?

—
Reply to this email directly or view it on GitHub.

benjchristensen · 2015-02-10T22:58:05Z

extra effort due to the uncertainty of what the first invocation will be, requiring an initial onSubscribe would make the logic more regular

I don't understand the extra effort for handling an onComplete terminal event as onError must already be handled.

viktorklang · 2015-02-11T10:37:55Z

My take on it:

Having onSusbcribe always come first is an invariant which is simpler to
encode as onComplete and onError will then only ever be emitted after it
(already having extra code to deal with 'early errors' is extra code). Any
publisher that isn't permanently failed or complete will have to pass down
Subscriptions of its own, so it is only an optimization to pass a
nop-subscription if already known to be failed/complete.

For the permanently failed and completed publishers the cost of the
overhead is more significant (at most 50%) but given up to hundreds of
millions of signals per second the impact will be hard to notice.

Spec-wise I found it simpler and more straightforward to amend in the
proposed direction, and (I suspect) it will be easier for implementers to
follow.

I think it will be an improvement over what's currently in master, and any
implementation bridge doing early onError or onComplete can always make
sure that a nop-subscription is passed to onSubscribe before issuing the
early onError/onComplete.
On 10 Feb 2015 23:58, "Ben Christensen" notifications@github.com wrote:

extra effort due to the uncertainty of what the first invocation will be,
requiring an initial onSubscribe would make the logic more regular

I don't understand the extra effort for handling an onComplete terminal
event as onError must already be handled.

—
Reply to this email directly or view it on GitHub
#202 (comment)
.

rkuhn added the bug label Jan 16, 2015

rkuhn added this to the 1.0.0.RC2 milestone Jan 16, 2015

rkuhn mentioned this issue Jan 16, 2015

TCK PublisherVerification does not support publishers of empty streams #198

Closed

viktorklang mentioned this issue Feb 10, 2015

Attempt to clarify the signalling sequence in the spec #212

Merged

viktorklang added enhancement and removed bug labels Feb 11, 2015

viktorklang self-assigned this Feb 11, 2015

viktorklang closed this as completed in #212 Feb 13, 2015

drewhk mentioned this issue May 19, 2015

Each incoming connection leaks one ActorInterpreter akka/akka#17494

Closed

kjkrum mentioned this issue May 1, 2017

What should a Publisher do if it detects a duplicate subscription? #364

Closed

cover the allowed call sequences on Subscriber in a spec rule #202

cover the allowed call sequences on Subscriber in a spec rule #202

Comments

rkuhn commented Jan 16, 2015

drewhk commented Jan 16, 2015

rkuhn commented Jan 16, 2015

ktoso commented Jan 16, 2015

drewhk commented Jan 16, 2015

drewhk commented Jan 16, 2015

viktorklang commented Jan 16, 2015

drewhk commented Jan 16, 2015

viktorklang commented Jan 16, 2015

drewhk commented Jan 16, 2015

viktorklang commented Jan 16, 2015

viktorklang commented Jan 16, 2015

drewhk commented Jan 16, 2015

viktorklang commented Jan 16, 2015

drewhk commented Jan 16, 2015

viktorklang commented Jan 16, 2015

drewhk commented Jan 16, 2015

viktorklang commented Jan 28, 2015

viktorklang commented Feb 4, 2015

benjchristensen commented Feb 4, 2015

viktorklang commented Feb 5, 2015

DougLea commented Feb 5, 2015

viktorklang commented Feb 5, 2015

drewhk commented Feb 5, 2015

benjchristensen commented Feb 6, 2015

viktorklang commented Feb 6, 2015

rkuhn commented Feb 6, 2015

viktorklang commented Feb 6, 2015

viktorklang commented Feb 6, 2015

rkuhn commented Feb 6, 2015

viktorklang commented Feb 6, 2015

DougLea commented Feb 6, 2015

drewhk commented Feb 6, 2015

rkuhn commented Feb 6, 2015

drewhk commented Feb 6, 2015

rkuhn commented Feb 6, 2015

benjchristensen commented Feb 6, 2015

benjchristensen commented Feb 6, 2015

viktorklang commented Feb 6, 2015

benjchristensen commented Feb 6, 2015

viktorklang commented Feb 6, 2015

drewhk commented Feb 6, 2015

viktorklang commented Feb 6, 2015

viktorklang commented Feb 6, 2015

benjchristensen commented Feb 6, 2015

rkuhn commented Feb 6, 2015

benjchristensen commented Feb 6, 2015

rkuhn commented Feb 6, 2015

smaldini commented Feb 8, 2015

benjchristensen commented Feb 10, 2015

viktorklang commented Feb 11, 2015