PolarStreams compared to NATS JetStream #104

pavelnikolov · 2022-11-15T18:25:59Z

pavelnikolov
Nov 15, 2022

This is an awesome project. I heard about it during KCD Spain - thank you for the awesome presentation!

To me it seems that PolarStreams is designed to achieve the same goal as NATS JetStream. I've been using event-based microservices for a while and I'm really interested in the differences between the two projects. It is true that NATS was designed with pub/sub in mind, however included in the same binary is JetStream using the -js command line flag (e.g. nats-server -js). NATS JetStream allows storing event streams on disk. It is already distributed, no brokers required, extremely lightweight, easy to deploy to Kubernetes, multi-tenanted, supports security using TLS and JWT tokens. I'm not familiar with PolarStreams but NATS JetStream seems to cover all the event streaming needs. I would like to know why would I choose PolarStreams instead of NATS JetStream.

EDIT (jorge): edited to reflect the name change from Barco to PolarStreams.

jorgebay · 2022-11-16T11:23:23Z

jorgebay
Nov 16, 2022
Maintainer

@pavelnikolov nice to meet you! Thanks for posting the question here.

I'm familiar with NATS core and I think it's a great lightweight solution for at-most-once delivery use cases. I have no experience with NATS Jetstream so my response comes from what I was able to test locally today and read in their docs.

These are main differences I can see between PolarStreams and NATS Jetstream:

Ordering

In Jetstream messages are published by 'publisher' (producer instance), with multiple producers there's no way to guarantee ordering. Having multiple producer instances is fairly common, i.e. multiple service instances producing messages. The following use case comes to mind: Account A is created, the event is produced by service instance 1; after some time (seconds, minutes) Account A is deleted, the event is produced by service instance 2; is there any guarantee that deletion event will be received by consumers after (and only after) the creation event?

Similar to Kafka, PolarStreams guarantees strict ordering of events within the same partition key.

Performance & Resource Usage

NATS Core has a nice performance for non-durable events. On the other hand, Jetstream throughput of durable events (replication=3) seems to be not very good even with more H/W resources than PolarStreams.

On the other hand, nats server uses resources in proportion to the load so you would have to model the hardware requirements (memory assigned to each pod) based on the load you expect:

The nats-server processes use resources in proportion to the load traffic generated by all the client applications, if the NATS (and JetStream) usage is high (or bursty, nats-server is very fast and can process sharp bursts in traffic), then you will need to set the container resource limits accordingly, or the container orchestration system will kill the server's container

PolarStreams uses a bounded amount of memory per topic and consumer group, allocating buffers in advance and reusing them. When doing capacity planning, this simplifies the task greatly: no matter the number of producers and consumers, or the load, we will get the same memory consumption.

Furthermore, NATS Jetstream uses regular I/O which makes it for a "bad Kubernetes neighbour" as it will pollute the page cache. The Linux page cache is a shared resource in K8s at node level. Using the page cache extensively also makes resource/capacity planning very hard as the page data is also included in the Working Set Size but we can't control it.

PolarStreams uses Direct I/O and a series of techniques that makes it lightweight and fast.

API

Jetstreams was created as a persistence layer on top of NATS which can be a good way to leverage the existing NATS ecosystem but it's not realistic to say that things that work on NATS (with at-most-once guarantees) will continue working with Jetstream (at-least-once), specially considering multi-producer/multi-consumers. As far as I can tell, there are several knobs and settings we would have to touch on different parts to make it work as we want (e.g. "use pull if you want to scale consumers"), making it very hard for new users not to shoot themselves in the foot by applying a different pattern (and thinking they are providing a guarantee that in production they are not).

In contrast, PolarStreams provides a REST API where producing is a simple call (the user can be sure is durably stored) and consuming requires setting the "group" it belongs to and that's it.

I personally found some NATS concepts hard to grasp, "source, subjects, streams, consumers, publishers, ... " as opposed to Kafka/PolarStreams' "topics, producers and consumers", but maybe that's just me :)

2 replies

jorgebay Nov 18, 2022
Maintainer

Account A is created, the event is produced by service instance 1; after some time (seconds, minutes) ~~Account B~~

I meant to use "Account A" all the way, representing a series of events with a shared key.

jnmoyne Nov 18, 2022

About ordering: the ordering in NATS JetStream is the same as with Kafka, when messages are received by the stream (i.e. by the nats server that is the elected leader for the stream, or by the Kafka broker that is the leader for the partition of the stream) they are serially sequenced. If you then create a NATS JS Consumer (like a Kafka Consumer Group) and have only one client application subscribing from that consumer then that application will receive the message in order from the stream. Even if you have more than one client application subscribing to that consumer at the same time, the messages are distributed in order and (the only way you can get 'out of order' being if there are problems and messages needing to be re-delivered and you allow more than one 'in flight' messages delivery at a time when creating the consumer.
About resource usage in proportion to the load: NATS JetStream also allows you to set limits on streams such as max number of messages or max number of bytes being stored, max message size, and even have strictly enforced 'exactly n messages per subject' limits. You can also set limits for resource usage per account, and overall limits per server.

bruth · 2022-11-17T21:10:43Z

bruth
Nov 17, 2022

Thanks @pavelnikolov for posing the question. @jorgebay I am looking forward to learning more about this project, but wanted to address a few of the points you made about JetStream.

As a general preface, it is very common when people compare a Kafka-like system to NATS, some of the concepts don't translate. They have fundamentally two different origin stories, so that is to be expected and the design decisions within JetStream are not attempting to copy those of Kafka.

Ordering

Account A is created, the event is produced by service instance 1; after some time (seconds, minutes) Account B is deleted, the event is produced by service instance 2; is there any guarantee that deletion event will be received by consumers after (and only after) the creation event?

As you noted, given multiple clients publishing messages being received and written to the same stream, since they are concurrent, by default, order will be dictated by how they are serialized by the server. This is true for any concurrent writer situation unless you declare an expectation of ordering.

In NATS, when a stream is created, it logically acts as a service that binds one or more subjects, e.g. orders.*. Any message published with a matching subject will be received by the stream, persisted, and then acknowledged to the client. It is very common to multiplex multiple entities onto a single stream especially for event-sourcing kind of use cases, e.g. PUB orders.1, PUB orders.2, etc.

At publish time, there are opt-in optimistic-concurrency control options you can provide in the form of headers on the published message:

Nats-Expected-Last-Sequence - whose value is the expected sequence of the stream
Nats-Expected-Last-Subject-Sequence - whose is the expected sequence a specific subject

This means that a client publishing to that stream can include one of those headers to ensure the stream or subject sequence has not changed in the meantime. In general, the subject-level sequence check is preferred to provide more granular OCC control. But this is how you can solve the concurrent publish-time ordering concern.

One additional contrasting point is that with a NATS stream, there are no partitions by default, everything is multiplexed onto the same stream. Any number of consumers (single workers or queue groups) can be created and does not have any constraints based on the number of partitions (since there aren't any). There is a pattern to introducing deterministic partitioning using subject mapping which, again, is opt-in for those that need it.

Performance & Resource Usage

These are good call-outs and favorable design decisions that Barco has optimized for up front. With JetStream, the extreme performance/scale as Kafka/Redpanda/Barco can achieve today given the use cases they focus on has not been as high of a priority. NATS is used for a spectrum of use cases which the Kafka-like systems are not well suited for. NATS is much more focused on being a connective technology which manifests in properties like being able to be deployed on edge devices and connected into a supercluster that can span the globe with full location transparency, etc.

That said, we do often get compared to Kafka and/or asked if we can replace a Kafka setup since NATS was one of the early projects that provided a self-contained, zero dependency binary that can be deployed anywhere. For use cases that need persistence but not extreme scale out (today), we are a great solution and it drops a huge dependency and cost of additional infrastructure especially if they are using core NATS for messaging. Likewise we have a key-value layer built on top of the stream layer which folks are adopting as a lightweight alternative for basic Redis KV.

Recent NATS versions are capable of reaching around 300k messages per second throughput, for a single stream with a replication factor of 3. This would be comparable to a single Kafka partition, since both are totally ordered. I don’t want to get into the weeds of benchmarks since that is a nuanced topic, but just calling it out since the SO post is quite old in terms of how quickly JetStream has been evolving.

Regarding k8s, NATS had not been initially optimized/focused on k8s since the server had been written over a decade ago. That said, we have many users deploying NATS into k8s environments and quite a bit of work has gone into tuning and documenting the necessary resource limits and Go runtime hints (e.g. GOMEMLIMIT) to optimize the performance in this environment. But the points you call out are useful for further optimization.

The bottom line is that each system makes different trade-offs, but it boils down to whether a technology fits the needs of the user/use case. NATS has a very feature-rich CLI that makes it straightforward to test virtually every feature of the server as well as benchmark.

API

Jetstreams was created as a persistence layer on top of NATS which can be a good way to leverage the existing NATS ecosystem but it's not realistic to say that things that work on NATS (with at-most-once guarantees) will continue working with Jetstream (at-least-once) [...]
making it very hard for new users not to shoot themselves in the foot by applying a different pattern (and thinking they are providing a guarantee that in production they are not)

I would summarize this comment as "the API may not be as intuitive as it could be" which is fair criticism. The NATS team is aware that some things can be simplified, the defaults can be improved and less client-side magic can occur.

However, this learning curve is fairly short-lived and once people start building for production, the set of knobs can be useful. Virtually no configuration needs to be set by default for streams or consumers, so it is all opt-in. The feature set has been driven largely by user requests to satisfy certain use cases.

In contrast, Barco provides a REST API where producing is a simple call (the user can be sure is durably stored) ...

Calling out a REST API vs the NATS protocol is not a useful comparison here and guarantees around durability of a stored message is completely orthogonal to the protocol. A client receiving a message published to a stream receives an acknowledgement from the server once it is persisted.

...and consuming requires setting the "group" it belongs to and that's it.

Without looking at the Barco consumer API in detail, creating consumers with options to control replicas, inactivity thresholds, the persistence medium, etc, is very straightforward with the client SDKs. As noted above, we are aware of more simplifications for pure beginners, but it is not a huge hurdle once you get the concepts.

I personally found some NATS concepts hard to grasp, "source, subjects, streams, consumers, publishers, ... " as opposed to Kafka/Barco's "topics, producers and consumers", but maybe that's just me :)

A natural bias when you are familiar/comfortable with one technology vs. another 😄

1 reply

jorgebay Nov 18, 2022
Maintainer

Regarding ordering, note that I'm not referring to concurrency control or total global order, it's about ordering guarantees for a given "key" or unit. My comment mentioned "seconds or minutes" later.

Back to the example I posted:

Account A is created, the event is produced by service instance 1; after some time (seconds, minutes) Account A is deleted, the event is produced by service instance 2; is there any guarantee that deletion event will be received by consumers after (and only after) the creation event?

My understanding is that as NATS creates a different sequence of messages by publisher, a system consuming the messages will likely receive them out of order, specially in the face of retries / backpressure scenarios. To continue with the example:

Different publishers, 2 events "Account A is created" and some seconds later "Account A is deleted". NATS sends the first event to the consumer client and then the second. If the recipient of the first event does not ACK the message (due to an occasional error or client restart), it will be sent again but the second event won't.

Effectively, the system consuming the messages would get "Account A deleted" first and then "Account A created".

This is an inherent problem of having multiple message "sequences".

In Barco, Pulsar and Kafka, in the face of retries / back-pressure issue, it will "rewind" the single sequence of events (partition log), so the following scenarios are possible:

"Account A is created", "Account A is deleted", "Account A is created", "Account A is deleted"
"Account A is created", "Account A is deleted", "Account A is deleted"

But "Account A is created" will never be the final state.

In any case, NATS is a really cool project specially for non persistent case (at most once) and I like what you folks are building at Synadia.

bruth · 2022-11-18T12:51:58Z

bruth
Nov 18, 2022

My understanding is that as NATS creates a different sequence of messages by publisher, a system consuming the messages will likely receive them out of order, specially in the face of retries / backpressure scenarios.

This is not quite right. Messages can be published from any number of publishers and received by a stream in the sever. By default the messages are ordered as they are received, but as noted OCC can be applied to ensure ordering at the stream level or subject (per key) has the expected order.

On the consumption side, by default a consumer can be created on a stream and will receive all messages in order. This can be for all messages in a stream or using a subject filter, e.g. orders.1 which will apply server-side filter of messages to only deliver messages for that specific order (in order).

Given either one of these consumers, if it you pull messages (fetching batches at a time), you effectively have control over the number of messages in flight for that consumer you are responsible for consumer and acking. While processing this batch in order, you ack each message and if an ack fails, you can either retry or bail out consuming (unsubscribe). On resubscribe, the consumer will simply restart from the last unack'ed message and deliver in order from there (effectively rewind).

There is also another setting which can be set on the consumer called "max-ack-pending" to one and then only one message for that consumer will be in-flight at any given time.

Often when people get surprised by out-of-order messages is when they are using a push consumer (which the server proactively pushes messages into the client's buffer) and they forget to ack messages. But like I said above, with a pull consumer this is easier to control the flow and desired behavior.

This is an inherent problem of having multiple message "sequences".

Just to reinforce, there are no separate message sequences. All messages across subjects bound to a stream are written in order in the persistence layer and sequenced together (1, 2, 3, etc.). So from the consumer side, it just one stream where each message could have a different concrete subject, but they have total order.

In any case, NATS is a really cool project specially for non persistent case (at most once) and I like what you folks are building at Synadia.

Thanks! Yes it is a nice project. We are always learning and improving and looking forward to seeing what we can learn from Barco.

7 replies

bruth Nov 18, 2022

I find it very hard to understand how ordering is guaranteed with "max-ack-pending" > 1 and multiple consumers.

😅 This is one of those "concepts don't fully translate" moments. My understanding of a consumer group in Kafka is that you can have up to one consumer (member of the group) per partition. Since keys are deterministically spread across partitions, each consumer in that group can process concurrently and the ordering (per key) is preserved.

However if there was only one partition, you can only have one member within that group and thus it receives everything in order for all keys. Is this accurate and how Barco works as well?

A NATS stream does not have any concept of a partition natively for a stream. As a side effect there is no association of a consumer within a group to be bound to a specific partition either. I linked to this above, but there is the ability to emulate partitions to scale out, be it ingest or have concurrent consumers akin to a Kafka consumer group (although the one consumer per partition would still not be a constraint).

Given a stream, let's say a pull consumer is created, by default one subscriber (member) is fetching.. of course you get correcting ordering. If you spin up more subscribes fetching from the same consumer state, then you are correct that each batch will be a different slice without any subject-level guaranteed ordering. Depending on the number of subjects/use case since many individual consumers can be created with their own subject filter applied, you could get ordering that we as well... so it depends.

That said, unsurprisingly this has been a requested feature for those coming with Kafka experience since its familiar. With a pull consumer (or queue push consumer), we don't impose a limit of the number of subscribers (members) receiving messages from the same consumer, so the design is different. The team has discussed what subscriber-level per-subject ordering might look like.. so it may be added in the future as another option.

jorgebay Nov 18, 2022
Maintainer

Is this accurate and how Barco works as well?

PolarStreams (previously known as Barco) broker-to-consumer fanout is determined by the amount of brokers multiplied by the consumer ranges. An easy way to think about it is how "many consumers will be able to read the data being produced at this rate per broker".

For example: a 3-broker cluster with 8 consumer ranges results in 24 consumer instances being able to read data concurrently in order.

This design also behaves well when a cluster scales the number of brokers (e.g. via a HPA) due to high usage, in that case the number of possible consumers automatically increases (e.g. from 24 to 48...).

There's some information on the technical docs.

bruth Nov 18, 2022

Thanks for the explanation. I thought this sentence was interesting in the docs:

To consume events, a client should poll for new data to all live brokers. The brokers will determine when that consumer should be served with topic events of a given partition depending on the consumer placement.

Does this imply that a client consumer needs to talk to all brokers on each poll/fetch? This ensures that even if the ring changes, consumption and ordering won't get disrupted? If so, is there a concern with the number of connections needing to be established between the consumers and brokers?

jnmoyne Nov 18, 2022

If you want to have 'strictly ordered per subject/key delivery' and max acks pending > 1 in JetStream you can actually do/emulate what is essentially a similar methodology to what you describe for Barco with partitioning and consistent hashing leveraging the fact that JetStream is subject based addressing aware (e.g. the key can be included in the subject) and the Core NATS subject mapping functionality. You create a subject mapping testing using the deterministic (i.e. a form of consistency hashing) {{Partition()}} function to insert a partition number as a token inside the subject. This allows you to then scale what would be a single stream into multiple partition streams, or scale what would be a single JS consumer into multiple partition consumers (allowing you to attach a single consuming application instance per partition consumer, and thereby to have max ack pending set to something larger than 1 and yet be sure that all the messages for a particular subject/key are always sent to the same partition (and thereby consumer).

Differentiating NATS JetStream from Barco, Kafka, Pulsar (or any other distributed WAL streaming systems) is a vast subject as there are actually more overlap and more difference than it would appear from a distance.

Personally I think it in the end boils down to 3 fundamental design differences:

JetStream is subject-based addressing aware, compared to a single topic per stream and a key associated with each message
JetStream is a 'data store': it can delete messages in the stream, it can enforce 'exactly-n' message per subject limits, and supports some form of querying with subject based filters, compared to persisting to WAL which is append only with log compaction.
JetStream distributed message consumption is partition-less (unless you artificially create them through subject manipulation).

It's not like a black and white "one model is right and the other is wrong": JS and distributed WAL systems are just different kinds of beasts that happen to have a fair bit of functional overlap. Many use cases and functionalities are possible directly in NATS + JetStream that would not be possible with just WAL streaming (or would require Kafka Streams, which itself can do things that you'd have), and some use cases are going to be easier/better/more optimized (or scalable) in one but yet still possible with the other.

jorgebay Nov 21, 2022
Maintainer

Sorry for the misunderstanding, it looks like it's possible to have ordering guarantees in NATS Jetstream after some stream and consumer tuning.

In PolarStreams, we provide partitioning and ordering out of the box, no configuration needed. It's persistent, distributed, durable and elastic by default. One of the main goals of our project is to be easy to use and operate, we want to leave no room for users to shoot themselves in the foot.

I hope this underlines the differences between PolarStreams and other event streaming technology out there.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PolarStreams compared to NATS JetStream #104

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments 10 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

PolarStreams compared to NATS JetStream #104

pavelnikolov Nov 15, 2022

Replies: 3 comments · 10 replies

jorgebay Nov 16, 2022 Maintainer

Ordering

Performance & Resource Usage

API

jorgebay Nov 18, 2022 Maintainer

jnmoyne Nov 18, 2022

bruth Nov 17, 2022

Ordering

Performance & Resource Usage

API

jorgebay Nov 18, 2022 Maintainer

bruth Nov 18, 2022

bruth Nov 18, 2022

jorgebay Nov 18, 2022 Maintainer

bruth Nov 18, 2022

jnmoyne Nov 18, 2022

jorgebay Nov 21, 2022 Maintainer

pavelnikolov
Nov 15, 2022

Replies: 3 comments 10 replies

jorgebay
Nov 16, 2022
Maintainer

jorgebay Nov 18, 2022
Maintainer

bruth
Nov 17, 2022

jorgebay Nov 18, 2022
Maintainer

bruth
Nov 18, 2022

jorgebay Nov 18, 2022
Maintainer

jorgebay Nov 21, 2022
Maintainer