Skip to content

Commit

Permalink
iox-#2128 Draft design for named segments
Browse files Browse the repository at this point in the history
  • Loading branch information
Graham Palmer committed Jan 11, 2024
1 parent 8a5e083 commit 1e8a562
Showing 1 changed file with 228 additions and 0 deletions.
228 changes: 228 additions & 0 deletions doc/design/draft/named-segments.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,228 @@
# Named Segments

## Summary and problem description

Shared memory configuration, as handled by the RouDi config and described in detail in
[the configuration guide](../../website/advanced/configuration-guide.md) currently works by
declaring one or more **segments** backed each by one or more **memory pools**.

Segments currently support configuring access controls, allowing one to specify which processes are allowed to read from/write to them.
These access controls are also used to determine which segment a publisher may use to publish messages, and
from which segments a subscriber is allowed to receive messages. This coupling presents a number of implications:

* The name of shared memory segments created under `/dev/shm` is automatically deduced as the name of the
POSIX user group that has write access to it.
* The above means that there can only be one segment that any given group has write access to.
* In order to determine which segment a publisher shall publish to, the permissions of the containing process are
matched against the permissions of each segment to identify a *unique match*.
* It is therefore not possible to have a single publisher access multiple segments.
* It is also not possible to have multiple publishers in the same process which access different segments

Because of the last point, it is impractical to split topics across several segments - doing so requires publishers
of different topics to be run in processes under different users with different group membership. Instead, applications
must rely on configuring differently sized memory pools to support topic data of different sizes.
This has a number of drawbacks, including:

* Processes communicating over otherwise independent topics share the same memory pool.
If any one process loans too many chunks, or fails to release chunks, it can cause starvation of ALL processes
relying on this memory pool.
* Topic data allocation is assigned to memory pools via "bucketing" - or finding the memory pool with the smallest
chunk size to support the loan request. This means that even if one attempts to cater a memory pool to a specific topic,
any other topics with the same size requirement will be forced to use the same memory pool.
* Faulty publishers which e.g. write past the end of their loaned message may cause faults in any process communicating
over the same shared memory segment. These faults will be difficult to trace, especially because the faulty publisher
may never write past the end of the mapped shared memory segment, and therefore may never trigger a segfault.

The above points demonstrate how the current design violates the principal of [Freedom From Interference](https://heicon-ulm.de/en/iso26262-freedom-from-interference-what-is-that/).

### Proposal

In order to overcome the above limitations, shared memory segments shall be mapped by
name instead of by access control. This will increase flexibility while still granting all the benefits of the current access control model.

## Requirements

* The RouDi configuration must support specifying names for shared memory segments.
* If a name is not configured, the fallback behavior must be the same as it is currently - assigning the name of the current process user.
* Communication endpoints should be able to specify which segments they intend to use.
* Publishers may specify a single segment into which they will publish.
* Subscribers may specify one or more segments from which they may receive messages.
* Clients and Servers may specify a single segment into which they send requests/responses, and multiple segments from which they receive responses/requests.
* When no segment is specified, the fallback behavior must be the same as it is currently
* Publishers will select the *unique* segment they have write access to.
* Subscribers will receive messages from all segments they have read access to.
* Clients and Servers follow the same rules as Publishers for data they send and the same rules as Subscribers for data they receive.
* Creation will fail if a communication endpoint requests to use a segment to which it does not
have the proper access rights.

### Optional additional requirements

* When initializing the runtime, a list of segment names may be provided. In this case, rather than mapping all shared memory segments available in the system, only those that are named shall be mapped.
* When a communication endpoint requests a segment that has not been mapped, this may be configured to either result in a failure or in mapping the requested segment on the fly.

## Design

### Updated RouDi Config

Supporting named segments will require add an additional `name` field to the RouDi config under the `[[segment]]` heading, as well as updating the config version Example:

```
[general]
version = 2
[[segment]]
name = "foo"
[[segment.mempool]]
size = 32
count = 10000
```

Updating the version will be necessary for older versions of RouDi to fail informatively when presented with a newer config. Otherwise RouDi can support version 1 configs as if they are version 2 configs with no `name` fields specified.

### Requesting Segments

#### When creating a Publisher

A new field will be added to the [PublisherOptions struct](../../../iceoryx_posh/include/iceoryx_posh/popo/publisher_options.hpp) as follows:

```
struct PublisherOptions
{
...
ShmName_t segmentName{""};
```

#### When creating a Subscriber

A similar field will be added to the [SubscriberOptions struct](../../../iceoryx_posh/include/iceoryx_posh/popo/subscriber_options.hpp), except that it will support multiple elements

```
struct SubscriberOptions
{
...
vector<ShmName_t, MAX_SUBSCRIBER_SEGMENTS> segmentNames{};
```

`MAX_SUBSCRIBER_SEGMENTS` can be set to some reasonably small value to start with since explicitly allowing many but not all read-access segments is an unlikely use case.

If this assumption turns out to be wrong however, we can always update it to be `MAX_SHM_SEGMENTS` instead.

#### When creating Clients and Servers

Clients and Servers will have similar fields, distinguished by request/response:

```
struct ClientOptions
{
...
ShmName_t requestSegmentName{""};
vector<ShmName_t, MAX_RESPONSE_SEGMENTS> responseSegmentNames{};
```

```
struct ServerOptions
{
...
vector<ShmName_t, MAX_REQUEST_SEGMENTS> requestSegmentNames{};
ShmName_t responseSegmentName{""};
```

#### When initializing the runtime

We may wish to also additionally support requesting segments upon runtime initialization. The motivation here is that if we know we will only need to use certain segments, we can avoid expending unecessary resources mapping the ones we will not use. Additionally, this can be used as an error mitigation technique to prevent communication endpoints from requesting an unsupported segment.

Currently initialization of the runtime happens in the constructor, and any errors that occur result in a contract violation and program termination.

While it is not directly in scope of this design, it would be beneficial to refactor the Runtime implementations to use the builder pattern such that any errors may be handled through returning errors types.

Tying this together with requesting segments, this could look something like:
```
class DefaultRuntimeBuilder
{
...
IOX_BUILDER_PARAMETER(vector<ShmName_t>, shmSegmentNames, {});
IOX_BUILDER_PARAMETER(bool, allowUnmappedSegments, false);
...
public:
expected<DefaultRuntime, RuntimeBuilderError> create() noexcept;
```

The `create` method of this builder would then return a custom error if a
segment was requested that either does not exist or the current process does not have access to.

Additionally, when a publisher or subscriber is created, the `allowUnmappedSegments` would determine whether or not we fail or try to map the newly requested segment on the fly.

### Segment Matching Algorithm

Ensuring backwards compatibility means carefully crafting how we register segments. The following pseudocode demonstrates how this should work in different scenarios

#### Endpoints requesting write access (Publisher, Client Request, Server Response)

In RouDi, given:
* `userGroups` - A list of POSIX user groups a publishing process belongs to called
* `segmentName` - The (possibly empty) name of a shared memory segment specified in the publisher options call
* `mappedSegments` - A list of shared memory segments

1. If `segmentName` is not empty
1. Find the segment in `mappedSegments` matching the `segmentName`
2. If the containing process has write access to the segment via one of its `userGroups`, return the corresponding segment information
3. Otherwise return an error indicating the publisher does not have write access to the requested segment
2. If it is empty:
1. Iterate over all process `userGroups`
2. Determine if the user group name matches the name of one of the segments
3. If it does:
1. If a match has already been previously made, return an error indicating that the publisher must only have write access to one segment
2. If not, verify the process has access to the segment and record the match
4. At the end of iteration, if a matching segment has been found, return the segment information
5. Otherwise return an error indicating that no matching segment has been found

#### Endpoints requesting read access (Subscriber, Client Response, Server Request)

In RouDi, when handling a port request for a new subscriber (and WLOG for clients and servers), given:
* PublisherOptions for a single publisher publishing on the requested topic
* SubscriberOptions for the requested subscriber

We determine if the endpoints are compatible as follows (and we repeat this for each publisher if there are several):
1. To determine if the subscriber and publisher segments match:
1. If the list of segment names provided by the subscriber is non-empty, check if any name matches the name provided by the publisher (an empty name is a non-match)
2. If the list of segment names provided by the subscriber is empty, then it is always a match
2. Determine if blocking policies are compatible
3. Determine if history request is compatible
4. Return true if all cases are met

If no publisher is compatible with a subscriber, then RouDi will refuse to provide a port, as is the current behavior when subscribers have incompatible blocking policies.

#### Runtime requesting segments to map

In the client process, while initializing the runtime, given:
* `userGroups` - A list of POSIX user groups a publishing process belongs to called
* `segmentContainer` - A list of shared memory segment information
* `segmentFilter` - A (possibly empty) list of segment names to filter against

In order to determine which segments to map:

1. If `segmentFilter` is empty then
1. Determine which segments the process has access rights (read or write) to.
2. Map all of them.
2. If `segmentFilter` is not empty
1. Iterate over each name in the filter
2. If there is a segment that matches the name in the filter
1. If the process has access rights to that segment, map it
2. If not, return an error indicating that a segment was requested the runtime does not have access to.
3. If there is no segment that matches the name in the filter, return an error indicating that there is no segment matching the one requested to be mapped.

## Development Roadmap

- [ ] Extend the RouDi config to support specifying segment names
- [ ] Add the name to the `MePooSegment` data structure and populate it based on the RouDi config
- [ ] Update communication endpoint options structs to include segment names
- [ ] Update the publisher, client request, and server response segment selection logic to take the segment name into account
- [ ] Update the subscriber, client response, and server request compatibility check to check for a segment name match

### Optional

- [ ] Refactor runtime initialization to use builder pattern
- [ ] Add segment filter and apply it to segment mapping during runtime initialization.
- [ ] Add flag to specify whether endpoints requesting non-mapped segments should fail or whether segments should be created dynamically

0 comments on commit 1e8a562

Please sign in to comment.