Development

Basic structure

The network operator consists of a controller loop and a rendering system. The basic flow is:

The controller loop detects a configuration change
The configuration is preprocessed:
- validity is checked.
- unspecified values are defaulted or carried forward from the previous configuration.
- safety of the proposed change is checked.
The configuration is rendered in to a set of kubernetes objects (e.g. DaemonSets).
The desired objects are reconciled against the API server, being created or updated as necessary.
The applied configuration is stored separately, for later comparison.

Filling defaults

Because most of the operator's configuration parameters are not changeable, it is important that the applied configuration is stable across upgrades. This has two implications:

All defaults must be made explicit.

The network configuration is transformed internally in to a fully-expressed struct. All optional values must have their default set. For example, if the vxlanPort is not specified, the default of 4789 is chosen and applied to the OpenShiftSDNConfig.

Making all defaults explicit makes it possible to prevent unsafe changes when a newer version of the operator changes a default value.

Some values must be carried forward from the previous configuration.

Some values are computed at run-time but can never be changed. For example, the MTU of the overlay network is determined from the node's interfaces. Changing this is unsafe, so this must always be carried forward.

Note that the fully-expressed configuration is not persisted back to the apiserver. Instead, it is saved only in the stored applied configuration. An alternative would be to inject these values via a mutating webhook. That requires a running service network, which we don't have until after we've run.

Validation

Each network provider is responsible for validating their view of the configuration. For example, the OpenShiftSDN provider validates that the vxlanPort is a number between 1 and 65535, that the MTU is sane, etc. No validation is provided via the CRD itself.

Building

Test builds

Build binaries manually with

./hack/build-go.sh

There is a special mode, the renderer, that emulates what the operator would apply, given a fresh cluster. You can execute it manually:

_output/linux/amd64/cluster-network-renderer --config sample-config.yaml --out out.yaml

Building images

By default, podman is used to build images.

./hack/build-image.sh

You might need sudo:

BUILDCMD="sudo podman build" ./hack/build-image.sh

Or you could use a docker that supports multi-stage builds

BUILDCMD="docker build" ./hack/build-image.sh

Running local builds with the installer

If you want to run a local build with the installer, you need to do a bit of hacking. There is a script that will do most of the dirty work for you. It will modify the installer so the network operator doesn't run, and spin up a local copy of the network operator. This script will either create a new cluster and run openshift-install for you, or attach to an existing cluster and restart the network operator with overrides.

This requires oc and kubectl in your $PATH.

The OpenShift installer is available from https://github.com/openshift/installer and may be built from git.

Steps for creating a new cluster

This mode also requires openshift-install in your $PATH or explicitly passed to hack/run-locally.sh with the -i <installer binary> option.

If you haven't already, build the cluster-network-operator:

hack/build-go.sh

Run hack/run-locally.sh, optionally passing an existing install-config.yaml (-f option), network plugin to use (-n option), a network plugin container development image (-m option), and a request to wait after creating manifests (-w option) for additional manual overrides. Available network plugins are sdn/OpenShiftSDN and ovn/OVNKubernetes.

export CLUSTER_DIR=<your cluster directory>
hack/run-locally.sh -n OpenShiftSDN -m "docker.io/myawesome/origin-sdn:latest"

It will print status messages as it waits for the installer to make progress.

Passing the -w option will wait after creating manifests to allow for additional manual overrides before the cluster is started.

Steps for "attach" mode

If you have an already-up cluster you would like to take over, then you can instead run:

export CLUSTER_DIR=<your cluster directory>
hack/run-locally.sh

It will stop the deployed operator, set up the environment variables, and execute your local build of the operator. A network plugin containter development image (-m option) may also be given.

Tips, Tricks, & Limitations

Cleaning up

After destroying your cluster, you should delete your $CLUSTER_DIR.

If you're using libvirt, you can use ./scripts/maintenance/virsh-cleanup.sh in the installer repository to quickly delete the virtual machines instead of waiting for Terraform.

RBAC & CVO

This runs the operator with the admin role, so it will have full permissions. When run by the real installer, this will not be the case. Created objects, e.g. DaemonSets, will still have RBAC, though.

In fact, no changes from manifests/ will be picked up except image references.

Saving install-config between runs

If you're sick of answering all the questions over and over, you can shortcut the installer. Just do

openshift-install --dir=$CLUSTER_DIR create install-config
cp $CLUSTER_DIR/install-config.yaml ~/.cache/openshift-install/install-config.yaml

Then, for subsequent installer runs, you can do

rm -r $CLUSTER_DIR; mkdir -p $CLUSTER_DIR
cp ~/.cache/openshift-install/install-config.yaml $CLUSTER_DIR/

Paring-down installed components

To reduce memory usage, you can remove some non-essential components from the cluster. This will cause some things (most notably, Prometheus) to be skipped. The installer process may or may not succeed, but you should be left with a functional control plane. This is fine for rapid network development. Just set:

export HACK_MINIMIZE=1

Developing with a local copy of openshift-sdn

Origin images are difficult to build correctly. If you want to test some binary changes with openshift-sdn, there's an easier way.

One-time setup:

Set up your API credentials: - Log on to https://api.ci.openshift.org/ with your GitHub credentials - On the top right, click copy login command. Execute it in a terminal - Execute oc registry login, which will install credentials for your local podman and docker clients

Building a development sdn image

Pick a registry. You can use the Openshift CI one if you are a member of the organization.

export REGISTRY=registry.svc.ci.openshift.org/<YOUR-GITHUB-USERNAME>

Build and push the sdn image

podman build -t ${REGISTRY}/origin-sdn:latest -f images/sdn/Dockerfile.fedora .
podman push ${REGISTRY}/origin-sdn:latest

Follow the steps above, using the -m option to override the image:

hack/run-locally.sh -m ${REGISTRY}/origin-sdn:latest

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HACKING.md

HACKING.md

Development

Basic structure

Filling defaults

Validation

Building

Test builds

Building images

Running local builds with the installer

Steps for creating a new cluster

Steps for "attach" mode

Tips, Tricks, & Limitations

Cleaning up

RBAC & CVO

Saving install-config between runs

Paring-down installed components

Developing with a local copy of openshift-sdn

One-time setup:

Building a development sdn image

Files

HACKING.md

Latest commit

History

HACKING.md

File metadata and controls

Development

Basic structure

Filling defaults

Validation

Building

Test builds

Building images

Running local builds with the installer

Steps for creating a new cluster

Steps for "attach" mode

Tips, Tricks, & Limitations

Cleaning up

RBAC & CVO

Saving install-config between runs

Paring-down installed components

Developing with a local copy of openshift-sdn

One-time setup:

Building a development sdn image