Getting started

Installation

With the following commands, you can quickly install an ACK Distro cluster in an offline environment and experience the same user experience as ACK without reaching the public cloud. You can also check out Sealer Get-Started for a more comprehensive guide on how to use the cluster.

Install cluster quickly

First of all, please check your cluster according to requirements.

Get sealer：

ARCH=amd64 # or arm64
wget http://ack-a-aecp.oss-cn-hangzhou.aliyuncs.com/ack-distro/sealer/sealer-0.9.4-beta2-linux-${ARCH}.tar.gz -O sealer-latest-linux-${ARCH}.tar.gz && \
      tar -xvf sealer-latest-linux-${ARCH}.tar.gz -C /usr/bin

Use sealer to get ACK Distro artifacts and create clusters:

sealer run ack-agility-registry.cn-shanghai.cr.aliyuncs.com/ecp_builder/ackdistro:v1-22-15-ack-10 -m ${master_ip1}[,${master_ip2},${master_ip3}] [ -n ${worker_ip1}...] -p password

If you want install ACK-D on an air-gap cluster, you can:

##########################################################
# The following command should be run on machine with internet access
# Use sealer to pull ACK-D cluster image,
# Also you can use --platform to specify the arch of cluster image you want to pull, if you want pull multi archs, please use ',' to join them, for example: --platform linux/amd64,linux/arm64
sealer --platform linux/amd64 pull ack-agility-registry.cn-shanghai.cr.aliyuncs.com/ecp_builder/ackdistro:v1-22-15-ack-10

# Save cluster image as a tar
sealer save ack-agility-registry.cn-shanghai.cr.aliyuncs.com/ecp_builder/ackdistro:v1-22-15-ack-10 -o ackdistro.tar

##########################################################
# The following command should be run on the target air gap cluster
copy ackdistro.tar to the air-gap machine.

sealer load -i ackdistro.tar

# Then execute sealer run as usual

Check cluster status:

kubectl get cs

[Advanced] Install with production-level configuration

ACK Distro has extensive production-level cluster management experience, and we currently provide the following production-level features.

1) automatically manage disk capacity for k8s daemons

Partition isolation and capacity management for K8s control can improve cluster performance and stability.

If you want ACK Distro to better manage the disks it uses, prepare raw data disks as needed (no partitioning and mounting required):

EtcdDevice: the disk allocated to etcd must be larger than 20GiB and IOPS>3300, only required by the Master node
StorageDevice: the disk allocated to docker and kubelet
DockerRunDiskSize, KubeletRunDiskSize: The size of the disk partition allocated to docker and kubelet is 100GiB by default; ACK-D uses LVM to manage disk partitions, so you can scale the size during the operation and maintenance phase

Configure your ClusterFile:

apiVersion: sealer.cloud/v2
kind: Cluster
metadata:
  name: my-cluster # must be my-cluster
spec:
  image: ack-agility-registry.cn-shanghai.cr.aliyuncs.com/ecp_builder/ackdistro:v1-22-15-ack-10
  env:
    - EtcdDevice=/dev/vdb # EtcdDevice is device for etcd, default is "", which will use system disk
    - StorageDevice=/dev/vdc # StorageDevice is device for kubelet and container daemon, default is "", which will use system disk
    - YodaDevice=/dev/vdd # YodaDevice is device for open-local, if not specified, open local can't provision pv
    - DockerRunDiskSize=100 # unit is GiB, capacity for /var/lib/docker, default is 100
    - KubeletRunDiskSize=100 # unit is GiB, capacity for /var/lib/kubelet, default is 100
  ssh:
    passwd: "password"
  hosts:
    - ips: # support ipv6
        - 1.1.1.1
        - 2.2.2.2
        - 3.3.3.3
      roles: [ master ] # add role field to specify the node role
      env: # all env are NOT necessary, rewrite some nodes has different env config
        - EtcdDevice=/dev/vdb
        - StorageDevice=/dev/vde
    - ips: # support ipv6
        - 4.4.4.4
        - 5.5.5.5
      roles: [ node ]

# install
sealer run -f ClusterFile

2) Flexible configuration of ssh access channel

Support ssh private key: need to send the corresponding public key file to all nodes in the cluster through ssh-copy-id or other methods
Support configuring different ssh access methods for different node groups
Support sudo user mode, requiring the user to have password-free sudo permissions

apiVersion: sealer.cloud/v2
kind: Cluster
metadata:
  name: my-cluster
spec:
  ...
  ssh:
    passwd: "password"
    #user: root # default is root
    #port: "22" # default is 22
    #pk: /root/.ssh/id_rsa
    #pkPasswd: xxx
  hosts:
    - ips:
        - 1.1.1.1
      ssh:
        user: root # sudo user, default is root
        passwd: passwd
        port: "22"
      ...
    - ips: # use default ssh
        - 4.4.4.4
      roles: [ node ]
      ...
  ...

3) Use a lite image

In versions after v1-22-15-ack-4 and v1-20-11-ack-17, ACK-D will provide cluster images of both the full and lite mode. The tag of the lite mode is "${full_mode_tag}-lite", for example v1-22-15-ack-4-lite

The lightweight version of the image (about 0.4GiB) is much smaller than the full version (about 1.4GiB), mainly because the lightweight version of the image does not save all the container images that ACK-D depends on. Therefore, if you need to use the lightweight Version mirroring must meet the following two requirements:

All nodes of the cluster to be deployed must be able to access ack-agility-registry.cn-shanghai.cr.aliyuncs.com
Configure .spec.registry as follows, NOTE: both externalRegistry and localRegistry are required

apiVersion: sealer.cloud/v2
kind: Cluster
metadata:
  name: my-cluster
spec:
  ...
  registry:
    externalRegistry: # external registry configuration
      domain: ack-agility-registry.cn-shanghai.cr.aliyuncs.com # if use lite mode image, externalRegistry must be set as this
    localRegistry: # local registry configuration
      domain: sea.hub # domain for local registry, default is sea.hub
      port: 5000 # port for local registry, default is 5000
  ...

4) Use preflight

The preflight tool can detect hidden dangers that may affect the stability of the cluster before cluster deployment.

# When deploying a cluster, the cluster precheck tool will run by default. If there is a precheck error ErrorX, but you think the error can be ignored, please do as follows
# specify IgnoreErrors=ErrorX[,ErrorY] in .spec.env of ClusterFile, and run again
sealer run -f ClusterFile

# Also you can ignore all errors

# specify SkipPreflight=true in .spec.env of ClusterFile, and run again
sealer run -f ClusterFile

5) Use health check

The cluster health check tool can check whether the cluster is healthy with one click. After the cluster deployment is completed, a health check will be triggered by default. If the check fails, an error will be reported directly; after that, the health check will run periodically.

# You can query the health check result of the last run
trident health-check

# Also you can trigger a new check
trident health-check --trigger-all

# more see
trident health-check --help

6) Configure container runtime

Currently, ACK-D provides both docker and containerd container runtimes, and will stop supporting docker in the 1.24 K8s. You can specify the container runtime through the following configuration:

apiVersion: sealer.cloud/v2
kind: Cluster
metadata:
  name: my-cluster
spec:
  ...
  containerRuntime:
    type: docker # which container runtime, support containerd/docker, default is docker
  ...

7) IPv6 dual stack

This section is about ipv6 dual stack configuration, if you just need ipv6 only, please use the method described in the previous section.

How to configure for IPv6 dual stack mode:

Node IP: The IP family of all node addresses passed needs to be consistent, either ipv4 or ipv6, ACK-Distro will definitely open the dual-stack mode, so it will try to find an additional IP address on each node corresponding to the default route of another address family, as the Second Host IP
SvcCIDR (if not necessary, you can skip set it, and the default value will be used): two svc network segments (ipv4 segment and ipv6 segment) must be passed, separated by ','. The IP family of the first service cidr need to be consistent with the IP family of all nodes,
PodCIDR (if not necessary, you can skip set it, and the default value will be used): same as SvcCIDR
The control plane pod will use the IP assigned by the first PodCIDR

apiVersion: sealer.cloud/v2
kind: Cluster
metadata:
  name: my-cluster
spec:
  ...
  env:
    - PodCIDR=5408:4003:10bb:6a01:83b9:6360:c66d:0000/112,101.64.0.0/16
    - SvcCIDR=6408:4003:10bb:6a01:83b9:6360:c66d:0000/112,11.96.0.0/16
  hosts:
    - ips:
        - 2408:4003:10bb:6a01:83b9:6360:c66d:ed57
        - 2408:4003:10bb:6a01:83b9:6360:c66d:ed58
      roles: [ master ] # add role field to specify the node role
    - ips:
        - 2408:4003:10bb:6a01:83b9:6360:c66d:ed59
      roles: [ node ]

8) Cluster Scales

You can configure different specifications through ClusterScale in .spec.env, with the number of pods <= 60 * the number of nodes:

small: 0-50 nodes
medium: 50-100 nodes
large: 100-1000 nodes
xlarge: 1000-3000 nodes

For specific specification parameters, please refer to the appendix.

9) Other configs

In ClusterFile's .spec.env, you can also modify the following configuration

Addons: Additional components that need to be deployed, currently support ack-node-problem-detector, paralb, kube-prometheus-crds, the default is empty
Network: The network plugin to use, currently supports hybridnet, calico, the default is hybridnet
DNSDomain: Kubernetes Service Domain suffix, can be customized, the default is "cluster.local"
ServiceNodePortRange: Kubernetes NodePort Service port range, which can be customized, the default is 30000-32767
EnableLocalDNSCache: Whether to enable the Local DNS Cache, the default is false
RemoveMasterTaint: Whether to automatically remove the master taint, the default is false
CertSANs: The domain name/IP that needs to be additionally issued for the APIServer certificate
IgnoreErrors: Preflight error items to ignore
TrustedRegistry: The registry domain that needs to be trusted by the container runtime
UseIPasNodeName: Whether to use the node IP as the NodeName, default is false
DefaultIPRetain: Whether to enable the network plug-in IP retention function, default is true
DockerVersion: which docker version to installed, include 19.03.15, 20.10.6. default is 19.03.15
ContainerDataRoot: docker data root path, default is /var/lib/docker

Operation and maintenance cluster

Unless otherwise specified, all operations can be performed on any Master, provided that the Master node has a sealer bin and the correct version of the cluster image

Scale-up node

sealer join -m ${master_ip1}[,${master_ip2},${master_ip3}] [ -n ${worker_ip1}...]

Scale-down node

sealer delete -m ${master_ip1}[,${master_ip2},${master_ip3}] [ -n ${worker_ip1}...]

For a node that has not been added successfully, you may get the following prompt after executing the above command: "both master and node need to be deleted all not in current cluster, skip delete". If you need to forcibly clean up the node, you can execute the following command:

sealer delete -m ${master_ip1}[,${master_ip2},${master_ip3}] [ -n ${worker_ip1}...] -f /root/.sealer/Clusterfile

Add new SANs for APIServer

sealer cert --alt-names ${new_apiserver_ip},${new_apiserver_domain}

Scale-up open-local pool

# if follow not exist, cp /root/.sealer/Clusterfile .
vim Clusterfile

---
apiVersion: sealer.cloud/v2
kind: Cluster
spec:
  image: ack-agility-registry.cn-shanghai.cr.aliyuncs.com/ecp_builder/ackdistro:v1-22-15-ack-10
  env:
    # add new device /dev/vde for open-local, if more than one, join them with ','
    - YodaDevice=/dev/vde

trident on-sealer -f Clusterfile  --sealer

etcd cron backup

No configuration is required. By default, etcd is backed up at 2 am every day. If you need to restore etcd, you can check the /backup/etcd/snapshots/ directory of any master node to obtain the backup file, and then follow Etcd Recovery for recovery.

K8s cluster audit log

No configuration is required. By default, only write operations are recorded. For a 3m+3w cluster, 1GiB space can be used to record about 72h of cluster write operations. Audit log files are stored in /var/log/kubernetes/audit.log of all Master nodes

Cleanup cluster

sealer delete -a

Usage

Kubernetes Usage

Please refer to Alibaba Help Center and Kubernetes community for the usage of standard Kubernetes:

https://help.aliyun.com/document_detail/309552.html
https://kubernetes.io/#

Appendix

Scales

Scale	Component	Configs
small	etcd	requests.cpu=100m requests.mem=100Mi
	kube-apiserver	requests.cpu=250m
	kube-scheduler	requests.cpu=100m
	kube-controller-manager	requests.cpu=200m
	coredns	requests.cpu=100m requests.mem=70Mi limits.mem=170Mi
	metrics-server	requests.cpu=125m requests.mem=500Mi limits.cpu=2 limits.mem=2Gi
	node-problem-detector	requests.cpu=0 requests.mem=200Mi limits.cpu=100m limits.mem=200Mi
	yoda-agent	requests.cpu=0 requests.mem=64Mi limits.cpu=200m limits.mem=256Mi
	yoda-controller	requests.cpu=300m requests.mem=768Mi limits.cpu=2.5 limits.mem=2816Mi
medium	etcd	requests.cpu=1 requests.mem=2Gi quota-backend-bytes: '8589934592' max-request-bytes: '33554432' experimental-compaction-batch-limit: '1000' auto-compaction-retention: 5m backend-batch-interval: 10ms backend-batch-limit: '100'
	kube-apiserver	requests.cpu=4 requests.mem=8Gi max-requests-inflight: '600' max-mutating-requests-inflight: '300'
	kube-scheduler	requests.cpu=4 requests.mem=8Gi kube-api-qps: '1000' kube-api-burst: '1000'
	kube-controller-manager	requests.cpu=4 requests.mem=8Gi kube-api-qps: '500' kube-api-burst: '500'
	coredns	requests.cpu=100m requests.mem=70Mi limits.mem=2Gi
	metrics-server	requests.cpu=125m requests.mem=500Mi limits.cpu=2 limits.mem=4Gi
	node-problem-detector	requests.cpu=0 requests.mem=200Mi limits.cpu=4 limits.mem=8Gi
	yoda-agent	requests.cpu=0 requests.mem=64Mi limits.cpu=1 limits.mem=2Gi
	yoda-controller	requests.cpu=300m requests.mem=768Mi limits.cpu=14 limits.mem=28Gi
large	etcd	requests.cpu=1 requests.mem=2Gi quota-backend-bytes: '8589934592' max-request-bytes: '33554432' experimental-compaction-batch-limit: '1000' auto-compaction-retention: 5m backend-batch-interval: 10ms backend-batch-limit: '100'
	kube-apiserver	requests.cpu=4 requests.mem=8Gi max-requests-inflight: '600' max-mutating-requests-inflight: '300'
	kube-scheduler	requests.cpu=4 requests.mem=8Gi kube-api-qps: '1000' kube-api-burst: '1000'
	kube-controller-manager	requests.cpu=4 requests.mem=8Gi kube-api-qps: '500' kube-api-burst: '500'
	coredns	requests.cpu=100m requests.mem=70Mi limits.mem=2Gi
	metrics-server	requests.cpu=125m requests.mem=500Mi limits.cpu=2 limits.mem=4Gi
	node-problem-detector	requests.cpu=0 requests.mem=200Mi limits.cpu=4 limits.mem=8Gi
	yoda-agent	requests.cpu=0 requests.mem=64Mi limits.cpu=1 limits.mem=2Gi
	yoda-controller	requests.cpu=300m requests.mem=768Mi limits.cpu=14 limits.mem=28Gi
xlarge	etcd	requests.cpu=8 requests.mem=16Gi quota-backend-bytes: '8589934592' max-request-bytes: '33554432' experimental-compaction-batch-limit: '1000' auto-compaction-retention: 5m backend-batch-interval: 10ms backend-batch-limit: '100'
	kube-apiserver	requests.cpu=16 requests.mem=128Gi max-requests-inflight: '600' max-mutating-requests-inflight: '300'
	kube-scheduler	requests.cpu=8 requests.mem=32Gi kube-api-qps: '1000' kube-api-burst: '1000'
	kube-controller-manager	requests.cpu=8 requests.mem=32Gi kube-api-qps: '500' kube-api-burst: '500'
	coredns	requests.cpu=100m requests.mem=70Mi limits.cpu=3 limits.mem=6Gi
	metrics-server	requests.cpu=125m requests.mem=500Mi limits.cpu=2 limits.mem=4Gi
	node-problem-detector	requests.cpu=0 requests.mem=200Mi limits.cpu=4 limits.mem=8Gi
	yoda-agent	requests.cpu=0 requests.mem=64Mi limits.cpu=1 limits.mem=2Gi
	yoda-controller	requests.cpu=300m requests.mem=768Mi limits.cpu=14 limits.mem=28Gi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

getting-started.md

getting-started.md

Getting started

Installation

Install cluster quickly

[Advanced] Install with production-level configuration

1) automatically manage disk capacity for k8s daemons

2) Flexible configuration of ssh access channel

3) Use a lite image

4) Use preflight

5) Use health check

6) Configure container runtime

7) IPv6 dual stack

8) Cluster Scales

9) Other configs

Operation and maintenance cluster

Scale-up node

Scale-down node

Add new SANs for APIServer

Scale-up open-local pool

etcd cron backup

K8s cluster audit log

Cleanup cluster

Usage

Kubernetes Usage

ACK Distro’s recommendation on how to use the network plugin:

ACK-Distro’s recommendation on how to use the storage plugin:

Use ACK to manage ACK-Distro cluster:

Appendix

Scales

Files

getting-started.md

Latest commit

History

getting-started.md

File metadata and controls

Getting started

Installation

Install cluster quickly

[Advanced] Install with production-level configuration

1) automatically manage disk capacity for k8s daemons

2) Flexible configuration of ssh access channel

3) Use a lite image

4) Use preflight

5) Use health check

6) Configure container runtime

7) IPv6 dual stack

8) Cluster Scales

9) Other configs

Operation and maintenance cluster

Scale-up node

Scale-down node

Add new SANs for APIServer

Scale-up open-local pool

etcd cron backup

K8s cluster audit log

Cleanup cluster

Usage

Kubernetes Usage

ACK Distro’s recommendation on how to use the network plugin:

ACK-Distro’s recommendation on how to use the storage plugin:

Use ACK to manage ACK-Distro cluster:

Appendix

Scales