See the DSRI documentation to deploy it on the Data Science Research Infrastructure (OpenShift cluster at Maastricht University).
Deployment written from the official Flink Kubernetes documentation. Images built from apache/flink.
More docs:
- more details on running jobs using the CLI or Kubernetes native execution.
- more Flink docs for Kubernetes deployment
You need anyuid
enabled to start this application 🔒
oc apply -f https://raw.githubusercontent.com/vemonet/flink-on-openshift/master/template-flink.yml
-
Flink home folder inside the pods is
/opt/flink
-
Persistent storage shared in
/mnt
inside the pods
Use this template for the Data Science Research Infrastructure:
oc apply -f https://raw.githubusercontent.com/vemonet/flink-on-openshift/master/template-flink-dsri.yml
Use this command to get the Flink Jobmanager Pod ID and copy files to the pod.
oc get pod --selector app=flink --selector component=jobmanager --no-headers -o=custom-columns=NAME:.metadata.name
# Example creating the workspace folder and copying the RMLStreamer.jar to the pod
oc exec <pod_id> -- mkdir -p /mnt/workspace/resources
oc cp folder-to-copy/ <pod_id>:/mnt/
Delete the deployed services:
oc delete all,secret,configmaps,serviceaccount,rolebinding --selector app=flink-cluster
Change
flink-cluster
by the name you gave to the application.