Skip to content

Commit

Permalink
Merge pull request #300 from chaitanya1731/playbook
Browse files Browse the repository at this point in the history
one_click: Updated Gaudi provisioning details
  • Loading branch information
uMartinXu authored Sep 5, 2024
2 parents 4062f27 + e69dea7 commit c6b765e
Show file tree
Hide file tree
Showing 2 changed files with 13 additions and 13 deletions.
6 changes: 3 additions & 3 deletions one_click/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,8 +27,8 @@ Execute below single command to provision Intel Data Center GPU:
$ ansible-playbook gpu_provisioning_playbook.yaml
```

## Reference Playbook – Habana Gaudi Provisioning
This playbook demonstrates the one-click provisioning of Habana Gaudi Accelerator on an RHOCP cluster. The steps involved are installation and configuration of general Operators including Node Feature Discovery (NFD) operator, Kernel Module Management (KMM) operator, and the HabanaAI Operator. The playbook also creates the Gaudi `DeviceConfig` CR which deploys the Gaudi Out-of-Tree drivers, Gaudi device plugins, Habana container runtime and Habana node metrics.
## Reference Playbook – Intel Gaudi Provisioning
This playbook demonstrates the one-click provisioning of Intel Gaudi AI Accelerator on an RHOCP cluster. The steps involved are installation and configuration of general Operators including Node Feature Discovery (NFD) operator, Kernel Module Management (KMM) operator, and the Intel Gaudi Base Operator. The playbook also creates the Gaudi `DeviceConfig` CR which deploys the Gaudi Out-of-Tree drivers, Gaudi device plugins, Habana container runtime and Habana node metrics.

### Prerequisite
Before running the playbook, ensure the following prerequisites are met:
Expand All @@ -42,7 +42,7 @@ To run the ansible playbook, clone this repository to your RHEL system. Navigate
$ git clone https://github.com/intel/intel-technology-enabling-for-openshift.git
$ cd intel-technology-enabling-for-openshift/one_click
```
Execute below single command to provision Habana Gaudi Accelerator:
Execute below single command to provision Intel Gaudi Accelerator:
```
$ ansible-playbook gaudi_provisioning_playbook.yaml
```
20 changes: 10 additions & 10 deletions one_click/gaudi_provisioning_playbook.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -112,16 +112,16 @@
type: Available
status: 'True'
reason: MinimumReplicasAvailable
- name: Install HabanaAI Operator
- name: Install Intel Gaudi Base Operator
tags:
- habana-ai
- intel-gaudi
block:
- name: Install HabanaAI Operator
- name: Install Intel Gaudi Base Operator
k8s:
state: present
src: "../gaudi/gaudi_install_operator.yaml"
wait: yes
- name: Wait until the Habana controller-manager Deployment is available
- name: Wait until the Intel Gaudi controller-manager Deployment is available
k8s_info:
kind: Deployment
wait: yes
Expand All @@ -133,19 +133,19 @@
reason: MinimumReplicasAvailable
- name: NFD - Install NFD CRs
block:
- name: NFD - Create NFD discovery instance for Habana Gaudi
- name: NFD - Create NFD discovery instance for Intel Gaudi
k8s:
state: present
src: "../gaudi/gaudi_nfd_instance_openshift.yaml"
wait: yes
- name: Install Habana Gaudi DeviceConfig CR
- name: Install Intel Gaudi DeviceConfig CR
block:
- name: Create Habana Gaudi DeviceConfig
- name: Create Intel Gaudi DeviceConfig
k8s:
state: present
src: "../gaudi/gaudi_device_config.yaml"
wait: yes
- name: Verify Habana Gaudi Resources
- name: Verify Intel Gaudi Resources
tags:
- gaudi_resource_test
block:
Expand All @@ -154,8 +154,8 @@
api: v1
kind: Node
label_selectors:
- "kmm.node.kubernetes.io/habana-ai-operator.habana-ai-module.device-plugin-ready="
- "kmm.node.kubernetes.io/habana-ai-operator.habana-ai-module.ready="
- "kmm.node.kubernetes.io/habana-ai-operator.intel-gaudi-module.device-plugin-ready="
- "kmm.node.kubernetes.io/habana-ai-operator.intel-gaudi-module.ready="
wait: yes
wait_timeout: 120
register: cluster_nodes_info
Expand Down

0 comments on commit c6b765e

Please sign in to comment.