cpu-inference #16

Summary
Jobs
- unit-tests
Run details
- Usage
- Workflow file

Workflow file for this run

.github/workflows/cpu-inference.yml at ade7149

	name: cpu-inference

	on:
	workflow_dispatch:
	pull_request:
	paths:
	- '.github/workflows/cpu-inference.yml'
	- 'requirements/**'
	- 'deepspeed/__init__.py'
	- 'deepspeed/inference/**'
	- '!deepspeed/inference/v2/**' # exclude v2 dir
	- 'tests/unit/inference/**'
	- '!tests/unit/inference/v2/**' # exclude v2 tests dir
	merge_group:
	branches: [ master ]
	schedule:
	- cron: "0 0 * * 0"

	concurrency:
	group: ${{ github.workflow }}-${{ github.ref }}
	cancel-in-progress: true

	jobs:
	unit-tests:
	runs-on: [self-hosted, cpu]

	env: {ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION: true} # Allow using Node16 actions

	steps:
	- uses: actions/checkout@v3

	- id: setup-venv
	uses: ./.github/workflows/setup-venv

	- name: Install gcc-9
	run: \|
	sudo add-apt-repository -u ppa:ubuntu-toolchain-r/test
	sudo apt install -y gcc-9 g++-9
	# set gcc-9 and g++9 to default
	sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-9 99
	sudo update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-9 99

	- name: Check gcc version
	run: \|
	# Get gcc version
	gcc --version
	g++ --version

	- name: Detect instruction sets on instance
	run: \|
	lscpu

	- name: Install numactl
	run: \|
	sudo apt-get install -y numactl

	- name: Install dependencies
	run: \|
	pip install torch
	# check installed version
	pip list \|grep \\\<torch\\\>

	- name: Install oneCCL
	run: \|
	pip install cmake
	git clone https://github.com/oneapi-src/oneCCL
	cd oneCCL
	mkdir build
	cd build
	cmake ..
	make -j install

	- name: Install transformers
	run: \|
	git clone https://github.com/huggingface/transformers
	cd transformers
	git rev-parse --short HEAD
	pip install .

	- name: Install deepspeed
	run: \|
	# check why the host does not have AVX2 support
	pip install .[dev,1bit,autotuning,inf]
	ds_report

	- name: Python environment check
	run: \|
	pip list
	source oneCCL/build/_install/env/setvars.sh
	export LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libstdc++.so.6
	# check whether the environment is properly setup
	python -c "import deepspeed;from deepspeed.accelerator import get_accelerator;print(get_accelerator().device_name());print(get_accelerator().is_available())"

	- name: Unit tests
	run: \|
	# prep oneCCL for CCLBackend comm ops building
	source oneCCL/build/_install/env/setvars.sh
	export LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libstdc++.so.6
	unset TORCH_CUDA_ARCH_LIST # only jit compile for current arch
	cd tests
	# LOCAL_SIZE=2 enforce CPU to report 2 devices, this helps run the test on github default runner
	LOCAL_SIZE=2 COLUMNS=240 HF_HOME=~/tmp/hf_home/ TORCH_EXTENSIONS_DIR=./torch-extensions pytest -m 'seq_inference' unit/
	LOCAL_SIZE=2 COLUMNS=240 HF_HOME=~/tmp/hf_home/ TORCH_EXTENSIONS_DIR=./torch-extensions pytest -m 'inference_ops' -m 'inference' unit/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpu-inference #16

Workflow file

cpu-inference #16

Jobs

Run details

Workflow file for this run