GitHub - AidenPearce-ZYQ/NeutronOrch: code for back up

NeutronOrch is a system for sample-based GNN training that incorporates a layer-based task orches-trating method and ensures balanced utilization of the CPU and GPU. NeutronOrch distinguishes itself from other GNN training frameworks with the following new properties:

NeutronOrch use a hotness-aware layer-based task orchestrating method that effectively leverages the computation and memory resources of the GPU-CPU heterogeneous system.
NeutronOrch integrates the pytorch automatic differentiation library libtorch and tensorflow to support automatic differentiation (automatic backpropagation) across workers.
NeutronOrch use a super-batch pipelined training, which fully overlaps different tasks on heterogeneous resources while strictly guaranteeing bounded staleness.

Currently NeutronOrch is under refactoring. We will release all features of NeutronOrch soon.

Quick Start

A compiler supporting OpenMP and C++11 features (e.g. lambda expressions, multi-threading, etc.) is required.

cmake >=3.14.3

MPI for inter-process communication

cuda > 11.0 for GPU based graph operation.

TBB for C++ 17 feature.

libnuma for NUMA-aware memory allocation.

sudo apt install libnuma-dev

libtorch version > 1.11 with gpu support for nn computation

unzip the libtorch package in the root dir of NeutronOrch and change CMAKE_PREFIX_PATH in "CMakeList.txt"to your own path

configure PATH and LD_LIBRARY_PATH for cuda and mpi

export CUDA_HOME=/usr/local/cuda
export MPI_HOME=/path/to/your/mpi
export LD_LIBRARY_PATH=$CUDA_HOME/lib64:$LD_LIBRARY_PATH
export PATH=$MPI_HOME/bin:$CUDA_HOME/bin:$PATH

clang-format is optional for auto-formatting:

sudo apt install clang-format

To build:

mkdir build

cd build

cmake ..

make -j4

To run NeutronOrch:

./nto_run.sh ${cfg_file}

ENGINE TYPE: We list serveral example in the root dir for your reference GCN: gcn_reddit_sample.cfg gcn_cora_sample.cfg

Name		Name	Last commit message	Last commit date
Latest commit History 346 Commits
.idea		.idea
build_support		build_support
comm		comm
core		core
cuda		cuda
data		data
dep/gemini		dep/gemini
tmp		tmp
toolkits		toolkits
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
cpu.sh		cpu.sh
gat_cora_sample.cfg		gat_cora_sample.cfg
gat_reddit_sample.cfg		gat_reddit_sample.cfg
gcn_cora_sample.cfg		gcn_cora_sample.cfg
gcn_reddit_sample.cfg		gcn_reddit_sample.cfg
get_log_run_time.py		get_log_run_time.py
get_rate.py		get_rate.py
gpu.sh		gpu.sh
graphsage_cora_sample.cfg		graphsage_cora_sample.cfg
graphsage_reddit_sample.cfg		graphsage_reddit_sample.cfg
make.sh		make.sh
nto_run.sh		nto_run.sh
nto_run_multi.sh		nto_run_multi.sh
re-make.sh		re-make.sh
run_all_gpu.sh		run_all_gpu.sh
run_cache_rate.sh		run_cache_rate.sh
run_cache_rate_and_alg.sh		run_cache_rate_and_alg.sh
run_exp.sh		run_exp.sh
run_multi_exp.sh		run_multi_exp.sh
run_nts.sh		run_nts.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quick Start

To run NeutronOrch:

About

Releases

Packages

Languages

License

AidenPearce-ZYQ/NeutronOrch

Folders and files

Latest commit

History

Repository files navigation

Quick Start

To run NeutronOrch:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages