FastTFWorkflow

Tutorial about How to change your slow tensorflow training faster

Description

THIS CODE ONLY WORKS ON NVIDIA GPUS

Assuming dataset length is infinite, lnline preprocessing can cause CPU bottleneck that can decrease training throughput.
This code samples show unoptimized/optimized tensorflow workflow.

Requirements

Hardware Requirements

x86-64 (AMD64) CPU
RAM >= 8GiB
NVIDIA Computer Capability 7.0+ GPUs
- GPU memory > 12GiB for default batch size

Test Environment

CPU : Intel(R) Xeon(R) Gold 5218R
GPU : 2x A100 80GB PCI-E
RAM : 255GiB

Optimization used in this repo

Nvidia DALI - GPU Accelerated Dataloader
Mixed Precision - Better MMA(Matrix Multiply-Accumulate) throughput than TF32
XLA - JIT-Compile and fuse operators to effective job scheduling in GPUs
(Optional) Multi GPU training - Use more then one GPU for training

Usage

Clone this repo with submodule

git clone --recursive https://github.com/ReturnToFirst/FastTFWorkflow.git

Compare performance between unoptimized/optimized workflow

For advanced users

after_optimization_multi.ipynb shows training process with multi gpu.

DISCLAIMER

Depanding on devices in computer, performance can be decreased.
This optimized code will not show best performance.
Multi-GPUs Training doesn't works on test envrionment.
Wrong description or code there.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
imgs		imgs
mnist_png @ d26bd33		mnist_png @ d26bd33
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
after_optimization.ipynb		after_optimization.ipynb
after_optimization_multi.ipynb		after_optimization_multi.ipynb
before_optimization.ipynb		before_optimization.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastTFWorkflow

Description

Requirements

Hardware Requirements

Test Environment

Optimization used in this repo

Usage

For advanced users

DISCLAIMER

About

Releases

Packages

Languages

ReturnToFirst/FastTFWorkflow

Folders and files

Latest commit

History

Repository files navigation

FastTFWorkflow

Description

Requirements

Hardware Requirements

Test Environment

Optimization used in this repo

Usage

For advanced users

DISCLAIMER

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages