Evolutionary Algorithms in Reinforcement Learning - Multi-objective Optimization in Inventory Management

Project

Motivation: Strike a balance between financial gains and transporation environmental impact of supply chain operations
Goal: Identify the trade-off solutions (Pareto front)
Key library: pymoo

Supply Chain Network in this problem

Methodology

Apply reinforcement learning framework
Use multi-objective evolutionary algorithms (MOEAs) to optimize the policy net
The MOEAs are: (1) NSGA-II (classic!), (2) AGE-MOEA (state-of-the-art).
Use Bayesian optimization to smart tune hyperparameters of the MOEAs

Result

Case 1: State formulation - Inventory level, backlog, unfulfilled order

Converge within evaluation budget
Well-defined Pareto front

Case 2 (when agent knows more): State formulation - Inventory level, backlog, unfulfilled order + Previous customer demand

Pareto front with better diversity if the agent has more info about the environment!

Investigation of NSGA-II hyperparameter:

(1) Ratio of number of offspring & population size
(2) Ratio of population size & number of generation

Investigation of AGE-MOEA hyperparameter:

Ratio of population size & number of generation

The hyperparameter ratios obtained by BO are the best (with highest hypervolume!

Summary

Novel methodology works for this multi-objective optimization (MOO) problem of inventory management, the first to combine RL+MOO.
BO can successfully fine-tune the hyperparameter
But more to expand on methodological front and supply chain environment setting.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README.md		README.md
a10_bayesian.py		a10_bayesian.py
a6_re_env.py		a6_re_env.py
a7_NN.py		a7_NN.py
a9b_.py		a9b_.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evolutionary Algorithms in Reinforcement Learning - Multi-objective Optimization in Inventory Management

Project

Supply Chain Network in this problem

Methodology

Result

Case 1: State formulation - Inventory level, backlog, unfulfilled order

Case 2 (when agent knows more): State formulation - Inventory level, backlog, unfulfilled order + Previous customer demand

Investigation of NSGA-II hyperparameter:

Investigation of AGE-MOEA hyperparameter:

Summary

About

Releases

Packages

Languages

yueqiu2/Multi-objective_SCM

Folders and files

Latest commit

History

Repository files navigation

Evolutionary Algorithms in Reinforcement Learning - Multi-objective Optimization in Inventory Management

Project

Supply Chain Network in this problem

Methodology

Result

Case 1: State formulation - Inventory level, backlog, unfulfilled order

Case 2 (when agent knows more): State formulation - Inventory level, backlog, unfulfilled order + Previous customer demand

Investigation of NSGA-II hyperparameter:

Investigation of AGE-MOEA hyperparameter:

Summary

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages