norpac-ai

a test neural network for the board game Northern Pacific

NOTE: this is a bit outdated as of april 2024, as i'm working on an alphazero-style mcts network. just run mcts.py and it should start training per parameters but you can't play against it yet

requirements

numpy
pytorch
pygame (optional)

for the older stuff you will also need numba

how to use

run pytorchtest.py to start training- hyperparameters are in constants near the top of the file. checkpoints saved to checkpoints/ every 20 generations by default

the network structure is semi-rainbow DQN

to see progress, ui.py lets you play against the AI. you might need to figure this one out yourself for now, the way you set it up is janky there are three kinds of opponents: normal AI: does top valued action distrib AI: uses values as a distribution top5random AI: takes a random action from the top 5 highest valued actions

todo

clean up code
document a bit more
implement distributional DQN
implement noise layer
add more checkpoints
decouple game logic from AIs
optimize code
add cuda alternative for pytorchtest or make it work for both
make all the variable/function names more comprehensible to anyone that's not me
revise reward function; probably don't penalize losing
let ui.py track & save an experience buffer for further training

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
alphatrain		alphatrain
good-checkpoints		good-checkpoints
old		old
.gitignore		.gitignore
README.md		README.md
board.jpg		board.jpg
newnorpac.py		newnorpac.py
norpac.py		norpac.py
ui.py		ui.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

norpac-ai

requirements

how to use

todo

About

Releases

Packages

Languages

sareneFactorial/norpac-ai

Folders and files

Latest commit

History

Repository files navigation

norpac-ai

requirements

how to use

todo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages