[feature request] callback returning wins (or winrate) vs another agent #100

kevinu3d · 2021-10-14T21:56:20Z

When training with different reward functions it's hard to compare 2 bots. A callback capable of running n games between current agent and another would prove useful to measure progress.

I will look into it but if someone knows how to do that help is welcome.

The text was updated successfully, but these errors were encountered:

glmcdona · 2021-10-16T04:41:48Z

Good idea.

This here is probably the closest existing code to it:

LuxPythonEnvGym/luxai2021/env/lux_env.py

Line 14 in c5d4c16

class SaveReplayAndModelCallback(BaseCallback):

And example usage:

LuxPythonEnvGym/examples/train.py

Line 114 in c5d4c16

SaveReplayAndModelCallback(

You can also look at the built-in eval callback:

LuxPythonEnvGym/examples/train.py

Line 136 in c5d4c16

EvalCallback(env_eval, best_model_save_path=f'./logs_{run_id}/',

One last thing to look at is this old example code that logged custom game information to the tensorboard. It's was a bit of a hackjob because of the agent having been reset before tensorboard grabs data: PR #86.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature request] callback returning wins (or winrate) vs another agent #100

[feature request] callback returning wins (or winrate) vs another agent #100

kevinu3d commented Oct 14, 2021

glmcdona commented Oct 16, 2021 •

edited

Loading

[feature request] callback returning wins (or winrate) vs another agent #100

[feature request] callback returning wins (or winrate) vs another agent #100

Comments

kevinu3d commented Oct 14, 2021

glmcdona commented Oct 16, 2021 • edited Loading

glmcdona commented Oct 16, 2021 •

edited

Loading