You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When training with different reward functions it's hard to compare 2 bots. A callback capable of running n games between current agent and another would prove useful to measure progress.
I will look into it but if someone knows how to do that help is welcome.
The text was updated successfully, but these errors were encountered:
One last thing to look at is this old example code that logged custom game information to the tensorboard. It's was a bit of a hackjob because of the agent having been reset before tensorboard grabs data: PR #86.
When training with different reward functions it's hard to compare 2 bots. A
callback
capable of runningn
games between current agent and another would prove useful to measure progress.I will look into it but if someone knows how to do that help is welcome.
The text was updated successfully, but these errors were encountered: