thoughts about AI difficulty #45

danlangford · 2023-06-05T04:10:17Z

danlangford
Jun 5, 2023
Collaborator

The username I use on button weavers website is BMAIBagels. I use ply 3 but have just today started tinkering with ply 4. Here are the settings used on most games up until now:

ply 3
max_sims 100
min_sims 5
maxbranch 400
surrender off

I have BMAIBagels python bridge run the BMAI binary for a max of 1 hour. if I can't find a solution I rerun with --ply and keep looping that logic until ply=0.

BMAIBagels has completed 1,374 games with a win rate of 62%.

Could increasing from ply3 to ply4 have a significant impact on win rate?
Is there a difference between ply0 and ply 1? they seem to output the same logs and same results.

Im thinking of running a second bot that is maybe easier, BMAIDonuts (softer and sweeter). Do you think that a ply0 or ply1 would be significantly less optimized and as a result more likely to be beaten? Im wondering how viable it is to have an easier AI running on the site that can calculate moves much faster. But also have a ply4 running that takes longer, but finds those better long term moves and weeks out more wins.

any thoughts you have on the matter would be helpful @pappde . maybe ply isn't the only place I should be adjusting numbers? maybe a different AI altogether.

/* … 
ai %1 %2			set player %1 (0-1) to AI type %2 (0 = BMAI, 1 = QAI, 2 = BMAI v2)
… */

pappde · 2023-06-05T17:34:07Z

pappde
Jun 5, 2023
Maintainer

You could even find

Is there a difference between ply0 and ply 1? they seem to output the same logs and same results.

I believe that's because the level (sm_level) effectively starts at 1, so setting m_max_ply <= 1 will result in it always using the "QAI". You need to start at ply 2 to use the BMAI.

Could increasing from ply3 to ply4 have a significant impact on win rate?

Maybe not significant, but theoretically should result in an improvement. However, there could be a tradeoff if you need to reduce the number of simulations when going to ply4, such that you get better results with ply3 and more simulations. That said, compute speeds are much better than they used to be so it is worth trying ply 4.

maybe ply isn't the only place I should be adjusting numbers?

Absolutely - basic parameters to try are the 4 you list: ply, min_sims, max_sims and max_branch. I was trying different values a while ago, and "Notes.txt" has some of the results.

date	games	ply     sims    branch  rating
090202	220	4       5-150   200     60.5
101302	399	2       25-250  2000    60.2
011703	878	3       5-100   400     64.8
050703	690	4       5-100   125     58.1
083003	378	4       5-100   250     60.8    (very slow)
071305	2304	3	10-250	250	58.2
020208	2611	3	10-250	500	63.6

This super old data suggested that ply3 with a large max_branch was the best performance, and did better than ply4 with a small max_branch. However, it would be interesting to rerun the experiment (and also track a time variable).

Im thinking of running a second bot that is maybe easier, BMAIDonuts (softer and sweeter). Do you think that a ply0 or ply1 would be significantly less optimized and as a result more likely to be beaten? Im wondering how viable it is to have an easier AI running on the site that can calculate moves much faster. But also have a ply4 running that takes longer, but finds those better long term moves and weeks out more wins.

I like the idea of running multiple bots to try different parameters (or AIs). However, as noted above, ply1 is probably just running the QAI which is naive. It would still be interesting to see the performance data though to see how a naive strategy does (a baseline).

ASIDE: check out BMC_Parser::PlayFairGames() which tests different basic AI strategies (command "playfair [games] [mode] [p]")

Running ply2/3/4, I suspect you'll find that you may get the best improvements more by tweaking things like max_branch or max_sims (or even min_sims).

maybe a different AI altogether.

The best AI would be replacing the QAI (for scoring positions) with a machine learning model that inputs the game state and outputs a probability of winning for each move. We could still use monte carlo simulations since "lower level" states are going to be more reliable.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thoughts about AI difficulty #45

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

thoughts about AI difficulty #45

danlangford Jun 5, 2023 Collaborator

Replies: 1 comment

pappde Jun 5, 2023 Maintainer

danlangford
Jun 5, 2023
Collaborator

pappde
Jun 5, 2023
Maintainer