Wrong IS-MCTS value #9

WhiffleFish · 2023-05-05T15:16:49Z

Current IS-MCTS implementation takes average of all samples yielding consistently low exploitability estimates.

This was done because root node may not be an action node for the exploiting player.

For example, in Kuhn Poker, the first node is a chance node, so $\max Q$ can't be calculated. Need to recursively search tree for first nodes corresponding to exploiting player and weight by reach probabilities.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong IS-MCTS value #9

Wrong IS-MCTS value #9

WhiffleFish commented May 5, 2023

Wrong IS-MCTS value #9

Wrong IS-MCTS value #9

Comments

WhiffleFish commented May 5, 2023