GameTheory_CFR

Counterfactual Regret Minimization

1. cfr_practice

This is a python version of some examples code from An Introduction to Counterfactual Regret Minimization. Including Rock-Paper-Scissors(RPS), Kuhn Poker, Liar’s Dice.

1.1 Paper outline

Regret Matching and Minimization
- Worked example: RPS regret minimization (Fixed opponent strategy)
- Exercise: RPS equilibrium, Colonel Blotto
Counterfactual Regret Minimization (CFR)
- Worked example: Kuhn Poker equilibrium
- Exercise: 1-die-versus-1-die Dudo
Fixed-Strategy Iteration Counterfactual Regret Minimization (FSICFR)
- Worked example: Liar Die
- Exercise: 1-die-versus-1-die Dudo with 3 claim memory limit
Exploiting Mistakes (Opponent)
- Exercise: Perturbed Liar Die

1.2 RPS

Rock-Scissors-Paper (RPS) is a two-player game where players each simultaneously make one of three gestures: rock (a closed fist), paper (an open face-down palm), or scissors (exactly two fingers extended).

	R	P	S
R	0, 0	-1,1	1,-1
P	1,-1	0, 0	-1,1
S	-1,1	1,-1	0, 0

1.3 Kuhn Poker

Kuhn Poker is a simple 3-card poker game by Harold E. Kuhn. Two players each ante 1 chip, i.e. bet 1 chip blind into the pot before the deal. Three cards, marked with numbers 1, 2, and 3, are shuffled, and one card is dealt to each player and held as private information.

Sequential Actions: Player 1 $\rightarrow$ Player 2 $\rightarrow$ Player 1

Player 1	Player 2	Player 1	Payoff
pass	pass		+1 to player with higher card
pass	bet	pass	+1 to player 2
pass	bet	bet	+2 to player with higher card
bet	pass		+1 to player 1
bet	bet		+2 to player with higher card

Kuhn Poker Node

graph TD
R((root)) --> |1:2 or 1:3| A{1}
R((root)) --> |2:1 or 2:3| B{2}
R((root)) --> |3:1 or 3:2| C{3}

A --> |P| A1{1P}
A --> |B| A2{1B}
B --> |P| B1{2P}
B --> |B| B2{2B}
C --> |P| C1{3P}
C --> |B| C2{3B}

A1 --> |P| A11( -1)
A1 --> |B| A12{1PB}
A2 --> |P| A21(1)
A2 --> |B| A22( -2)
A12 --> |P| A121( -1)
A12 --> |B| A122( -2)

B1 --> |P| B11(1/-1)
B1 --> |B| B12{2PB}
B2 --> |P| B21(1)
B2 --> |B| B22(2/-2)
B12 --> |P| B121( -1)
B12 --> |B| B122(2/-2)

C1 --> |P| C11(1)
C1 --> |B| C12{3PB}
C2 --> |P| C21( -1)
C2 --> |B| C22(2)
C12 --> |P| C121(1)
C12 --> |B| C122(2)

1.4 Liar Die

Dudo is a bluffing dice game thought to originate from the Inca Empire circa 15th century. Many variations exist in both folk and commercial forms. The ruleset we use from is perhaps the simplest representative form, and is thus most easily accessible to both players and researchers. Liar's Dice, Bluff, Call My Bluff, Perudo, Cacho, Cachito are names of variations. For detailed rules, please read Liar's Dice Rules.

1-Die-Versus-1-Die Dudo (2 players with 1 die each) We are limited to two 6-sided dice, yielding 13 possible actions: 12 claims of 1 or 2 of 6 different ranks, plus the doubting "dudo" action. Let us index claims in increasing order of strength starting at 0, and let the "dudo" action have index 12.

Strength $s(n,r)$ 0 1 2 3 4 5 6 7 8 9 10 11

Claim $n × r$ $1×2$ $1×3$ $1×4$ $1×5$ $1×6$ $1×1$ $2×2$ $2×3$ $2×4$ $2×5$ $2×6$ $2×1$

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
cfr_practice		cfr_practice
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GameTheory_CFR

1. cfr_practice

1.1 Paper outline

1.2 RPS

1.3 Kuhn Poker

1.4 Liar Die

About

Releases

Packages

Languages

License

yanxinyi620/GameTheory_CFR

Folders and files

Latest commit

History

Repository files navigation

GameTheory_CFR

1. cfr_practice

1.1 Paper outline

1.2 RPS

1.3 Kuhn Poker

1.4 Liar Die

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages