[OSPP-Week2] Check the kuhn_nfsp
experiment and read papers
#342
Replies: 3 comments 4 replies
-
Besides, as we mentioned before, the julia version of KuhnPokerEnv may need to implement the |
Beta Was this translation helpful? Give feedback.
-
For now, nash_conv can be used in the kuhn_nfsp experiment, although its result looks bad. So in this week, I would do the following things:
|
Beta Was this translation helpful? Give feedback.
-
Tasking List for adding NFSPAgent's structure:
Task list
New progress is in #402. |
Beta Was this translation helpful? Give feedback.
-
@findmyway
Hi, I am a student of the OSPP project Implement Multi-Agent Reinforcement Learning Algorithms in Julia. This summer, my main mission is to implement some
multi-agent RL algorithms
(such as NFSP and PSRO) inRLZoo
and add relative experiments inRLExperiment
. Following is my current progress and this week's plan.Current Progress
I have implemented the Neural Fictitious Self-play(NFSP) algorithm roughly and test it on the Kuhn Poker game(KuhnPokerEnv). See my project for more details.
This Week's Plan
Use nash_conv to evaluate the experiment:
I should add some
prob
function in QBasedpolicy for some specific stages. For now, nash_conv is not available for QBasedpolicy.Organize current codes and add them to the corresponding directories in
RL.jl
.Beta Was this translation helpful? Give feedback.
All reactions