Add num_rounds and num_chips to poker envs #1109

jjshoots · 2023-09-27T18:00:08Z

Description

This allows the poker envs to have multi rounds and have persistent number of chips for the total episode.

This is accomplished through the num_rounds and num_chips parameter, and bootstraps off the MultiEpisodeWrappers.

…includes betting (can be sure rendering works)

pettingzoo/classic/rlcard_envs/texas_holdem.py

elliottower · 2023-09-27T18:08:22Z

pettingzoo/utils/wrappers/multi_episode_env.py

@@ -13,21 +13,30 @@ class MultiEpisodeEnv(BaseWrapper):
    When there are no more valid agents in the underlying environment, the environment is automatically reset.
    After `num_episodes` have been run internally, the environment terminates normally.
    The result of this wrapper is that the environment is no longer Markovian around the environment reset.
+
+    When `starting_utility` is used, all agents start with a base amount of health points (think of this as poker chips).


This is an interesting way to do it, but I'm not sure if this is the best name for it. Can't really think of anything else besides starting_reward or something. Maybe it could be like total_rewards to indicate that it makes the rewards track between resets. Or tally_rewards. Feel like starting_utility doesn't mean anything to me if I don't know how it works. Would make sense and be simple if it was total_rewards adds or subtracts by round.

I think utility makes more sense, this is a pretty standard way of saying "starting amount of meaningful substance", starting reward can be confused for the reward given to the agent for starting a new episode, while tally rewards sounds like it should be a boolean.

pettingzoo/utils/wrappers/multi_episode_env.py

elliottower

Looks mostly good but will test it locally a bit as well to see what the values are. I believe since the number of chips you start with is 100, and if you lose a round when you go all in you get -100 as reward, it should just represent the amount of chips you lost. But I'm not 100% sure if that's how the rewards for texas holdem work in our envs.

elliottower · 2023-09-27T18:14:58Z

Ok so looks like the rewards are +raised chips/2 for winner and -raised chips/2 for loser, not sure where the divide by 2 comes from (raise chips I'm assuming is the amount that you raised? Or maybe it's the total pot divided by 2? What if it's more than 2 players? Maybe you could look into this and see if it makes sense or if there's anything on RLCard's paper or website that talks about it)

jjshoots · 2023-09-27T20:31:38Z

Welp this was a dumb idea, closing.

elliottower · 2023-09-27T20:36:11Z

(We’re going to do this specific to poker instead, as it sort of breaks other envs when you remove agents on reset)

jjshoots and others added 19 commits May 6, 2023 20:13

ew

fad28c3

remove printline

ee51fb4

precommit

f0cd4b2

change inline to multiline

4f6545d

I have no idea why this works

02cf1d6

Remove rogue print

d4e2028

Merge branch 'Farama-Foundation:master' into master

4e06b0f

Merge branch 'Farama-Foundation:master' into master

cb9671c

add multi episode thing

53d15fd

fix doc

93036cb

update doc

96cfac9

futures

88817b1

precommit

1fd262e

more futures

8506bdc

add back todo

c53b8df

Remove unused 'legal_moves' key in info for rlcard envs

77de93e

Change action space seeding so that each round goes to the river and …

e58ccb6

…includes betting (can be sure rendering works)

Merge branch 'Farama-Foundation:master' into master

a273289

add number of chips for number of rounds for poker

cb1825a

elliottower reviewed Sep 27, 2023

View reviewed changes

pettingzoo/classic/rlcard_envs/texas_holdem.py Outdated Show resolved Hide resolved

elliottower reviewed Sep 27, 2023

View reviewed changes

pettingzoo/utils/wrappers/multi_episode_env.py Outdated Show resolved Hide resolved

1.24.2

e3520ee

elliottower reviewed Sep 27, 2023

View reviewed changes

pettingzoo/utils/wrappers/multi_episode_env.py Outdated Show resolved Hide resolved

optinal and change sign

2370135

elliottower approved these changes Sep 27, 2023

View reviewed changes

jjshoots closed this Sep 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add num_rounds and num_chips to poker envs #1109

Add num_rounds and num_chips to poker envs #1109

jjshoots commented Sep 27, 2023

elliottower Sep 27, 2023

jjshoots Sep 27, 2023

elliottower left a comment

elliottower commented Sep 27, 2023

jjshoots commented Sep 27, 2023

elliottower commented Sep 27, 2023

Add num_rounds and num_chips to poker envs #1109

Add num_rounds and num_chips to poker envs #1109

Conversation

jjshoots commented Sep 27, 2023

Description

elliottower Sep 27, 2023

Choose a reason for hiding this comment

jjshoots Sep 27, 2023

Choose a reason for hiding this comment

elliottower left a comment

Choose a reason for hiding this comment

elliottower commented Sep 27, 2023

jjshoots commented Sep 27, 2023

elliottower commented Sep 27, 2023