Noise Benchmarking, LSTM sb3_zoo hyperpar, Issue #1 #9

felimomo · 2024-03-05T19:31:35Z

In this branch:

I added the ability to manually tune the system noise (Asm().sigma) with a script input, added a bash script to train agents for several values of noise simultaneously.
I added RecurrentPPO hyperparams from sb3_zoo, and added the fn sb3_train_v2 which is tailored to reading these hyperparams from yaml.
I fixed Issue#1
I updated the observation and harvest processes to be relative to vulnerable populations rather than to the entire population.

Next steps: sb3_train_v2 is more general than sb3_train, however its syntax is a bit more involved and might need refactoring in order to reach sb3_train's level of clarity. I'll leave this for the next pull request -- the code is fully functional right now, although in a 'quick-n-dirty' state in some parts.

…nor changes to envs

…py now accepts new yaml syntax & old (TBD: re-standardize)

…sheries into benchmarking

felimomo · 2024-03-05T23:42:32Z

Converted to draft to solve Issue #1

…r Asm2o.observe()

felimomo · 2024-03-06T21:42:04Z

Updated based on our meeting! Also, side note: now self.vulb is updated at the observation using the new (post-harvest, post-growth) system state.

cboettig

@felimomo fantastic work here! A few comments in line. Basically I think let's just add the harvest_vul in now as well, since it looks odd to have something explicitly called survey_vul being used inside the harvest function, and then merge this.

I think we're due for a refactor of the Asm code, so that it's easier to read and easier to customize. (most importantly, asm_2o should re-use all the code except for an override observation method, and similarly for escapement with an override step method I think). but let's tackle that in a separate PR.

hyperpars/ppo-asm-v0-1.yml

hyperpars/ppo-asm2o-v0-1.yml

hyperpars/rppo-asm2o.yml

scripts/benchmark_noise.sh

cboettig · 2024-03-06T23:42:16Z

src/rl4fisheries/envs/asm.py

@@ -174,7 +173,7 @@ def initialize_population(self):

        # leading array calculations to get vul-at-age, wt-at-age, etc.
        for a in range(0, p["n_age"], 1):
-            vul[a] = 1 / (1 + np.exp(-p["asl"] * (p["ages"][a] - p["ahv"])))
+            survey_vul[a] = 1 / (1 + np.exp(-p["asl"] * (p["ages"][a] - p["ahv"])))


once we have a harvest_vul, it's going to need to be initialized here too. Presumably it's identical but has different values for p["asl"] and p["ahv"]. (We'll need @CarlJwalters or @ChrisFishCahill to figure out some reasonable choices there)

src/rl4fisheries/envs/asm.py

cboettig · 2024-03-06T23:43:49Z

src/rl4fisheries/envs/asm.py

-            self.abar = sum(p["vul"] * np.array(p["ages"]) * n) / sum(n)
-            self.wbar = sum(p["vul"] * n * p["wt"]) / sum(n * p["wt"])
+            self.abar = sum(p["survey_vul"] * np.array(p["ages"]) * n) / sum(n)
+            self.wbar = sum(p["survey_vul"] * n * p["wt"]) / sum(n * p["wt"])


same as above, eventually this should be harvest_vul

src/rl4fisheries/envs/asm_2o.py

src/rl4fisheries/utils/sb3.py

felimomo and others added 9 commits March 4, 2024 23:41

handier customization

439450a

updated yamls, added noise benchmarking scripts, updated train fn, mi…

5e97bb6

…nor changes to envs

minibugs

17a9364

script

55a64a3

new sb3_train util for LSTMs, minichanges to LSTM yaml (RPPO), train.…

e4eb9cd

…py now accepts new yaml syntax & old (TBD: re-standardize)

tensorboard log inside algo_config now

8406b73

Merge branch 'main' into benchmarking

b25ad63

cleaned up yaml

107586f

Merge branch 'benchmarking' of https://github.com/boettiger-lab/rl4fi…

abda692

…sheries into benchmarking

felimomo requested a review from cboettig March 5, 2024 19:49

felimomo added 4 commits March 5, 2024 21:42

added more hyperparams sets for lstms

ad1b25c

mini

b1e0ea5

mini

68b353e

mini

4d30d3d

felimomo marked this pull request as draft March 5, 2024 23:41

felimomo added 2 commits March 5, 2024 23:45

resolving issue #1

872e53f

typos

b44b35b

felimomo marked this pull request as ready for review March 5, 2024 23:51

felimomo mentioned this pull request Mar 5, 2024

Bug in vulnerable biomass equation #8

Closed

felimomo changed the title ~~Noise Benchmarking, adding LSTM training with sb3_zoo hyperparams~~ Noise Benchmarking, LSTM sb3_zoo hyperpar, Issue #1 Mar 5, 2024

added hyperparam-associated ids in rppo yaml

c629100

felimomo marked this pull request as draft March 6, 2024 21:01

felimomo added 3 commits March 6, 2024 21:01

rppo yaml

2a5cc70

vulnerable population observations

bdba0e9

vulnerable biomass updated at Env.observe(), included nicer syntax fo…

87f2972

…r Asm2o.observe()

felimomo added 3 commits March 6, 2024 21:45

minibug

b89a51e

minibug

630a911

vul -> survey_vul (in anticipation of having a separate fishing_vul)

f568682

cboettig requested changes Mar 6, 2024

View reviewed changes

felimomo added 2 commits March 7, 2024 00:09

paths in yaml files

be9b990

harvest vul

2c2284c

cboettig marked this pull request as ready for review March 7, 2024 00:19

cboettig approved these changes Mar 7, 2024

View reviewed changes

cboettig merged commit bc0a45d into main Mar 7, 2024
2 checks passed

felimomo deleted the benchmarking branch March 18, 2024 19:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Noise Benchmarking, LSTM sb3_zoo hyperpar, Issue #1 #9

Noise Benchmarking, LSTM sb3_zoo hyperpar, Issue #1 #9

felimomo commented Mar 5, 2024 •

edited

Loading

felimomo commented Mar 5, 2024

felimomo commented Mar 6, 2024

cboettig left a comment

cboettig Mar 6, 2024

cboettig Mar 6, 2024

Noise Benchmarking, LSTM sb3_zoo hyperpar, Issue #1 #9

Noise Benchmarking, LSTM sb3_zoo hyperpar, Issue #1 #9

Conversation

felimomo commented Mar 5, 2024 • edited Loading

felimomo commented Mar 5, 2024

felimomo commented Mar 6, 2024

cboettig left a comment

Choose a reason for hiding this comment

cboettig Mar 6, 2024

Choose a reason for hiding this comment

cboettig Mar 6, 2024

Choose a reason for hiding this comment

felimomo commented Mar 5, 2024 •

edited

Loading