Release 0.3.0 · st-tech/zr-obp

This release enhances the OBP package in the following ways.

allowing evaluation policy to be stochastic, which makes the package more consistent with the formulation of OPE
adding some advanced estimation techniques such as cross-fitting and doubly robust with shrinkage
modifying examples to evaluate offline bandit policies (not online ones), which again makes the package more consistent with the formulation of OPE: https://github.com/st-tech/zr-obp/tree/master/examples
adding some slides: https://github.com/st-tech/zr-obp/tree/master/slides

Provide feedback