BestPolicyIdentificationMDP Python implementation of algorithms for Best Policy Identification in Markov Decision Processes From the papers: https://arxiv.org/pdf/2106.02847.pdf http://proceedings.mlr.press/v139/marjani21a/marjani21a.pdf Requirements To use the library you need the following libraries: cvxpy, jax, numpy. License MIT License