Bandits Playground

Taking inspiration from Sutton & Barto this repo is a place to experiment with bandits algorithms.

See policies.py

Requirements

Run the experimenter from main.py with command-line arguments

poetry run main.py {options}

For example

poetry run main.py --nb_bandits=100 --bandit_type=gaussian --steps=500

Options

Flag	Parameters	Description	Required	Default Value
nb_bandits	int	The number of bandit arms	N	10
bandit_type	{bernoulli, gaussian}	How the bandit distributes rewards	N	bernoulli
steps	int	How many steps to train for	N	1000
trials	int	How many to times to repeat the experiment	N	5

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
bandits		bandits
configs		configs
.gitignore		.gitignore
README.md		README.md
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml