A collection of implementations of the bandit problem (school project, Reinforcement Learning class, 2018).
- Normal multi-armed bandits
- Bernouilli multi-armed bandits
- Random bandit algorithm
- Greedy bandit algorithm
- Epsilon greedy bandit algorithm
- UCB bandit algorithm
- Thompson sampling algorithm
See the notebook for examples of usage.
The notebook is from Valentin Emiya.