This repository has been archived by the owner on Feb 27, 2020. It is now read-only.

swasun / BanditProblem Public archive

Notifications You must be signed in to change notification settings
Fork 0
Star 2

A collection of implementations of the bandit problem.

2 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
bandit_problem		bandit_problem
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
notebook.ipynb		notebook.ipynb

Repository files navigation

A collection of implementations of the bandit problem (school project, Reinforcement Learning class, 2018).

Features

Bandits

Normal multi-armed bandits
Bernouilli multi-armed bandits

Algorithms

Random bandit algorithm
Greedy bandit algorithm
Epsilon greedy bandit algorithm
UCB bandit algorithm
Thompson sampling algorithm

Using the code

See the notebook for examples of usage.

Credits

The notebook is from Valentin Emiya.

About

A collection of implementations of the bandit problem.

thompson-sampling epsilon-greedy multi-armed-bandits bandit-algorithms linucb

Report repository

Releases

No releases published

Packages

No packages published

Languages