Multi-Armed Bandit

Base

Bayesian Bandit

Bernoulli Bandit

Espilon Greedy

Gaussian

Gradient

Thmopson Sampling

Upper Confidence Bound