latest
User Guide
About
Tutorials
API
Agents
A2C
DDPG
DQN
PPO1
VPG
TD3
SAC
Q-Learning
SARSA
Contextual Bandit
Multi-Armed Bandit
Base
Bayesian Bandit
Bernoulli Bandit
Espilon Greedy
Gaussian
Gradient
Thmopson Sampling
Upper Confidence Bound
Common
Environments
Utilities
Trainers
Core
Torchmm
Docs
»
Agents
»
Multi-Armed Bandit
Edit on GitHub
Multi-Armed Bandit
¶
Base
¶
Bayesian Bandit
¶
Bernoulli Bandit
¶
Espilon Greedy
¶
Gaussian
¶
Gradient
¶
Thmopson Sampling
¶
Upper Confidence Bound
¶
Read the Docs
v: latest
Versions
latest
stable
Downloads
pdf
html
epub
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.