latest
User Guide
About
Tutorials
API
Agents
A2C
DDPG
DQN
PPO1
VPG
TD3
SAC
Q-Learning
SARSA
Contextual Bandit
Multi-Armed Bandit
Common
Environments
Utilities
Trainers
Core
Torchmm
Docs
»
Agents
Edit on GitHub
Agents
ΒΆ
Deep
A2C
DDPG
DQN
PPO1
VPG
TD3
SAC
Classical
Q-Learning
SARSA
Bandit
Contextual Bandit
Base
Bootstrap Neural
Fixed
Linear Posterior
Neural Greedy
Neural Linear Posterior
Neural Noise Sampling
Variational
Multi-Armed Bandit
Base
Bayesian Bandit
Bernoulli Bandit
Espilon Greedy
Gaussian
Gradient
Thmopson Sampling
Upper Confidence Bound
Read the Docs
v: latest
Versions
latest
stable
Downloads
pdf
html
epub
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.