Logo
latest

User Guide

  • About
  • Tutorials

API

  • Agents
    • A2C
    • DDPG
    • DQN
    • PPO1
    • VPG
    • TD3
    • SAC
    • Q-Learning
    • SARSA
    • Contextual Bandit
    • Multi-Armed Bandit
  • Common
  • Environments
  • Utilities
  • Trainers
  • Core
Torchmm
  • Docs »
  • Agents
  • Edit on GitHub

AgentsΒΆ

Deep

  • A2C
  • DDPG
  • DQN
  • PPO1
  • VPG
  • TD3
  • SAC

Classical

  • Q-Learning
  • SARSA

Bandit

  • Contextual Bandit
    • Base
    • Bootstrap Neural
    • Fixed
    • Linear Posterior
    • Neural Greedy
    • Neural Linear Posterior
    • Neural Noise Sampling
    • Variational
  • Multi-Armed Bandit
    • Base
    • Bayesian Bandit
    • Bernoulli Bandit
    • Espilon Greedy
    • Gaussian
    • Gradient
    • Thmopson Sampling
    • Upper Confidence Bound
Next Previous

© Copyright 2020, torchmm Revision 8b6f61ff.

Built with Sphinx using a theme provided by Read the Docs.