SimpleDistributedRL

Contents

  • Installation
  • How To Use
  • Distributed Learning (Online)

Custom

  • Making a Custom environment
  • Making a Custom algorithm
  • Detailed Framework

API

  • EnvConfig
  • RLConfig
  • RLConfig Parameters
  • Runner(Base)
  • Runner

Algorithms

  • Q-Learning
  • Deep Q-Networks
  • Rainbow
  • Agent57
  • Agent57 light
  • PPO(Proximal Policy Optimization)
  • DDPG(Deep Deterministic Policy Gradient)
  • SAC(Soft-Actor-Critic)
  • SND(Self-supervised Network Distillation)
  • Monte Carlo tree search
  • AlphaZero
  • MuZero
  • DreamerV3
SimpleDistributedRL
  • 検索


© Copyright 2022, poco.

Built with Sphinx using a theme provided by Read the Docs.