SimpleDistributedRL

Contents

  • Installation
  • How To Use
  • Yaml Training
  • Distributed Learning (Online)

Custom

  • Making a Custom environment
  • Making a Custom algorithm
  • Detailed Framework

API

  • EnvConfig
  • RLConfig
  • Context
  • Runner(Base)
  • Runner

Algorithm Config

  • Memory
  • RLConfigComponentFramework
  • InputBlock
  • InputValueBlock
  • InputImageBlock
  • HiddenBlock
  • DuelingNetwork
  • Scheduler
  • LRSchaduler

Algorithms

  • Q-Learning
  • Deep Q-Networks
  • Rainbow
  • Agent57
  • Agent57 light
  • PPO(Proximal Policy Optimization)
  • DDPG(Deep Deterministic Policy Gradient)
  • SAC(Soft-Actor-Critic)
  • SND(Self-supervised Network Distillation)
  • Monte Carlo tree search
  • AlphaZero
  • MuZero
  • DreamerV3
  • No Target Deep Q-Networks
SimpleDistributedRL
  • 検索


© Copyright 2022, poco.

Built with Sphinx using a theme provided by Read the Docs.