SimpleDistributedRL

Contents

Installation
How To Use
Distributed Learning (Online)

Custom

Making a Custom environment
Making a Custom algorithm
Detailed Framework

API

EnvConfig
RLConfig
RLConfig Parameters
Runner(Base)
Runner

Algorithms

Q-Learning
Deep Q-Networks
Rainbow
Agent57
Agent57 light
PPO(Proximal Policy Optimization)
DDPG(Deep Deterministic Policy Gradient)
SAC(Soft-Actor-Critic)
SND(Self-supervised Network Distillation)
Monte Carlo tree search
AlphaZero
MuZero
DreamerV3

SimpleDistributedRL

Welcome to SimpleDistributedRL's documentation!
View page source

Welcome to SimpleDistributedRL's documentation!

Contents

Installation
How To Use
Distributed Learning (Online)

Custom

Making a Custom environment
Making a Custom algorithm
Detailed Framework

API

EnvConfig
- EnvConfig
RLConfig
- RLConfig
RLConfig Parameters
Runner(Base)
- RunnerBase
Runner
- Runner

Algorithms

Q-Learning
- Config
Deep Q-Networks
- Config
Rainbow
- Config
Agent57
- Config
Agent57 light
- Config
PPO(Proximal Policy Optimization)
- Config
DDPG(Deep Deterministic Policy Gradient)
- Config
SAC(Soft-Actor-Critic)
- Config
SND(Self-supervised Network Distillation)
- Config
Monte Carlo tree search
- Config
AlphaZero
- Config
MuZero
- Config
DreamerV3
- Config

Indices and tables

索引
モジュール索引
検索ページ

Next

© Copyright 2022, poco.

Built with Sphinx using a theme provided by Read the Docs.