Detailed Framework
Play flow

・1 Episode loop

Multiplay flow

Distributed flow

Class diagram
RL

Env

Run

Interface Type
SpaceBase(srl.base.spaces)
Class |
Type |
SpaceType |
DiscreteSpace |
int |
DISCRETE |
ArrayDiscreteSpace |
list[int] |
DISCRETE |
ContinuousSpace |
float |
CONTINUOUS |
ArrayContinuousSpace |
list[float] |
CONTINUOUS |
BoxSpace |
NDArray[AnyType] |
srl.base.define.SpaceTypes |
MultiSpace |
list[SpaceBase] |
MULTI |
RL type
Action |
Observation |
Observation window |
|
Discrete |
int
DiscreteSpace
|
list[int]
ArrayDiscreteSpace
|
list[int]
ArrayDiscreteSpace
|
Continuous |
list[float]
ArrayContinuousSpace
|
NDArray[np.float32]
BoxSpace
|
NDArray[np.float32]
BoxSpace
|
Image |
NDArray[np.uint8]
BoxSpace
|
NDArray[np.float32]
BoxSpace
|
NDArray[np.float32]
BoxSpace
|