tf_agents.environments.ParallelPyEnvironment

Batch together environments and simulate them in external processes.

Inherits From: PyEnvironment

View aliases

Main aliases

tf_agents.environments.parallel_py_environment.ParallelPyEnvironment

tf_agents.environments.ParallelPyEnvironment(
    env_constructors: Sequence[tf_agents.environments.parallel_py_environment.EnvConstructor],
    start_serially: bool = True,
    blocking: bool = False,
    flatten: bool = False
)

The environments are created in external processes by calling the provided callables. This can be an environment class, or a function creating the environment and potentially wrapping it. The returned environment should not access global variables.

Args
`env_constructors`	List of callables that create environments.
`start_serially`	Whether to start environments serially or in parallel.
`blocking`	Whether to step environments one after another.
`flatten`	Boolean, whether to use flatten action and time_steps during communication to reduce overhead.

Raises
`ValueError`	If the action or observation specs don't match.

Attributes
`batch_size`	The batch size of the environment.
`batched`	Whether the environment is batched or not. If the environment supports batched observations and actions, then overwrite this property to True. A batched environment takes in a batched set of actions and returns a batched set of observations. This means for all numpy arrays in the input and output nested structures, the first dimension is the batch size. When batched, the left-most dimension is not part of the action_spec or the observation_spec and corresponds to the batch dimension. When batched and handle_auto_reset, it checks `np.all(steps.is_last())`.
`envs`

Attributes

batch_size The batch size of the environment.

batched

Whether the environment is batched or not.

If the environment supports batched observations and actions, then overwrite this property to True.

A batched environment takes in a batched set of actions and returns a batched set of observations. This means for all numpy arrays in the input and output nested structures, the first dimension is the batch size.

When batched, the left-most dimension is not part of the action_spec or the observation_spec and corresponds to the batch dimension.

When batched and handle_auto_reset, it checks np.all(steps.is_last()).

envs

tf_agents.environments.ParallelPyEnvironment

View aliases

Args

Raises

Attributes

Methods

action_spec

close

current_time_step

discount_spec

get_info

get_state

observation_spec

render

reset

reward_spec

seed

set_state

should_reset

start

step

time_step_spec

__enter__

__exit__

`action_spec`

`close`

`current_time_step`

`discount_spec`

`get_info`

`get_state`

`observation_spec`

`render`

`reset`

`reward_spec`

`seed`

`set_state`

`should_reset`

`start`

`step`

`time_step_spec`

`enter`

`exit`