Module: tf_agents.environments.suite_gym

Suite for loading Gym Environments.

Note we use gym.spec(env_id).make() on gym envs to avoid getting a TimeLimit wrapper on the environment. OpenAI's TimeLimit wrappers terminate episodes without indicating if the failure is due to the time limit, or due to negative agent behaviour. This prevents us from setting the appropriate discount value for the final step of an episode. To prevent that we extract the step limit from the environment specs and utilize our TimeLimit wrapper.

Functions

load(...): Loads the selected environment and wraps it with the specified wrappers.

wrap_env(...): Wraps given gym environment with TF Agent's GymWrapper.

Type Aliases

TimeLimitWrapperType

Other Members
absolute_import	Instance of `__future__._Feature`
division	Instance of `__future__._Feature`
print_function	Instance of `__future__._Feature`

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2024-04-26 UTC.

English
中文 – 简体

Module: tf_agents.environments.suite_gym Stay organized with collections Save and categorize content based on your preferences.

Functions

Type Aliases

Other Members

Module: tf_agents.environments.suite_gym