|
bitrl & cuberl Documentation
Simulation engine for reinforcement learning agents
|
The CliffWorld class. More...
#include <cliff_world_env.h>


Public Types | |
| typedef std::vector< std::tuple< real_t, uint_t, real_t, bool > > | dynamics_type |
| dynamics_t | |
| typedef ToyTextEnvBase< TimeStep< uint_t >, 37, 4 >::base_type | base_type |
| The base type. | |
| typedef base_type::time_step_type | time_step_type |
| The time step type we return every time a step in the environment is performed. | |
| typedef base_type::state_space_type | state_space_type |
| The type describing the state space for the environment. | |
| typedef base_type::action_space_type | action_space_type |
| The type of the action space for the environment. | |
| typedef base_type::action_type | action_type |
| The type of the action to be undertaken in the environment. | |
| typedef base_type::state_type | state_type |
| The type of the action to be undertaken in the environment. | |
Public Types inherited from bitrl::envs::gymnasium::ToyTextEnvBase< TimeStep< uint_t >, 37, 4 > | |
| typedef GymnasiumEnvBase< TimeStep< uint_t >, ScalarDiscreteEnv< state_end, action_end, 0, 0 > >::base_type | base_type |
| The base_type. | |
| typedef base_type::time_step_type | time_step_type |
| The time step type we return every time a step in the environment is performed. | |
| typedef base_type::state_space_type | state_space_type |
| The type describing the state space for the environment. | |
| typedef base_type::action_space_type | action_space_type |
| The type of the action space for the environment. | |
| typedef base_type::action_type | action_type |
| The type of the action to be undertaken in the environment. | |
| typedef base_type::state_type | state_type |
| The type of the state. | |
| typedef std::vector< std::tuple< real_t, uint_t, real_t, bool > > | dynamics_t |
| dynamics_t | |
Public Types inherited from bitrl::envs::gymnasium::GymnasiumEnvBase< TimeStepType, SpaceType > | |
| typedef EnvBase< TimeStepType, SpaceType > | base_type |
| Base environment type alias. | |
| typedef base_type::time_step_type | time_step_type |
| Time step returned at each environment step. | |
| typedef base_type::state_space_type | state_space_type |
| Type describing the observation/state space of the environment. | |
| typedef base_type::action_space_type | action_space_type |
| Type describing the action space of the environment. | |
| typedef base_type::action_type | action_type |
| Type representing a valid action to execute. | |
| typedef base_type::state_type | state_type |
| Type representing a state/observation returned by the environment. | |
Public Types inherited from bitrl::envs::EnvBase< TimeStepType, SpaceType > | |
| typedef TimeStepType | time_step_type |
| Alias for the type returned when stepping the environment. | |
| typedef SpaceType::state_space | state_space_type |
| Type describing the environment state space. | |
| typedef SpaceType::state_type | state_type |
| Type describing an individual state. | |
| typedef SpaceType::action_space | action_space_type |
| Type describing the environment action space. | |
| typedef SpaceType::action_type | action_type |
| Type representing an individual action. | |
Public Member Functions | |
| CliffWorld (network::RESTRLEnvClient &api_server) | |
| CliffWorld. | |
| CliffWorld (const CliffWorld &other) | |
| copy constructor | |
| ~CliffWorld ()=default | |
| ~CliffWorld. Destructor | |
| virtual void | make (const std::string &version, const std::unordered_map< std::string, std::any > &options, const std::unordered_map< std::string, std::any > &reset_options) override final |
| make. Builds the environment. Optionally we can choose if the environment will be slippery | |
| virtual time_step_type | step (const action_type &action) override final |
| step | |
Public Member Functions inherited from bitrl::envs::gymnasium::ToyTextEnvBase< TimeStep< uint_t >, 37, 4 > | |
| virtual | ~ToyTextEnvBase ()=default |
| ~FrozenLake. Destructor. | |
| dynamics_t | p (uint_t sidx, uint_t aidx) const |
| P. | |
| uint_t | n_actions () const noexcept |
| n_actions. Returns the number of actions | |
| uint_t | n_states () const noexcept |
| Number of states. | |
Public Member Functions inherited from bitrl::envs::gymnasium::GymnasiumEnvBase< TimeStepType, SpaceType > | |
| virtual | ~GymnasiumEnvBase () |
| Virtual destructor. | |
| virtual bool | is_alive () const |
| Check whether the environment is still alive/connected. | |
| virtual void | close () override |
| Close the environment on the server and release any resources. | |
| virtual time_step_type | reset () override |
| Reset the environment to an initial state using the reset options specified during make. | |
| uint_t | n_copies () const |
| network::RESTRLEnvClient & | get_api_server () const |
| Retrieve the REST API wrapper instance used for communication. | |
| std::string | get_url () const |
| Get the full URL for this environment endpoint on the server. | |
| virtual time_step_type | reset ()=0 |
| Import the reset() overloads from the base class. | |
Public Member Functions inherited from bitrl::envs::EnvBase< TimeStepType, SpaceType > | |
| virtual | ~EnvBase ()=default |
| Virtual destructor. | |
| virtual time_step_type | step (const action_type &action)=0 |
| Perform one step in the environment using an action. | |
| const std::unordered_map< std::string, std::any > & | make_options () const noexcept |
| Access the configuration options provided to make(). | |
| const std::unordered_map< std::string, std::any > & | reset_options () const noexcept |
| Access the configuration options provided to make(). | |
| template<typename T > | |
| T | read_option (const std::string &op_name) const |
| Read a specific make() option and cast it to the requested type. | |
| std::string | idx () const noexcept |
| Get the id identifying this environment within a simulation batch. The id is valid only if make has been called. | |
| bool | is_created () const noexcept |
| Check if make() has successfully initialized the environment. | |
| std::string | env_name () const noexcept |
| Get the name of this environment instance. | |
| std::string | version () const noexcept |
| Get the environment version set during make(). | |
Static Public Attributes | |
| static const std::string | name = "CliffWalking" |
| name | |
| static const std::string | URI = "/gymnasium/cliff-walking-env" |
| The URI for accessing the environment. | |
Static Public Attributes inherited from bitrl::envs::EnvBase< TimeStepType, SpaceType > | |
| static const uint_t | DEFAULT_ENV_SEED = 42 |
| Default seed used in reset() if none provided. | |
Protected Member Functions | |
| virtual time_step_type | create_time_step_from_response_ (const nlohmann::json &response) const override final |
| Handle the reset response from the environment server. | |
Protected Member Functions inherited from bitrl::envs::gymnasium::ToyTextEnvBase< TimeStep< uint_t >, 37, 4 > | |
| ToyTextEnvBase (network::RESTRLEnvClient &api_server, const std::string &name) | |
| Constructor. | |
| ToyTextEnvBase (const ToyTextEnvBase &other) | |
| Copy constructor. | |
| virtual dynamics_t | build_dynamics_from_response_ (const nlohmann::json &response) const |
| build the dynamics from response | |
Protected Member Functions inherited from bitrl::envs::gymnasium::GymnasiumEnvBase< TimeStepType, SpaceType > | |
| GymnasiumEnvBase (network::RESTRLEnvClient &api_server, const std::string &name) | |
| Constructor. | |
| GymnasiumEnvBase (const GymnasiumEnvBase &) | |
| Copy constructor. | |
Protected Member Functions inherited from bitrl::envs::EnvBase< TimeStepType, SpaceType > | |
| EnvBase (const std::string &idx=bitrl::consts::INVALID_STR, const std::string &name=bitrl::consts::INVALID_STR) | |
| Constructor (protected — for subclassing only). | |
| EnvBase (const EnvBase &) | |
| Copy constructor. | |
| void | set_version_ (const std::string &version) noexcept |
| Set internal version string. | |
| void | set_idx_ (const std::string &idx) noexcept |
| Set the id of the environment. | |
| void | set_make_options_ (const std::unordered_map< std::string, std::any > &options) noexcept |
| Store options for future access. | |
| void | set_reset_options_ (const std::unordered_map< std::string, std::any > &options) noexcept |
| Store reset options for future access. | |
| void | invalidate_is_created_flag_ () noexcept |
| Mark environment as not created. | |
| void | make_created_ () noexcept |
| Mark environment creation as successful. | |
| time_step_type & | get_current_time_step_ () noexcept |
| Mutable access to the current time step. | |
| const time_step_type & | get_current_time_step_ () const noexcept |
| Read-only access to the current time step. | |
Protected Attributes | |
| uint_t | max_episode_steps_ |
| Maximum episodes per step. | |
Protected Attributes inherited from bitrl::envs::gymnasium::GymnasiumEnvBase< TimeStepType, SpaceType > | |
| network::RESTRLEnvClient * | api_server_ |
| Server wrapper handling communication with remote Gymnasium environment. | |
The CliffWorld class.
| typedef base_type::action_space_type bitrl::envs::gymnasium::CliffWorld::action_space_type |
The type of the action space for the environment.
| typedef base_type::action_type bitrl::envs::gymnasium::CliffWorld::action_type |
The type of the action to be undertaken in the environment.
| typedef ToyTextEnvBase<TimeStep<uint_t>,37,4>::base_type bitrl::envs::gymnasium::CliffWorld::base_type |
The base type.
| typedef std::vector<std::tuple<real_t, uint_t, real_t, bool> > bitrl::envs::gymnasium::CliffWorld::dynamics_type |
dynamics_t
| typedef base_type::state_space_type bitrl::envs::gymnasium::CliffWorld::state_space_type |
The type describing the state space for the environment.
| typedef base_type::state_type bitrl::envs::gymnasium::CliffWorld::state_type |
The type of the action to be undertaken in the environment.
The time step type we return every time a step in the environment is performed.
| bitrl::envs::gymnasium::CliffWorld::CliffWorld | ( | network::RESTRLEnvClient & | api_server | ) |
| bitrl::envs::gymnasium::CliffWorld::CliffWorld | ( | const CliffWorld & | other | ) |
copy constructor
|
default |
~CliffWorld. Destructor
|
finaloverrideprotectedvirtual |
Handle the reset response from the environment server.
Implements bitrl::envs::gymnasium::GymnasiumEnvBase< TimeStepType, SpaceType >.
|
finaloverridevirtual |
make. Builds the environment. Optionally we can choose if the environment will be slippery
Implements bitrl::envs::EnvBase< TimeStepType, SpaceType >.
|
finaloverridevirtual |
step
| action |
|
protected |
Maximum episodes per step.
|
static |
name
|
static |
The URI for accessing the environment.