|
bitrl & cuberl Documentation
Simulation engine for reinforcement learning agents
|
RLAlgoBase. Base class for RL algorithms. More...
#include <rl_algorithm_base.h>


Public Types | |
| typedef EnvType | env_type |
Public Member Functions | |
| virtual | ~RLSolverBase ()=default |
| Destructor. | |
| virtual void | actions_before_training_begins (env_type &)=0 |
| actions_before_training_begins. Execute any actions the algorithm needs before starting the iterations | |
| virtual void | actions_after_training_ends (env_type &)=0 |
| actions_after_training_ends. Actions to execute after the training iterations have finisehd | |
| virtual void | actions_before_episode_begins (env_type &, uint_t) |
| actions_before_training_episode | |
| virtual void | actions_after_episode_ends (env_type &, uint_t, const EpisodeInfo &) |
| actions_after_training_episode | |
| virtual EpisodeInfo | on_training_episode (env_type &, uint_t)=0 |
| on_episode Do one on_episode of the algorithm | |
Protected Member Functions | |
| RLSolverBase ()=default | |
| Constructor. | |
RLAlgoBase. Base class for RL algorithms.
| typedef EnvType cuberl::rl::algos::RLSolverBase< EnvType >::env_type |
|
virtualdefault |
Destructor.
|
protecteddefault |
Constructor.
|
inlinevirtual |
actions_after_training_episode
Reimplemented in cuberl::rl::algos::td::SarsaSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::IterativePolicyEvalutationSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::PolicyImprovement< EnvType, PolicyType >, cuberl::rl::algos::dp::PolicyIterationSolver< EnvType, PolicyType >, and cuberl::rl::algos::dp::ValueIteration< EnvType >.
|
pure virtual |
actions_after_training_ends. Actions to execute after the training iterations have finisehd
Implemented in cuberl::rl::algos::td::SarsaSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::PolicyImprovement< EnvType, PolicyType >, cuberl::rl::algos::dp::PolicyIterationSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::ValueIteration< EnvType >, cubeai::rl::algos::mc::MCTSSolver< EnvTp, NodeTp >, cuberl::rl::algos::dp::IterativePolicyEvalutationSolver< EnvType, PolicyType >, and cuberl::rl::algos::dp::IterativePolicyEvalutationSolver< env_type, policy_type >.
|
inlinevirtual |
actions_before_training_episode
Reimplemented in cuberl::rl::algos::td::SarsaSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::IterativePolicyEvalutationSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::PolicyImprovement< EnvType, PolicyType >, cuberl::rl::algos::dp::PolicyIterationSolver< EnvType, PolicyType >, and cuberl::rl::algos::dp::ValueIteration< EnvType >.
|
pure virtual |
actions_before_training_begins. Execute any actions the algorithm needs before starting the iterations
Implemented in cuberl::rl::algos::td::SarsaSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::PolicyImprovement< EnvType, PolicyType >, cubeai::rl::algos::mc::MCTSSolver< EnvTp, NodeTp >, cuberl::rl::algos::dp::IterativePolicyEvalutationSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::IterativePolicyEvalutationSolver< env_type, policy_type >, cuberl::rl::algos::dp::PolicyIterationSolver< EnvType, PolicyType >, and cuberl::rl::algos::dp::ValueIteration< EnvType >.
|
pure virtual |
on_episode Do one on_episode of the algorithm
Implemented in cuberl::rl::algos::td::SarsaSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::IterativePolicyEvalutationSolver< EnvType, PolicyType >, cuberl::rl::algos::dp::IterativePolicyEvalutationSolver< env_type, policy_type >, cuberl::rl::algos::dp::PolicyImprovement< EnvType, PolicyType >, cuberl::rl::algos::dp::PolicyImprovement< env_type, policy_type >, cuberl::rl::algos::dp::PolicyIterationSolver< EnvType, PolicyType >, and cuberl::rl::algos::dp::ValueIteration< EnvType >.