|
bitrl & cuberl Documentation
Simulation engine for reinforcement learning agents
|
MCTreeSearchBase. More...
#include <mc_tree_search_solver.h>


Public Types | |
| typedef EnvTp | env_type |
| env_type | |
| typedef NodeTp | node_type |
| node_type | |
| typedef env_type::time_step_type | time_step_type |
| The time step type. | |
Public Types inherited from cuberl::rl::algos::RLSolverBase< EnvTp > | |
| typedef EnvTp | env_type |
Public Member Functions | |
| MCTSSolver (MCTreeSearchConfig config) | |
| MCTreeSearchBase. | |
| virtual void | actions_before_training_begins (env_type &) override final |
| actions_before_training_begins. Execute any actions the algorithm needs before starting the iterations | |
| virtual void | actions_after_training_ends (env_type &) override final |
| actions_after_training_ends. Actions to execute after the training iterations have finisehd | |
| virtual void | actions_before_episode_begins (env_type &, uint_t) |
| actions_before_training_episode | |
| virtual void | actions_after_episode_ends (env_type &, uint_t, const EpisodeInfo &) |
| actions_after_training_episode | |
| virtual EpisodeInfo | on_training_episode (env_type &, uint_t) override final |
| on_episode Do one on_episode of the algorithm | |
| time_step_type | simulate_node (std::shared_ptr< node_type > node, env_type &env) |
| simulate_node | |
| void | expand_node (std::shared_ptr< node_type > node, env_type &env) |
| expand_node | |
| void | backprop (std::shared_ptr< node_type > node) |
| backprop | |
| uint_t | max_depth_tree () const noexcept |
| max_depth_tree | |
Public Member Functions inherited from cuberl::rl::algos::RLSolverBase< EnvTp > | |
| virtual | ~RLSolverBase ()=default |
| Destructor. | |
| virtual void | actions_before_episode_begins (env_type &, uint_t) |
| actions_before_training_episode | |
| virtual void | actions_after_episode_ends (env_type &, uint_t, const EpisodeInfo &) |
| actions_after_training_episode | |
| virtual EpisodeInfo | on_training_episode (env_type &, uint_t)=0 |
| on_episode Do one on_episode of the algorithm | |
Protected Attributes | |
| IterationMixin | itr_mix_ |
| itr_mix_ | |
| std::shared_ptr< node_type > | root_ |
| root_ | |
| uint_t | max_depth_tree_ |
| max_depth_tree_ | |
| real_t | temperature_ |
| temperature_ | |
Additional Inherited Members | |
Protected Member Functions inherited from cuberl::rl::algos::RLSolverBase< EnvTp > | |
| RLSolverBase ()=default | |
| Constructor. | |
MCTreeSearchBase.
| typedef EnvTp cubeai::rl::algos::mc::MCTSSolver< EnvTp, NodeTp >::env_type |
env_type
| typedef NodeTp cubeai::rl::algos::mc::MCTSSolver< EnvTp, NodeTp >::node_type |
node_type
| typedef env_type::time_step_type cubeai::rl::algos::mc::MCTSSolver< EnvTp, NodeTp >::time_step_type |
The time step type.
| cubeai::rl::algos::mc::MCTSSolver< EnvTp, NodeTp >::MCTSSolver | ( | MCTreeSearchConfig | config | ) |
MCTreeSearchBase.
| config |
|
inlinevirtual |
actions_after_training_episode
|
inlinefinaloverridevirtual |
actions_after_training_ends. Actions to execute after the training iterations have finisehd
Implements cuberl::rl::algos::RLSolverBase< EnvTp >.
|
inlinevirtual |
actions_before_training_episode
|
inlinefinaloverridevirtual |
actions_before_training_begins. Execute any actions the algorithm needs before starting the iterations
Implements cuberl::rl::algos::RLSolverBase< EnvTp >.
| void cubeai::rl::algos::MCTSSolver::backprop | ( | std::shared_ptr< node_type > | node | ) |
backprop
| void cubeai::rl::algos::MCTSSolver::expand_node | ( | std::shared_ptr< node_type > | node, |
| env_type & | env | ||
| ) |
expand_node
| node |
|
inlinenoexcept |
max_depth_tree
|
finaloverridevirtual |
on_episode Do one on_episode of the algorithm
| MCTSSolver< Env, NodeTp >::time_step_type cubeai::rl::algos::MCTSSolver::simulate_node | ( | std::shared_ptr< node_type > | node, |
| env_type & | env | ||
| ) |
simulate_node
| node |
|
protected |
itr_mix_
|
protected |
max_depth_tree_
|
protected |
root_
|
protected |
temperature_