|
bitrl & cuberl Documentation
Simulation engine for reinforcement learning agents
|
The ReinforceOpts struct. Holds various configuration options for the Reinforce algorithm. More...
#include <reinforce_config.h>
Public Member Functions | |
| std::ostream & | print (std::ostream &out) const |
| print | |
| void | load_from_json (const std::string &filename) |
| Load the configuration from the given json file. | |
Public Attributes | |
| bool | normalize_rewards {false} |
| cuberl::utils::TrainEnumType | train_type {cuberl::utils::TrainEnumType::BATCH} |
| How to train the algorithm. | |
| BaselineEnumType | baseline_type {BaselineEnumType::NONE} |
| The baseline to use. | |
| DeviceType | device_type |
| The device type that PyTorch calculations take place. | |
| uint_t | n_episodes |
| The number of episodes. | |
| uint_t | max_itrs_per_episode |
| Max number of iterations per episode. | |
| real_t | gamma |
| The discount factor. | |
| real_t | baseline_constant {0.0} |
| The constant to use when baseline_type = BaselineEnumType::CONSTANT. | |
| real_t | eps {bitrl::consts::TOLERANCE} |
| Small constant to use as tolerance Used when baseline_type = BaselineEnumType::STANDARDIZE. | |
The ReinforceOpts struct. Holds various configuration options for the Reinforce algorithm.
for REINFORCE algorithm
| void cuberl::rl::algos::pg::ReinforceConfig::load_from_json | ( | const std::string & | filename | ) |
Load the configuration from the given json file.
| std::ostream & cuberl::rl::algos::pg::ReinforceConfig::print | ( | std::ostream & | out | ) | const |
| out |
| real_t cuberl::rl::algos::pg::ReinforceConfig::baseline_constant {0.0} |
The constant to use when baseline_type = BaselineEnumType::CONSTANT.
| BaselineEnumType cuberl::rl::algos::pg::ReinforceConfig::baseline_type {BaselineEnumType::NONE} |
The baseline to use.
| DeviceType cuberl::rl::algos::pg::ReinforceConfig::device_type |
The device type that PyTorch calculations take place.
| real_t cuberl::rl::algos::pg::ReinforceConfig::eps {bitrl::consts::TOLERANCE} |
Small constant to use as tolerance Used when baseline_type = BaselineEnumType::STANDARDIZE.
| real_t cuberl::rl::algos::pg::ReinforceConfig::gamma |
The discount factor.
| uint_t cuberl::rl::algos::pg::ReinforceConfig::max_itrs_per_episode |
Max number of iterations per episode.
| uint_t cuberl::rl::algos::pg::ReinforceConfig::n_episodes |
The number of episodes.
| bool cuberl::rl::algos::pg::ReinforceConfig::normalize_rewards {false} |
| cuberl::utils::TrainEnumType cuberl::rl::algos::pg::ReinforceConfig::train_type {cuberl::utils::TrainEnumType::BATCH} |
How to train the algorithm.