Reinforcement Learning: Deep Q-networks
Published:
Online Q-learning can experience instabilities during training. This is because by using experience sampled sequentially from the environment leads to highly correlated gradient steps. Deep Q-networks (DQN) made deep reinforcement learning a viable approach to complex sequential control problems. In this section, we introduce the vanilla DQN algorithm. Next sections will discuss various improvements that have been proposed in the literature.