A values based algorithm in reinforcement learning. Q-learning algorithm performs the action to obtain the reward and the new state. 27.07.2023 17:54 aior