Glossary of Artificial Intelligence (AI), Machine Learning (ML), and Big Data Terms
Off-policy Learning Algorithm
In reinforcement learning, off-policy learning approaches iteratively evaluate and improve a policy that is different from the current policy to evaluate if the other policy is better for enabling an agent to achieve the desired rewards.