Double Deep Q Network with Huber Reward Function for Cart-Pole Balancing Problem
Shaili Mishra and Anuja Arora
Int J Performability Eng . 2022, (9): 644 -653 .  DOI: 10.23940/ijpe.22.09.p5.644653