x39826 / reinforcement_learning_v_mpo Goto Github PK
View Code? Open in Web Editor NEWThis project forked from wisnunugroho21/reinforcement_learning_v_mpo
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)