TODO: implement better "EV" practices (# of wins out of 100 rounds if this strategy) implement a round as turn based for Q-learning efficiency
jto-d / syn-solver Goto Github PK
View Code? Open in Web Editor NEWsolver using q-learning for the game screw your neighbor