1AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard MDP (opens in new tab)(arxiv.org)arXiv1kenny23910mo ago0Save