1AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard MDP (opens in new tab)(arxiv.org)1kenny2397mo ago0