Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning (opens in new tab)

(arxiv.org)

131 pointsaklein3y ago77 comments

77 comments

54 comments · 11 top-level

spywaregorilla3y ago· 13 in thread

Stratego is an odd choice I feel. Evaluating it must be really hard. A significant chunk of the game is just trying to remember which unit is which of the things you've seen so far. Which humans generally can't do very well but machines can do easily. Beating hand crafted bots is good.

Can expert humans beat the hand crafted bots? I'm guessing no. Also, what's stratego like without the hidden units? Is that... hard?

boringg3y ago

I don't think you are that familiar with how good Stratego is played. It isn't strictly a memorize where your opponents units are. There's a significant degree of bluffing and posturing.

spywaregorilla3y ago

It isn't strictly about memorizing where your opponents are, but having a perfect memory would be an enormous advantage, the point of being an entirely different game imo.

3 more replies

thekiptxt3y ago

To the extent that machines can’t replicate. If I place my hand on a piece that’s illegal to move, feign that I’m reconsidering, and then move a different piece, then my opponent may suspect that the originally touched piece may be moved.

How can I use these 200 IQ moves against a bot?

1 more reply

TemplateRex3y ago

The best available bot that is also mentioned in the paper, is Probe. Expert humans will score the same as DeepNash against Probe. The best humans have no trouble recalling every piece that moved, and the square it originated from. Top-level play usually has very few moved pieces (since they are vulnerable to your opponent's general once your own marshal gets revealed), so memory is important but typically not the main bottleneck.

spywaregorilla3y ago

> Top-level play usually has very few moved pieces (since they are vulnerable to your opponent's general once your own marshal gets revealed), so memory is important but typically not the main bottleneck.

Interesting. Is top level play... boring? Stratego doesn't have a lot of nuance to positional advantaging aside from moving forward or back, and while I'd imagine there's stalemate rules, there's probably a lot of nothing moves to dance around getting super minor and uninteresting advantages. Is that a correct statement?

1 more reply

Imnimo3y ago

From the paper:

> Successes in the game have been limited, with artificial agents only able to play at a level comparable to a human amateur, see e.g. (14–20).

riku_iki3y ago

It could be because no one seriously tried to build competitive AI player.

1 more reply

dr_orpheus3y ago

> A significant chunk of the game is just trying to remember which unit is which of the things you've seen so far

While this does get hard the further you go in the game, a think a more significant portion of the strategy is trying to predict newly moving units based on what is happening in the game. i.e. I just took a 7 unit with my 8. And now my opponent is coming straight towards me with a unit from elsewhere. Is it a 9 or 10, is it a bluff to drive me off, to drive me in to a spot where the actual 9 or 10 is waiting?

aidenn03y ago

Did they renumber the pieces? In the set I played as a kid a scout was 9 and a Marshall was 1...

1 more reply

spywaregorilla3y ago

Computers are better at such problems.

janosett3y ago

You might consider reading the linked abstract: “… Stratego has been a grand challenge for the field of AI for decades, and existing AI methods barely reach an amateur level of play.”

spywaregorilla3y ago

Sounds like bullshit to me. I'm not convinced. Here's a paper from 2021 that suggests as much. It's hard, sure, but it's also not really seriously explored.

> Compared to other games like Chess and Go, not much work has been done on creating an AI agent for Stratego. As such, the available literature is far and few between, and mostly consists of bachelor’s and master’s theses. In fact, most agents created for Stratego are largely undocumented or closed-source, making it difficult to effectively ascertain exactly which particular methods and techniques have been applied and how effective they were.

edit: the claim about bots not beating humans appears to hold, but I'm not convinced its just shoddy bot quality.

1 more reply

Buttons8403y ago

I think the best human players can beat the best computer players in Stratego. Thus, Stratego is an excellent choice.

hirundo3y ago· 8 in thread

When I was a kid I "won" a Stratego game with a non-move that my friend claimed was against the rules. So he claimed the win. Could I get an umpire's call here?

The issue is that when considering my next move, I picked up the bomb piece, thought for a few moments, put it back down, and moved another piece. My friend then, assuming that I had just given away that it was not a bomb, attempted to capture it, and lost the attacker.

He claimed that it was illegal to pick up that piece and put it down again, although he had no objection until he learned that I'd tricked him. We had never previously announced or enforced a touch-it-move-it rule.

So did I win that game or did he? That's not a question machine learning could answer.

aidenn03y ago

Official ISF tournament rules[1] say:

> Touching one of his own pieces does not oblige a player to move it.

It also says:

> Psychology, bluff and misleading manoeuvres are considered important aspects of Stratego. Bluffing consists of all verbal communication (talking) or non-verbal communication (acting, mimic or feign) which is intended to mislead your opponent. All forms of bluff are allowed at any time during the game, unless prohibited by any other rule. Abuse will be considered unsporting behaviour and can be penalized accordingly.

So I think you won.

1: https://isfstratego.kleier.net/docs/rulreg/isfgamerules.pdf

toast03y ago

I agree

> 5.2 Moving

> Flag and Bombs are never moved (For the definition of „move‟: see chapter 6).

Chapter 6 shows the sequences of moves, and then Chapter 7 says:

> A move is made when:

> a piece is released on another square than the starting one, or

> a player touches an opponent‟s piece with one of his own pieces or with the hand in which his own piece is held.

So if the piece was picked up and replaced on the starting square, it was not moved, and that's fine.

On the other hand, these rules incorporate some other rules by reference which includes:

> The bombs and the flag may never be moved and therefore remain in the same place throughout the duration of the game

A bomb hasn't exactly remained in the same place if it's been picked up, has it?

1 more reply

HWR_143y ago

You won because "He claimed that it was illegal to pick up that piece and put it down again, although he had no objection until he learned that I'd tricked him."

I think "touch-move" is perfectly valid to enforce, but you waive your right to do so once you move after that. If someone touches a piece in chess, then moves another piece, you don't get to go all the way until you're mated and then say there was an error you should have won on.

And if you touch an illegal to move piece, I would say there the penalty would probably be revealing the bomb (or that it is immobile), not forfeiture of the game.

boringg3y ago

Hmm that you picked up the piece might have been an infraction. I constantly touch the bomb pieces but I don't lift them off the ground which implies that they can move.

Tough call - glad that you are still carrying this from your childhood though. I would wager that you won but through borderline cheating. Still not sure TBH

Swenrekcah3y ago

You won. Deception is obviously a critical part of all warfare.

yborg3y ago

By that argument, the opponent rightfully won, because negotiation is also a critical part of warfare. He got his opponent to give away a tactical battlefield victory on the basis of mutually agreed ground rules.

Of course, dissatisfaction with such outcomes has often resulted in further wars.

milesskorpen3y ago

If you're playing seriously, I think if you touch a piece you need to move it.

Imnimo3y ago

Wow! I did exactly the same thing as a kid. I though I was sooooo clever.

hervature3y ago· 7 in thread

I'm still trying to grok and implement the paper, but I studied AlphaGo/AlphaZero/MuZero during my PhD. The core contribution here is the Nash equilibrium component to imperfect information games using only self-play. Note, there is no MCTS being done in this paper. This differs from counter factual regret methods (like the most famous Poker AIs) because it does not need to compute for all possible "information sets" which makes it intractable for even sufficiently complicated poker variants. It should also be noted (as they do in the paper) that this is more incremental than methodologically innovative as AlphaGo. This is the AlphaZero step increment to NeuRD. As is my general critique with their previous papers, they generally omit many engineering details that prove to be very important. Here, they admit that fine-tuning is vitally important (one of the 3 core steps) but details are relegated to the supplementary materials. It also opens up the question of if this new "fine-tuned" policy still guarantees the Nash equilibrium which it obviously does not as some mixed strategies are going to have sufficiently small probability. I wish researchers would be more honest with "this is a hack to get things to work on a computer because neural networks have floating point inaccuracies". It doesn't ruin any of the theory and no one is going to hold it against you. But it causes all sorts of confusion when trying to reimplement.

TemplateRex3y ago

What I don't understand is why they don't try to make inferences about the opponent's private state. I get that the full Bayesian update is intractable, but some sort of RNN or LSTM should be able to produce pretty accurate estimates for the opponent's private info. And with self-play, you can train the deduction head of a NN by adding a KL-divergence between inferred and ex-post observed pieces. That would both make you guess better and also try and "jam" your opponent's inference by randomizing your own piece distribution.

hervature3y ago

This is an interesting avenue for future research. The reason why it is not as straightforward as you claim is because all inference is going to depend on your perception of their policy. That's why the Nash equilibrium is sought after first. Because you should assume your opponent is perfect until you start observing their suboptimal behavior that you can exploit. Additionally, you would also have to handle the meta part where the exploiting portion of the algorithm isn't itself being exploited by the opponent. Somehow, you should deviate slowly from the Nash equilibrium but revert quickly if the opponent is abusing your new strategy.

1 more reply

thomasahle3y ago

Bayesian play is not necessarily optimal for imperfect information games. The reason is: You don't only need to play optimally with respect to the information you have observed, you also need to hide your own information and balance those two needs.

See the Deep Mind "Player of Games" paper from last year for an agent that takes a more game theoretic approach, which is probably needed for "simpler" games like Poker, that we can play to higher levels of accuracy: https://arxiv.org/pdf/2112.03178.pdf

algo_trader3y ago

> I'm still trying to grok and implement the paper, but I studied AlphaGo/AlphaZero/MuZero during my PhD

What is the SOTA on solving non-adversarial (single player?) POMDPs? Are those considered to be much simpler problems?

hervature3y ago

POMDPs is exactly how one formalizes imperfect information games. This is where the concept of information sets comes from. To answer your question, any two player algorithm is going to apply to single player games as it is trivial to transform. For games like 2048, the "adversary" is simply the opposite of your outcome. For games where you are trying to maximize your score, this is the standard RL setting and any of the Atari algorithms (including MuZero) can be used.

In case you are wondering about cooperative multi-agent games, I would check this group's publications: https://www.cs.ox.ac.uk/people/publications/date/Shimon.Whit...

joe_the_user3y ago

POMDPs? Are those considered to be much simpler problems?

Well, solving a Partially Observed Markov Decision Processes in general isn't just NP-complete but actually undecidable. So I'm not sure how one measures SOTA (state of the art).

1 more reply

igorkraw3y ago

It strongly depends on what type of structure you can assume and how expensive sampling is. Dreamerv2, agent57 on Atari, dreamerv2 and the generalized agent model trained on 600 tasks by deepmind might be worth looking into for different approaches on pomdps, but you can do much better if you impose physics priors by e.g. using neural ODEs for the latent state modeling.

POMDP just means "observations are not state" and that you need to use a stateful policy to infer the state somehow, but without further assumptions it's difficult to answer this question

miiiiiike3y ago· 7 in thread

Got a copy of Stratego (one of the old-style ones with descending rank, as pleases the gods) so I could show my board game loving girlfriend one of my favorite games from when I was a kid. She hated it.

jrussino3y ago

Did you still enjoy it? I loved playing this game with my Dad when I was a kid and I'm wondering if it still holds up.

jasone3y ago

It held up for me. I played a lot of Stratego with family and friends as a kid, but there was one friend in particular who routinely wiped the battlefield with my pieces. I couldn't understand how he so consistently beat me, and as an adult I was able to 1) reason more deeply about the game, and 2) quickly learn deeper strategy from the Internet. As a result, the game is even richer now to me than it was in the 1980s.

TedDoesntTalk3y ago

Played it recently with a 12 year old boy. He loves it.

miiiiiike3y ago

Loved it. I still think it’s one of the best games out there. The best games tho: Neuroshima Hex! And 51st State from Portal games.

LionTamer3y ago

Seeing the paper title gave me so much nostalgia for playing Stratego as a kid, that was always my favorite Board game. Glad to see I’m not the only one who used to love that game, few of my friends growing up played it.

kibwen3y ago

Stratego was probably my favorite board game as a kid, but, to be fair, when people these days say that they love board games what they probably mean is that they love Eurogames, which Stratego decidedly is not.

miiiiiike3y ago

Yeah, no. She plays everything.

She’s one of the few people who can regularly beat me at war games. The Avalon Hill and GMT types. My favorite example is the time she carefully mislead me into believing that she hadn’t found the other end of a wormhole that opened near my home-world in a game of Space Empires 4x. Spent the whole game exploring and increasing ship speed and weapons just enough, waiting for me to commit my heavily armed, but slow-ish ships. Then, giggled, and sighed for relief, as she paid the overnight rate to send an armada to my doorstep. Oh, and time she slipped a WMD into the US in Labyrinth. Or in Starship Troopers..

She likes everything from Codenames to those Rosenberg games that are so heavy they should come with an OSHA training poster.

In Stratego she had bluffed me into believing her flag was in a corner.. But then she move a piece that I assumed was a bomb and her face gave away everything. But a momentary lapse of Stratego-face wasn’t the issue. Mostly, it was just too slow for her. Her last obsession was Tyrants of the Underdark. Deck building + area control.

evouga3y ago· 7 in thread

I mean. Stratego is a great game; I had a lot of fun playing it at summer camps when I was a young boy. It's cool there's a good AI for it.

But this result feels a bit anticlimactic in a world where AIs can already beat expert humans at go, six-player poker, Starcraft, ...

goodside3y ago

It’s explained right in the abstract why Stratego is a more difficult game for AI than go or poker.

TemplateRex3y ago

You can view Stratego as the "Cartesian product" of a public information board game and an imperfection information "card" game. The board game has much simpler local tactics than e.g. chess or checkers, although whole board tactics where 2+ high pieces are trapping 1 lower piece defended by 1+ high piece are extremely complicated to reason about.

The "card" game can be viewed as a form of Limit Poker. The bidding in poker is done with secret cards and public bets. In Stratego, you bet with your secret pieces, so it's more like a closed bid auction. But since there are only 10 moveable ranks, the range of bluffs you can pull off is rather limited compared to e.g. No Limit Poker.

Each of the "subgames" in itself are quite tractable for computers. But the numerical product of all public game states times the number of secret information states is humongous. Combine this with the fact that imperfect information game trees don't decompose as nicely as e.g. chess game trees, and computers will also not be able to divide-and-conquer their way out of the numerical complexity with brute force. Whereas humans can come a long way with heuristics.

60secz3y ago

These are interesting not because you're solving for a game, but because you're potentially partially solving a category of problem.

kzrdude3y ago

How far did they get with Starcraft? Stratego should be a stepping stone to get there - it introduces imperfect information.

anonymoushn3y ago

spywaregorilla left a good reply about Starcraft. In the games I watched, despite the AI being handicapped to use human-like levels of APM and human-like viewport management, it primarily fought using *clearly superhuman* blink stalker micromanagement. Stalkers with barely any health would typically survive engagements and get to re-engage when their shields were fully recovered. On the other hand, a human player managed to confuse the bot by repeatedly airdropping units in its base, picking them up, and dropping them elsewhere. The bot uselessly moved its mass of stalkers back and forth while losing many probes and buildings.

Edit: After looking at some games again, the AI also benefits from precise target selection for the phoenix's graviton beam. A human player might take 3 phoenixes and a bunch of stalkers into a fight against an army containing 3 immortals, some sentries, some stalkers, and some zealots, and use graviton beam to pick up some mix of units including units other than immortals. The bot can pick up only the immortals.

spywaregorilla3y ago

It's very difficult to say. There are many cookie cutter strategies for RTS games and it's difficult to handicap the ai to note use its machine precision and quickness of thinking to just win everything on a tactical level. actions per minute handicaps are not nearly nuanced enough to capture this. Generally it seems that the absolute top humans are better than bots strategically but the execution to do so is really, really tough. And then of course there are dumb exploits because you find some dumb weakpoint than any sane human would quickly adjust their behavior for.

confuseshrink3y ago

Starcraft in the form of Alphastar worked in the sense that it could beat humans, at least in the short term. The problem with the whole technique is that they had to tether it to the human examples they had gathered in the form of a divergence loss.

I haven't checked out the linked paper yet but if they managed to do something from first principles that would still be an interesting development.

Someone3y ago· 1 in thread

My gut feeling is that optimum play in Stratego is not to play.

It feels better to let your opponent try and take your piece because, if they take it, you can make sure there will be at least one neighboring piece that can strike back.

If so, every game should end in a draw because of inactivity of both players.

My limited experience confirms that. Playing defensively, only offering my scouts to get intel tends to win games for me.

But then, I’ve never found any strategy guides, and wouldn’t know how good players play.

thomasahle3y ago

I you read the article you will see a lot of discussion on different strategies the bot has (re)discovered. You can also try playing against others on the internet and see how well your method works.

thomasahle3y ago

I really like the section on initial piece deployment:

> The Flag is almost always put on the back row, and often protected by Bombs. Occasionally, however, DeepNash will not surround the Flag with Bombs. Experts (e.g. Vincent de Boer, 3-fold World Champion) believe that it is indeed good to occasionally not protect the Flag because this unpredictability makes it harder for the opponent in the end-game. Another pattern observed is that the highest pieces, the 10 and 9, are often deployed on different sides of the board. Additionally, the Spy is quite often located not too far away from the 9 (or 8), which protects it against the opponent’s 10. DeepNash does not often deploy Bombs on the front row, which complies with the behavior seen from strong human players. The 3’s (Miner), which can defuse Bombs, are often placed on the back row, which makes sense because their importance typically increases throughout a game as more opponent Bombs and potential Flag positions get revealed. The eight 2’s (Scout) are typically deployed both in the front and more in the back, allowing to scout opponent pieces initially but also in later phases of the game.

voidfunc3y ago

I haven't played Stratego since I lost my board when I was in third grade and brought it to play during recess...

Is there a good online version these days?

mensetmanusman3y ago

Would it be interesting if the posted the approximate kWhr energy required to train?

bezoz3y ago

So we have gone from DQN to Alpha Go to Alpha Zero to Mu Zero to Deep Nash? Every time I thought I have figured out their naming scheme, they come out with something even more unpredictable.

warrenm3y ago

I haven't played Stratego in decades!

Loved it as a kid, though

j / k navigate · click thread line to collapse

77 comments

54 comments · 11 top-level

spywaregorilla3y ago· 13 in thread

Can expert humans beat the hand crafted bots? I'm guessing no. Also, what's stratego like without the hidden units? Is that... hard?

boringg3y ago

I don't think you are that familiar with how good Stratego is played. It isn't strictly a memorize where your opponents units are. There's a significant degree of bluffing and posturing.

spywaregorilla3y ago

It isn't strictly about memorizing where your opponents are, but having a perfect memory would be an enormous advantage, the point of being an entirely different game imo.

3 more replies

thekiptxt3y ago

How can I use these 200 IQ moves against a bot?

1 more reply

TemplateRex3y ago

spywaregorilla3y ago

1 more reply

Imnimo3y ago

From the paper:

> Successes in the game have been limited, with artificial agents only able to play at a level comparable to a human amateur, see e.g. (14–20).

riku_iki3y ago

It could be because no one seriously tried to build competitive AI player.

1 more reply

dr_orpheus3y ago

> A significant chunk of the game is just trying to remember which unit is which of the things you've seen so far

aidenn03y ago

Did they renumber the pieces? In the set I played as a kid a scout was 9 and a Marshall was 1...

1 more reply

spywaregorilla3y ago

Computers are better at such problems.

janosett3y ago

You might consider reading the linked abstract: “… Stratego has been a grand challenge for the field of AI for decades, and existing AI methods barely reach an amateur level of play.”

spywaregorilla3y ago

Sounds like bullshit to me. I'm not convinced. Here's a paper from 2021 that suggests as much. It's hard, sure, but it's also not really seriously explored.

edit: the claim about bots not beating humans appears to hold, but I'm not convinced its just shoddy bot quality.

1 more reply

Buttons8403y ago

I think the best human players can beat the best computer players in Stratego. Thus, Stratego is an excellent choice.

hirundo3y ago· 8 in thread

When I was a kid I "won" a Stratego game with a non-move that my friend claimed was against the rules. So he claimed the win. Could I get an umpire's call here?

So did I win that game or did he? That's not a question machine learning could answer.

aidenn03y ago

Official ISF tournament rules[1] say:

> Touching one of his own pieces does not oblige a player to move it.

It also says:

So I think you won.

1: https://isfstratego.kleier.net/docs/rulreg/isfgamerules.pdf

toast03y ago

I agree

> 5.2 Moving

> Flag and Bombs are never moved (For the definition of „move‟: see chapter 6).

Chapter 6 shows the sequences of moves, and then Chapter 7 says:

> A move is made when:

> a piece is released on another square than the starting one, or

> a player touches an opponent‟s piece with one of his own pieces or with the hand in which his own piece is held.

So if the piece was picked up and replaced on the starting square, it was not moved, and that's fine.

On the other hand, these rules incorporate some other rules by reference which includes:

> The bombs and the flag may never be moved and therefore remain in the same place throughout the duration of the game

A bomb hasn't exactly remained in the same place if it's been picked up, has it?

1 more reply

HWR_143y ago

You won because "He claimed that it was illegal to pick up that piece and put it down again, although he had no objection until he learned that I'd tricked him."

And if you touch an illegal to move piece, I would say there the penalty would probably be revealing the bomb (or that it is immobile), not forfeiture of the game.

boringg3y ago

Hmm that you picked up the piece might have been an infraction. I constantly touch the bomb pieces but I don't lift them off the ground which implies that they can move.

Tough call - glad that you are still carrying this from your childhood though. I would wager that you won but through borderline cheating. Still not sure TBH

Swenrekcah3y ago

You won. Deception is obviously a critical part of all warfare.

yborg3y ago

Of course, dissatisfaction with such outcomes has often resulted in further wars.

milesskorpen3y ago

If you're playing seriously, I think if you touch a piece you need to move it.

Imnimo3y ago

Wow! I did exactly the same thing as a kid. I though I was sooooo clever.

hervature3y ago· 7 in thread

TemplateRex3y ago

hervature3y ago

1 more reply

thomasahle3y ago

algo_trader3y ago

> I'm still trying to grok and implement the paper, but I studied AlphaGo/AlphaZero/MuZero during my PhD

What is the SOTA on solving non-adversarial (single player?) POMDPs? Are those considered to be much simpler problems?

hervature3y ago

In case you are wondering about cooperative multi-agent games, I would check this group's publications: https://www.cs.ox.ac.uk/people/publications/date/Shimon.Whit...

joe_the_user3y ago

POMDPs? Are those considered to be much simpler problems?

Well, solving a Partially Observed Markov Decision Processes in general isn't just NP-complete but actually undecidable. So I'm not sure how one measures SOTA (state of the art).

1 more reply

igorkraw3y ago

POMDP just means "observations are not state" and that you need to use a stateful policy to infer the state somehow, but without further assumptions it's difficult to answer this question

miiiiiike3y ago· 7 in thread

jrussino3y ago

Did you still enjoy it? I loved playing this game with my Dad when I was a kid and I'm wondering if it still holds up.

jasone3y ago

TedDoesntTalk3y ago

Played it recently with a 12 year old boy. He loves it.

miiiiiike3y ago

Loved it. I still think it’s one of the best games out there. The best games tho: Neuroshima Hex! And 51st State from Portal games.

LionTamer3y ago

kibwen3y ago

miiiiiike3y ago

Yeah, no. She plays everything.

She likes everything from Codenames to those Rosenberg games that are so heavy they should come with an OSHA training poster.

evouga3y ago· 7 in thread

I mean. Stratego is a great game; I had a lot of fun playing it at summer camps when I was a young boy. It's cool there's a good AI for it.

But this result feels a bit anticlimactic in a world where AIs can already beat expert humans at go, six-player poker, Starcraft, ...

goodside3y ago

It’s explained right in the abstract why Stratego is a more difficult game for AI than go or poker.

TemplateRex3y ago

60secz3y ago

These are interesting not because you're solving for a game, but because you're potentially partially solving a category of problem.

kzrdude3y ago

How far did they get with Starcraft? Stratego should be a stepping stone to get there - it introduces imperfect information.

anonymoushn3y ago

spywaregorilla3y ago

confuseshrink3y ago

I haven't checked out the linked paper yet but if they managed to do something from first principles that would still be an interesting development.

Someone3y ago· 1 in thread

My gut feeling is that optimum play in Stratego is not to play.

It feels better to let your opponent try and take your piece because, if they take it, you can make sure there will be at least one neighboring piece that can strike back.

If so, every game should end in a draw because of inactivity of both players.

My limited experience confirms that. Playing defensively, only offering my scouts to get intel tends to win games for me.

But then, I’ve never found any strategy guides, and wouldn’t know how good players play.

thomasahle3y ago

I you read the article you will see a lot of discussion on different strategies the bot has (re)discovered. You can also try playing against others on the internet and see how well your method works.

thomasahle3y ago

I really like the section on initial piece deployment:

voidfunc3y ago

I haven't played Stratego since I lost my board when I was in third grade and brought it to play during recess...

Is there a good online version these days?

mensetmanusman3y ago

Would it be interesting if the posted the approximate kWhr energy required to train?

bezoz3y ago

So we have gone from DQN to Alpha Go to Alpha Zero to Mu Zero to Deep Nash? Every time I thought I have figured out their naming scheme, they come out with something even more unpredictable.

warrenm3y ago

I haven't played Stratego in decades!

Loved it as a kid, though

j / k navigate · click thread line to collapse