undefined | Better HN

0 pointsFiberBundle4y ago0 comments

Well with respect to Go the fundamental difference afaict is that you can apply self-supervised learning, which is an incredibly powerful approach (But note e.g. that even this approach wasn't successful in "solving" Starcraft). Unfortunately it's extremely difficult to frame real-world problems in that setting. I don't know anything about protein-folding and don't know what Deepmind uses to try to solve that problem, so I cannot comment on that.

0 comments

cjbprime4y ago

> this approach wasn't successful in "solving" Starcraft)

Why do you say that? As I understand it, AlphaStar beat pros consistently, including a not widely reported showmatch against Serral when he was BlizzCon champ.

gavagai6914y ago

Two possible reasons.

1. First, though I am not sure of this (i.e. this should be verified), I heard that the team working on AlphaStar initially tried to create a Starcraft AI entirely through "self-play," but this was not successful. (Intuitively, in a real-time game, there are too many bad options too early on that even with a LOT of time to learn, if your approach is too "random" you will quickly enter an unwinnable position and not learn anything useful.) As a result, they replaced this approach with an approach which incorporated learning from human games.

2. "including a not widely reported showmatch against Serral when he was BlizzCon champ." is a mischaracterization. It was not a "showmatch," rather there was a setup at Blizzcon where anyone could sit down and play against AlphaStar, and Serral at some point sat down to play AlphaStar there. He went 0-4 vs AlphaStar's protoss and zerg, and 1-0 vs its Terran. However, not only was he not using his own keyboard and mouse, but he could not use any custom hotkeys. If you do not play Starcraft it may not be obvious just how large of a difference this could make. BTW, when Serral played (perhaps an earlier iteration of) AlphaStar's terran on the SC2 ladder, he demolished it.

I remember when seeing the final report, I was a bit disappointed. It seemed like they cut the project off at a strange point, before AlphaStar was clearly better than humans. I feel that if they had continued they could have gotten to that point, but now we will never know.

fuy4y ago

"It seemed like they cut the project off at a strange point, before AlphaStar was clearly better than humans. I feel that if they had continued they could have gotten to that point" What if that's why they cut it off..

1 more reply

dwohnitmok4y ago

> but he could not use any custom hotkeys.

IIRC you could and Serral did set his own custom keybindings on the machine. The main difference was different keyboard and mouse.

1 more reply

zwaps4y ago

Not once humans adapted to it afaik. AlphaStar got to top grandmaster level and then that was it, as people found ways to beat it. Now, it may be that the team considered the project complete and stopped training it. But technically - as it stands - Starcraft is still the one game where humans beat AI.

cjbprime4y ago

No, the version which played on ladder was much weaker than the later version which played against pros and was at BlizzCon -- the later version was at professional level of play.

callmekit4y ago

There were numerous issues. First one (somewhat mitigated lately) was extremely large number of actions per minute and (most importantly) extremely fast reaction speed.

Another big issue is that the bot communicated with the game via a custom API, not a via images and clicks. Details of this API are unknown - like how invisible units were handled, but it was much higher level than a human would have (pixels).

If you look at the games, the bot wasn't clever (which was a hope), just fast and precise. And some people far from the top were able to beat it convincingly.

And now the project is gone, even before people had a chance to really play against the bot and find more weaknesses.

bglazer4y ago

That’s not entirely correct, as I know of at least one approach to neural program synthesis that employs self supervised learning.

https://arxiv.org/abs/2006.08381

It’s a slightly different, easier problem: generating programs based on example outputs, rather than natural language specifications.

YeGoblynQueenne4y ago

The difference is that DreamCoder has a hand-crafted PCFG [1] that is used to generate programs, rather than a large language model. So the difference is in how programs are generated.

________

[1] The structure of the PCFG is hand-crafted, but the weights are trained during learning in a cycle alternating with neural net training. It's pretty cool actually, thought a bit over-engineered if you ask me.

bglazer4y ago

Right, I think it’s a bit crazy not to use a grammar as part of the generation process when you have one. My guess is that constraining LLM generation with a grammar would make it way more efficient. But that’s more complicated than just throwing GPT3 at all of Github.

Also, my understanding is that Dreamcoder does some fancy PL theory stuff to factorize blocks of code with identical behavior into functions. Honestly I think that’s the key advance in the paper, more than the wake-sleep algorithm they focus on.

Anyways the point was more that self supervised learning is quite applicable to learning to program. I think the downside is that the model learns its own weird, non-idiomatic conventions, rather than copying github.

1 more reply

j / k navigate · click thread line to collapse

0 comments

cjbprime4y ago

> this approach wasn't successful in "solving" Starcraft)

Why do you say that? As I understand it, AlphaStar beat pros consistently, including a not widely reported showmatch against Serral when he was BlizzCon champ.

gavagai6914y ago

Two possible reasons.

fuy4y ago

1 more reply

dwohnitmok4y ago

> but he could not use any custom hotkeys.

IIRC you could and Serral did set his own custom keybindings on the machine. The main difference was different keyboard and mouse.

1 more reply

zwaps4y ago

cjbprime4y ago

No, the version which played on ladder was much weaker than the later version which played against pros and was at BlizzCon -- the later version was at professional level of play.

callmekit4y ago

There were numerous issues. First one (somewhat mitigated lately) was extremely large number of actions per minute and (most importantly) extremely fast reaction speed.

If you look at the games, the bot wasn't clever (which was a hope), just fast and precise. And some people far from the top were able to beat it convincingly.

And now the project is gone, even before people had a chance to really play against the bot and find more weaknesses.

bglazer4y ago

That’s not entirely correct, as I know of at least one approach to neural program synthesis that employs self supervised learning.

https://arxiv.org/abs/2006.08381

It’s a slightly different, easier problem: generating programs based on example outputs, rather than natural language specifications.

YeGoblynQueenne4y ago

The difference is that DreamCoder has a hand-crafted PCFG [1] that is used to generate programs, rather than a large language model. So the difference is in how programs are generated.

________

bglazer4y ago

1 more reply

j / k navigate · click thread line to collapse