Richard Sutton and Andrew Barto Win 2024 Turing Award (opens in new tab)

(awards.acm.org)

520 pointscamlinke1y ago112 comments

112 comments

75 comments · 22 top-level

ofirpress1y ago· 18 in thread

Good time to re-read The Bitter Lesson: https://www.cs.utexas.edu/~eunsol/courses/data/bitter_lesson...

cxr1y ago

Canonical URL: <http://www.incompleteideas.net/IncIdeas/BitterLesson.html>

khaledh1y ago

Indeed a bitter lesson. I once enjoyed encoding human knowledge into a computer because it gives me understanding of what's going on. Now everything is becoming a big black box that is hard to reason about. /sigh/

Also, Moore's law has become a self-fulfilling prophecy. Now more than ever, AI is putting a lot of demand on computational power, to the point which drives chip makers to create specialized hardware for it. It's becoming a flywheel.

anonzzzies1y ago

I am still hoping AI progress will get to the point where the AI can eventually create AI's that are built up out of robust and provable logic which can be read and audited. Until that time, I wouldn't trust it for risky stuff. Unfortunately, it's not my choice and within a scarily short timespan, black boxes will make painfully wrong decisions about vital things that will ruin lives.

3 more replies

amelius1y ago

Well, take compiler optimization for example. You can allow your AI to use correctness-preserving transformations only. This will give you correct output no matter how weird the AI behaves.

The downside is that you will sometimes not get the optimizations that you want. But, this is sort of already the case, even with human made optimization algorithms.

kleiba1y ago

This depends a little bit on what the goal of AI research is. If it is (and it might well be) to build machines that excel at tasks previously thought to be exclusively reserved to, or needing to involve, the human mind, then these bitter lessons are indeed worthwhile.

But if you do AI research with the idea that by teaching machines how to do X, we might also be able to gain insight in how people do X, then ever more complex statistical setups will be of limited information.

Note that I'm not taking either point of view here. I just want to point out that perhaps a more nuanced approach might be called for here.

visarga1y ago

> if you do AI research with the idea that by teaching machines how to do X, we might also be able to gain insight in how people do X, then ever more complex statistical setups will be of limited information

At the very least we know consistent language and vision abilities don't require lived experience. That is huge in itself, it was unexpected.

2 more replies

jdright1y ago

> In computer vision, there has been a similar pattern. Early methods conceived of vision as searching for edges, or generalized cylinders, or in terms of SIFT features. But today all this is discarded.Modern deep-learning neural networks use only the notions of convolution and certain kinds of invariances, and perform much better.

I was there, at that moment where pattern matching for vision started to die. That was not completely lost though, learning from that time is still useful on other places today.

abdullahkhalids1y ago

I was an undergrad interning in a computer vision lab in the early 2010s. During group meeting, someone presented a new paper that was using abstract machine learning like stuff to do vision. The prof was so visibly perturbed and agnostic. He could not believe that this approach was even a little bit viable, when it so clearly was.

Best lesson for me - vowed never to be the person opposed to new approaches that work.

1 more reply

Buttons8401y ago

Oof. Imagine the bitter lesson classical NLP practitioners learned. That paper is as true today as ever.

DavidPiper1y ago

This describes Go AIs as a brute force strategy with no heuristics, which is false as far as I know. Go AIs don't search the entire sample space, they search based on their training data of previous human games.

HarHarVeryFunny1y ago

First there was AlphaGo, which had learnt from human games, then further improved from self-play, then there was AlphaGo Zero which taught itself from scratch just by self-play, not using any human data at all.

Game programs like AlphaGo and AlphaZero (chess) are all brute force at core - using MCTS (Monte Carlo Tree Search) to project all potential branching game continuations many moves ahead. Where the intelligence/heuristics comes to play is in pruning away unpromising branches from this expanding tree to keep the search space under control; this is done by using a board evaluation function to assess the strength of a given considered board position and assess if it is worth continuing to evaluate that potential line of play.

In DeepBlue (old IBM "chess computer" that beat Kasparov) the board evalation function was hand written using human chess expertise. In modern neural-net based engines such as AlphaGo and AlphaZero, the board evaluation function is learnt - either from human games and/or from self-play, learning what positions lead to winning outcomes.

So, not just brute force, but that (MCTS) is still the core of the algorithm.

2 more replies

signa111y ago

> ... This describes Go AIs as a brute force strategy with no heuristics ...

no, not really, from the paper

>> Also important was the use of learning by self play to learn a value function (as it was in many other games and even in chess, although learning did not play a big role in the 1997 program that first beat a world champion). Learning by self play, and learning in general, is like search in that it enables massive computation to be brought to bear.

important notion here is, imho "learning by self play". required heuristics emerge out of that. they are not programmed in.

dfan1y ago

The paragraph on Go AI looked accurate to me. Go AI research spent decades trying to incorporate human-written rules about tactics and strategy. None of that is used any more, although human knowledge is leveraged a bit in the strongest programs when choosing useful features to feed into the neural nets. (Strong) Go AIs are not trained on human games anymore. Indeed they don't search the entire sample space when they perform MCTS, but I don't see Sutton claiming that they do.

crabbone1y ago

I remember the article, and remember how badly it missed the point... The goal of writing a chess program that could beat a world champion wasn't to beat the world champion... the goal was to gain understanding into how anyone can play chess well. The victory in that match would've been equivalent to eg. drugging Kasparov prior to the match, or putting a gun to his head and telling him to lose: even cheaper and more effective.

krallistic1y ago

"The goal of Automated driving is not to drive automatically but to understand how anyone can drive well"...

The goal of DeepBlue was to beat the human with a machine, nothing more.

While the conquest of deeper understanding is used for a lot of research, most AI (read modern DL) research is not about understanding human intelligence, but automatic things we could not do before. (Understanding human intelligence is nowadays a different field)

1 more reply

perks_121y ago

The Bitter Lesson seems to be generally accepted knowledge in the field. Wouldn't that make DeepSeek R1 even more of a breakthrough?

currymj1y ago

that was “bitter lesson” in action.

for example there are clever ways of rewarding all the steps of a reasoning process to train a network to “think”. but deepseek found these don’t work as well as much simpler yes/no feedback on examples of reasoning.

blufish1y ago

nice read and insightful

vonneumannstan1y ago· 17 in thread

Good time to remind everyone that Sutton is a human successionist and doesn't care if humans all die. He is not to be trusted nor celebrated: https://www.youtube.com/watch?v=NgHFMolXs3U

textlapse1y ago

The ACM award is for their professional academic achievements - this fetishism to dig into another person’s personal life and find the most weird thing they said as the thing that paints over all of their life’s achievements as evil must stop.

It’s silly and dangerous. Because you don’t like thing A and they said/did thing A all of their lofty accomplishments get nullified by anyone. And worst of all internet gives your opinion the same weight as someone else (or the rest of us) who knows a lot about thing B that could change the world. From a strictly professional capacity.

This works me up because this is what’s dividing up people right now at a much larger scale.

I wish you well.

vonneumannstan1y ago

>this fetishism to dig into another person’s personal life and find the most weird thing they said as the thing that paints over all of their life’s achievements as evil must stop.

This has nothing to do with his professional life. He has made these comments in a professional capacity at an industry AI conference... The rest of your comment is a total non sequitur.

>And worst of all internet gives your opinion the same weight as someone else (or the rest of us) who knows a lot about thing B that could change the world. From a strictly professional capacity.

I've worked professionally in the ML field for 7 years so don't try some appeal to authority bs on me. Geoff Hinton, Yoshua Bengio, Demis Hassabis, Dario Amodei and countless other leaders in the field all recognize and highlight the possible dangers of this technology.

2 more replies

kalkin1y ago

> all of their lofty accomplishments get nullified by anyone

I don't think it's a question of whether their achievements are nullified, but as you mention, how to weight the opinions of various people. Personally, I think both a Turing award for technical achievement and a view that humanity ought to be replaced are relevant in evaluating someone's opinions on AI policy, and we shouldn't forget the latter because of the former.

(Also, this isn't about Sutton's personal life - that's a pretty bad strawman.)

1 more reply

jffhn1y ago

>the most weird thing they said

Reminds me of a quote from Jean Cocteau, of which I could not find the exact words, but which roughly says that if the public knew what thoughts geniuses can have, it would be more terrified than admiring.

3170701y ago

Have you ever met Sutton? He is the most heart-warming, caring and passionate hippy I have ever met. He does not want all humans to die. The talk you link also doesn't support your claim. Perhaps I missed it, in that case, do leave a timestamp.

In the talk, he says it will lead to an era of prosperity for humanity, however without humanity being in sole control of their destiny. His conclusion slide (at 12:33) literally has the bullet point "the best hope for a long-term future for humanity". That is opposite to you saying he "doesn't care if humans all die".

If I plan for my succession, I don't hope nor expect my daughter will murder me. I'm hoping for a long retirement in good health after which I will quietly pass in my sleep, knowing I left her as well as I could in a symbiotic relationship with the universe.

vonneumannstan1y ago

Here's the difference, you are not personally building the device which will cause your demise and your succession. We as humanity ARE doing that and have agency to choose NOT to do that.

zoogeny1y ago

> doesn't care if humans all die

That seems to be a harsh and misleading framing of his position. My own reading is that he believes it is inevitable that humans will be replaced by transhumans. That seems more like wild sci-fi utopianism than ill-will. It doesn't seem like a reason to avoid celebrating his academic achievements.

smokel1y ago

It is interesting that you bring this to the attention, but I don't see why we should not trust or celebrate someone if they have views that you don't agree with.

Edit: especially since I think your implied claim that Sutton would actively want everyone to die seems very much unfounded.

cowsandmilk1y ago

> doesn't care if humans all die

His last slide literally says “best hope for a long-term future for humanity”. That’s literally the opposite of what you’re claiming.

visarga1y ago

I think he is trying to take the positive side of what is probably an inetability.

vonneumannstan1y ago

Or we could just you know, not build the thing that will probably kill us all and at minimum will obsolete all our labor value.

1 more reply

nycticorax1y ago

This is so silly. Do you imagine temporal difference learning is some kind of human successionist plot?

vonneumannstan1y ago

The video is not about his technical work but rather his view that AI will or should take over the future.

1 more reply

Version4671y ago

Very disappointing. I do not understand how people earnestly defend the successionist view as a good future, but I thought he might at least give some interesting arguments.

This talk isn't that. There are no substantive arguments for why we should embrace this future and his representation of the opposite side isn't in good faith either, instead he chose to present straw-man versions of them.

He concludes with "A successful succession offers [...] the best hope for a long-term future for humanity. How this can possibly be true when ai succession necessarily includes replacement eludes me. He does mention transhumanism on a slide, but it seems extremely unlikely that he's actually talking about that and the whole succession spiel is just unfortunate wording.

neuroticnews251y ago

I'm not trying to be edgy or misanthropic but I don't understand why would you attach emotional value to the abstract concept of existence of humanity over next millennia. Isn't it the same kind of extrapolation of kin selection instincts far into the domain of values as for example favouring your race over others?

To me robots are just as cool.

visarga1y ago

> ai succession necessarily includes replacement

How is AI going to make its own chips and energy? The supply chain for AI hardware is long an fragile. AGI will have an interest in maintaining peace for this reason.

And why would it replace us, our thoughts are like food for AI. Our bodies are very efficient and mobile, biology will certainly be an option for AGI at some point.

3 more replies

ks20481y ago

At least his Twitter profile no longer has the bitcoin-meme-red-eyes thing.

porridgeraisin1y ago· 6 in thread

Their book "Introduction to Reinforcement Learning" is one of the most accessible texts in the AI/ML field, highly recommend reading it.

barrenko1y ago

I've tried descending down the RL branch, always seem way out of my depth with those formulas and star-this, star-that.

porridgeraisin1y ago

Yeah, the formalisations can be hard to crunch through (especially because of [1]). But this book in particular is quite well laid out. I'd suggest getting a math background on the (very) basics of "contraction mappings", as this is something the book kind of assumes you have the knowledge of.

[1] There's a lot of confusing naming. For example, due to its historic ties with behavioural psychology, there are a bunch of things called "eligibility traces" and so on. Also, even more than the usual "obscurity through notation" seen in all of math and AI, early RL literature in particular has particularly bad notation. You'd see the same letter mean completely different things (sometimes even opposite!) in two different papers.

incognito1241y ago

What is your background? Unfortunately I did not find it very accessible.

jxjnskkzxxhx1y ago

That book is a joy. Strong recommend.

zelphirkalt1y ago

You mean "Reinforcement Learning: An Introduction"? Or did they write another one?

porridgeraisin1y ago

Yeah that one. Messed up the name.

j7ake1y ago· 5 in thread

Amazing that Sutton (American) chooses to live in Edmonton, AB rather than USA.

Shows he has integrity and is not a careerist focused on prestige and money above all else.

armSixtyFour1y ago

https://nationalpost.com/news/canada/ai-guru-rich-sutton-dee...

He gave up his US citizenship years ago but he explains some of the reasons why he left. I'll also say that the AI research coming out of Canada is pretty great as well so I think it makes sense to do research there.

tbrockman1y ago

As someone who grew up in Edmonton, attended the U of A, and had the good fortune of receiving an incredible CS education at a discount price, I'm incredibly grateful for his (and the other amazing professors there) immense sacrifice.

Great people and cheap cost of living, but man do I not miss the city turning into brown sludge every winter.

jp571y ago

He's been there since he left Bell Labs, in the mid 2000's, I think. The U of A is, or was, rich with Alberta oil sands money and willing to use it to fund "curiosity-driven research", which is pretty nice if you're willing to live where the temperatures go down to -40 in the winter.

Philpax1y ago

Keen is a fully remote outfit, so he can work wherever. It's pretty likely that his reputation would open that door for him no matter where he goes.

j7ake1y ago

At his level it is much more than just being able to do what he wants, it’s about attracting resources and talent to accomplish his goals.

From that perspective location still matters if you want to maximise impact

zackkatz1y ago· 3 in thread

Very cool to see this! It turns out my wife and I bought Andy Barto’s (and his wife’s) house.

During the process, there was a bidding war. They said “make your prime offer” so, knowing he was a mathematician, we made an offer that was a prime number :-)

So neat to see him be recognized for his work.

dustfinger1y ago

Ha haa, that is fantastic. You should have joked and said - "I'd like to keep things even between us, how about $2?"

grumpopotamus1y ago

> we made an offer that was a prime number

$12345678910987654321?

HPMOR1y ago

This is a crazy story!! Hahaha wow. What was the prime number?

1 more reply

wegfawefgawefg1y ago· 3 in thread

These guys are great but unfortunately the ai sutton and barto book is really bad. You would do better with Grokking Machine Learning by trask, and then a couple months of implementing ml papers.

Buttons8401y ago

I second this suggestion. Read Grokking Deep Reinforcement Learning before reading Sutton. Well, the Sutton book is free, so take a peak, but if the formulas scare you then read Grokking Deep Reinforcement Learning.

3170701y ago

These books are about different topics? Sutton and Barto is about Reinforcement learning, and the other book you mention by Trask is on Deep Learning?

wegfawefgawefg1y ago

The sutton and barto book is often given as an introductory ai book to people with no experience in ai or rl. This is unfortunate as it functions as neither a good rl book nor a good ai book.

Wheras the introductory book Grokking Deep Learning walks you through implementing your own pytorch, and has a portion about rl near the end, then has a follow up book on rl, and it is trivial to have your own from scratch model and framework playing tic tac toe, snake, even without any math skills beyond multiplication.

This happens without just smacking the reader with the modified bellman equation, and a bunch of chain rule backwards, and padded paragraphs intended to sell additional versions to universities.

rvz1y ago· 1 in thread

Absolutely well deserved.

darosati1y ago

Hear hear

mark_l_watson1y ago

Nice! Well deserved. They make both editions of their RL textbook available as a free to read PDF. I have been a paid AI practitioner since 1982, and I must admit that RL is one subject I personally struggle mastering, and the Sutton/Barto book, the Cousera series on RL taught by Professors White and White, etc. personally helped me: recommended!

EDIT: the example programs for their book are available in Common Lisp and Python. http://incompleteideas.net/book/the-book-2nd.html

darkoob121y ago

They should have given it to some physicists to make it even.

rhema1y ago

I used their RL book for a course I taught. It's beautifully written and freely available (http://incompleteideas.net/book/the-book-2nd.html)! I kept getting distracted by the beautiful writing that I would miss the actual content.

cxie1y ago

Huge congratulations to Andrew Barto and Richard Sutton on the well-deserved Turing Award! as a student, their textbook Reinforcement Learning: An Introduction was my gateway into the field. I still remembered that how Chapter 6 on ‘Temporal Difference Learning’ fundamentally reshaped the way I thought about sequential decision-making.

a timeless classic that I still highly recommend reading today!

textlapse1y ago

This is a long time coming. To see through an idea from start to finish and make this span an entire field instead of a sub chapter in a dynamic programming book.

I wish a lot more games actually ended up using RL - the place where all of this started in the first place - would be really cool!

jimbohn1y ago

Well deserved, RL will only gain more importance as time goes on thanks to its (and neural nets) flexibility. The bitter lesson won't feel so bitter as we scale.

optimalsolver1y ago

So 2025 really is the year of agents.

jamesblonde1y ago

Built a lot of my PhD on their work 20 years ago. It really stood the test of time.

carabiner1y ago

Wonder if he's still working in AGI with Carmack.

pklee1y ago

Very well deserved !! Amazing contributions !!

PartiallyTyped1y ago

This made my day! Well deserved!

nextworddev1y ago

RL may prove to be the most important tech going fwd due to test time compute

byyoung31y ago

they deserve it. definitely recommend their book

vicentwu1y ago

Great!

ignoramous1y ago

Congratulations to Prof Barto & Prof Sutton. I'm sure the late Harry Klopf is all smiles (:

> The ACM A.M. Turing Award, often referred to as the "Nobel Prize in Computing," carries a $1 million prize with financial support provided by Google, Inc.

Good on Google, but there will be questions if their mere sponsorship in any way influences the awards.

If ACM wanted, could it not raise $1m prize money from non-profits/trusts without much hassle?

j / k navigate · click thread line to collapse

112 comments

75 comments · 22 top-level

ofirpress1y ago· 18 in thread

Good time to re-read The Bitter Lesson: https://www.cs.utexas.edu/~eunsol/courses/data/bitter_lesson...

cxr1y ago

Canonical URL: <http://www.incompleteideas.net/IncIdeas/BitterLesson.html>

khaledh1y ago

anonzzzies1y ago

3 more replies

amelius1y ago

Well, take compiler optimization for example. You can allow your AI to use correctness-preserving transformations only. This will give you correct output no matter how weird the AI behaves.

The downside is that you will sometimes not get the optimizations that you want. But, this is sort of already the case, even with human made optimization algorithms.

kleiba1y ago

Note that I'm not taking either point of view here. I just want to point out that perhaps a more nuanced approach might be called for here.

visarga1y ago

At the very least we know consistent language and vision abilities don't require lived experience. That is huge in itself, it was unexpected.

2 more replies

jdright1y ago

I was there, at that moment where pattern matching for vision started to die. That was not completely lost though, learning from that time is still useful on other places today.

abdullahkhalids1y ago

Best lesson for me - vowed never to be the person opposed to new approaches that work.

1 more reply

Buttons8401y ago

Oof. Imagine the bitter lesson classical NLP practitioners learned. That paper is as true today as ever.

DavidPiper1y ago

HarHarVeryFunny1y ago

So, not just brute force, but that (MCTS) is still the core of the algorithm.

2 more replies

signa111y ago

> ... This describes Go AIs as a brute force strategy with no heuristics ...

no, not really, from the paper

important notion here is, imho "learning by self play". required heuristics emerge out of that. they are not programmed in.

dfan1y ago

crabbone1y ago

krallistic1y ago

"The goal of Automated driving is not to drive automatically but to understand how anyone can drive well"...

The goal of DeepBlue was to beat the human with a machine, nothing more.

1 more reply

perks_121y ago

The Bitter Lesson seems to be generally accepted knowledge in the field. Wouldn't that make DeepSeek R1 even more of a breakthrough?

currymj1y ago

that was “bitter lesson” in action.

blufish1y ago

nice read and insightful

vonneumannstan1y ago· 17 in thread

Good time to remind everyone that Sutton is a human successionist and doesn't care if humans all die. He is not to be trusted nor celebrated: https://www.youtube.com/watch?v=NgHFMolXs3U

textlapse1y ago

This works me up because this is what’s dividing up people right now at a much larger scale.

I wish you well.

vonneumannstan1y ago

>this fetishism to dig into another person’s personal life and find the most weird thing they said as the thing that paints over all of their life’s achievements as evil must stop.

This has nothing to do with his professional life. He has made these comments in a professional capacity at an industry AI conference... The rest of your comment is a total non sequitur.

>And worst of all internet gives your opinion the same weight as someone else (or the rest of us) who knows a lot about thing B that could change the world. From a strictly professional capacity.

2 more replies

kalkin1y ago

> all of their lofty accomplishments get nullified by anyone

(Also, this isn't about Sutton's personal life - that's a pretty bad strawman.)

1 more reply

jffhn1y ago

>the most weird thing they said

3170701y ago

vonneumannstan1y ago

Here's the difference, you are not personally building the device which will cause your demise and your succession. We as humanity ARE doing that and have agency to choose NOT to do that.

zoogeny1y ago

> doesn't care if humans all die

smokel1y ago

It is interesting that you bring this to the attention, but I don't see why we should not trust or celebrate someone if they have views that you don't agree with.

Edit: especially since I think your implied claim that Sutton would actively want everyone to die seems very much unfounded.

cowsandmilk1y ago

> doesn't care if humans all die

His last slide literally says “best hope for a long-term future for humanity”. That’s literally the opposite of what you’re claiming.

visarga1y ago

I think he is trying to take the positive side of what is probably an inetability.

vonneumannstan1y ago

Or we could just you know, not build the thing that will probably kill us all and at minimum will obsolete all our labor value.

1 more reply

nycticorax1y ago

This is so silly. Do you imagine temporal difference learning is some kind of human successionist plot?

vonneumannstan1y ago

The video is not about his technical work but rather his view that AI will or should take over the future.

1 more reply

Version4671y ago

Very disappointing. I do not understand how people earnestly defend the successionist view as a good future, but I thought he might at least give some interesting arguments.

neuroticnews251y ago

To me robots are just as cool.

visarga1y ago

> ai succession necessarily includes replacement

How is AI going to make its own chips and energy? The supply chain for AI hardware is long an fragile. AGI will have an interest in maintaining peace for this reason.

And why would it replace us, our thoughts are like food for AI. Our bodies are very efficient and mobile, biology will certainly be an option for AGI at some point.

3 more replies

ks20481y ago

At least his Twitter profile no longer has the bitcoin-meme-red-eyes thing.

porridgeraisin1y ago· 6 in thread

Their book "Introduction to Reinforcement Learning" is one of the most accessible texts in the AI/ML field, highly recommend reading it.

barrenko1y ago

I've tried descending down the RL branch, always seem way out of my depth with those formulas and star-this, star-that.

porridgeraisin1y ago

incognito1241y ago

What is your background? Unfortunately I did not find it very accessible.

jxjnskkzxxhx1y ago

That book is a joy. Strong recommend.

zelphirkalt1y ago

You mean "Reinforcement Learning: An Introduction"? Or did they write another one?

porridgeraisin1y ago

Yeah that one. Messed up the name.

j7ake1y ago· 5 in thread

Amazing that Sutton (American) chooses to live in Edmonton, AB rather than USA.

Shows he has integrity and is not a careerist focused on prestige and money above all else.

armSixtyFour1y ago

https://nationalpost.com/news/canada/ai-guru-rich-sutton-dee...

tbrockman1y ago

Great people and cheap cost of living, but man do I not miss the city turning into brown sludge every winter.

jp571y ago

Philpax1y ago

Keen is a fully remote outfit, so he can work wherever. It's pretty likely that his reputation would open that door for him no matter where he goes.

j7ake1y ago

At his level it is much more than just being able to do what he wants, it’s about attracting resources and talent to accomplish his goals.

From that perspective location still matters if you want to maximise impact

zackkatz1y ago· 3 in thread

Very cool to see this! It turns out my wife and I bought Andy Barto’s (and his wife’s) house.

During the process, there was a bidding war. They said “make your prime offer” so, knowing he was a mathematician, we made an offer that was a prime number :-)

So neat to see him be recognized for his work.

dustfinger1y ago

Ha haa, that is fantastic. You should have joked and said - "I'd like to keep things even between us, how about $2?"

grumpopotamus1y ago

> we made an offer that was a prime number

$12345678910987654321?

HPMOR1y ago

This is a crazy story!! Hahaha wow. What was the prime number?

1 more reply

wegfawefgawefg1y ago· 3 in thread

These guys are great but unfortunately the ai sutton and barto book is really bad. You would do better with Grokking Machine Learning by trask, and then a couple months of implementing ml papers.

Buttons8401y ago

3170701y ago

These books are about different topics? Sutton and Barto is about Reinforcement learning, and the other book you mention by Trask is on Deep Learning?

wegfawefgawefg1y ago

The sutton and barto book is often given as an introductory ai book to people with no experience in ai or rl. This is unfortunate as it functions as neither a good rl book nor a good ai book.

This happens without just smacking the reader with the modified bellman equation, and a bunch of chain rule backwards, and padded paragraphs intended to sell additional versions to universities.

rvz1y ago· 1 in thread

Absolutely well deserved.

darosati1y ago

Hear hear

mark_l_watson1y ago

EDIT: the example programs for their book are available in Common Lisp and Python. http://incompleteideas.net/book/the-book-2nd.html

darkoob121y ago

They should have given it to some physicists to make it even.

rhema1y ago

cxie1y ago

a timeless classic that I still highly recommend reading today!

textlapse1y ago

This is a long time coming. To see through an idea from start to finish and make this span an entire field instead of a sub chapter in a dynamic programming book.

I wish a lot more games actually ended up using RL - the place where all of this started in the first place - would be really cool!

jimbohn1y ago

Well deserved, RL will only gain more importance as time goes on thanks to its (and neural nets) flexibility. The bitter lesson won't feel so bitter as we scale.

optimalsolver1y ago

So 2025 really is the year of agents.

jamesblonde1y ago

Built a lot of my PhD on their work 20 years ago. It really stood the test of time.

carabiner1y ago

Wonder if he's still working in AGI with Carmack.

pklee1y ago

Very well deserved !! Amazing contributions !!

PartiallyTyped1y ago

This made my day! Well deserved!

nextworddev1y ago

RL may prove to be the most important tech going fwd due to test time compute

byyoung31y ago

they deserve it. definitely recommend their book

vicentwu1y ago

Great!

ignoramous1y ago

Congratulations to Prof Barto & Prof Sutton. I'm sure the late Harry Klopf is all smiles (:

> The ACM A.M. Turing Award, often referred to as the "Nobel Prize in Computing," carries a $1 million prize with financial support provided by Google, Inc.

Good on Google, but there will be questions if their mere sponsorship in any way influences the awards.

If ACM wanted, could it not raise $1m prize money from non-profits/trusts without much hassle?

j / k navigate · click thread line to collapse