TensorKart: self-driving MarioKart with TensorFlow (opens in new tab)

(kevinhughes.ca)

631 pointspickle279y ago68 comments

68 comments

50 comments · 22 top-level

cr0sh9y ago· 6 in thread

This is pretty cool; as someone who is currently working on the second project (traffic sign recognition) for the Udacity "Self-Driving Car Engineer" nanodegree, using TensorFlow - it is interesting to me how it seems like the "standard" MNIST CNN can be adapted to so many other use cases.

For the project I am currently working on, I'm using a slightly modified form of LeNet - which isn't too different from the TF MNIST tutorial; after all, recognizing traffic signs isn't much different than recognizing hand-written numbers...

...but "driving" a course? That seems radically different to my less-than-expert-at-TensorFlow understanding, but that is only due to my ignorance.

I'm glad that these examples and demos are being investigated and made public for others - especially people learning like myself - to look at and learn from.

halflings9y ago

From the post:

> Later, I switched to use Nvidia’s Autopilot...

So I guess he didn't use the MNIST CNN model.

cr0sh9y ago

However, if you look at the code:

https://github.com/SullyChen/Autopilot-TensorFlow/blob/maste...

You can see that it follows much the same pattern as LeNet CNN for MNIST - a few (ok, more than a few!) convolutional layers followed by a few fully connected layers.

Maybe you could call it a "follow on" or perhaps an ANN pattern?:

Conv -> Conv -> Reshape/Flatten -> FC -> FC -> FC

(disregarding activation and such)

...which is really the lesson of the LeNet MNIST CNN - at least, that's my takeaway.

1 more reply

glidek9y ago

As someone who's interested in taking the Udacity course, would your recommend it? Do you think the course prepares you enough find a Self-Driving developer job? Would you learn enough to compete/work along side people who got their Masters/PhD in Machine Learning? Appreciate your input.

cr0sh9y ago

> As someone who's interested in taking the Udacity course, would your recommend it?

So far, yes - but that has a few caveats:

See - I have some background prior to this, and I think it biases me a bit. First, I was one of the cohort that took the Stanford-sponsored ML Class (Andrew Ng) and AI Class (Thrun/Norvig), in 2011. While I wasn't able to complete the AI Class (due to personal reasons), I did complete the ML Class.

Both of these courses are now offered by Udacity (AI Class) and Coursera (ML Class):

https://www.udacity.com/course/intro-to-artificial-intellige...

https://www.coursera.org/learn/machine-learning

If you have never done any of this before, I encourage you to look into these courses first. IIRC, they are both free and self-paced online. I honestly found the ML Class to be easier than the AI class when I took them - but that was before the founding of these two MOOC-focused companies, so the content may have changed or been made more understandable since then.

In fact, now that I think about it, I might try taking those courses again myself as a refresher!

After that (and kicking myself for dropping out of the AI Class - but I didn't have a real choice there at the time), in 2012 Udacity started, and because of (reasons...) they couldn't offer the AI Class as a course (while for some reason, Coursera could offer the ML Class - there must have been licensing issues or something) - so instead, they offered their CS373 course in 2012 (at the time, titled "How to Build Your Own Self-Driving Vehicle" or something like that - quite a lofty title):

https://www.udacity.com/course/artificial-intelligence-for-r...

I jumped at it - and completed it as well; I found it to be a great course, and while difficult, it was very enlightening on several fronts (for the first time, it clearly explained to me exactly how a Kalman filter and PID worked!).

So - I have that background, plus everything else I have read before then or since (AI/ML has been a side interest of mine since I was a child - I'm 43 now).

My suggestion if you are just starting would be to take the courses in roughly this order - and only after you are fairly comfortable with both linear algebra concepts (mainly vectors/matrices math - dot product and the like) and stats/probabilities. To a certain extent (and I have found this out with this current Udacity course), having a knowledge of some basic calculus concepts (derivatives mainly) will be of help - but so far, despite that minor handicap, I've been ok without that greater knowledge - but I do intend to learn it:

1. Coursera ML Class 2. Udacity AI Class 3. Udacity CS373 course 4. Udacity Self-Driving Car Engineer Nanodegree

> Do you think the course prepares you enough find a Self-Driving developer job?

I honestly think it will - but I also have over 25 years under my belt as a professional software developer/engineer. Ultimately, it - along with the other courses I took - will (and have) help me in having other tools and ideas to bring to bear on problems. Also - realize that this knowledge can apply to multiple domains - not just vehicles. Marketing, robotics, design - heck, you name it - all will need or do currently need people who understand machine learning techniques.

> Would you learn enough to compete/work along side people who got their Masters/PhD in Machine Learning?

I believe you could, depending on your prior background. That said, don't think that these courses could ever substitute for graduate degree in ML - but I do think they could be a great stepping stone. I am actually planning on looking into getting my BA then Masters (hopefully) in Comp Sci after completing this course. Its something I should have done long ago, but better late than never, I guess! All I currently have is an associates from a tech school (worth almost nothing), and my high school diploma - but that, plus my willingness to constantly learn and stay ahead in my skills has never let me down career-wise! So I think having this ML experience will ultimately be a plus.

Worst-case scenario: I can use what I have learned in the development of a homebrew UGV (unmanned ground vehicle) I've been working at on and off for the past few years (mostly "off" - lol).

> Appreciate your input.

No problem, I hope my thoughts help - if you have other questions, PM me...

melvinmt9y ago

> the Udacity "Self-Driving Car Engineer" nanodegree

That looks like a great course by the way, thanks for sharing.

jrheard9y ago

I'm one of cr0sh's classmates. I don't have any background in ML/AI/etc, so I've had to supplement the Udacity course materials with a lot of external resources (just finished watching the Stanford CS231n course, which was very helpful), but overall the course been really interesting+fun so far. It's really nice to be exposed to new kinds of tech I've never heard of / used before. Refreshing change from webdev.

If you're strapped for cash and don't want to pay the $800/term, you could definitely learn these things on your own using free online resources. If you don't mind the price, though, I've found this course worth the time+money+effort so far. [they're not paying me to say this :)]

3 more replies

JonnieCache9y ago· 4 in thread

In contrast, here's what is effectively an oracle machine playing mario kart: https://www.youtube.com/watch?v=ZBNgbJ5hXtQ

(Amazingly detailed) info: http://tasvideos.org/5243S.html

rl39y ago

I like how it just glitches itself to an almost instant win on half of the maps.

dclowd99019y ago

These are Tool-Assisted Speedruns. That means that it's a human player using things like slow motion, mem dumps and other mechanisms to play perfect games. It's more an example of human's abilities when augmented with computers than AI discovering those glitches itself.

1 more reply

prezjordan9y ago

Thank you for sharing! I thought the Moo Moo Farm run was incredible on its own, THEN I saw the write-up. Blown away.

hmhrex9y ago

This is awesome! So much fun to watch these shortcuts.

adyus9y ago· 3 in thread

Congrats on finishing the project! As you've already linked at the bottom of your post, it's possible that OpenAI could've solved most of your I/O issues.

One thing I'd suggest is exploring a reward function, instead of using only pre-recorded training data. That is, give the AI a goal to complete (in this case, finish the race) and let it learn by itself!

Drdrdrq9y ago

I would love to learn how to do that - any suggestions?

EDIT: to clarify: what should I google for?

adyus9y ago

Here's what I could find in a couple minutes:

https://github.com/openai/universe-starter-agent

OpenAI's example universe agent. Remember that while their goal is an agent that works in any and all environments (read: games), you could certainly optimize yours just for MarioKart.

1 more reply

paulbaumgart9y ago

Reinforcement Learning. Here's a good intro: http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html

1 more reply

jostmey9y ago· 3 in thread

Quote: "Driving a new (untrained) section of the Royal Raceway:"

So the author did a proper test of the model by scoring it on an unseen track to make sure it generalizes! This is very awesome!

ska9y ago

How did we get from "bare minimum sensible testing" to "This is very awesome!"? Are things that bad on average?

kevinwang9y ago

There's probably a broad range of people in the hn community.

1 more reply

jupiter900009y ago

It is pretty cool! I'd love to see how it did on the other parts of that track as well, if that was tested

eli_gottlieb9y ago· 3 in thread

Personally, I'm just a little impressed that you can train an active agent to play a game using old-fashioned supervised learning on screen states and controller states rather than relying on "action-oriented" learning techniques like reinforcement learning, online learning, or even a recurrent model.

It really shows how simple many control tasks actually are!

tobilarscheid9y ago

This is exactly what I wondered about. So what exactly is the function you are training for? Is it basically like "if the screen (showing the track) looks like this, apply these controls"?

PeterisP9y ago

An more accurate description of the function would be "given this picture of a screen, what is the most likely key my author was pressing in this situation" - no goals, no values, no optimization, but simply learning to imitate the actions performed by a human.

Coincidentally, one of the neural network components in AlphaGo did pretty much the same, i.e. attempted to guess what human player would usually play in this situation purely based on the image and nothing else.

eli_gottlieb9y ago

In TFA it says that he was training a supervised learner to predict the control state from the screen state. So yes, "if the screen looks like this, apply these controls", and that can play Mario Kart 64.

bduerst9y ago· 2 in thread

Personally I think the most impressive thing here isn't that you created a self-driving MarioKart, but that you trained TensorFlow based on input screenshots of your desktop.

I feel like that could be a good next step - a ubiquitous neural net model that, after mapping inputs, will learn to play any video game that's on your screen.

jboggan9y ago

Especially since the hard work of increasing the screen resolution has already been done.

Also, bravo on including the stupid little bugs that gave you trouble. It always sustains me working on a hard project to know that a self-driving video game was blocked by a missing newline in a C HTTP request. It makes me step back and laugh at the ridiculous complexity of what we take for granted in our day to day work.

Hydraulix9899y ago

There's work being done to allow reinforcement nets to do transfer learning.

nartam119y ago· 2 in thread

How are the original computer opponents able to play MarioKart?

taway_12129y ago

1. The AI in games has access to internal representations of game state and does not have to recognize it from pixels on screen. This is a massive difference.

2. The logic is usually a bunch of (human-authored) scripts consisting of if-else spaghetti.

bryondowd9y ago

Also, the AI opponents don't have to play by the same rules. They go by fun > fairness to keep things interesting. That's why you normally can't keep a huge lead on AI opponents, because they "rubberband" back up to you faster than they should be able to.

Wouldn't surprise me if they don't even 'drive' in any sense while off-screen, just increment some abstract position relative to the track length. But I don't know this for a fact.

3 more replies

tjfontaine9y ago· 2 in thread

Next, have it upload its race results to kartlytics

TomAnthony9y ago

Yeah - that was a very cool project!

glaberficken9y ago

omg! thanks for the refference to kartlytics. I had no idea this existed!

CM309y ago· 2 in thread

Pretty interesting I must say. Have to admit though, I kind of expected the self driving AI to be trying to win Grand Prix or Versus races instead of doing well in Time Trials. But hey, I can see how that would be utterly painful to try and set up, especially given how times you get hit by items or rammed off the track in more recent games.

bisby9y ago

Step 1 is to make the AI find an ideal path through the course.

Step 2 is to make AI figure out how to return to the ideal path through the course when other people are stealing your items or shelling you.

step 3 is to make the AI figure out how to counter attack to slow down the opponents.

Step 4 is OH GOD WE TAUGHT THE AI HOW TO ATTACK RUN FOR YOUR LIVES.

bryondowd9y ago

Step 2.5 would be to make the AI figure out how to evade or minimize the effect of or ability to initiate opponents' offensive moves. That would be the most interesting bit to me. Would be neat to see an AI intentionally stay in 2nd place with an item at the ready until the home stretch, to avoid being blue-shelled.

1 more reply

xigency9y ago· 1 in thread

I'm interested in knowing why the Python and C components communicate with HTTP, beyond reading about the bugfix. Wouldn't it be easier to use sockets or files or some other mechanism to integrate the two languages?

Just something to think about as a developer. I would imagine that on a local machine, using HTTP as the protocol might add latency.

haikuginger9y ago

This was my initial reaction as well; it seems like a raw socket or even embedding a Python interpreter would be better ways to go.

rl39y ago

The inevitable follow-up article that delves into training offensive banana peel usage should be interesting.

cjmcqueen9y ago

Best part, "With this in mind I played more MarioKart to record new training data. I remember thinking to myself while trying to drive perfectly, “is this how parents feel when they’re driving with their children who are almost 16?”"

bitL9y ago

It's basically a project these days at Udacity's Self-driving car nanodegree under "Behavioral Cloning" ;-)

jordigh9y ago

I was ready to be impressed about seeing an AI that could consistently beat the game's own AI, with blue turtle shells and all. Oh well, still pretty impressive to be able to drive on the easiest course without opponents.

sakabaro9y ago

Check out also MarI/O, very impressive: https://www.youtube.com/watch?v=qv6UVOQ0F44

TomAnthony9y ago

It would be very interesting to see how well this does with more training data, especially with multiple players.

holografix9y ago

This is very cool and I think if Kevin spends a bit of time learning reinforcement learning it could be amazing.

It seems like a lot of people doing reinforcement learning on video games get bogged down on training on raw pixels only... it would take a tremendous amount of data to make the driver recognise when and where to use certain power ups, however if you encoded this as a variable, wow it could be really cool.

I believe this is fundamentally how we humans learn with so few examples. Other humans "encode features for our brain to track" by telling us how it should be done and what information to prioritise.

ramzyo9y ago

This is really cool, and any reason to bring this game back into my life is warmly welcomed

tomrod9y ago

I love this!

I'm working (albeit very slowly, as a beginner) on a similar project with Geometry Dash and Python. You're a great inspiration!

gm-conspiracy9y ago

I appreciate the write-up. Thank you!

dylanbfox9y ago

great write up! this is awesome

jondiggsit9y ago

No power slide? Failure.

j / k navigate · click thread line to collapse

68 comments

50 comments · 22 top-level

cr0sh9y ago· 6 in thread

...but "driving" a course? That seems radically different to my less-than-expert-at-TensorFlow understanding, but that is only due to my ignorance.

I'm glad that these examples and demos are being investigated and made public for others - especially people learning like myself - to look at and learn from.

halflings9y ago

From the post:

> Later, I switched to use Nvidia’s Autopilot...

So I guess he didn't use the MNIST CNN model.

cr0sh9y ago

However, if you look at the code:

https://github.com/SullyChen/Autopilot-TensorFlow/blob/maste...

You can see that it follows much the same pattern as LeNet CNN for MNIST - a few (ok, more than a few!) convolutional layers followed by a few fully connected layers.

Maybe you could call it a "follow on" or perhaps an ANN pattern?:

Conv -> Conv -> Reshape/Flatten -> FC -> FC -> FC

(disregarding activation and such)

...which is really the lesson of the LeNet MNIST CNN - at least, that's my takeaway.

1 more reply

glidek9y ago

cr0sh9y ago

> As someone who's interested in taking the Udacity course, would your recommend it?

So far, yes - but that has a few caveats:

Both of these courses are now offered by Udacity (AI Class) and Coursera (ML Class):

https://www.udacity.com/course/intro-to-artificial-intellige...

https://www.coursera.org/learn/machine-learning

In fact, now that I think about it, I might try taking those courses again myself as a refresher!

https://www.udacity.com/course/artificial-intelligence-for-r...

So - I have that background, plus everything else I have read before then or since (AI/ML has been a side interest of mine since I was a child - I'm 43 now).

1. Coursera ML Class 2. Udacity AI Class 3. Udacity CS373 course 4. Udacity Self-Driving Car Engineer Nanodegree

> Do you think the course prepares you enough find a Self-Driving developer job?

> Would you learn enough to compete/work along side people who got their Masters/PhD in Machine Learning?

Worst-case scenario: I can use what I have learned in the development of a homebrew UGV (unmanned ground vehicle) I've been working at on and off for the past few years (mostly "off" - lol).

> Appreciate your input.

No problem, I hope my thoughts help - if you have other questions, PM me...

melvinmt9y ago

> the Udacity "Self-Driving Car Engineer" nanodegree

That looks like a great course by the way, thanks for sharing.

jrheard9y ago

3 more replies

JonnieCache9y ago· 4 in thread

In contrast, here's what is effectively an oracle machine playing mario kart: https://www.youtube.com/watch?v=ZBNgbJ5hXtQ

(Amazingly detailed) info: http://tasvideos.org/5243S.html

rl39y ago

I like how it just glitches itself to an almost instant win on half of the maps.

dclowd99019y ago

1 more reply

prezjordan9y ago

Thank you for sharing! I thought the Moo Moo Farm run was incredible on its own, THEN I saw the write-up. Blown away.

hmhrex9y ago

This is awesome! So much fun to watch these shortcuts.

adyus9y ago· 3 in thread

Congrats on finishing the project! As you've already linked at the bottom of your post, it's possible that OpenAI could've solved most of your I/O issues.

Drdrdrq9y ago

I would love to learn how to do that - any suggestions?

EDIT: to clarify: what should I google for?

adyus9y ago

Here's what I could find in a couple minutes:

https://github.com/openai/universe-starter-agent

OpenAI's example universe agent. Remember that while their goal is an agent that works in any and all environments (read: games), you could certainly optimize yours just for MarioKart.

1 more reply

paulbaumgart9y ago

Reinforcement Learning. Here's a good intro: http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html

1 more reply

jostmey9y ago· 3 in thread

Quote: "Driving a new (untrained) section of the Royal Raceway:"

So the author did a proper test of the model by scoring it on an unseen track to make sure it generalizes! This is very awesome!

ska9y ago

How did we get from "bare minimum sensible testing" to "This is very awesome!"? Are things that bad on average?

kevinwang9y ago

There's probably a broad range of people in the hn community.

1 more reply

jupiter900009y ago

It is pretty cool! I'd love to see how it did on the other parts of that track as well, if that was tested

eli_gottlieb9y ago· 3 in thread

It really shows how simple many control tasks actually are!

tobilarscheid9y ago

This is exactly what I wondered about. So what exactly is the function you are training for? Is it basically like "if the screen (showing the track) looks like this, apply these controls"?

PeterisP9y ago

eli_gottlieb9y ago

bduerst9y ago· 2 in thread

Personally I think the most impressive thing here isn't that you created a self-driving MarioKart, but that you trained TensorFlow based on input screenshots of your desktop.

I feel like that could be a good next step - a ubiquitous neural net model that, after mapping inputs, will learn to play any video game that's on your screen.

jboggan9y ago

Especially since the hard work of increasing the screen resolution has already been done.

Hydraulix9899y ago

There's work being done to allow reinforcement nets to do transfer learning.

nartam119y ago· 2 in thread

How are the original computer opponents able to play MarioKart?

taway_12129y ago

1. The AI in games has access to internal representations of game state and does not have to recognize it from pixels on screen. This is a massive difference.

2. The logic is usually a bunch of (human-authored) scripts consisting of if-else spaghetti.

bryondowd9y ago

Wouldn't surprise me if they don't even 'drive' in any sense while off-screen, just increment some abstract position relative to the track length. But I don't know this for a fact.

3 more replies

tjfontaine9y ago· 2 in thread

Next, have it upload its race results to kartlytics

TomAnthony9y ago

Yeah - that was a very cool project!

glaberficken9y ago

omg! thanks for the refference to kartlytics. I had no idea this existed!

CM309y ago· 2 in thread

bisby9y ago

Step 1 is to make the AI find an ideal path through the course.

Step 2 is to make AI figure out how to return to the ideal path through the course when other people are stealing your items or shelling you.

step 3 is to make the AI figure out how to counter attack to slow down the opponents.

Step 4 is OH GOD WE TAUGHT THE AI HOW TO ATTACK RUN FOR YOUR LIVES.

bryondowd9y ago

1 more reply

xigency9y ago· 1 in thread

Just something to think about as a developer. I would imagine that on a local machine, using HTTP as the protocol might add latency.

haikuginger9y ago

This was my initial reaction as well; it seems like a raw socket or even embedding a Python interpreter would be better ways to go.

rl39y ago

The inevitable follow-up article that delves into training offensive banana peel usage should be interesting.

cjmcqueen9y ago

bitL9y ago

It's basically a project these days at Udacity's Self-driving car nanodegree under "Behavioral Cloning" ;-)

jordigh9y ago

sakabaro9y ago

Check out also MarI/O, very impressive: https://www.youtube.com/watch?v=qv6UVOQ0F44

TomAnthony9y ago

It would be very interesting to see how well this does with more training data, especially with multiple players.

holografix9y ago

This is very cool and I think if Kevin spends a bit of time learning reinforcement learning it could be amazing.

I believe this is fundamentally how we humans learn with so few examples. Other humans "encode features for our brain to track" by telling us how it should be done and what information to prioritise.

ramzyo9y ago

This is really cool, and any reason to bring this game back into my life is warmly welcomed

tomrod9y ago

I love this!

I'm working (albeit very slowly, as a beginner) on a similar project with Geometry Dash and Python. You're a great inspiration!

gm-conspiracy9y ago

I appreciate the write-up. Thank you!

dylanbfox9y ago

great write up! this is awesome

jondiggsit9y ago

No power slide? Failure.

j / k navigate · click thread line to collapse