Also this "real GPU" is explicitly called out in the Google docs as unsupported.
(I've since fixed this - I hadn't chmod'd right, and didn't account for working from an ubuntu machine)
I like the ones that are hardcoded to a specific name in a home directory best. Especially when it doesn't match the github name of the "creator."
If you're just interested in playing around, then your laptop will do fine - TensorFlow is happy with just about any hardware you throw at it. Hell, your modern Android phone will run it =]
If you're interested in a more involved experiment, develop and debug your task locally on your laptop. By the time you're ready for large scale training, there might be a stable and battle tested AMI such that people are no longer reporting issues in [1] about it.
Again, if you're interested, follow the CUDA 3.0 issue on GitHub[1] - this is nowhere near a solved problem and will only cause headaches if you're using it for education.
https://github.com/tensorflow/tensorflow/issues/25#issuecomm... is one verification :)
I put together an AMI in virginia: ami-cf5028a5 and if you have a masochistic streak, here are the steps to do it yourself: https://gist.github.com/erikbern/78ba519b97b440e10640
The main issue I'm still seeing is that g2.8xlarge with my AMI doesn't run 4x faster than g2.2xlarge even if correctly detects the 4 GPU's. Haven't had time to identify the issue though.