Why Tesla removed radar and ultrasonic sensors [video] (opens in new tab)

(youtube.com)

241 pointsshekhar1013y ago471 comments

471 comments

197 comments · 43 top-level

CharlesW3y ago· 43 in thread

I thought it was telling that Andrej immediately "reframed" the question because Lex asked the "wrong question". This is a classic evasion technique one learns from experience and/or media training. Lex's comment immediately after was a clever and gentle dig at Andrej's response.

It seemed like all the "full cost" negatives Andrej mentioned were related to Tesla's ability to execute, and not what would actually produce better results. Tesla would have to be able to reliably procure parts, write reliable firmware, create designs and processes that won't increase unexpected assembly line stops, etc.

Regarding results, the best Andrej can do is, "In this case, we looked at using it and not using it, and the delta was not massive." In other words, the results are better, but not enough to make up for the fact that Telsa can't support additional sensors without incurring a prohibitive amount of additional risk to Tesla. Risk to passengers doesn't appear to be a consideration.

flashgordon3y ago

That was hilarious. Basically (unless this needs a reframing/realignment/repositioning/reorienting):

Q: "are less sensors less safe/effective?"

A: "well more sensors are costly to the organization and add more tech debt so safety is orthogonal and not worth answering".

jholman3y ago

Uh, that's not at all a good paraphrase.

Q: "Does [removing some sensors] make the perception problem harder, or easier?"

(note, this is literally what Lex asked, your restatement is misleading)

A: [paraphrasing] "Well more sensor diversity makes it harder to focus on the thing that I believe really moves the needle, so by narrowing the space of consideration, I think we'll get better results"

Karpathy might not be telling the truth, I don't know. But it's a much more credible pitch than you make it sound, because it's often true that you can deliver better by focusing on a smaller number of things. Engineering has always been about tradeoffs. Nobody is offering Karpathy infinite money plus infinite resources plus infinite time to do the job.

Again, I'm not saying Karpathy is honest or correct. I'm saying that the rephrasings in this comment and this thread are hilariously unfair.

3 more replies

judge20203y ago

Munro’s cost breakdown is much more informative in just how much it’ll save in terms of parts/labor. https://youtu.be/LS3Vk0NPFDE

In general the ‘harm to consumers’ is really just making it more likely they damage the car in a parking lot or their garage, which tells you where their priorities are (sales, Automotive gross profit). Assuming occupancy network works, the only real blind spot left is if something in front of the car changes in between it turning off and on (assuming occupancy will 'remember' the map around it when it goes to sleep).

Also, Tesla’s strategy for safety is seemingly “excel in industry standard tests, ie. IIHS and EuroNCAP”, so this might be a case of the measure becoming a target.

1 more reply

Retric3y ago

The video has a more reasonable answer.

The sensors are unreliable and expensive in terms of R&D. Having marginal parts which takes money from a finite R&D budget can easily result in a worse product. “They contribute noise and entropy into everything.” … “you’re investing fully into that [vision] and you can make that extremely good. You only have a finite amount of spend of focus across different facets of the system.”

His standpoint can be summed up as “I think some of the other companies are going to drop it.” Which would be really interesting if true.

2 more replies

seanmcdirmid3y ago

It is a hard question to answer. It’s like asking if more programmers on a project will allow it to be completed faster with higher quality. Ya, theoretically they could, in practice not likely. More sensors are like more programmers, theoretically they can be safer and more effective, but in practice they won’t be. Sensor fusion is as hard a problem as scaling up a software team.

4 more replies

aeternum3y ago

Lack of focus is a major problem for companies and we all know that tech debt leads to increased bug counts.

Team focus on vision which is by far the highest accuracy and bandwidth sensor allows for a faster rate of safety innovation given a constant team size.

5 more replies

FreakLegion3y ago

> In other words, the results are better, but not enough to make up for the fact that Telsa can't support additional sensors without incurring a prohibitive amount of additional risk to Tesla. Risk to passengers doesn't appear to be a consideration.

You may be right about the actual decision process Tesla went through, but Karpathy is right in principle. One of the first things he says is "there can be problems with [the sensors]", and a lot of what he mentions increases the risk of run-time failure, not just cost.

It's easy to cast this as an optimization problem where you're trading off asymptotically improved sensing for linearly or superlinearly increased failure rates. There's certainly a point where the complexity of more sensors or certain types of sensors outweighs any marginal benefit they provide.

YZF3y ago

Taking his point to the extreme why use 8 cameras? just use 4? 1? One photo-diode?

Cameras can also fail at run-time there can (and is) be variability in how they're mounted, in the lenses, in the sensors. They can get blinded or not get enough light. Their cabling can fail random components can fail.

Tesla has claimed that vision outperforms vision+radar but anecdotal reports don't seem to support that conclusion. IMHO these technologies are not directly replaceable, but are complementary. It's like you can't replace your ears with your eyes (yeah, you can read lips, if they're visible).

But sure, there is a sweet spot. Is Tesla really optimizing for best performance at any cost or are they optimizing making more money and selling that to us as an improvement? That's really the question and I don't think we got a frank answer there.

4 more replies

bumby3y ago

There are other ways of optimizing for reliability, though, like redundancy in parallel or higher spec’d sensors. But that still gets back to the same issue where they are going to be concerned about cost.

latchkey3y ago

Not just risk to passengers, risk to any thing in proximity to the vehicle while it is in motion.

threeseed3y ago

It was an ominous answer.

We really should be focusing on what is the best solution and trying to solve price issues through existing techniques e.g. economies of scale, competition, miniaturisation. Instead they are trying to build whatever solution they can that fits in a pre-defined cost window.

Except this isn't a new phone or sneakers we are trying to take to market it's something that will directly impact people's lives.

Nomentatus3y ago

You can get to this conclusion if you're sure Andrej is lying, and that the risks cited are smoke; but only then. BTW, I've upgraded my sneakers after a couple falls on a rough beach with tangled driftwood (drift trees, really) proved their cheap too-slick surface had real world consequences. I was lucky not to break a bone. I'm going to bet he isn't lying, but I can understand someone making the opposite bet, market competition being market competition.

1 more reply

throwawaylinux3y ago

Everything in engineering has a cost tradeoff, and always has. And peoples lives are improved by things which they can afford There is no "best solution" you can talk seriously about without talking about cost.

Why not have a thousand sensors if more is better?

mensetmanusman3y ago

It can’t be solved without a few 10s of billion in infrastructure investment.

taneq3y ago

Whose money should “we” be spending on this grail quest?

This mindset is something I see a lot, that “best” means the technically optimal (or sometimes just personally most convenient) solution to the specific problem that they personally are working on. If they take a step back and look at the bigger picture, the technical merits are usually only a tiny part of the whole decision.

minhazm3y ago

There's an opportunity cost trying to get to the best solution. What about all of the people that die in the mean time while we delay rolling out something due to it not being perfect? Just doing some googling the Waymo stack is estimated to cost somewhere in the range of $50-100k, not including the car. A better solution that no one can afford is no solution at all.

Ultimately the only requirement is that the system is safer than humans by some margin that people are comfortable with buying such a system. If that amount is even as little as 2x safer than humans, we still have a moral obligation to roll that out even if we could be 5x safer if we had another $50k worth of sensors and processors on the car.

2 more replies

3apo3y ago

I feel the other reason is that Tesla has not figured out a way to put Radar into their ML pipeline. If you take the Range-Doppler Map from the radar as the 'pixel' map, that data is inherently very dependent on the scenario and the radar sensor intrinsic parameters. This variability in what the radar sees in the RD space is what makes this a challenge for ML/AI pipelines. If Tesla were to 'fuse' information from these sensors in the object track level - I believe they will be less susceptible to this variability.

dreamcompiler3y ago

Exactly. Radar gives you direct range data; camera pixels need to be processed by ML to infer range data, and the latter is never going to be as close to ground truth as the former, so the former should be prioritized.

1 more reply

touch_abs3y ago

Its interesting, that kind of object level fusion is a fairly different problem to training visual perception, following some of the less in fashion robotics techniques. I wonder if its a case of the Tesla engineers focusing on the fad technologies (or just their strengths) more than its a hardware cost thing.

steve_adams_863y ago

That’s one way to read it, but in my own experience the “do one thing really well” approach can yield far better results. Meaning, if vision is truly sufficient and you do it really well rather than a bunch of sensors “okay”, that may actually be safer overall. You might get far more focused and practical results from your efforts.

I’m not saying this is definitely true, and at the moment we probably can’t verify it either. I’m just “steel manning” his case, as Lex loves to say.

I think you’re probably correct that the business aspect was a significant factor, but perhaps it wasn’t everything.

hammock3y ago

Devils advocate, if the cost of working to improve cameras to the point where they eliminate that delta is lower than the cost of using the sensors instead, then it is a net benefit

tophi3y ago

Yes, the current delta was not massive and will shrink over time.

By getting rid of the extra sensors they eliminate a temporary crutch and focus resources on the simple solution.

Not a new concept by the way. Henry Ford was obsessed with simplifying and eliminating every part that wasn’t necessary on the model T for virtually all the same reasons.

2 more replies

itsoktocry3y ago

>f the cost of working to improve cameras to the point where they eliminate that delta is lower than the cost of using the sensors instead

In a vacuum, how can cameras ever be better than cameras + other sensors?

2 more replies

adolph3y ago

> Regarding results, the best Andrej can do is, "In this case, we looked at using it and not using it, and the delta was not massive." In other words, the results are better, but not enough to make up for the fact that Telsa can't support additional sensors without incurring a prohibitive amount of additional risk to Tesla. Risk to passengers doesn't appear to be a consideration.

I think this mischaracterizes Andrej's response. If anything he is referring to a wholistic view of the vehicle, which includes but doesn't entirely consist of Tesla. For example, 5-10 years down the road, when sensors start going bad, consumers will appreciate fewer things to go wrong with a vehicle--that is one of the advantages of electric over ICE after all.

If anything this is an acknowledgement that George Hotz was right in focusing on optical sensors with Comma.ai.

pmarreck3y ago

Two thoughts:

1) He's not touching on the software cost of integrating different sensor data into the same trained machine learning model; it is likely far simpler to just stick to stereoscopic vision data (the same thing the human genome decided!)

2) That said, it seems at least theoretically advantageous to have a sensory system that exceeds that which humans are limited to; things like LIDAR can work in complete darkness and potentially spot, for example, pedestrians crossing a dark road without any reflective clothing on, where a vision-based system would fail (perhaps add infrared sensation?)

Anyway, doesn't AEB (automatic emergency braking) have to be installed in every car, by law, in the US, around now? And wouldn't that be less reliable if done via vision?

n0tth3dro1ds3y ago

>it is likely far simpler to just stick to stereoscopic vision data (the same thing the human genome decided!)

There’s a lot more to perception while driving than just stereoscopic vision.

First, your stereoscopic “cameras” (eyes) are mounted in free-rotating sockets, which are themselves mounted in a rotating and swiveling base (your head/neck). Your eyes can do rapid single-point autofocus better than any existing camera. They also have built in glare mitigations —- squinting, sunglasses, and sun visors. This system is way more advanced than fixed cameras. Yes, even an array of fixed cameras with a 360 degrees field of view.

Then you have your sense of touch, your hearing, and your equilibrio sense. You feel motion in the car. You feel vibrations in the pedals. You hear road noise, other cars, sirens, and the engine (not much in EVs). You smell weird smells and know when you’re driving with your e-brake on or when there’s a skunk nearby. There’s a lot getting fused with the vision to make it all happen, and I think you’d be surprised how “broken” your driving capabilities would be if you took one of these “background” senses out of the equation.

My anecdote: I drive a manual transmission car. A few months back, I woke up with no hearing in my right ear. Spooked, I drove to urgent care. I could not drive well at all —- I was holding low gears for way too long. I learned that I use hearing almost exclusively to know when to shift. If you had asked me beforehand, I probably would have said that I’m visually monitoring the tachometer to know when to shift. Not the case. Also, I had a TERRIBLE sense of my surroundings. As I drive, I’m definitely building a model of the environment around me based on road noise, sound from other cars, sirens, and the like. Without hearing in just one ear, I felt very disconnected and unsafe. Living in California where lanesplitting is legal, I had several motorcycles catch me completely off guard. I had my hearing restored at urgent care and everything went back to normal immediately on the drive home.

I think Andrej and Tesla massively overestimate vision’s sole ability to solve the problem. Humans are fusing lots of sensation to drive well.

1 more reply

sfifs3y ago

> likely far simpler to just stick to stereoscopic vision data (the same thing the human genome decided!)

Yeah and till we had reliable and powerful artificial lighting, it was highly unsafe to journey in low visibility/ darkness. We used to finish journeys when darkness fell.

Animals that do require precise movement in low visibility (bats, dolphins) conditions often evolved ultrasound solutions.

So should we license Tesla vehicles to only operate when visibility and weather forecast is good and not drive in the dark at all?

1 more reply

quonn3y ago

I think the key point he‘s trying to make is that the size of the fleet is more important than the quality of the sensor. The risk would be reduced by a better system and he seems to be convinced that rolling out vision to more and cheaper cars would get you there.

L0stLink3y ago

There is a great argument for having ultrasonic sensors and radar in a recent video by Rayan from FortNine discussing two fatal accidents involving tesla autopilot https://www.youtube.com/watch?v=yRdzIs4FJJg

ion_fury3y ago

lex's comment did not strike me as a dig. i am actually concerned by your comment because it makes me wonder if i am missing other things too? it just doesnt seem like a dig. it seems like he thought of something funny and wanted to share it. am i alone in this?

and also i dont understand your assertion that it was some kind of cynical maneuver to re-frame the question. he could have also said "yes, more sensors are always better but you can add an arbitrary number of sensors and so we had to decide where to draw the line. the cameras we use are capable of meeting our goal of full self driving that is significantly safer than a human driver. and this also streamlines the production and software which has a material impact on our ability to actually produce the cars which is of course necessary to meet the goal of making self driving cars. bloat could actually kill tesla."

this is logically the same thing that he said in the interview, so whats cynical about it? how is it underhanded?

also is there some intrinsic limitation of the dynamic range of cameras? people are talking about problems with dynamic range being intrinsic to cameras but im pretty sure that cameras and especially camera suites that do not have more problems with dynamic range than a human eye are possible to make and probably already on the market.

tsimionescu3y ago

> also is there some intrinsic limitation of the dynamic range of cameras? people are talking about problems with dynamic range being intrinsic to cameras but im pretty sure that cameras and especially camera suites that do not have more problems with dynamic range than a human eye are possible to make and probably already on the market.

I think it's possible that professional movie cameras (with the appropriate lenses) may have higher dynamic range than human vision. Good luck getting those cheaper than a lidar.

1 more reply

JumpCrisscross3y ago

> did not strike me as a dig

It wasn't a dig. It was calling out a bullshit move that, in my opinion, Andrej deployed out of panic more than strategically. (My evidence for this being Andrej eventually gave a good answer.)

2 more replies

Nomentatus3y ago

Andrej did get around to answering the original question, he just wanted to say more, to put it into a bigger frame with more context. I had the same "weasling" concern at first; but his answer was more or less "You lose more than you gain, but yes there was a small delta; in exchange for which any organization would take on not just an economic hit but a lot of additional opportunities for process and maintenance errors; plus distracting the team." So he'll agree that in an ideal world you'd want 'em, just not want 'em that much; but in the real world, more geegaws that aren't really pulling their weight are a terrible idea.

Although he didn't explicitly say so, neither his answer nor Elon's "take it out 'cause you can always put it back in if it turns out you really need it" philosophy absolutely rule out lidar coming back in the future if some remaining edge case just requires it. Clearly he thinks this is quite unlikely, however.

strangescript3y ago

You are making a lot of potentially faulty assumptions. 1) The "delta" was wide enough to save/harm people, you have no idea. 2) The extra information provided would always be valuable and/or not be overcome with better AI models using the visual sensors in the future. 3) The amount of technical overhead generated by the extra sensors were not prohibitive long term. When working with AI there are often times where it would seem logical that extra relevant data will always improve a model, but that turns out not to always be the case, or provide so little value that managing another dataset is just not worth it.

asah3y ago

Risk to PEDESTRIANS even less.

patrec3y ago

> I thought it was telling that Andrej immediately "reframed" the question because Lex asked the "wrong question". This is a classic evasion technique

I agree with this assessment. However:

> Telsa can't support additional sensors without incurring a prohibitive amount of additional risk to Tesla. Risk to passengers doesn't appear to be a consideration.

This is a stupidifying take. Of course when you work in a line of business producing gadgets that, as an unintended side-effect, kill a lot of people (napkin math suggests above 2 milli-kills per car in the US), you will need to pick a point at which you say further fatality reduction is no longer justified given the economic cost of achieving it. Even if you are a pure altruist (if you go out of business, less safe cars will replace yours). Conversely, even if you are the embodiment of capitalist evil, risks to passengers will absolutely affect your bottom line and if are rational you will take them into consideration. Any meaningful criticism needs to be about the trade-offs they make, not that they make them or are loath to explicitly say so on camera.

CharlesW3y ago

> …you will need to pick a point at which you say further fatality reduction is no longer justified given the economic cost of achieving it.

You're right — the sad truth is that corporations put costs on human lives every day. Where I think we disagree is that you believe they made the decision based primarily on costs. After watching this video, I believe they made the decision because they didn't think they could reliably implement and support a sensor fusion approach.

(BTW, I enjoyed "stupidifying"! I'm sorry I made people stupider.)

ClumsyPilot3y ago

> risks to passengers will absolutely affect your bottom line and if are rational you will take them into consideration

percieved, not real risks to customers. PR matters more than reality

1 more reply

snotrockets3y ago

Tesla hasn't proven itself to be a capable major car manufacturer (they probably lead the minor category, at least in deliveries) in all but one: their de-prioritizing of human life.

kotlin23y ago

Is there hard data on how deadly they are vs. other auto manufacturers? There is definitely a narrative that the cars are dangerous, but I'd like to see that quantified.

xodjmk3y ago

The majority of comments surrounding stereo-camera/lidar questions have a ridiculously simplified idea of the problem. It's obviously the case that 'more sensor good, less sensor bad'. This is frankenstein's monster level technical analysis. Why don't the majority of large-brained animals have many eyes, and many different antannea appendages processing an array of diverse sensory input? You don't just automatically gain by adding lots of sensors. The signals have to be fused together and reliably cooperate and come to agreement in real time for any decision. Any sensor is only providing raw crude data. The majority of the work involved is done by processing this crude data and inferring a much more sophisticated approximation of the real environment from prior knowledge, hence using neural nets with pre-trained data. It is a good debate whether the approximation can be done better by adding more sensor input and diverting R&D & processing resources towards fusion as opposed to improving the results that can be obtained from stereo image sensor. It's not obvious to anyone. And nature seems to inform us that most large brain animals evolve to rely heavily on two eyes instead of 16 eyes + lasers. This is an interesting discussion, but the issue isn't 'tesla could just bolt a Lidar box to the roof and magic, but they want to scam you out of a few extra bucks'. That is a moronic idea.

JumpCrisscross3y ago

> Why don't the majority of large-brained animals have many eyes

Because of the cost of additional eyes. If Tesla is optimizing for cost against safety, that's sort of the point.

I don't believe that's totally the case. Andrej later makes a better argument regarding limited R&D bandwidth, noise and entropy. But the "I would almost reframe the question" evasion was disconcerting. It's a textbook media trained tactic for avoiding a question to which you have no good answer. That it was deployed here badly against a skilled interviewer such that it backfired is a valid observation.

1 more reply

wonnage3y ago

The issue here is trying to infer distance based on complex image processing or just… measuring the damn distances.

1 more reply

petilon3y ago· 30 in thread

I didn't find his answers particularly convincing. His answer focused on costs mainly, and how "the best part is no part". We have already seen multiple accidents caused by camera's limitations [1] which would not have happened if Tesla used Lidars.

Cameras have poor dynamic range and can be easily blinded by bright surfaces. While it is true that humans do fine with only eyes, our eyes are significantly better than cameras.

More importantly, expectations are higher when an automated system is driving the car. It is not sufficient if, in aggregate, self-driving cars have fewer accidents. If you lose a loved one in an accident where the accident could have been easily avoided if a human was driving, then you're not going to be mollified to hear that in aggregate, fewer people are being killed by self-driving cars! You'd be outraged to hear such a justification! The expectation therefore is that in each individual injury accident a human clearly could not have handled the situation any better. Self-driving cars have to be significantly better than humans to be accepted by society, and that means it has to have better-than-human levels of vision (which lidars provide).

[1] https://www.youtube.com/watch?v=X3hrKnv0dPQ

aeternum3y ago

The dynamic range is the reason Tesla know counts photons rather than use traditional camera processing. They basically remove the concept of exposure entirely and simply pass the sensor photon counts to the neural net.

This approach not only simpler as it removes photo processing/encoding but the result is that the NN can operate with a very high dynamic range similar to the human eye and in many cases can be sensitive on the single-photon level.

davidgay3y ago

> They basically remove the concept of exposure entirely and simply pass the sensor photon counts to the neural net.

That sentence does not make sense. There's no such thing as a count without a corresponding interval that count occurred over. That interval is the exposure.

You can of course do lots of (very) short exposures to avoid sensor saturation. That's "just" a movie at a very high frame rate. And then you can post-process this in lots of exciting ways, align the frames, average them, etc, etc.

1 more reply

lambdasquirrel3y ago

Counting photons won't keep a camera from being "jammed." Unless you are using a physically perfect polarizing filter, such that each pixel on the sensor only receives photons from the exact angular window, traced back through the lenses, you have a camera that can ultimately be "jammed."

The human eye isn't so great on those terms. But humans can raise their hand to block the sun if it's straight at our eyes.

petilon3y ago

But it doesn't appear to be helping. Here's an example accident where depth data from Lidar would have helped:

"Tesla later said that during the crash, Autopilot’s camera could not distinguish between the white truck and the bright sky."

https://www.nytimes.com/2021/12/06/technology/tesla-autopilo...

2 more replies

whiddershins3y ago

The replies to your comment don't seem to understand you at all. in the video link here

https://youtu.be/ODSJsviD_SU?t=4424

he clearly states 16x dynamic range as a result of direct photon processing.

emkoemko3y ago

how do you count photons continuously? what... this makes no sense, if you pass "the photon count" you just did a exposure... also how does a photo diode count photons?

moralestapia3y ago

Does it have electrolytes as well?

Nice tech and single photons and whatnot but it still runs into things that a radar with some really simple code wouldn't. ¯\_(ツ)_/¯

lostsock3y ago

That video is from 2020, but Tesla didn't remove radar until 2021. Meaning that the crash occurred with radar still active, which I feel just backs up what Karpathy was saying.

petilon3y ago

Well, the car may have had radar hardware but there are questions as to whether the software was using it:

https://www.nytimes.com/2021/07/05/business/tesla-autopilot-...

Excerpt:

Mr. Rajkumar of Carnegie Mellon, who reviewed the video and data at the request of The Times, said Autopilot might have failed to brake for the Explorer because the Tesla’s cameras were facing the sun or were confused by the truck ahead of the Explorer. The Tesla was also equipped with a radar sensor, but it appears not to have helped.

“A radar would have detected the pickup truck, and it would have prevented the collision,” Mr. Rajkumar said in an email. “So the radar outputs were likely not being used.”

https://www.nytimes.com/2021/12/06/technology/tesla-autopilo...

Excerpt:

Tesla later said that during the crash, Autopilot’s camera could not distinguish between the white truck and the bright sky. Tesla has never publicly explained why the radar did not prevent the accident.

2 more replies

mensetmanusman3y ago

I have some experience with LiDAR, they fail easily if a water droplet is on the cover or if signs are too bright. It’s a whole different technology challenge.

imglorp3y ago

Why is LiDAR still expensive enough for the cost to be a problem for anyone? Why have large numbers not driven these down to commodity devices at this point? Or maybe similar tech at another frequency like spread spectrum microwave with phased array semiconductor antennae?

3 more replies

amelius3y ago

The problem with self-driving is that it is based on data, but the environment may change. See e.g. the case where Tesla thinks that firetrucks are roads.

So if fashion changes, pedestrians may suddenly look like road too, as just an example.

Another problem is that state-of-the-art classification networks have an accuracy in the 90% range. Given that a car has to take hundreds of decisions in a single ride, then even if the accuracy was 99%, you see that error rate simply gets too high.

martindbp3y ago

> state-of-the-art classification networks have an accuracy in the 90% range.

If you're referring to ImageNet SOTA, is has 20000 different classes, including 120 different dog breeds [1]. This is a vastly different task than reliably detecting pedestrians where Tesla can actively curate a dataset of hard examples (from their fleet), whereas ImageNet is fixed, sometimes with low quality labels and as few as a couple of hundred examples. Tesla can also pick a point on the ROC curve to give higher recall but more false positives (which is important for VRUs specifically). Another big factor is that Tesla is using video, not still images, which makes predictions even more robust.

And that's just for pedestrians, Tesla are also using a general ViDAR (visual LiDAR) which is trained to detect obstacles that do not have a specific class. The ViDAR again operates on image sequences, not a single image, and can thus pick out structure from motion.

[1] https://en.wikipedia.org/wiki/ImageNet

akira25013y ago

> While it is true that humans do fine with only eyes, our eyes are significantly better than cameras.

They also have better failure modes and a really sophisticated error management system. They are susceptible to optical illusions, though.

> It is not sufficient if, in aggregate, self-driving cars have fewer accidents.

This is the incorrect analysis anyways. This was always going to be true because a large portion of accidents are single vehicle accidents where the driver was at fault for the crash. Usually due to speeding, alcohol, youth, or a combination of them.

If they didn't have fewer accidents then something is very very wrong with the entire idea. Which may very well be the outcome here. Looking at multi-vehicle accidents where there was no fault of the driver who died, it's not clear that an automated system driving the car would have saved them.

Roads are built right next to cliffs and bodies of water. Semi trucks can completely destroy your vehicle in an instant. Large accidents on snowy or foggy highways happen. Drunk drivers exist and sometimes literally do come out of nowhere, a pickup truck moving at 60mph has enough energy to knock a firetruck onto it's side if you hit it side-on and freeway ramps dump out right onto residential streets. Parts fail, floormats get stuck, people don't wear their seatbelts, and you can get a license to ride on a motorcycle if you want.

It's a guess based on the research I've done, but my expectation is around 20% of fatal accidents can in some way be prevented by automation. You'd honestly prevent more fatalities by putting an ignition interlock on everyone's vehicle or building real barriers between traffic and pedestrians.

BurningFrog3y ago

> a large portion of accidents are single vehicle accidents where the driver was at fault for the crash. Usually due to speeding, alcohol, youth, or a combination of them.

Also plenty of suicides in that group, which confuses the stats.

We really need SDCs to have fewer accidents than human drivers, excluding the suicides.

oldgradstudent3y ago

> but my expectation is around 20% of fatal accidents can in some way be prevented by automation.

Assuming, of course, that automation does not introduce its own failure modes.

That's a strong assumption.

bergenty3y ago

Not to mention waymo works well with LiDAR, cameras and radars. If you’re argument is it’s too hard to deal with that much data, it’s definitely the wrong answer.

jandrese3y ago

> While it is true that humans do fine with only eyes

We do not. Humans are terrible at driving. Traffic accidents are one of the leading causes of death in the developed world. Billions of dollars of property damage occur every year because humans are not up to the task. A self driving system that is as safe as an average human driver would be an absolute failure.

paulcole3y ago

> Traffic accidents are one of the leading causes of death in the developed world.

Perhaps leading cause of premature death or leading cause of accidental death or leading cause in demographics who are otherwise unlikely to die, but they are nowhere close to the top of the overall list.

croes3y ago

>We do not.

But not because of a lack of visual information.

Most of the time it's a la k of concentration or an overestimation of one's own driving abilities.

1 more reply

ec1096853y ago

Humans are amazing at driving. We typically go millions of miles in a lifetime without causing any fatal vehicle crashes and can generally handle unknown situations just fine.

AI is nowhere near that.

P_I_Staker3y ago

We are excellent at driving. It's shocking that there aren't more accidents.

rootusrootus3y ago

> Traffic accidents are one of the leading causes of death in the developed world.

Not even close, really. A bit under 1%. You are more likely to die from an overdose, or suicide. And much, much, much more likely to die from cancer or heart disease.

And that is without getting into the trade-offs. Cars at least have a significant utility value, which is not true of suicide, opiate addiction, cancer, or heart disease. We should try to reduce traffic deaths, but we should not lose perspective.

1 more reply

Nomentatus3y ago

You're right to note the advantages of lidar and (narrow range of) contrast problem for cameras (they arent eyes.) This is why the Uber human driver shouldn't have trusted the machine at night, in particular.

But you still have to address his system argument, which was that adding geegaws that added little would actually increase overall risks along the supply chain (plus maintenance) while distracting the team and adding more risk that way, for very little apparent (but only apparent) gain. The team does believe that they'll get to better than human driving, and do that without lidar.

freejazz3y ago

"But you still have to address his system argument"

Do you? It's his argument that he needs to substantiate... it's not my burden to confirm his conjecture. And even looking at it on its face... it's clearly self-serving bs. It doesn't seem to be a problem for any other car company, so I'm a bit confused as to why it's such a problem for Tesla. Of course, the obvious answer is that Tesla is cheap and doesn't want to pay to have a team that would have sufficient bandwidth to do what every other car company and self-driving system is doing...

1 more reply

Marazan3y ago

They cannot actually believe that.

2 more replies

whiddershins3y ago

He's not talking about costs - money. He's talking about costs - engineering.

It's about more information is not always better. It can instead muddy the waters. It can create confusion.

watwut3y ago

> It is not sufficient if, in aggregate, self-driving cars have fewer accidents

It would be sufficient if it would be the case. With actual proof.

Reality is that in limited abstract situations, self driving card maybe have some advantages. But, that is all that we can claim. And when self driving fails, somehow human is always the cause.

bumby3y ago

I disagree, but mainly because of the way humans perceive risk.

From a public standpoint I don’t think it’s sufficient because there’s inherent trust lacking in an automated system. With ape-driven systems we have a certain amount of trust because we can more accurately intuit what the other ape is reasonably thinking. This is not the case with autonomous driving which leads to a wider amount of uncertainty. Not unlike how we are intuitively less trusting of someone who is legitimately “crazy” even if statistically we don’ can’t say they are shown to be more dangerous.

1 more reply

masswerk3y ago

Having seen their AI Day, I supposed this was all about a unified, pseudo-visual voxel representation – and especially about generating scenarios. Apparently these have become a crucial part of the system and generating a broader variety of sensor data would be a considerable liability.

TheLoafOfBread3y ago· 22 in thread

This whole question about the vision boils down to "humans don't need it so cars should not need it too" the problem with this statement is that humans does not have wheels to move around, they have legs, but wheels are ridiculously simple compared to 4 legs tapping 160km/h on a highway. Same for birds - they also does not need jet engines to fly around, but imagine Airbus A380 flapping its wings and what kind of complexity would you need to flap 800km/h through air.

elteto3y ago

More importantly, we have a tremendous data "engine" processing input from our senses. So assuming for a second that cameras match what our eyes can do, you still do not have a processing engine on the level of our brain to make sense of those inputs.

AtlasBarfed3y ago

soooo... you're agreeeing that non-vision isn't necessary since the control domain is so much simpler?

I personally think they should use as much data inputs as possible: radar, IR, LIDAR, mesh networks, fixed route information.

Where tesla went particularly wrong IMO is ignoring some sort of route-based chunk information which is how humans navigate. IIRC Elon said something to the effect of just having an algorithm to work everywhere.

Humans use the basic algorithm "stay in lane, drive forward" and then decorate with signs, knowledge of curves, locations of potholes, dangerous low-viz corners, likelihood of surprise stopped traffic, obscured driveways, general character of neighborhoods, road purpose. Weather. Windy sections, icy sections, light availability anomalies. What type of vehicle. Repair state of vehicle.

A general AI algorithm will never be able to properly account for flavors/tags/chunk info on routes. Especially since cloud precomputation is so available these days.

Anyway, while recognizing that Tesla's "Fully Self Driving" is not as advertised, and we are a ways from self driving for any statistical measure of superiority to a healthy aware adult, it is still damn impressive what FSD vids show.

Do AI driving systems try to make "subsystems" of AI networks to reduce inputs to various higher-level inputs, or do other just throw a ton of inputs at a big ass network and just let the entire system rise from the soup of information?

1132453y ago

The Tesla AI day videos [1] go into some detail about this. They use multiple networks that are dedicated to specific tasks.

[1]https://www.youtube.com/watch?v=j0z4FweCy4M (2021), https://www.youtube.com/watch?v=ODSJsviD_SU (2022)

latchkey3y ago

> Humans use the basic algorithm "stay in lane, drive forward"

If you've ever driven in Vietnam, that is so not true.

1 more reply

ajross3y ago

But the question at hand is system control, not locomotion! You're not asking the automation to walk (well, I mean someday we will, but Teslas have wheels), nor the aircraft to flap. We want the automation to do what a human pilot would do. And that works with eyes.

No, I think this argument is largely correct. And frankly settled: anyone who's driven recent FSD beta versions knows very well that the cars "see just fine". They don't hit anything, they see and avoid obstacles. Frankly they're much more observant than humans are, my car will twitch when pedestrians turn as if they're going to enter the road (where human drivers mostly don't notice, and if they do they ignore it). What problems still exist are in planning: things like sign reading, lane selection, etc... still need some work. But collision avoidance just isn't an issue. It isn't. The LIDAR folks were wrong, basically.

(I will admit though that I'm a little sad about the removal of the ultrasound sensors though. It's true the autonomy probably doesn't need them, but I really like having the chimes to guide parking and garage maneuvering.)

enragedcacti3y ago

> No, I think this argument is largely correct. And frankly settled: anyone who's driven recent FSD beta versions knows very well that the cars "see just fine". They don't hit anything, they see and avoid obstacles.

Only if you ignore times where intervention stopped it from hitting something, times where it did actually hit something, massive amounts of jitter and popping in the visual output, phantom braking, etc.

Unless of course "recent" means n+1 where n is the version that crashed into something.

Collision with bollard in Feb 2022: https://www.youtube.com/watch?v=sbSDsbDQjSU

attempts to plow through cyclist Feb 2022: https://www.youtube.com/watch?v=a5wkENwrp_k

almost crashes into tram (can't gauge speed or direction?) Jun 2022: https://www.youtube.com/watch?v=yxX4tDkSc_g

Crashes into curb Aug 2022: https://youtube.com/shorts/8Mh1GjejdsI

Phantom brake Sep 2022: https://www.youtube.com/shorts/5v6j_oL7S-g

Almost colliding with bridge pillar 2 weeks ago: https://www.youtube.com/watch?v=5CMYkDWaqn0

Crashes into various objects in testing 2 weeks ago: https://www.youtube.com/watch?v=yyDxqEzV5Zc

threeseed3y ago

> The LIDAR folks were wrong, basically

I think your mistake is thinking LiDAR exists to solve the happy day scenario. It doesn't.

Vision is sufficient for the majority of use cases. Where LiDAR comes into its own is in the edge cases because it almost guarantees accurate bounding box detection. Which is where vision is at its weakest.

So I want to know what does FSD do when it sees a billboard of a person or when it is seeing a new object for the first time.

1 more reply

elteto3y ago

> The LIDAR folks were wrong, basically.

This is far, far from settled at this point.

1 more reply

fzeroracer3y ago

> The LIDAR folks were wrong, basically.

According to who? Tesla? Because Tesla has a vested interest in trying to prove that they're right even if they're obviously wrong. That's why they constantly try to downplay failures, software issues, device issues etc.

I'm very confused by the attempts to discredit the usefulness of LIDAR. It's another tool you can use to improve the accuracy of your model. Sure, you can use a screwdriver, flip it around and use it as a hammer. But if you need to deal with nails, it's better to grab a hammer instead.

1 more reply

Slartie3y ago

> my car will twitch when pedestrians turn as if they're going to enter the road (where human drivers mostly don't notice, and if they do they ignore it)

As long as those pedestrians DO NOT actually enter the road after those turns, any "twitching" of your car in response is an ADDITIONAL SAFETY PROBLEM, because other drivers might notice the erratic movements of your car and do erratic things as well, which in the end might result in accidents that wouldn't have happened had your car not "twitched".

Especially "twitchy" AIs like that of your car might very well "re-twitch" on noticing your car doing small, but erratic and rapid changes in behavior, thereby initiating a "twitch escalation spiral".

freejazz3y ago

Disregarding everything else about your post, which was better addressed by others, I'm amused that you think the FSD being twitchy reflects safety.

Gordonjcp3y ago

The thing is, they don't see as well as humans. They don't respond to changes in the environment until a car is actually in the middle of changing lanes.

It's like being driven around by a drunk person - the reaction happens loooooong after the action that causes it has started.

jeromenerf3y ago

> We want the automation to do what a human pilot would do. And that works with eyes.

Humans can’t really turn senses off, so they have coffee when driving. Touch and hearing are quite important to “read the road”. Equilibrium too.

1 more reply

clouddrover3y ago

> You're not asking the automation to walk

Tesla should aim for parking first. Teslas do poorly at self parking:

https://www.youtube.com/watch?v=nsb2XBAIWyA

codeflo3y ago

Exactly. The way biology solved something may not always be the best way to do it with technology, because the constraints or so different. And to be more blunt, I think none of the problems where technology surpassed human performance were achieved by doing it the exact same way. From locomotion (legs vs. wheels) to playing chess (strategic intuition vs. billions of calculations).

m4633y ago

> "humans don't need it so cars should not need it too"

I think of parking and I'm reminded of "the camry dent"

https://duckduckgo.com/?q=the+camry+dent&iax=images&ia=image...

yarg3y ago

Human binocular vision is what has been used to drive cars up until now, so it can be done (with a few thousand million years of iteration).

Ideally cars will be self-driving using only passive sensors - but I do think that Musk/Tesla completely missed the value of active sensors in training.

gibolt3y ago

Pretty sure humans haven't been striving for drivers licenses for millions of years...

Tesla does use Lidar on a small number of test vehicles for assessing ground truth. However, they have built enough of a data pipeline and fleet data acquisition to use repeat clips to determine ground truth better than human labelers.

1 more reply

Nomentatus3y ago

No, he precisely said that the difference Lidar made was tested, and the delta (difference made) was quite small; not enough to outweigh the downsides. Elon has noted that humans do well, and that's relevant, but that observation was also tested, re lidar.

croes3y ago

>and the delta (difference made) was quite small

But why. Because LIDAR doesn't help much in general or because the Tesla engineers aren't good at using the sensor data?

Same with the manufacturing.

Sounds to me like Tesla can't handle complexity. And if they can't handle the complexity of manufacturing, they surely can't handle the complexity of full autonomous driving.

1 more reply

NBJack3y ago

Basically everything he said as a justification (sourcing, firmware, etc.) applies to every sufficiently advanced part of the vehicle. By that logic, they should not be using touchscreens on the center console, etc.

1 more reply

dmix3y ago

> This whole question about the vision boils down to

Is that really what the problem boils down to? Or how it was decided? Or are you just questioning a common meme that comes up in internet debates about car AI?

eachro3y ago· 9 in thread

So the key question is how much of an improvement does radar/sensors/etc give you over just using computer vision?

diskzero3y ago

As someone working in the field, I would never choose to eliminate the information provided by radar, lidar and any other sensor technology. Depending only on camera information would be too limiting.

bpanon3y ago

You haven't solved the problem though.

throwntoday3y ago

If we're to trust what Elon and the team said during the last few AI day, none. They stated that the ultrasonic and radar sensors were actually performing worse than their pure vision stack.

justapassenger3y ago

Real life performance of vision only stack doesn’t agree with it.

quonn3y ago

I‘m ready to be convinced that this will be true at some point for the ultrasonic sensors. But by design the radar can see things that vision can never see. It seems like a bad idea to take that away.

1 more reply

kevin_thibedeau3y ago

Vision systems don't work at all in fog or heavy rain/snow.

Dunedan3y ago

Up to a certain degree they work, as humans can drive in fog or heavy rain/snow as well. If visibility is so bad that a human wouldn't be able to drive, I wouldn't want to sit in a self-driving car either, no matter if it does use vision only or has additional sensors.

nicbou3y ago

Any more or less than the human equivalent?

I'm not following the news, but I haven't seen any videos set in what Canada looks like 4 months per year.

m4633y ago

A better "answer" might be to make them an option and let the market decide.

For many (MANY) years airbags were fought by the auto industry even though people wanted them.

bekantan3y ago· 8 in thread

He explains it quite well: all necessary information is already in the pixel-space and adding more sensors slows team down more than it improves the system performance. My understanding is that major blockers are not in perception area anyways, would be great if someone with relevant experience could comment if this is indeed the case.

diskzero3y ago

I am a principal engineer for a major autonomous vehicle company. You can break this statement down into two components:

Adding more sensors slows his team now more than it improves system performance

I'll take his word on this. It is a lot of work to incorporate multiple sensors.

All necessary information is already in the pixel-space.

I hate to disagree with someone as distinguished as Karpathy, but this is simply not what I have observed from all of that data that we have access to. Given my knowledge of the various stacks deployed today, I would never ever ever get into a vehicle using a vision only stack and expect it to perform in some of the challenging environments encountered during testing.

alsodumb3y ago

I think one should distinguish between 'all necessary information is already in the pixel-space' vs 'we already know how to extract all the information needed from pixel-space'

The fact that (most) humans manage to drive around safely and successfully in current roads proves that the information needed exists in the pixel-space (not just current image, but say current + history). We don't yet have stacks that can successfully map everything needed from this information but I don't think Dr. Karpathy ever claimed that.

(I am not a principal engineer but a mere PhD student who argues daily with people on how RGB information is underappreciated and under utilized)

3 more replies

kfarr3y ago

Full on agreement. There are literally videos of Teslas smashing into stationary vehicles on the highway at night using only vision camera for FSD. No way any rational actor could claim the visible pixel space is sufficient in that scenario compared to LIDAR, Radar, etc

1 more reply

cma3y ago

Compare their occupancy map with what you get out of the latest LIDAR Waymo is using and it is scary (occupancy is harder as it fills in what is occluded, but Tesla's looks like Minecraft-style 1x1x1m resolution).

Dunedan3y ago

Out of curiosity: Could you please elaborate what such challenging environments can be?

jbverschoor3y ago

It’s good enough for people, so all the info is there.

Doesn’t mean it’s better or easier

6stringmerc3y ago

I have driven in extreme rain flash flood conditions in north Texas and I consider this a specific challenge, natural, that would defeat his system.

pclmulqdq3y ago

Any amount of snow would do this too. It severely reduces the color space of road features.

1 more reply

woeirua3y ago· 7 in thread

Andrej's argument about more sensors adding entropy strikes me as disingenuous considering that in the next question he then says that Tesla's biggest advantage over everyone else is "the fleet", which clearly introduces orders of magnitude more entropy into the system than anything else. Can you imagine the infrastructure required to gather video from "the fleet" anytime a car sees something unexpected? How about diagnosing what went wrong in that specific instance? How many thousands of these cases do they see everyday?

Given the progress of the FSD "beta" to date, and the fact that Andrej _left_ Tesla, I'd wager that he knows that this approach is a dead end, but he won't say that because he'd get himself in hot water with Elon.

dmix3y ago

One is infrastructure entropy and the other is software engineering entropy (we're still talking about data from one type of sensor, just at larger scale).

Most tech startups have 10x+ more problems with the engineering part than the infrastructure/ops part.

Also this is one person's perspective from a large team. His answer might be biased because he's an engineer and I doubt his was the only voice in the debate.

ralfd3y ago

Assuming Karpathy is lying is quite a hot take to disregard his opinion.

No. He makes it clear that he is very convinced about it. There is no relativism, no weasel words or couching in maybes. He could be wrong, of course, but he believes in what he is saying.

nicbou3y ago

> no weasel words

The video starts with him reframing the question instead of answering it

1 more reply

woeirua3y ago

Here's the thing though, if you're Karpathy and you are 100% confident that Tesla's approach is on the cusp of delivering full L5 autonomous driving, then why leave? Surely, Tesla would become the most valuable company overnight if they could actually do it. He would be showered with accolades when they finally finish it. To leave, _before_ any of that happens, says it all.

pyinstallwoes3y ago

Your comparison is a little short sighted because your example would require fleet * additional sensor entropy rather than fleet on itself. And if fleet on itself is adequate then anything extra is simply inefficient at best.

woeirua3y ago

No, my point was that the fleet itself adds entropy to the system because it creates a lot of noise for engineers to track down edge cases and other weird things that happen when you're collecting data from a lot of "sensors", i.e. cars, at once. The exact same argument Karpathy made about why they dumped ultrasonics and radar.

The problem is that so far, Tesla has yet to demonstrate that the fleet _is_ sufficient. IMO, if the fleet was enough to get to L5 autonomous driving, then they would already be there.

Nimitz143y ago

You don't understand the term entropy.

011000113y ago· 6 in thread

I still suspect it's because they need to preserve compute resources for vision processing. Sensor fusion is likely eating up too much of their current HW and limiting their progress in other areas. I suspect Tesla will have to admit they need to upgrade the current HW before they ever 'solve' FSD.

snovv_crash3y ago

The amount of compute that sensor fusion uses is miniscule compared to running a NN or computing stereo depth maps. Sensor fusion runs in the background of your phone the whole time to power things like [0] for example.

0. https://sensor-js.xyz/demo.html

011000113y ago

You are confused my friend.

Reading sensor data is not the same as feeding that data to a neural network and asking it to form a worldview composed of possibly conflicting sensor data streams(i.e. lidar vs vision vs ultrasonic).

You are somewhat correct that it is quite trivial to read sensor data. For many sensors, there is some work which needs to be done to denoise or cleanup the input data. That's not where the story ends, however.

1 more reply

Nomentatus3y ago

Well, yes and no. Integrating the data and adjudicating conflicts between sensors is a real task, too. Also having just two opinions doesn't necessarily help if they conflict, and the lidar is the thinnest source of data. How do you coin flip that? You likely end up just discarding the Lidar's conflicting opinion.

1 more reply

vhold3y ago

One camera produces millions of bytes of data every single frame, an ultrasonic sensor is useful producing just 1 or 2 bytes of data in the same time span. (distance to something within the sensor's cone).

So it seems like a totally ridiculous argument that ultrasonic sensors create some kind of data processing overload.

An ultrasonic sensor makes it possible to implement incredibly simple and reliable safety features with well known performance characteristics. Processing an image with ML to produce the same effect has tons of edge cases where it might not work, and nobody knows when it won't work, and every update to the system could introduce regressions.

It's why they had to disable certain features when they got rid of the ultrasonic sensors. Those features may come back some day, but I bet they'll never be as reliable, and certainly won't be as predictable.

https://www.pcmag.com/news/tesla-removes-ultrasonic-sensors-...

011000113y ago

Not saying they create a data processing overload. I'm saying they're fed to a deep learning architecture that must then try to fuse disparate sets of data into a coherent picture. The neural network becomes simpler when you remove that function and just focus on visual processing.

another_devy3y ago

I think not using LiDAR would be a good bet. LiDAR in nutshell allows you to give an object in vision 3D space, relative speed which 2 human eyes can do very fast. Problems with solving this in vision based input is in dataset and interpretation. Computer vision and AI can’t effectively apply a human drivers judgment with better camera and processing power, at least not yet.

nelox3y ago· 4 in thread

“The world is designed for human visual consumption” and “[vision] has all the information you need for driving”. While vision may be sufficient, I would say that other senses, such as hearing, touch and smell augment driving very well. Especially with regard to situational awareness. e.g. The sirens of emergency vehicles are typically the first indication of their presence, which often can be felt as well. The wail of a tornado siren, similarly. Loud throbbing motorcycles do much to improve rider safety simply due to them broadcasting their presence. At railway level crossings, drivers should slow down, look and listen for oncoming trains. The smell of wildfire or bushfire smoke provides enhanced warning or nearby danger. So to say vision is sufficient, does not fully take in to account the driver experience, especially where safety and situational awareness are concerned.

stonogo3y ago

One thing that Tesla engineers seem to keep forgetting is that human eyes are not fixed into a steel block. We can tilt our heads, crane our necks, hold hands up to block glare, and so much more. Human vision is responsive, adaptable, and not comparable to some cameras bolted to a car.

And even with all these advantages, tens of thousands of people are killed in car crashes every year. Some people make a compelling argument that this is evidence that human vision doesn't have all the information you need for driving. While I don't go that far, I do think autonomous driving has a long way to go.

djleni3y ago

THANK YOU. Reading this thread was getting to me because so many comments say humans drive eyes only.

I use far more than just vision driving:

- sound, for emergency vehicles, detecting vehicles outside of my field of view if my windows are down or the vehicle is loud, tire sound (especially in snow and rain), engine sound (more feedback in snow or ice about what my tires are doing)

- touch (steering feedback, gives information about grip in some circumstances)

- acceleration (can feel if the rear tires break loose in a turn on snow or ice, or if I’m sliding while breaking)

And probably many more

AlotOfReading3y ago

It's worth noting that most autonomous vehicle solutions have dedicated microphones for emergency vehicles and sensors that can detect slip. I had a little dashboard measuring wheel slip at <company>. It mainly ended up mapping train tracks and potholes.

2 more replies

mola3y ago

We even use Doppler! Our hearing is capable of sensing movement (speed and acceleration) using the Doppler effect. Our hearing also has a remarkable ability to locating sound source incoming direction.

mongol3y ago· 4 in thread

What is it that makes Lidar so expensive? Is it something intrinsic to the technology that prevents costs from coming down?

fooblaster3y ago

Lidar is coming down significantly in the next few years and is available from multiple tier 1 automotive suppliers. Technologies like high power vcsel arrays and highly integrated and photosensitive SPAD arrays/detector logic are making this possible. Prior lidar devices used non automotive grade discrete components like edge emitting laser diodes, high speed adcs, and APDs. These were expensive and hard to integrate, and aren't present in mass market inexpensive AM lidar coming to the market.

diskzero3y ago

A LIDAR sensor is a complex device with spinning motors, mirrors, lasers and more. Costs are coming down and less-expensive and more capable devices are coming to market. Once the price-points come way down, I wouldn't be surprised to see Tesla reconsider their decision to exclude them from their sensor platform.

Nasrudith3y ago

They are also fairly power-hungry from what I heard.

frxx3y ago

There are also solid state LIDARs these days which involve no moving parts. Still use more power than radar though.

dreamcompiler3y ago· 3 in thread

It's obviously a stupid decision to remove a direct source of range data (radar and ultrasound) in favor of an indirect one (vision).

But on second thought this doesn't bother me that much because Tesla FSD is absolute garbage even with radar (and I don't think Tesla will get away with selling the FSD snake oil for much longer), so if vision-only is good enough for the base-level lane-keeping autopilot functionality and it makes the cars cheaper, maybe that's a good thing.

dmix3y ago

Even though the risks are high the outcomes will always keep them honest. Whether they like it or not.

This isn't like Facebook continually releasing a product that sucks but people will use anyway.

Tesla is constantly working against the clock and everything they do has real world consequences. There are multiple gov agencies watching over it at all times. Of course there's lots of people with far higher risk tolerance than is being exhibited but if it does turn out badly IRL this will get shut down pretty quickly.

The good news is Tesla has the ability to cripple this feature remotely without a costly/lengthy recall if that does happen.

AlotOfReading3y ago

Outcomes have to be truly and utterly fucked for "new" types of problems to be noticed in the automotive industry. Take the Toyota Unintended Acceleration case, where completely negligent software quality took over a decade and at least 89 lives to be noticed and (partially) rectified.

Regulators have keen noses for very particular types of issues and rely heavily on manufacturer judgements on a lot of the rest. Issues that aren't in any of those fairly narrow categories need to be extremely public or extremely egregious to attract their notice.

1 more reply

epgui3y ago

Even if you were right, there's nothing "obvious" about that.

hbarka3y ago· 2 in thread

Intuition + other examples tell you that radar and ultrasonic sensors work. Why do we twist ourselves to believe otherwise?

Elon removed the radar and ultrasonics for the simple fact that its supply chain logjam was screwing up the manufacturing schedule. They also realized that the profit margin can be sustained in an inflationary environment by simply removing these parts [1]. “Oh, we were going to remove them anyway because humans can see fine with just eyes and no radar, why can’t cars?” Tesla then turned up the marketing of the AI/vision hype lever once more to toss another shiny tech object and get buyers to ignore the fact that there is a regression of features in the newer cars going forward.

[1] https://youtu.be/LS3Vk0NPFDE

mgoetzke3y ago

So all those engineers are lying when they talk about this topic ?

9935c101ab17a663y ago

Uh, which engineers are you referring to?

oxplot3y ago· 2 in thread

Here's the summary (mixed with observations from Munro and past Tesla presentations):

- Costs money: the physical sensors (a dozen of them), wiring it up, assembling it, maintain inventory, code it, etc.

- Time spent on maintaining, improving software stack for the non-vision sensors as well as efforts needed to fuse the data with vision, takes away from focusing on vision alone. It also holds back vision in relevant areas.

- Existing non-vision sensors used by Tesla are orders of magnitude lower fidelity than vision. It has historically (as the case with radar) led to vision essentially having to overriding radar because vision just performed much better (see AI day 2021).

My take:

As with any new tech, it likely sucks at the start (think HDD and SSDs, and how a mechanical thing with lots of moving parts was way more reliable than SSDs at the start). However, by essentially moving past the local maxima, you get to innovate better, faster in the future.

In case of ultrasonic sensors, they are for low speed cases anyway and most people are fine without them. Majority of fatalities and injuries happen at higher speeds.

rootusrootus3y ago

That's great for them. But when I'm shopping for a car, I get to choose between a manufacturer that installs the extra sensors and seems to be able to get them to work, and Tesla.

Used to be that Tesla was blazing a trail and if you wanted a good EV, that was what you got. Now, if you want the best EV, it's usually not going to be a Tesla. And I don't see that they're making any decisions that will regain them that title. The incumbent manufacturers are quickly proceeding to eat their lunch, just like many of us predicted would happen. Turns out the hard part of making a successful car isn't the drivetrain.

oxplot3y ago

> Now, if you want the best EV, it's usually not going to be a Tesla.

Would love to hear what you consider "good" and what specific EV ticks the most good features that a Tesla Model 3 doesn't.

2 more replies

a-dub3y ago· 2 in thread

i'm not sure if i buy his argument that the "delta is not big enough." i have some experience with realtime ai systems and i've noticed something interesting about them.

they have a non-smooth capability curve, where they can demonstrate proficiency in activities that in regular computer programs or people would imply a complete and continuous path of capability that has been mastered to achieve the demonstration, but ai systems are weird in that can do amazing things, but have loads of little holes and failure modes along the way.

for example: gpt-3 can write you a shell script that will emit a c program that prints a poem about people you know, but will fail at very basic logic, sometimes.

in light of that, having additional support data like radar or lidar seems like the right move for plugging all those little holes in capability that turn up in real ai systems.

because at the end of the day, when you're driving a car in the real world and lives are at stake, simply interpolating or averaging over uncertainty seems awfully deadly and the only way to ameliorate that uncertainty seems to have multiple redundant sensory systems that can stand in for each other as conditions change. just like us!

Nomentatus3y ago

They do "fall off the edge of the world" a lot; but so do human neural networks; I've seen a bad crash as a result of a human simply pulling out of a driveway right in front of a motorcycle, 'cause they're rare. She had tagged the motorcycle as a bike while it was farther away, then boom. Her interpolation (while checking the other side) didn't work, and her averaging over uncertainty didn't work either because motorcycles are rather rare up north, they aren't the average vehicle. I've made a similar error re a kid on a wall (he suddenly jumped directly into the bikepath) but managed to avoid him (my bike zoomed to his left and I tumbled past his right. He wasn't hurt, although I got a severe wrist sprain from throwing the bike to the left.) As a driver I behave very differently around kids on walls, now. It was just an edge case I'd never encountered, and I didn't have enough data to calculate under uncertainty.

api3y ago

I see surprisingly little discussion of overall statistics on safety of self drive vs humans, and what I do see is often self reported by companies or by equally potentially biased sources in the media. I’ve searched many times and a straightforward stat seems hard to find.

3 more replies

JaggerJo3y ago· 2 in thread

This sucks for parking. It is simply (physically) not possible for the existing cameras to see the area directly in front of the car.

So how would this work for parking?

A: Add more cameras so there are no dead areas in front of the car

B: build a model in vector space when driving towards a parking spot and assume blind spots don't change. (still sucks)

oxplot3y ago

> It is simply (physically) not possible for the existing cameras to see the area directly in front of the car.

Think about how a human driver does it, given his/her even worse vantage point. They model what's in front/behind the car from afar and remember what's where as they approach it. There are other signals as well, such as continuation of a kerb, etc.

I think people keep forgetting that Teslas run hundreds of ML prediction tasks all the time. Watch recent AI day and their talks about "occupancy network" to get a sense of the car's ability to:

1. Construct 3D model of its surrounding in real time; 2. Remember occluded sections based on what's it's seen previously.

watwut3y ago

Human driver constantly turns head around to where he is Mos likely to hit something.

1 more reply

friend_and_foe3y ago· 2 in thread

I remember watching an interview with George Hotz when Comma.ai was young, where he essentially said this as a critique of Tesla. He's a bit of a showman and likes to invite a little controversy when he says things, but I found myself agreeing with his point. It's not surprising to see such a practical company like Tesla face the facts about all these sensors eventually.

P_I_Staker3y ago

> such a practical company like Tesla

Where are you getting that from? Tesla has always seemed pie in the sky, and hardly a down to earth company at all throughout the history.

I'm basing this one both their public record, and reputation within the auto industry.

friend_and_foe3y ago

Tesla created the modern electric vehicle industry. They're innovators, sure, and they push limits, but their priority has always been to actually build. And they do.

lawrenceyan3y ago· 2 in thread

I can see a path where with only cameras, Tesla might be able to reach level 4 autonomy in perfect conditions.

But the biggest thing that comes to mind is what happens at night. Are they only going to enable self-driving during the day?

speedgoose3y ago

Wouldn’t turning on the headlights fix the problem at night?

Snow and ice may be another challenge but night sounds easy.

lawrenceyan3y ago

When you drive at night with headlights, tell me honestly how confident you feel driving versus during the daytime.

yreg3y ago· 1 in thread

They are pretending as if the USS were there only for self driving.

I use them as well!

georgeg233y ago

Indeed the ultrasonic sensors are pretty critical for (human) parking and backing up.

post_break3y ago· 1 in thread

I think this is the real reason: https://www.youtube.com/watch?v=LS3Vk0NPFDE

Cost cutting.

taf23y ago

Probably a benefit but also imagine the difference in software. You Boolean logic like radar says we are gonna hit , vision sensor says we have nothing there, sonar indicator says nothing there… so the idea of having just a really good single source of truth probably makes a lot of code a lot less complex… I have no way of knowing either way but from a less is less that can break point of view this seems kinda good… like many things only time can tell and at least we have different groups of people pushing on different potentially viable paths forward so that we can soon hopefully know if self driving is possible (wide scale) one way or the other

nova220333y ago· 1 in thread

At 2:05. Suddenly you need a column in your sqlite telling you what type of sensor it is....

Seriously? This is a major technical challenge?

danpalmer3y ago

The challenge isn't the storing of the flag that says which sensor it has, it's testing the combinations, training for the different scenarios, treating the incoming data differently, and so on.

0xfffafaCrash3y ago· 1 in thread

Seems like a very political answer from Andrej. Of course he’s not going to outright say “yeah, we’re prioritizing the profit margin over accuracy and safety considerations” if he wants to keep his job, but that seems to be the short of it. Others may choose to follow, at least in the short term, but it won’t be because of “entropy” making the system worse (you can always build a model without a data source and then refine the results based on the added value of a data source) but doing so just because it will save lives doesn’t cut it when the goal is to cut corners and costs to maximize profit. I can believe that some types of sensors aren’t worth the trouble in terms of additional signal to noise ratio, but I can’t believe this is one of them.

Nomentatus3y ago

The entropty he's taking about comes from many sources, in particular opening yourself up to maintenance or supply side errors, and just overloading team attention (always at a premium) for no net return. It's not just CPU cycles (although that's part of it, that hardware could be doing something else useful.)

nielsbot3y ago· 1 in thread

All I heard was "cost savings, cost savings, cost savings"

oxplot3y ago

Well watch it again and again and again. He talks about the determental effect of lo-fi sensors in conjunction with vision among other things.

justapassenger3y ago· 1 in thread

TL;DW.

Tesla doesn’t know how to do change management.

Nomentatus3y ago

Given Andrej's explanation this verges on mere gainsaying. Could you expand on what you think they should be doing? Other firms would also encounter "entropy", they always do; what's your way of reducing that severely?

1 more reply

throwaway4good3y ago

Let me reframe the answer:

"We removed them because they cost money. And we are trying to make money ... at least right now.

Listen, this pure autonomous self-driving car stuff is never going to work, so who cares if we have these gadgets or not ..."

Animats3y ago

From my DARPA Grand Challenge days, I used to have an Eaton VORAD automotive radar. This was an early design - 24GHZ, 1 scanning axis. It could see cars, but not bicycles, at least not reliably. For several months, I had one pointed out the window of my house, looking at an intersection. So I had a V-shaped wedge on screen, and could watch the cars go by.

It's a Doppler radar, so you don't get any info from things stationary relative to the radar, but you do get range and range rate. And the quality of that data is independent of distance. We used it mainly as a backup system for the world model built with LIDAR and (to a very limited extent) vision. The VORAD data could lower the speed limit for the rest of the system, and if a collision was about to happen, it would slam on the brakes independently of the world model.

The big problem with coarse automotive radar is that it can detect targets, but doesn't tell you much about them. Cars, trash cans, and metal road debris all look about the same. There's also a lot of trouble from big flat metal surfaces being mirrors for radar. We were willing to accept slowing down for ambiguous cases until the other sensors could get a good look. Drivers hate that if road-oriented systems do it.

Modern units are up around 70-80GHz and often have 2D scanning, which is a big help. I haven't seen the output from a modern automotive radar. I was expecting that by now, low cost millimeter microwave systems (200-300GHz) would be available, providing detailed images somewhat coarser than you can get with light. You get range and range rate, and you can usually steer the beam electronically rather than mechanically. The technology exists to get high-resolution radar images, but is mostly used for scanning people for weapons at checkpoints. It hasn't become cheap yet.

dane-pgp3y ago

I think there's an interesting general optimisation problem here of balancing the accuracy/performance of a software/hardware system, against the goal of making that system easier to iterate on and develop.

Presumably this is a matter of working out if you are at a local maximum or not, and thinking about what properties the ideal solution will have. It also matters if you have other competitors that might be racing towards the ideal solution faster than you, potentially patenting their progress along the way.

60Vhipx7b4JL3y ago

From an engineering perspective I would ask: Can your sensor package understand the environment to the required (low) failure rate?

Radar/Lidar/Ultrasonic is going to give you information that your camera systems will not give you. It does not matter if the delta of information is little. If this little is required because you can't obtain it otherwise, you still need it.

If you just rely on the fleet, you rely on the things you have seen. What about the objects that you have not yet seen?

gnicholas3y ago

How does it make sense to not even have sensors for parking? If you think they don't help during normal-speed driving, that's one thing. But they obviously help during parking, since (IIRC) they've had to disable certain autonomous features until they get their vision-based systems upgraded to be able to fill in this gap.

xnx3y ago

Sensor fusion seems to be another thing that Tesla is not good at.

EVa5I7bHFq9mnYK3y ago

From the first principles point of view, it comes down to radar and ultrasonic having much higher wavelengths than optical. Which results in much lower amount of incoming information, worse resolution and higher interference if many cars radiate the same signals on a busy street.

jakeogh3y ago

Giving human[1][2] drivers better situational awareness[3] is the future. Specifically open[4]:

a. Windshields that clean the inside as well as the outside.

b. Better eyeglasses[5].

c. User controllable hi-res HUD thermal IR overlay.

d. Headlights with adaptive notch filters so the oncoming vehicle can pick an empty spectral range... without the source being monochromatic (with required adaptive filters on the recieving end)... and/or really good coronagraph's.

e. Brake control[6].

Any entity capable of driving[7] in a population of humans (including adversarial humans) is sentient[8], and has real skin in the game. It would be unethical to lock one in a car:

[1] https://news.ycombinator.com/item?id=33213860 (analog FPGA)

[2] https://news.ycombinator.com/item?id=21106367 (general AI)

[3] https://news.ycombinator.com/item?id=16646112 (2018)

[4] https://www.tesla.com/blog/all-our-patent-are-belong-you (2014)

[5] https://patents.google.com/patent/US7744217 (2007)

[6] https://news.ycombinator.com/item?id=18013388 (2018)

[7] no human behind the wheel, no human to correct impending mistakes, but (critically) with one or more humans in the car.

[8] The idea that non-biological machines can have 'self' is a window into modern mass transformation. Please checkout the analog FPGA experiments linked above.

danbmil993y ago

Musk is also famously against using lidar. He doesn't understand/accept that an autonomous vehicle needs any sensors that humans do not posess.

sidcool3y ago

I feel that was more an operational answer than an engineering one .. I still feel that depth perception of vision alone is unreliable.

dncornholio3y ago

Just remember folks, we will have full self driving vehicles by the end of this year!

ra73y ago

If I were a Tesla fan/investor/FSD customer, I’d be very concerned that the former (effective) tech lead of FSD doesn’t know about sensor fusion or that it’s a solved problem for majority of the companies in this space.

ornel3y ago

Video summary:

https://www.summarize.tech/www.youtube.com/watch?v=_W1JBAfV4...

superkuh3y ago

Humans don't use radar or ultrasound sense to drive. If we want cars that drive like humans drive they should use the same senses. For example, in the northern parts of the USA there is snow cover for much of the year and lanes are emergent from flocking without any absolute reference to the actual location of the lanes. The reasons everyone choses the same places to drive are that they see the same environment with the same senses. Even if autonomous driving with radar and ultrasound was made to work if it picks the correct lane position and all the humans pick the wrong new lane position then the car is wrong, not the humans.

mavili3y ago

Did anyone else catch Andrej's "sqlite" comment? If that is not just a simple analogy, Tesla may be using sqlite in their cars? :D

sgjohnson3y ago

Does this mean that now when someone smashes one of their bumpers on a Tesla, the insurance will no longer have to total the entire vehicle?

solardev3y ago

Cuz Muskdaddy wanted mo money. There, mystery solved.

julienreszka3y ago

Geohot said something similar years ago already

smrtinsert3y ago

Hm I'd rather have someone from Twitter audit this decision

bigtex3y ago

Did Lex ask him why Tesla love to crash into emergency vehicles?

1 more reply

KVFinn3y ago

TLDR: Tesla thinks LIDAR hardware is more expensive than the performance improvement it provides.

I didn't like his line of logic about how vision is necessary and sufficient, because that's how humans drive. Okay sure, but if some combinations of non-human sensors could drive better and/or cheaper than a vision only driving system, surely he would not argue for sticking with vision only? Maybe adding non-vision sensors lets you save hardware and software resources on the vision part of the system.

j / k navigate · click thread line to collapse

471 comments

197 comments · 43 top-level

CharlesW3y ago· 43 in thread

flashgordon3y ago

That was hilarious. Basically (unless this needs a reframing/realignment/repositioning/reorienting):

Q: "are less sensors less safe/effective?"

A: "well more sensors are costly to the organization and add more tech debt so safety is orthogonal and not worth answering".

jholman3y ago

Uh, that's not at all a good paraphrase.

Q: "Does [removing some sensors] make the perception problem harder, or easier?"

(note, this is literally what Lex asked, your restatement is misleading)

Again, I'm not saying Karpathy is honest or correct. I'm saying that the rephrasings in this comment and this thread are hilariously unfair.

3 more replies

judge20203y ago

Munro’s cost breakdown is much more informative in just how much it’ll save in terms of parts/labor. https://youtu.be/LS3Vk0NPFDE

Also, Tesla’s strategy for safety is seemingly “excel in industry standard tests, ie. IIHS and EuroNCAP”, so this might be a case of the measure becoming a target.

1 more reply

Retric3y ago

The video has a more reasonable answer.

His standpoint can be summed up as “I think some of the other companies are going to drop it.” Which would be really interesting if true.

2 more replies

seanmcdirmid3y ago

4 more replies

aeternum3y ago

Lack of focus is a major problem for companies and we all know that tech debt leads to increased bug counts.

Team focus on vision which is by far the highest accuracy and bandwidth sensor allows for a faster rate of safety innovation given a constant team size.

5 more replies

FreakLegion3y ago

YZF3y ago

Taking his point to the extreme why use 8 cameras? just use 4? 1? One photo-diode?

4 more replies

bumby3y ago

latchkey3y ago

Not just risk to passengers, risk to any thing in proximity to the vehicle while it is in motion.

threeseed3y ago

It was an ominous answer.

Except this isn't a new phone or sneakers we are trying to take to market it's something that will directly impact people's lives.

Nomentatus3y ago

1 more reply

throwawaylinux3y ago

Why not have a thousand sensors if more is better?

mensetmanusman3y ago

It can’t be solved without a few 10s of billion in infrastructure investment.

taneq3y ago

Whose money should “we” be spending on this grail quest?

minhazm3y ago

2 more replies

3apo3y ago

dreamcompiler3y ago

1 more reply

touch_abs3y ago

steve_adams_863y ago

I’m not saying this is definitely true, and at the moment we probably can’t verify it either. I’m just “steel manning” his case, as Lex loves to say.

I think you’re probably correct that the business aspect was a significant factor, but perhaps it wasn’t everything.

hammock3y ago

Devils advocate, if the cost of working to improve cameras to the point where they eliminate that delta is lower than the cost of using the sensors instead, then it is a net benefit

tophi3y ago

Yes, the current delta was not massive and will shrink over time.

By getting rid of the extra sensors they eliminate a temporary crutch and focus resources on the simple solution.

Not a new concept by the way. Henry Ford was obsessed with simplifying and eliminating every part that wasn’t necessary on the model T for virtually all the same reasons.

2 more replies

itsoktocry3y ago

>f the cost of working to improve cameras to the point where they eliminate that delta is lower than the cost of using the sensors instead

In a vacuum, how can cameras ever be better than cameras + other sensors?

2 more replies

adolph3y ago

If anything this is an acknowledgement that George Hotz was right in focusing on optical sensors with Comma.ai.

pmarreck3y ago

Two thoughts:

Anyway, doesn't AEB (automatic emergency braking) have to be installed in every car, by law, in the US, around now? And wouldn't that be less reliable if done via vision?

n0tth3dro1ds3y ago

>it is likely far simpler to just stick to stereoscopic vision data (the same thing the human genome decided!)

There’s a lot more to perception while driving than just stereoscopic vision.

I think Andrej and Tesla massively overestimate vision’s sole ability to solve the problem. Humans are fusing lots of sensation to drive well.

1 more reply

sfifs3y ago

> likely far simpler to just stick to stereoscopic vision data (the same thing the human genome decided!)

Yeah and till we had reliable and powerful artificial lighting, it was highly unsafe to journey in low visibility/ darkness. We used to finish journeys when darkness fell.

Animals that do require precise movement in low visibility (bats, dolphins) conditions often evolved ultrasound solutions.

So should we license Tesla vehicles to only operate when visibility and weather forecast is good and not drive in the dark at all?

1 more reply

quonn3y ago

L0stLink3y ago

ion_fury3y ago

this is logically the same thing that he said in the interview, so whats cynical about it? how is it underhanded?

tsimionescu3y ago

I think it's possible that professional movie cameras (with the appropriate lenses) may have higher dynamic range than human vision. Good luck getting those cheaper than a lidar.

1 more reply

JumpCrisscross3y ago

> did not strike me as a dig

It wasn't a dig. It was calling out a bullshit move that, in my opinion, Andrej deployed out of panic more than strategically. (My evidence for this being Andrej eventually gave a good answer.)

2 more replies

Nomentatus3y ago

strangescript3y ago

asah3y ago

Risk to PEDESTRIANS even less.

patrec3y ago

> I thought it was telling that Andrej immediately "reframed" the question because Lex asked the "wrong question". This is a classic evasion technique

I agree with this assessment. However:

> Telsa can't support additional sensors without incurring a prohibitive amount of additional risk to Tesla. Risk to passengers doesn't appear to be a consideration.

CharlesW3y ago

> …you will need to pick a point at which you say further fatality reduction is no longer justified given the economic cost of achieving it.

(BTW, I enjoyed "stupidifying"! I'm sorry I made people stupider.)

ClumsyPilot3y ago

> risks to passengers will absolutely affect your bottom line and if are rational you will take them into consideration

percieved, not real risks to customers. PR matters more than reality

1 more reply

snotrockets3y ago

Tesla hasn't proven itself to be a capable major car manufacturer (they probably lead the minor category, at least in deliveries) in all but one: their de-prioritizing of human life.

kotlin23y ago

Is there hard data on how deadly they are vs. other auto manufacturers? There is definitely a narrative that the cars are dangerous, but I'd like to see that quantified.

xodjmk3y ago

JumpCrisscross3y ago

> Why don't the majority of large-brained animals have many eyes

Because of the cost of additional eyes. If Tesla is optimizing for cost against safety, that's sort of the point.

1 more reply

wonnage3y ago

The issue here is trying to infer distance based on complex image processing or just… measuring the damn distances.

1 more reply

petilon3y ago· 30 in thread

Cameras have poor dynamic range and can be easily blinded by bright surfaces. While it is true that humans do fine with only eyes, our eyes are significantly better than cameras.

[1] https://www.youtube.com/watch?v=X3hrKnv0dPQ

aeternum3y ago

davidgay3y ago

> They basically remove the concept of exposure entirely and simply pass the sensor photon counts to the neural net.

That sentence does not make sense. There's no such thing as a count without a corresponding interval that count occurred over. That interval is the exposure.

1 more reply

lambdasquirrel3y ago

The human eye isn't so great on those terms. But humans can raise their hand to block the sun if it's straight at our eyes.

petilon3y ago

But it doesn't appear to be helping. Here's an example accident where depth data from Lidar would have helped:

"Tesla later said that during the crash, Autopilot’s camera could not distinguish between the white truck and the bright sky."

https://www.nytimes.com/2021/12/06/technology/tesla-autopilo...

2 more replies

whiddershins3y ago

The replies to your comment don't seem to understand you at all. in the video link here

https://youtu.be/ODSJsviD_SU?t=4424

he clearly states 16x dynamic range as a result of direct photon processing.

emkoemko3y ago

how do you count photons continuously? what... this makes no sense, if you pass "the photon count" you just did a exposure... also how does a photo diode count photons?

moralestapia3y ago

Does it have electrolytes as well?

Nice tech and single photons and whatnot but it still runs into things that a radar with some really simple code wouldn't. ¯\_(ツ)_/¯

lostsock3y ago

That video is from 2020, but Tesla didn't remove radar until 2021. Meaning that the crash occurred with radar still active, which I feel just backs up what Karpathy was saying.

petilon3y ago

Well, the car may have had radar hardware but there are questions as to whether the software was using it:

https://www.nytimes.com/2021/07/05/business/tesla-autopilot-...

Excerpt:

“A radar would have detected the pickup truck, and it would have prevented the collision,” Mr. Rajkumar said in an email. “So the radar outputs were likely not being used.”

https://www.nytimes.com/2021/12/06/technology/tesla-autopilo...

Excerpt:

2 more replies

mensetmanusman3y ago

I have some experience with LiDAR, they fail easily if a water droplet is on the cover or if signs are too bright. It’s a whole different technology challenge.

imglorp3y ago

3 more replies

amelius3y ago

The problem with self-driving is that it is based on data, but the environment may change. See e.g. the case where Tesla thinks that firetrucks are roads.

So if fashion changes, pedestrians may suddenly look like road too, as just an example.

martindbp3y ago

> state-of-the-art classification networks have an accuracy in the 90% range.

[1] https://en.wikipedia.org/wiki/ImageNet

akira25013y ago

> While it is true that humans do fine with only eyes, our eyes are significantly better than cameras.

They also have better failure modes and a really sophisticated error management system. They are susceptible to optical illusions, though.

> It is not sufficient if, in aggregate, self-driving cars have fewer accidents.

BurningFrog3y ago

> a large portion of accidents are single vehicle accidents where the driver was at fault for the crash. Usually due to speeding, alcohol, youth, or a combination of them.

Also plenty of suicides in that group, which confuses the stats.

We really need SDCs to have fewer accidents than human drivers, excluding the suicides.

oldgradstudent3y ago

> but my expectation is around 20% of fatal accidents can in some way be prevented by automation.

Assuming, of course, that automation does not introduce its own failure modes.

That's a strong assumption.

bergenty3y ago

Not to mention waymo works well with LiDAR, cameras and radars. If you’re argument is it’s too hard to deal with that much data, it’s definitely the wrong answer.

jandrese3y ago

> While it is true that humans do fine with only eyes

paulcole3y ago

> Traffic accidents are one of the leading causes of death in the developed world.

croes3y ago

>We do not.

But not because of a lack of visual information.

Most of the time it's a la k of concentration or an overestimation of one's own driving abilities.

1 more reply

ec1096853y ago

Humans are amazing at driving. We typically go millions of miles in a lifetime without causing any fatal vehicle crashes and can generally handle unknown situations just fine.

AI is nowhere near that.

P_I_Staker3y ago

We are excellent at driving. It's shocking that there aren't more accidents.

rootusrootus3y ago

> Traffic accidents are one of the leading causes of death in the developed world.

Not even close, really. A bit under 1%. You are more likely to die from an overdose, or suicide. And much, much, much more likely to die from cancer or heart disease.

1 more reply

Nomentatus3y ago

freejazz3y ago

"But you still have to address his system argument"

1 more reply

Marazan3y ago

They cannot actually believe that.

2 more replies

whiddershins3y ago

He's not talking about costs - money. He's talking about costs - engineering.

It's about more information is not always better. It can instead muddy the waters. It can create confusion.

watwut3y ago

> It is not sufficient if, in aggregate, self-driving cars have fewer accidents

It would be sufficient if it would be the case. With actual proof.

Reality is that in limited abstract situations, self driving card maybe have some advantages. But, that is all that we can claim. And when self driving fails, somehow human is always the cause.

bumby3y ago

I disagree, but mainly because of the way humans perceive risk.

1 more reply

masswerk3y ago

TheLoafOfBread3y ago· 22 in thread

elteto3y ago

AtlasBarfed3y ago

soooo... you're agreeeing that non-vision isn't necessary since the control domain is so much simpler?

I personally think they should use as much data inputs as possible: radar, IR, LIDAR, mesh networks, fixed route information.

A general AI algorithm will never be able to properly account for flavors/tags/chunk info on routes. Especially since cloud precomputation is so available these days.

1132453y ago

The Tesla AI day videos [1] go into some detail about this. They use multiple networks that are dedicated to specific tasks.

[1]https://www.youtube.com/watch?v=j0z4FweCy4M (2021), https://www.youtube.com/watch?v=ODSJsviD_SU (2022)

latchkey3y ago

> Humans use the basic algorithm "stay in lane, drive forward"

If you've ever driven in Vietnam, that is so not true.

1 more reply

ajross3y ago

enragedcacti3y ago

Unless of course "recent" means n+1 where n is the version that crashed into something.

Collision with bollard in Feb 2022: https://www.youtube.com/watch?v=sbSDsbDQjSU

attempts to plow through cyclist Feb 2022: https://www.youtube.com/watch?v=a5wkENwrp_k

almost crashes into tram (can't gauge speed or direction?) Jun 2022: https://www.youtube.com/watch?v=yxX4tDkSc_g

Crashes into curb Aug 2022: https://youtube.com/shorts/8Mh1GjejdsI

Phantom brake Sep 2022: https://www.youtube.com/shorts/5v6j_oL7S-g

Almost colliding with bridge pillar 2 weeks ago: https://www.youtube.com/watch?v=5CMYkDWaqn0

Crashes into various objects in testing 2 weeks ago: https://www.youtube.com/watch?v=yyDxqEzV5Zc

threeseed3y ago

> The LIDAR folks were wrong, basically

I think your mistake is thinking LiDAR exists to solve the happy day scenario. It doesn't.

So I want to know what does FSD do when it sees a billboard of a person or when it is seeing a new object for the first time.

1 more reply

elteto3y ago

> The LIDAR folks were wrong, basically.

This is far, far from settled at this point.

1 more reply

fzeroracer3y ago

> The LIDAR folks were wrong, basically.

1 more reply

Slartie3y ago

> my car will twitch when pedestrians turn as if they're going to enter the road (where human drivers mostly don't notice, and if they do they ignore it)

freejazz3y ago

Disregarding everything else about your post, which was better addressed by others, I'm amused that you think the FSD being twitchy reflects safety.

Gordonjcp3y ago

The thing is, they don't see as well as humans. They don't respond to changes in the environment until a car is actually in the middle of changing lanes.

It's like being driven around by a drunk person - the reaction happens loooooong after the action that causes it has started.

jeromenerf3y ago

> We want the automation to do what a human pilot would do. And that works with eyes.

Humans can’t really turn senses off, so they have coffee when driving. Touch and hearing are quite important to “read the road”. Equilibrium too.

1 more reply

clouddrover3y ago

> You're not asking the automation to walk

Tesla should aim for parking first. Teslas do poorly at self parking:

https://www.youtube.com/watch?v=nsb2XBAIWyA

codeflo3y ago

m4633y ago

> "humans don't need it so cars should not need it too"

I think of parking and I'm reminded of "the camry dent"

https://duckduckgo.com/?q=the+camry+dent&iax=images&ia=image...

yarg3y ago

Human binocular vision is what has been used to drive cars up until now, so it can be done (with a few thousand million years of iteration).

Ideally cars will be self-driving using only passive sensors - but I do think that Musk/Tesla completely missed the value of active sensors in training.

gibolt3y ago

Pretty sure humans haven't been striving for drivers licenses for millions of years...

1 more reply

Nomentatus3y ago

croes3y ago

>and the delta (difference made) was quite small

But why. Because LIDAR doesn't help much in general or because the Tesla engineers aren't good at using the sensor data?

Same with the manufacturing.

Sounds to me like Tesla can't handle complexity. And if they can't handle the complexity of manufacturing, they surely can't handle the complexity of full autonomous driving.

1 more reply

NBJack3y ago

1 more reply

dmix3y ago

> This whole question about the vision boils down to

Is that really what the problem boils down to? Or how it was decided? Or are you just questioning a common meme that comes up in internet debates about car AI?

eachro3y ago· 9 in thread

So the key question is how much of an improvement does radar/sensors/etc give you over just using computer vision?

diskzero3y ago

bpanon3y ago

You haven't solved the problem though.

throwntoday3y ago

If we're to trust what Elon and the team said during the last few AI day, none. They stated that the ultrasonic and radar sensors were actually performing worse than their pure vision stack.

justapassenger3y ago

Real life performance of vision only stack doesn’t agree with it.

quonn3y ago

1 more reply

kevin_thibedeau3y ago

Vision systems don't work at all in fog or heavy rain/snow.

Dunedan3y ago

nicbou3y ago

Any more or less than the human equivalent?

I'm not following the news, but I haven't seen any videos set in what Canada looks like 4 months per year.

m4633y ago

A better "answer" might be to make them an option and let the market decide.

For many (MANY) years airbags were fought by the auto industry even though people wanted them.

bekantan3y ago· 8 in thread

diskzero3y ago

I am a principal engineer for a major autonomous vehicle company. You can break this statement down into two components:

Adding more sensors slows his team now more than it improves system performance

I'll take his word on this. It is a lot of work to incorporate multiple sensors.

All necessary information is already in the pixel-space.

alsodumb3y ago

I think one should distinguish between 'all necessary information is already in the pixel-space' vs 'we already know how to extract all the information needed from pixel-space'

(I am not a principal engineer but a mere PhD student who argues daily with people on how RGB information is underappreciated and under utilized)

3 more replies

kfarr3y ago

1 more reply

cma3y ago

Dunedan3y ago

Out of curiosity: Could you please elaborate what such challenging environments can be?

jbverschoor3y ago

It’s good enough for people, so all the info is there.

Doesn’t mean it’s better or easier

6stringmerc3y ago

I have driven in extreme rain flash flood conditions in north Texas and I consider this a specific challenge, natural, that would defeat his system.

pclmulqdq3y ago

Any amount of snow would do this too. It severely reduces the color space of road features.

1 more reply

woeirua3y ago· 7 in thread

dmix3y ago

One is infrastructure entropy and the other is software engineering entropy (we're still talking about data from one type of sensor, just at larger scale).

Most tech startups have 10x+ more problems with the engineering part than the infrastructure/ops part.

Also this is one person's perspective from a large team. His answer might be biased because he's an engineer and I doubt his was the only voice in the debate.

ralfd3y ago

Assuming Karpathy is lying is quite a hot take to disregard his opinion.

No. He makes it clear that he is very convinced about it. There is no relativism, no weasel words or couching in maybes. He could be wrong, of course, but he believes in what he is saying.

nicbou3y ago

> no weasel words

The video starts with him reframing the question instead of answering it

1 more reply

woeirua3y ago

pyinstallwoes3y ago

woeirua3y ago

The problem is that so far, Tesla has yet to demonstrate that the fleet _is_ sufficient. IMO, if the fleet was enough to get to L5 autonomous driving, then they would already be there.

Nimitz143y ago

You don't understand the term entropy.

011000113y ago· 6 in thread

snovv_crash3y ago

0. https://sensor-js.xyz/demo.html

011000113y ago

You are confused my friend.

1 more reply

Nomentatus3y ago

1 more reply

vhold3y ago

So it seems like a totally ridiculous argument that ultrasonic sensors create some kind of data processing overload.

https://www.pcmag.com/news/tesla-removes-ultrasonic-sensors-...

011000113y ago

another_devy3y ago

nelox3y ago· 4 in thread

stonogo3y ago

djleni3y ago

THANK YOU. Reading this thread was getting to me because so many comments say humans drive eyes only.

I use far more than just vision driving:

- touch (steering feedback, gives information about grip in some circumstances)

- acceleration (can feel if the rear tires break loose in a turn on snow or ice, or if I’m sliding while breaking)

And probably many more

AlotOfReading3y ago

2 more replies

mola3y ago

mongol3y ago· 4 in thread

What is it that makes Lidar so expensive? Is it something intrinsic to the technology that prevents costs from coming down?

fooblaster3y ago

diskzero3y ago

Nasrudith3y ago

They are also fairly power-hungry from what I heard.

frxx3y ago

There are also solid state LIDARs these days which involve no moving parts. Still use more power than radar though.

dreamcompiler3y ago· 3 in thread

It's obviously a stupid decision to remove a direct source of range data (radar and ultrasound) in favor of an indirect one (vision).

dmix3y ago

Even though the risks are high the outcomes will always keep them honest. Whether they like it or not.

This isn't like Facebook continually releasing a product that sucks but people will use anyway.

The good news is Tesla has the ability to cripple this feature remotely without a costly/lengthy recall if that does happen.

AlotOfReading3y ago

1 more reply

epgui3y ago

Even if you were right, there's nothing "obvious" about that.

hbarka3y ago· 2 in thread

Intuition + other examples tell you that radar and ultrasonic sensors work. Why do we twist ourselves to believe otherwise?

[1] https://youtu.be/LS3Vk0NPFDE

mgoetzke3y ago

So all those engineers are lying when they talk about this topic ?

9935c101ab17a663y ago

Uh, which engineers are you referring to?

oxplot3y ago· 2 in thread

Here's the summary (mixed with observations from Munro and past Tesla presentations):

- Costs money: the physical sensors (a dozen of them), wiring it up, assembling it, maintain inventory, code it, etc.

My take:

In case of ultrasonic sensors, they are for low speed cases anyway and most people are fine without them. Majority of fatalities and injuries happen at higher speeds.

rootusrootus3y ago

That's great for them. But when I'm shopping for a car, I get to choose between a manufacturer that installs the extra sensors and seems to be able to get them to work, and Tesla.

oxplot3y ago

> Now, if you want the best EV, it's usually not going to be a Tesla.

Would love to hear what you consider "good" and what specific EV ticks the most good features that a Tesla Model 3 doesn't.

2 more replies

a-dub3y ago· 2 in thread

i'm not sure if i buy his argument that the "delta is not big enough." i have some experience with realtime ai systems and i've noticed something interesting about them.

for example: gpt-3 can write you a shell script that will emit a c program that prints a poem about people you know, but will fail at very basic logic, sometimes.

in light of that, having additional support data like radar or lidar seems like the right move for plugging all those little holes in capability that turn up in real ai systems.

Nomentatus3y ago

api3y ago

3 more replies

JaggerJo3y ago· 2 in thread

This sucks for parking. It is simply (physically) not possible for the existing cameras to see the area directly in front of the car.

So how would this work for parking?

A: Add more cameras so there are no dead areas in front of the car

B: build a model in vector space when driving towards a parking spot and assume blind spots don't change. (still sucks)

oxplot3y ago

> It is simply (physically) not possible for the existing cameras to see the area directly in front of the car.

I think people keep forgetting that Teslas run hundreds of ML prediction tasks all the time. Watch recent AI day and their talks about "occupancy network" to get a sense of the car's ability to:

1. Construct 3D model of its surrounding in real time; 2. Remember occluded sections based on what's it's seen previously.

watwut3y ago

Human driver constantly turns head around to where he is Mos likely to hit something.

1 more reply

friend_and_foe3y ago· 2 in thread

P_I_Staker3y ago

> such a practical company like Tesla

Where are you getting that from? Tesla has always seemed pie in the sky, and hardly a down to earth company at all throughout the history.

I'm basing this one both their public record, and reputation within the auto industry.

friend_and_foe3y ago

Tesla created the modern electric vehicle industry. They're innovators, sure, and they push limits, but their priority has always been to actually build. And they do.

lawrenceyan3y ago· 2 in thread

I can see a path where with only cameras, Tesla might be able to reach level 4 autonomy in perfect conditions.

But the biggest thing that comes to mind is what happens at night. Are they only going to enable self-driving during the day?

speedgoose3y ago

Wouldn’t turning on the headlights fix the problem at night?

Snow and ice may be another challenge but night sounds easy.

lawrenceyan3y ago

When you drive at night with headlights, tell me honestly how confident you feel driving versus during the daytime.

yreg3y ago· 1 in thread

They are pretending as if the USS were there only for self driving.

I use them as well!

georgeg233y ago

Indeed the ultrasonic sensors are pretty critical for (human) parking and backing up.

post_break3y ago· 1 in thread

I think this is the real reason: https://www.youtube.com/watch?v=LS3Vk0NPFDE

Cost cutting.

taf23y ago

nova220333y ago· 1 in thread

At 2:05. Suddenly you need a column in your sqlite telling you what type of sensor it is....

Seriously? This is a major technical challenge?

danpalmer3y ago

The challenge isn't the storing of the flag that says which sensor it has, it's testing the combinations, training for the different scenarios, treating the incoming data differently, and so on.

0xfffafaCrash3y ago· 1 in thread

Nomentatus3y ago

nielsbot3y ago· 1 in thread

All I heard was "cost savings, cost savings, cost savings"

oxplot3y ago

Well watch it again and again and again. He talks about the determental effect of lo-fi sensors in conjunction with vision among other things.

justapassenger3y ago· 1 in thread

TL;DW.

Tesla doesn’t know how to do change management.

Nomentatus3y ago

1 more reply

throwaway4good3y ago

Let me reframe the answer:

"We removed them because they cost money. And we are trying to make money ... at least right now.

Listen, this pure autonomous self-driving car stuff is never going to work, so who cares if we have these gadgets or not ..."

Animats3y ago

dane-pgp3y ago

60Vhipx7b4JL3y ago

From an engineering perspective I would ask: Can your sensor package understand the environment to the required (low) failure rate?

If you just rely on the fleet, you rely on the things you have seen. What about the objects that you have not yet seen?

gnicholas3y ago

xnx3y ago

Sensor fusion seems to be another thing that Tesla is not good at.

EVa5I7bHFq9mnYK3y ago

jakeogh3y ago

Giving human[1][2] drivers better situational awareness[3] is the future. Specifically open[4]:

a. Windshields that clean the inside as well as the outside.

b. Better eyeglasses[5].

c. User controllable hi-res HUD thermal IR overlay.

e. Brake control[6].

Any entity capable of driving[7] in a population of humans (including adversarial humans) is sentient[8], and has real skin in the game. It would be unethical to lock one in a car:

[1] https://news.ycombinator.com/item?id=33213860 (analog FPGA)

[2] https://news.ycombinator.com/item?id=21106367 (general AI)

[3] https://news.ycombinator.com/item?id=16646112 (2018)

[4] https://www.tesla.com/blog/all-our-patent-are-belong-you (2014)

[5] https://patents.google.com/patent/US7744217 (2007)

[6] https://news.ycombinator.com/item?id=18013388 (2018)

[7] no human behind the wheel, no human to correct impending mistakes, but (critically) with one or more humans in the car.

[8] The idea that non-biological machines can have 'self' is a window into modern mass transformation. Please checkout the analog FPGA experiments linked above.

danbmil993y ago

Musk is also famously against using lidar. He doesn't understand/accept that an autonomous vehicle needs any sensors that humans do not posess.

sidcool3y ago

I feel that was more an operational answer than an engineering one .. I still feel that depth perception of vision alone is unreliable.

dncornholio3y ago

Just remember folks, we will have full self driving vehicles by the end of this year!

ra73y ago

ornel3y ago

Video summary:

https://www.summarize.tech/www.youtube.com/watch?v=_W1JBAfV4...

superkuh3y ago

mavili3y ago

Did anyone else catch Andrej's "sqlite" comment? If that is not just a simple analogy, Tesla may be using sqlite in their cars? :D

sgjohnson3y ago

Does this mean that now when someone smashes one of their bumpers on a Tesla, the insurance will no longer have to total the entire vehicle?

solardev3y ago

Cuz Muskdaddy wanted mo money. There, mystery solved.

julienreszka3y ago

Geohot said something similar years ago already

smrtinsert3y ago

Hm I'd rather have someone from Twitter audit this decision

bigtex3y ago

Did Lex ask him why Tesla love to crash into emergency vehicles?

1 more reply

KVFinn3y ago

TLDR: Tesla thinks LIDAR hardware is more expensive than the performance improvement it provides.

j / k navigate · click thread line to collapse