

It is possible, but it is not error-free. For example, there is no way to tell a distance to a featureless wall that takes up the entire field of vision. But Teslas don’t even have stereoscopic vision. The moment you start using neural networks and monoscopic vision, you get affected by all the visual illusions that humans get. And that is in addition to the system not being as good as humans at processing visual information.

To have steroscopic vision you need two cameras with the same optics. Tesla has three front facing cameras, each with a different field of view. And they are too close together. But they never claimed to have stereo vision. You don’t need that on the road - all the cars are roughly the same size, so it is easy to “guess” the distance to them. I just wouldn’t trust it with anything when off the road.