“Just doing this, building a computer vision stack that’s powerful enough to splice the world in its components is something that will get us to a very basic semantic understanding of a scene. Need lots of data. But we do have tons of images, all the videos on YouTube. These are immediate business opportunities.”
“Once we’ve gained that sort of, once we are capable of explaining what happens in the picture, we would like to apply this at scale to all the images that we have on the internet, for example. To try to learn or teach computers what happens for real in the world.”
“The computer essentially lets you see a better version of the world than your normal eyes can see, right? Imagine you know this product, this is this is making life so much better.”