AI That Sees
Close your eyes for a moment. Now open them. In less than a blink, your brain figured out where you are, what objects are around you, how far away things are, and whether anyone you know is nearby. Your eyes and brain work together so fast and so well that you do not even notice how incredible it is. Now here is the remarkable part: scientists have been working for decades to teach computers to do something similar — to look at a picture or a video and understand what is in it. They call this computer vision. And today, AI that sees is all around us in ways you might not realize.
How Does AI See?
When you look at a photo of a dog, you instantly know it is a dog. But a computer sees something completely different: millions of tiny colored dots called pixels. A pixel is just a number that represents a color. A photo of a dog is really just a giant list of numbers to a computer. So how does AI turn those numbers into understanding? By learning from examples. Scientists show AI millions of labeled photos — this is a dog, this is a cat, this is a flower, this is a car. The AI studies all those examples and finds patterns in the numbers. It figures out which combinations of pixels tend to mean dog, which mean cat, and so on. After studying millions of examples, the AI can look at a brand-new photo it has never seen before and make a very good guess about what is in it. That is seeing with AI!
AI that sees learns by studying millions of labeled photos. It finds patterns in the number-dots that make up pictures, and uses those patterns to recognize objects in brand-new images it has never seen before.
Computer vision is used in so many places today. Your phone might use it to unlock with your face — AI recognizes your unique facial features and knows it is you. Self-driving cars use computer vision to see the road, spot other cars, read street signs, and notice pedestrians stepping off the sidewalk. Doctors use AI vision to look at X-rays and scans. The AI has studied thousands of images labeled with different conditions, so it can spot patterns that might be hard to notice. Farmer robots use computer vision to look at crops and identify which plants need more water or are getting sick — helping grow food more efficiently. When you search for photos online, computer vision helps sort and find the right images. When a website automatically adds captions to your uploaded pictures, that is computer vision too.
Match each AI vision use to what it does.
Terms
Definitions
Drag terms onto their definitions, or click a term then click a definition to match.
AI vision is not perfect. It can make mistakes — especially when it sees something unusual that was not well-represented in its training photos. Imagine an AI trained mostly on photos taken in bright daylight. Give it a dark or blurry photo and it might struggle. That is why scientists keep improving AI vision by giving it more diverse training examples — photos taken in all kinds of light, from many angles, in different conditions. The more varied the examples, the better the AI gets at handling the real messy world. Right now, the best AI vision systems can recognize objects in photos with extraordinary accuracy — in many specific tasks, matching or even surpassing what people can do. That progress happened in just the last few years. Imagine what the next few years will bring!
Every photo on your screen is made of tiny colored squares called pixels. Your phone screen might have millions of them! AI sees a photo as a giant list of numbers — one number for each pixel's color. Finding patterns in those numbers is how AI vision works.
What does a computer actually see when it looks at a photo?
How does AI that sees get better at recognizing things in photos?
Pixel Art Detective
- Get a piece of graph paper, or draw a 10-by-10 grid on regular paper.
- Fill in each square with one of three colors — let us say red, blue, and white.
- Make a simple hidden picture using those three colors — like a heart, a star, or an arrow.
- Swap your grid with a friend or family member without telling them what you drew.
- See if they can figure out the hidden picture!
- Talk about it: how is figuring out the hidden picture in your grid like how AI figures out what is in a photo by looking at its pixels?