Class 7 · CBSE AI · Strand C — NLP, Vision, and LLMs Deep-Dive
How computer vision works — convolution explained intuitively
CNNs spot patterns by sliding small filters across an image. The intuition behind computer vision. For Class 7.
Class 7 · CBSE AI · Strand C — NLP, Vision, and LLMs Deep-Dive
CNNs spot patterns by sliding small filters across an image. The intuition behind computer vision. For Class 7.
Rangoli making
Imagine a tiny template (like a small rangoli stamp) that you slide across a large blank floor, pressing it down at each position to see where it 'matches' the pattern you're looking for. A convolutional filter works exactly like this stamp — sliding it across the image and recording a strong response wherever the pattern it encodes appears.
Finding a specific motif in a saree border
To find where a particular flower motif repeats in a saree's border, you'd slide your finger along comparing each section to the flower in your memory. That systematic sliding-and-comparing is convolution — your mental image of the flower is the filter, and each position in the border is a location you're testing.
Every Dhee Learning session for this concept follows three stages. We share the questions Dhee actually asks, so you can hear what a session sounds like.
Stage 1 — Surface
How do you recognise a tiger, whether you see it in the top-left corner of a photo or the bottom-right corner? What does that tell you about what your brain is doing when it recognises objects?
Rote answer
"A convolutional neural network uses filters to detect features in images."
Understood
"I recognise the tiger by its stripes, orange colour, and shape — regardless of where in the image it appears. My brain has learned a 'tiger detector' pattern that it checks everywhere. That's exactly what a convolutional filter does — it's a pattern detector applied at every location."
Stage 2 — Reasoning
A CNN trained to detect faces fails badly when tested on upside-down faces, even though an upside-down face has exactly the same pixels — just flipped. Why does this reveal something important about what CNNs are actually learning?
Follow-up Dhee may use: How would you fix this? Would you change the architecture, the training data, or both?
Stage 3 — Application
You're building a vision system to identify damaged potholes on Indian roads from smartphone photos. List three specific image characteristics that would make your convolution-based model's job harder than detecting faces.
Misconception Dhee watches for: Assuming that any image classifier can detect any object with enough data — the choice of architecture and training strategy needs to match the specific visual properties of the target object.
Dhee turns this concept into a 15-minute spoken session — asking, listening, and probing — so your child builds the idea themselves.
CNNs spot patterns by sliding small filters across an image. The intuition behind computer vision. For Class 7.
CNNs 'see' images the way humans do — they detect statistical patterns in pixel matrices, with no perceptual experience or semantic understanding of what they are detecting.
Dhee opens with a question — for example: "How do you recognise a tiger, whether you see it in the top-left corner of a photo or the bottom-right corner? What does that tell you about what your brain is doing when it recognises objects?" — listens to your child's answer, then probes the reasoning behind it. The session ends when the child can apply the idea to a brand-new situation, not just recall it.