Class 4 · CBSE AI · Strand C — Garbage In, Garbage Out
Garbage In, Garbage Out — what makes AI data good
If you teach an AI from bad examples, it learns bad habits. The most important lesson in AI.
Class 4 · CBSE AI · Strand C — Garbage In, Garbage Out
If you teach an AI from bad examples, it learns bad habits. The most important lesson in AI.
Cooking / kitchen
If you put stale, rotten vegetables into a curry, no amount of good cooking will make it taste right. Data is like the ingredients — the dish (the AI) can only be as good as what you put in.
Cricket scoreboard
Imagine the scorekeeper writing random runs on the board instead of the actual score. At the end of the match, everyone would think the wrong team won. That scoreboard is 'garbage data'.
Every Dhee session for this concept follows three stages. We share the questions Dhee actually asks, so you can hear what a session sounds like.
Stage 1 — Surface
If you asked 10 friends what their favourite food is but wrote down the answers randomly without listening — would that be good data or bad data? Why?
Rote answer
"Bad data because it is incorrect."
Understood
"It would be bad because the answers don't actually match what my friends said, so any conclusion I draw would be wrong too."
Stage 2 — Reasoning
Imagine an AI is trained to recommend medicines to sick people, but the data it learned from had many wrong entries. What could go wrong?
Follow-up Dhee may use: Think about it this way — if you studied from a textbook with wrong answers, what would happen when you took a test?
Stage 3 — Application
Your school wants to build an AI that suggests what snacks to keep in the canteen. What would 'good data' look like for this AI, and what would 'bad data' look like?
Misconception Dhee watches for: Thinking that more data always means better data, regardless of its accuracy or fairness.
Spark turns this concept into a 15-minute spoken session — asking, listening, and probing — so your child builds the idea themselves.
If you teach an AI from bad examples, it learns bad habits. The most important lesson in AI.
More data always means better AI — in reality, a large amount of inaccurate data is worse than a small amount of accurate data.
Dhee opens with a question — for example: "If you asked 10 friends what their favourite food is but wrote down the answers randomly without listening — would that be good data or bad data? Why?" — listens to your child's answer, then probes the reasoning behind it. The session ends when the child can apply the idea to a brand-new situation, not just recall it.