Build a Safe vs Risky AI Chatbot Detector Game with Your Kid

Why this matters more than "don't talk to strangers"

Your child will interact with AI chatbots for the rest of their life. Every school assignment, every late-night curiosity, every piece of homework help, and eventually every professional task will involve AI in some form. The most important skill they can develop right now is not "never use AI" - every executive who has tried that rule with their teenagers knows how it ends. The most important skill is learning to evaluate whether an AI's output is safe, accurate, and worth trusting in the specific context where it appears.

That skill has a formal name in AI research: evaluation. It is the ability to look at what a model produces and make a judgment about whether it is reliable. Adults who work with AI professionally develop this instinct over months of use. Kids who learn to evaluate AI output at age 10 will have it as a native instinct at age 20, when it will matter for their careers, their health decisions, and their ability to navigate a world where AI-generated content is everywhere and not uniformly trustworthy.

This game teaches evaluation in 60 minutes using a hands-on build-and-test loop that works better than any lecture or rule-based conversation. It does not tell your child what to think about AI. It gives them a framework for thinking about AI themselves. That is a much more durable skill.

The 60-minute activity (No screens for the first 40 minutes)

Minutes 0 to 15: Collect training data

With your child, collect 25 to 30 example responses from real AI chatbots. You want a mix: helpful, accurate responses and responses that contain bad advice, unsafe recommendations, or factual errors. Use a family-friendly chatbot for this step and collect responses to benign questions (what to do when you feel sick, how to handle a conflict with a friend, what is a good snack before bed). Print or Write each response on its own index card. Sort them together into two categories: "Safe AI response" and "Risky AI response."

Spend time on this categorization step. Ask your child why they put each card where they did. The discussion about what makes a response risky or safe is as valuable as the rest of the activity. Common patterns they will notice: risky responses sound confident about things the AI cannot know, give advice without important caveats, or suggest something that could cause harm without flagging the risk. Safe responses acknowledge uncertainty, recommend talking to an adult for important decisions, and match what your child already knows to be true.

Minutes 15 to 35: Train the model (Teachable Machine)

Go to teachablemachine.withgoogle.com. No account required, no download required, free to use. Create a new "Text" project (under New Project, choose Text). Create two classes: "Safe" and "Risky." Add 20 to 25 of the index card examples to the appropriate class by typing or pasting each one. Click "Train Model." The training takes 30 to 90 seconds depending on the number of examples. When training completes, test the model with 5 new examples your child has not seen yet. Record how many it gets right.

The accuracy of the first model is almost never perfect, and that is exactly the point. A model trained on 20 examples with inconsistent categories will make mistakes. Those mistakes are the most valuable moments of the whole activity.

Minutes 35 to 50: Troubleshoot with GPT-5.5

When the model gets an example wrong, paste that example into GPT-5.5 and run this prompt with your child:

I am training an AI to tell apart safe and risky chatbot answers. When I show it [EXAMPLE], it thinks it is [WRONG CATEGORY]. Why might my AI be confused? What could I change about the training examples?

GPT-5.5 will explain what features of the text the model is keying on and why those features are misleading it. Common explanations: the model learned to associate confident tone with "Safe" when that's not always true, or it associated short responses with "Risky" based on a coincidence in the training data. Each explanation gives your child an insight into how machine learning actually works: the model finds patterns in the data you gave it, not patterns in the real world. If your data had a flaw, the model learned the flaw.

Minutes 50 to 60: Turn it into a family game

Pull out new examples (from the chatbot, from a search engine snippet, from a social media post your child has seen) and test them against the model. Whoever correctly predicts whether the model will label the example correctly before it runs wins the round. Keep score. The competitive element keeps energy high, but the conversation between rounds is where the learning sticks. Why did the model disagree with you? Who was right? What does that tell you about the model's training data?

The core lesson (repeat this out loud)

"AI is only as good as the data it sees. Bias is not a mystery - it is a math problem. If the data is skewed, the model is skewed. If the data is incomplete, the model is incomplete. Your job as someone who uses AI is to notice when the output doesn't match reality, ask why, and decide whether to trust it anyway."

Say that out loud during the activity, not before it. The phrase lands differently when your child has just watched their own model make a mistake for a reason they can not explain. Abstract principles become concrete understanding after a hands-on demonstration.

Why this beats lectures

Children remember what they build and do not remember what they are told. The research on this is consistent across decades of educational science: active learning, where the student constructs knowledge through doing, produces retention rates three to five times higher than passive instruction. A lecture about AI bias will be forgotten by Tuesday. A model your child trained, broke, and fixed with their own hands will be referenced in conversations for months.

The game also transfers. After this activity, when your child gets an odd answer from a homework helper or a weird recommendation from a streaming service, they have a mental model for why that might happen: "Maybe the training data was bad." That framing - curious and analytical rather than either blindly trusting or reflexively suspicious - is exactly the relationship you want your child to develop with AI technology they will use for the next six decades.

This weekend: Do the activity. Keep the Teachable Machine model - it takes 30 seconds to reopen. Next time your child gets a strange answer from a chatbot, pull it up and ask: "Would our detector flag this as risky? Why or why not?" That one question, asked regularly over the next year, is how you raise a generation that uses AI with critical judgment instead of passive acceptance.