AI Glossary: Key Terms Like Hallucinations, Bias, and Generative AI Explained

AI is moving fast—and the language around it is moving even faster. If you’ve tried to keep up with product launches, policy debates, or even casual conversations among coworkers, you’ve probably noticed the same thing: the terms multiply, the definitions shift depending on who’s speaking, and suddenly everyone is using the same handful of words—bias, hallucinations, prompts, models—as if they’re universally understood. They aren’t. And that matters, because how people define these terms often determines what they think the technology can do, what risks they believe are real, and what they expect from vendors.

What follows is a deeper, more practical guide to the AI vocabulary that shows up again and again. Think of it less as a dictionary and more as a way to decode what someone is really saying when they use these phrases. The goal isn’t to make you an AI engineer overnight. It’s to help you read the news with fewer assumptions—and more clarity.

At the center of the conversation is a deceptively simple question: what does “AI” actually mean?

AI (Artificial Intelligence): the umbrella that hides the details
“AI” is the broad label for technologies that perform tasks associated with human intelligence—things like understanding language, recognizing images, planning actions, or making recommendations. But “AI” doesn’t tell you which approach is being used. A system that classifies spam emails is AI. So is a system that generates a novel paragraph in response to a prompt. So is a system that controls a robot arm in a warehouse.

In other words, “AI” is an umbrella term. When someone says “AI,” ask what kind: is it predictive (forecasting outcomes), generative (creating new content), or decision-making (choosing actions)? Is it trained on historical data, or does it interact with the world in real time? Is it optimized for accuracy, speed, cost, or safety? Without those details, “AI” can become marketing shorthand rather than a technical description.

Machine Learning (ML): where most modern AI comes from
Most of the AI you encounter today is powered by machine learning. ML is a subset of AI where systems learn patterns from data rather than being explicitly programmed with rules for every scenario. Instead of writing “if this then that” logic for every possible case, developers provide examples—inputs and desired outputs—and the system learns statistical relationships.

This is why ML is so closely tied to the quality and representativeness of data. If the training data reflects the real world poorly, the model may learn shortcuts. If the data is biased, the model can reproduce that bias. If the data is incomplete, the model may fail in edge cases.

A useful way to interpret ML news is to look for the phrase “trained on.” When a company claims performance improvements, the underlying story is often about training data, model architecture, fine-tuning, or evaluation methods—not magic.

Generative AI: the content-creating branch of the family tree
Generative AI is the part of AI that creates new content: text, images, audio, video, code, and more. Unlike traditional ML systems that might classify something into categories, generative systems produce outputs that weren’t directly present in the training set. They generate based on learned patterns.

This is where the public imagination took off. Generative AI tools can draft emails, summarize documents, write marketing copy, translate languages, and assist with coding. But generative AI is also where misunderstandings are most common. People sometimes treat generated text as if it were a database lookup or a guaranteed truth engine. In reality, generative models are typically optimized to produce plausible continuations of language (or plausible structures in images, audio, etc.), not to verify facts unless they’re explicitly connected to retrieval systems or grounded sources.

That distinction—plausibility versus correctness—is one of the most important ideas in the entire AI glossary.

Prompt: the instruction that shapes the output
A prompt is the input you give a generative AI system: a question, an instruction, a role (“You are a legal assistant”), constraints (“Use bullet points”), or context (“Here’s the document…”). Prompts are often treated like a simple interface, but they’re more like steering wheels.

Two prompts that look similar can lead to very different outputs because the model interprets them as signals about intent, format, and priorities. Prompt engineering—the practice of crafting prompts to get better results—has become its own mini-industry. But it’s also a reminder that generative AI is sensitive to framing. If you want reliable outcomes, you don’t just ask for an answer; you specify what “good” looks like.

In news coverage, prompts are sometimes described as “just a user input.” That’s true technically, but incomplete practically. Prompts are where user goals meet model behavior. They’re also where ambiguity can creep in. If your prompt is vague, the model will fill gaps with its learned patterns—sometimes correctly, sometimes not.

Model: the trained engine behind the curtain
A model is the underlying system trained on data to recognize patterns and generate outputs. When people talk about “the model,” they’re usually referring to a neural network architecture that has been trained through a process that adjusts internal parameters to reduce errors on training objectives.

Models vary widely. Some are small and specialized; others are large and general-purpose. Some are trained from scratch; others are pre-trained and then fine-tuned for specific tasks. Some are designed to follow instructions; others are better at generating fluent text but require additional scaffolding to behave safely.

When a company announces “a new model,” the announcement can mean many things: a different architecture, more training data, improved training techniques, better alignment to human preferences, or changes in how the system is deployed. The headline rarely captures the full story. If you want to understand what changed, look for details about training, evaluation, and safety measures.

Training Data: the raw material that shapes behavior
Training data is the dataset used to teach an AI system how to behave or what patterns to learn. This includes not only the content itself (text, images, code) but also how it’s curated, labeled, filtered, and balanced.

Training data influences everything: language style, knowledge coverage, and the kinds of mistakes the model makes. It also influences bias. If certain groups, dialects, or contexts are underrepresented, the model may perform worse for those cases. If the data contains stereotypes or skewed associations, the model may reproduce them.

One reason AI debates get heated is that training data is often opaque. Companies may not disclose exactly what was used, how it was cleaned, or how it was weighted. That opacity makes it harder for outsiders to evaluate fairness, robustness, and potential harms.

Bias: when the model’s “learning” becomes unfair outcomes
Bias in AI refers to systematic errors or unfair results that stem from the data, the training process, or the way the system is evaluated and deployed. Bias can show up as inaccurate predictions for certain groups, uneven performance across demographics, or outputs that reflect harmful stereotypes.

It’s tempting to think of bias as a moral flaw in the model. But bias is usually a property of the pipeline: the data collected, the labels assigned, the objective function used during training, and the metrics used to judge success. Even a technically “accurate” model can be biased if accuracy is measured in a way that ignores harm or unequal impact.

Bias is also not always obvious. A model might appear to work well overall while failing quietly in specific subpopulations. That’s why fairness evaluation is increasingly discussed alongside performance benchmarks. In news coverage, watch for whether companies talk about subgroup testing, error analysis, and mitigation strategies—or whether they only report aggregate scores.

Hallucination: plausible-sounding errors that can mislead
Hallucination is one of the most widely used AI terms, and also one of the most misunderstood. In everyday conversation, “hallucination” often means “the AI made something up.” Technically, it refers to outputs that are generated confidently but are incorrect, ungrounded, or not supported by the provided context.

The key word is plausible. Generative models are trained to produce coherent text. Coherence is not the same as truth. A model can generate a detailed explanation that sounds right while being wrong—especially when it lacks access to verified sources or when the prompt asks for information outside its training knowledge.

Hallucinations are more likely when:
1) the prompt is underspecified,
2) the user asks for niche or rapidly changing facts,
3) the system isn’t connected to retrieval or fact-checking tools,
4) the model is forced to answer despite uncertainty.

This is why “hallucination” is often paired with “grounding” and “retrieval.” Grounding means tying outputs to reliable sources—such as documents the user provides or information retrieved from trusted databases. Retrieval-augmented generation (RAG) is one common approach: the system fetches relevant information first, then generates an answer based on that material. When done well, it reduces hallucinations by limiting what the model can invent.

But grounding isn’t a magic shield. Retrieval can return irrelevant or incomplete information. Sources can be outdated. And even with retrieved context, the model can still misinterpret or overgeneralize. The best systems combine grounding with careful evaluation and user-facing transparency.

Why these terms matter beyond definitions
If you only memorize the glossary, you’ll still miss the real story. The real story is how these concepts connect to risk, reliability, and accountability.

Consider the chain of events in a typical generative AI workflow:
1) A user writes a prompt.
2) The system uses a model to generate an output.
3) The model’s behavior is shaped by training data and training objectives.
4) The output may be influenced by biases learned from data.
5) The output may include hallucinations if it isn’t grounded or if uncertainty isn’t handled properly.

Each step introduces opportunities for failure—and each failure mode maps to a different mitigation strategy. If the problem