Study Azure AI-900 Training Basics: key concepts, common traps, and exam decision cues.
This part of AI-900 is about whether you understand how the data supports the model. You do not need MLOps detail. You do need to know what features and labels are, why validation matters, and what it means when a model memorizes instead of generalizes.
Generalization: A model’s ability to perform well on new data rather than only on the data it saw during training.
| Term | Meaning | Fast exam anchor |
|---|---|---|
| feature | input used by the model | what the model reads |
| label | known target for supervised learning | what the model learns to predict |
| training data | examples used to fit the model | learning stage |
| validation data | separate data used to check generalization | performance check on unseen examples |
| Stage | What happens |
|---|---|
| training | the model learns patterns from historical examples |
| validation | the model is tested on data it did not simply memorize |
| inference | the trained model is used to predict on new input |
If the model performs very well on training data but poorly on new data, AI-900 expects you to recognize overfitting.
AI-900 keeps this conceptual. The goal is not to discuss every data-preparation technique. The goal is to recognize that:
| Mistake | Better rule |
|---|---|
| calling the target column a feature | the target in supervised learning is the label |
| treating validation data as more training data | validation exists to test generalization |
| assuming high training accuracy proves the model is good | high training performance can still hide overfitting |