Study AIF-C01 AI, ML and Data Fundamentals: key concepts, common traps, and exam decision cues.
AIF-C01 begins by testing whether you can keep the core AI language straight. The exam is not asking you to derive model equations. It is asking whether you know what AI, ML, deep learning, training, inference, features, labels, and data types mean so later service and use-case questions stay grounded.
Inference: Using a trained model to produce an output on new input data.
Feature: Input variable the model uses to learn patterns or make predictions.
Label: The known correct outcome used in supervised learning.
| Term | Best mental model |
|---|---|
| Artificial intelligence | broad field of systems that perform tasks associated with human intelligence |
| Machine learning | subset of AI that learns patterns from data |
| Deep learning | subset of ML that uses neural-network approaches |
| Training | process of learning from data |
| Inference | using the trained model on new data |
AWS expects you to recognize structured, semi-structured, and unstructured data at a broad level because the shape of the data often changes what kind of AI path even makes sense.
| Data shape | What it usually looks like | Why it matters on the exam |
|---|---|---|
| Structured | rows, columns, predictable fields | good fit for many classic ML use cases |
| Semi-structured | JSON, logs, tagged content | still machine-readable, but less rigid |
| Unstructured | free text, images, audio, video | often points toward NLP, vision, speech, or GenAI patterns |
Candidates often blur the two. AIF-C01 keeps bringing this distinction back because it changes both the use case and the AWS service fit:
This lesson is not only vocabulary. It is testing whether you can:
A company has historical loan-application data with known repayment outcomes and wants to predict whether a new applicant is likely to repay. Which ideas should you classify first?
Correct answer: A. The scenario gives labeled historical outcomes and a prediction goal, which points to supervised ML and a clear training-versus-inference split.