Study AIF-C01 GenAI Concepts, Tokens, Embeddings and Models: key concepts, common traps, and exam decision cues.
This lesson gives you the GenAI vocabulary that appears all over AIF-C01. AWS wants you to know what tokens, embeddings, vector search, modalities, and foundation models are before it asks you to choose patterns like RAG or fine-tuning.
Token: Unit of text or other model input/output used in model processing and pricing.
Embedding: Numeric representation that captures semantic similarity so related items sit closer together in vector space.
Context window: The amount of input and output context a model can consider in one interaction.
| Term | Best mental model |
|---|---|
| Foundation model | broad pretrained model usable across many tasks |
| Token | input/output unit that affects context and cost |
| Embedding | semantic representation for similarity and retrieval |
| Multimodal model | model that works across more than one content type |
| Context window | amount of input a model can consider at once |
These are not isolated glossary words. AIF-C01 uses them as a chain:
| Concept | Better reading |
|---|---|
| embedding | numerical meaning representation |
| vector store | storage and search layer for embeddings |
| retrieval | finding semantically relevant content |
| grounding | using trusted retrieved content to improve factual answers |
A team wants users to search thousands of internal documents by meaning rather than by exact keyword matches. Which concept is the strongest foundation for that design?
Correct answer: A. Semantic retrieval depends on turning content into embeddings that can be searched by similarity.