Study AIF-C01 FM Selection, Inference Parameters, RAG and Agents: key concepts, common traps, and exam decision cues.
This lesson covers the main application patterns AWS wants candidates to classify correctly. The exam often asks whether the stronger answer is a different model, a different inference configuration, a retrieval step, or an agentic workflow.
RAG: Retrieval-augmented generation, where the system retrieves relevant context before the model generates an answer.
Agent: Model-driven workflow component that can reason about steps and call tools or actions.
Inference parameters: Runtime settings such as temperature that shape how deterministic or creative the output becomes.
| Need | Strongest first fit |
|---|---|
| fresher answers grounded in business documents | RAG |
| sequential tool-using workflow | agent pattern |
| shorter, more deterministic output | lower creativity / more constrained inference settings |
| broader creative generation | more open-ended inference behavior |
| If the problem is mainly about… | Better reading |
|---|---|
| stale or missing business knowledge | add retrieval and grounding |
| multi-step action with tool use | consider an agent pattern |
| output randomness or creativity | inspect inference settings first |
| business fit across cost, latency, and modality | revisit model selection |
Many AIF-C01 questions are not really about one AWS product. They are about picking the right FM application pattern for the risk, freshness, and workflow requirement.
A support assistant answers well most of the time, but it keeps missing new policy updates stored in internal documents. The team does not need tool calling or multi-step action. Which change is strongest first?
Correct answer: A. The issue is missing current knowledge, which points first to retrieval and grounding rather than agency.