AWS MLA-C01 Cheat Sheet: ML Pipelines, Endpoints, and Drift
March 28, 2026
AWS MLA-C01 cheat sheet for ML pipelines, endpoints, drift, traps, and final review.
On this page
Keep this cheat sheet open while drilling questions. MLA‑C01 rewards “production ML realism”: data quality gates, repeatability, safe deployments, drift monitoring, cost controls, and least-privilege security.
Quick facts (MLA-C01)
Item
Value
Questions
65 (multiple-choice + multiple-response)
Time
130 minutes
Passing score
720 (scaled 100–1000)
Cost
150 USD
Domains
D1 28% • D2 26% • D3 22% • D4 24%
Fast strategy (what the exam expects)
If the question says best-fit managed ML, the answer is often SageMaker (Feature Store, Pipelines, Model Registry, managed endpoints).
If the scenario is “data is messy,” think data quality checks, profiling, transformations, and feature consistency (train/serve).
If the scenario is “accuracy dropped in prod,” think drift, monitoring baselines, A/B or shadow, and retraining triggers.
If the scenario is “cost is spiking,” think right-sizing, endpoint type selection, auto scaling, Spot / Savings Plans, and budgets/tags.
If there’s “security/compliance,” include least privilege IAM, encryption, VPC isolation, and audit logging.
Read the last sentence first to capture constraints: latency, cost, ops effort, compliance, auditability.
ML decision flow
Use this when the question is really asking which lifecycle layer is failing.
flowchart TD
S["Scenario"] --> D["Data quality or feature mismatch?"]
D -->|yes| P["Fix prep, profiling, or Feature Store first"]
D -->|no| M["Model / deployment / monitoring issue?"]
M -->|yes| O["Tune, register, deploy, or monitor"]
M -->|no| G["Check governance, cost, and security controls"]
MLA-C01 answer sequence
Use this when the stem mixes data prep, model choice, deployment, and monitoring.
flowchart TD
S["Scenario"] --> D["Check data quality and feature consistency"]
D --> M["Choose model, training, or tuning path"]
M --> P["Pick deployment and orchestration pattern"]
P --> O["Add monitoring, drift, and scaling controls"]
O --> G["Verify governance, security, and cost fit"]
Domain weights (how to allocate your time)
Domain
Weight
Prep focus
Domain 1: Data Preparation for ML
28%
Ingest/ETL, feature engineering, data quality and bias basics
Domain 2: ML Model Development
26%
Model choice, training/tuning, evaluation, Clarify/Debugger/Registry
Domain 3: Deployment + Orchestration
22%
Endpoint types, scaling, IaC, CI/CD for ML pipelines
Domain 4: Monitoring + Security
24%
Drift/model monitor, infra monitoring + costs, security controls
Final 20-minute recall (exam day)
Cue -> best answer (pattern map)
If the question says…
Usually best answer
Data is messy/inconsistent before training
Data Wrangler/DataBrew + quality checks
Train/serve feature mismatch
SageMaker Feature Store
Need systematic hyperparameter search
SageMaker Automatic Model Tuning
Need fairness/explainability evidence
SageMaker Clarify
Training instability / convergence issues
SageMaker Debugger
Accuracy degraded in production
SageMaker Model Monitor + drift triggers + retraining
Govern model promotion and rollback
SageMaker Model Registry + approval workflow
Constant low-latency traffic
Real-time endpoint
Spiky traffic with low idle tolerance
Serverless endpoint
Long-running or non-interactive inference
Async endpoint or batch transform
Must-memorize MLA defaults
Topic
Fast recall
First failure domain
Data quality and leakage before model changes
Metric selection
Match metric to business cost (precision vs recall trade-off)
Drift controls
Baselines, alerts, and versioned retraining pipeline
Cost controls
Right-size, auto scale, pick correct endpoint mode, use Spot where safe