Python Institute PCAD cheat sheet for data science basics, models, traps, and final review.
Use this cheat sheet for Certified Associate Python for Data Science (PCAD) after you know the basics but before you start a timed practice block. The goal is not to memorize a vendor catalog; the goal is to classify the scenario and reject attractive wrong answers quickly.
Use this when the stem mixes data engineering, SQL, data prep, validation, or analysis method.
flowchart TD
S["Scenario"] --> Q["Identify the business question"]
Q --> T["Name the data shape"]
T --> C["Choose cleaning or transformation path"]
C --> V["Verify result and documentation"]
| Lane | Decision rule | Reject when |
|---|---|---|
| Python foundations | Use core syntax, functions, data structures, files, and exceptions as the base for data work. | Blaming a library when the issue is basic Python type or scope behavior. |
| Data handling | Understand tabular data, missing values, filtering, grouping, sorting, joins, and reshaping concepts. | Cleaning data without knowing what rows, columns, keys, and missing values mean. |
| Statistics and visualization | Read summary statistics, distributions, correlation, charts, and common interpretation traps. | Confusing correlation with causation or average with distribution. |
| Machine learning basics | Understand train/test split, features, labels, overfitting, metrics, and model evaluation. | Judging models only by training accuracy. |
| Data ethics and workflow | Recognize bias, privacy, reproducibility, documentation, and responsible use. | Publishing results without reproducible steps or data-quality notes. |
| Trap | Better instinct |
|---|---|
| Library memorization only | Understand the data operation and shape change, not just method names. |
| No holdout data | Evaluate on unseen data to avoid false confidence. |
| Missing-value shortcuts | Know when dropping, imputing, or flagging values changes meaning. |
| Unexplained charts | Tie chart choice to question, variable type, and audience. |
| If the stem says | Start with |
|---|---|
| least privilege, private access, compliance, or audit | identity scope, data boundary, policy enforcement, logging, and ownership |
| least operational effort | managed service, native integration, simple workflow, and fewer moving parts |
| high availability, recovery, or outage | failure domain, recovery objective, health check, rollback, and validation |
| performance, scale, or cost | bottleneck evidence, traffic pattern, sizing, caching, batching, and quotas |
| troubleshoot, diagnose, or investigate | symptom, recent change, logs, metrics, status, dependency, and smallest safe test |
Use IT Mastery for the exact product route, practice status, spaced review when available, and close-answer explanation practice as coverage expands.
Open the exact IT Mastery route here: PCAD on MasteryExamPrep.
Data science Python questions reward data-shape awareness: clean, summarize, visualize, model, evaluate, and explain responsibly.